Designing a Semantic Ground Truth for Mathematical Formulae
Sexton, Alan; Sorge, Volker; Suzuki, Masakazu
- Towards a Digital Mathematics Library. Paris, France, July 7-8th, 2010, Publisher: Masaryk University Press(Brno, Czech Republic), page 37-42
Access Full Article
topAbstract
topHow to cite
topSexton, Alan, Sorge, Volker, and Suzuki, Masakazu. "Designing a Semantic Ground Truth for Mathematical Formulae." Towards a Digital Mathematics Library. Paris, France, July 7-8th, 2010. Brno, Czech Republic: Masaryk University Press, 2010. 37-42. <http://eudml.org/doc/220332>.
@inProceedings{Sexton2010,
abstract = {We report on a new project to design a semantic ground truth set for mathematical document analysis. The ground truth set will be generated by annotating recognised mathematical symbols with respect to both their global meaning in the context of the considered documents and their local function within the particular mathematical formula they occur. The aim of our work is to have a reliable database available for semantic classification during the formula recognition process with the aim of enabling correct interpretations of mathematical formulae and generating semantic markup such as Content MathML.},
author = {Sexton, Alan, Sorge, Volker, Suzuki, Masakazu},
booktitle = {Towards a Digital Mathematics Library. Paris, France, July 7-8th, 2010},
keywords = {Content MathML; OCR},
location = {Brno, Czech Republic},
pages = {37-42},
publisher = {Masaryk University Press},
title = {Designing a Semantic Ground Truth for Mathematical Formulae},
url = {http://eudml.org/doc/220332},
year = {2010},
}
TY - CLSWK
AU - Sexton, Alan
AU - Sorge, Volker
AU - Suzuki, Masakazu
TI - Designing a Semantic Ground Truth for Mathematical Formulae
T2 - Towards a Digital Mathematics Library. Paris, France, July 7-8th, 2010
PY - 2010
CY - Brno, Czech Republic
PB - Masaryk University Press
SP - 37
EP - 42
AB - We report on a new project to design a semantic ground truth set for mathematical document analysis. The ground truth set will be generated by annotating recognised mathematical symbols with respect to both their global meaning in the context of the considered documents and their local function within the particular mathematical formula they occur. The aim of our work is to have a reliable database available for semantic classification during the formula recognition process with the aim of enabling correct interpretations of mathematical formulae and generating semantic markup such as Content MathML.
KW - Content MathML; OCR
UR - http://eudml.org/doc/220332
ER -
References
top- Aly, W., Uchida, S., Fujiyoshi, A., Suzuki, M., Statistical classification of spatial relationships among mathematical symbols, In: Proceedings of ICDAR 2009, pages 1350–1354. IEEE Society Press, 2009. (2009)
- Baker, J., Sexton, A., Sorge, V., A linear grammar approach to mathematical formula recognition from PDF, In: Proceedings of Intelligent Computer Mathematics, LNAI. Springer Verlag, Germany, 2009. (2009)
- Baker, J., Sexton, A., Sorge, V., Faithful mathematical formula recognition from PDF documents, In: Proceedings of DAS 2010, 2010. Forthcoming. (2010)
- Buswell, S., Caprotti, O., Carlisle, D. P., Dewar, M. C., Gaëtano, M., Kohlhase, M., The OpenMath Standard, The OpenMath Society, June 2004. (2004)
- Suzuki, M., Tamari, F., Fukuda, R., Uchida, S., Kanahori, T., Infty—an integrated OCR system for mathematical documents, In: Proceedings of ACM Symposium on Document Engineering, pages 95–104. ACM Press, 2003. (2003)
- Suzuki, M., Uchida, S., Nomura, A., A ground-truthed mathematical character and symbol image database, In: Proceedings of ICDAR 2005, pages 675–679. IEEE Society Press, 2005. (2005)
- The American Mathematical Society, 2000 Mathematics Subject Classification, 2000. http://www.ams.org/msc/. (2000)
- Beusekom, J. van, Shafait, F., Breuel, T. M., Automated OCR ground truth generation, In: Proceedings of DAS 2008, Sep 2008. (2008)
NotesEmbed ?
topTo embed these notes on your page include the following JavaScript code on your page where you want the notes to appear.