Designing a Semantic Ground Truth for Mathematical Formulae

Sexton, Alan; Sorge, Volker; Suzuki, Masakazu

  • Towards a Digital Mathematics Library. Paris, France, July 7-8th, 2010, Publisher: Masaryk University Press(Brno, Czech Republic), page 37-42

Abstract

top
We report on a new project to design a semantic ground truth set for mathematical document analysis. The ground truth set will be generated by annotating recognised mathematical symbols with respect to both their global meaning in the context of the considered documents and their local function within the particular mathematical formula they occur. The aim of our work is to have a reliable database available for semantic classification during the formula recognition process with the aim of enabling correct interpretations of mathematical formulae and generating semantic markup such as Content MathML.

How to cite

top

Sexton, Alan, Sorge, Volker, and Suzuki, Masakazu. "Designing a Semantic Ground Truth for Mathematical Formulae." Towards a Digital Mathematics Library. Paris, France, July 7-8th, 2010. Brno, Czech Republic: Masaryk University Press, 2010. 37-42. <http://eudml.org/doc/220332>.

@inProceedings{Sexton2010,
abstract = {We report on a new project to design a semantic ground truth set for mathematical document analysis. The ground truth set will be generated by annotating recognised mathematical symbols with respect to both their global meaning in the context of the considered documents and their local function within the particular mathematical formula they occur. The aim of our work is to have a reliable database available for semantic classification during the formula recognition process with the aim of enabling correct interpretations of mathematical formulae and generating semantic markup such as Content MathML.},
author = {Sexton, Alan, Sorge, Volker, Suzuki, Masakazu},
booktitle = {Towards a Digital Mathematics Library. Paris, France, July 7-8th, 2010},
keywords = {Content MathML; OCR},
location = {Brno, Czech Republic},
pages = {37-42},
publisher = {Masaryk University Press},
title = {Designing a Semantic Ground Truth for Mathematical Formulae},
url = {http://eudml.org/doc/220332},
year = {2010},
}

TY - CLSWK
AU - Sexton, Alan
AU - Sorge, Volker
AU - Suzuki, Masakazu
TI - Designing a Semantic Ground Truth for Mathematical Formulae
T2 - Towards a Digital Mathematics Library. Paris, France, July 7-8th, 2010
PY - 2010
CY - Brno, Czech Republic
PB - Masaryk University Press
SP - 37
EP - 42
AB - We report on a new project to design a semantic ground truth set for mathematical document analysis. The ground truth set will be generated by annotating recognised mathematical symbols with respect to both their global meaning in the context of the considered documents and their local function within the particular mathematical formula they occur. The aim of our work is to have a reliable database available for semantic classification during the formula recognition process with the aim of enabling correct interpretations of mathematical formulae and generating semantic markup such as Content MathML.
KW - Content MathML; OCR
UR - http://eudml.org/doc/220332
ER -

References

top
  1. Aly, W., Uchida, S., Fujiyoshi, A., Suzuki, M., Statistical classification of spatial relationships among mathematical symbols, In: Proceedings of ICDAR 2009, pages 1350–1354. IEEE Society Press, 2009. (2009) 
  2. Baker, J., Sexton, A., Sorge, V., A linear grammar approach to mathematical formula recognition from PDF, In: Proceedings of Intelligent Computer Mathematics, LNAI. Springer Verlag, Germany, 2009. (2009) 
  3. Baker, J., Sexton, A., Sorge, V., Faithful mathematical formula recognition from PDF documents, In: Proceedings of DAS 2010, 2010. Forthcoming. (2010) 
  4. Buswell, S., Caprotti, O., Carlisle, D. P., Dewar, M. C., Gaëtano, M., Kohlhase, M., The OpenMath Standard, The OpenMath Society, June 2004. (2004) 
  5. Suzuki, M., Tamari, F., Fukuda, R., Uchida, S., Kanahori, T., Infty—an integrated OCR system for mathematical documents, In: Proceedings of ACM Symposium on Document Engineering, pages 95–104. ACM Press, 2003. (2003) 
  6. Suzuki, M., Uchida, S., Nomura, A., A ground-truthed mathematical character and symbol image database, In: Proceedings of ICDAR 2005, pages 675–679. IEEE Society Press, 2005. (2005) 
  7. The American Mathematical Society, 2000 Mathematics Subject Classification, 2000. http://www.ams.org/msc/. (2000) 
  8. Beusekom, J. van, Shafait, F., Breuel, T. M., Automated OCR ground truth generation, In: Proceedings of DAS 2008, Sep 2008. (2008) 

NotesEmbed ?

top

You must be logged in to post comments.

To embed these notes on your page include the following JavaScript code on your page where you want the notes to appear.

Only the controls for the widget will be shown in your chosen language. Notes will be shown in their authored language.

Tells the widget how many notes to show per page. You can cycle through additional notes using the next and previous controls.

    
                

Note: Best practice suggests putting the JavaScript code just before the closing </body> tag.