MathML-aware Article Conversion from LaTeX

Stamerjohanns, Heinrich; Ginev, Deyan; David, Catalin; Misev, Dimitar; Zamdzhiev, Vladimir; Kohlhase, Michael

  • Towards a Digital Mathematics Library. Grand Bend, Ontario, Canada, July 8-9th, 2009, Publisher: Masaryk University Press(Brno), page 109-120

Abstract

top
Publishing in Mathematics and theoretical areas in Computer Science and Physics has been predominantly using TeX/LaTeX as a formatting language in the last two decades. This large corpus of born-digital material is both a boon — LaTeX is semi-semantic format where the source often contains indications of the author’s intentions — and a problem — TeX is Turing-complete and authors use this freedom to use thousands of styles and millions of user macros. Several tools have been developed to convert TeX/LaTeX documents to XML-based — i.e. Web and DML-compatible formats. Different DML Projects use different tools, and the selection seems largely accidental. To put the choice of converters for DML projects onto a more solid footing and to encourage competition and feature convergence we survey the market. In this paper we investigate and compare five LaTeX-to-XML transformers in three dimensions: a ) ergonomic factors like documentation, ease of installation, b ) coverage, and c ) quality of the resulting documents (in particular the MathML parts).

How to cite

top

Stamerjohanns, Heinrich, et al. "MathML-aware Article Conversion from LaTeX." Towards a Digital Mathematics Library. Grand Bend, Ontario, Canada, July 8-9th, 2009. Brno: Masaryk University Press, 2009. 109-120. <http://eudml.org/doc/220017>.

@inProceedings{Stamerjohanns2009,
abstract = {Publishing in Mathematics and theoretical areas in Computer Science and Physics has been predominantly using TeX/LaTeX as a formatting language in the last two decades. This large corpus of born-digital material is both a boon — LaTeX is semi-semantic format where the source often contains indications of the author’s intentions — and a problem — TeX is Turing-complete and authors use this freedom to use thousands of styles and millions of user macros. Several tools have been developed to convert TeX/LaTeX documents to XML-based — i.e. Web and DML-compatible formats. Different DML Projects use different tools, and the selection seems largely accidental. To put the choice of converters for DML projects onto a more solid footing and to encourage competition and feature convergence we survey the market. In this paper we investigate and compare five LaTeX-to-XML transformers in three dimensions: $a$) ergonomic factors like documentation, ease of installation, $b$) coverage, and $c$) quality of the resulting documents (in particular the MathML parts).},
author = {Stamerjohanns, Heinrich, Ginev, Deyan, David, Catalin, Misev, Dimitar, Zamdzhiev, Vladimir, Kohlhase, Michael},
booktitle = {Towards a Digital Mathematics Library. Grand Bend, Ontario, Canada, July 8-9th, 2009},
keywords = {University of Western Ontario; XML},
location = {Brno},
pages = {109-120},
publisher = {Masaryk University Press},
title = {MathML-aware Article Conversion from LaTeX},
url = {http://eudml.org/doc/220017},
year = {2009},
}

TY - CLSWK
AU - Stamerjohanns, Heinrich
AU - Ginev, Deyan
AU - David, Catalin
AU - Misev, Dimitar
AU - Zamdzhiev, Vladimir
AU - Kohlhase, Michael
TI - MathML-aware Article Conversion from LaTeX
T2 - Towards a Digital Mathematics Library. Grand Bend, Ontario, Canada, July 8-9th, 2009
PY - 2009
CY - Brno
PB - Masaryk University Press
SP - 109
EP - 120
AB - Publishing in Mathematics and theoretical areas in Computer Science and Physics has been predominantly using TeX/LaTeX as a formatting language in the last two decades. This large corpus of born-digital material is both a boon — LaTeX is semi-semantic format where the source often contains indications of the author’s intentions — and a problem — TeX is Turing-complete and authors use this freedom to use thousands of styles and millions of user macros. Several tools have been developed to convert TeX/LaTeX documents to XML-based — i.e. Web and DML-compatible formats. Different DML Projects use different tools, and the selection seems largely accidental. To put the choice of converters for DML projects onto a more solid footing and to encourage competition and feature convergence we survey the market. In this paper we investigate and compare five LaTeX-to-XML transformers in three dimensions: $a$) ergonomic factors like documentation, ease of installation, $b$) coverage, and $c$) quality of the resulting documents (in particular the MathML parts).
KW - University of Western Ontario; XML
UR - http://eudml.org/doc/220017
ER -

References

top
  1. Ausbrooks, Ron, Mathematical Markup Language (MathML) version 2.0 (second edition), . W3C recommendation, World Wide Web Consortium, 2003. (2003) 
  2. Anghelache, Romeo, Hermes discontinued, . project page at http://humanist.roua.org/2009/01/01/hermes-paused/, seen May 2009. 
  3. Anghelache, Romeo, Hermes website, . project page at http://hermes.roua.org/, seen May 2009. 
  4. arxmliv build system, . http://arxmliv.kwarc.info. 
  5. arXiv.org e-Print archive, , seen December2007. web page at http://www.arxiv.org. 
  6. Thierry, Bouche, Cedrics: When CEDRAM meets Tralics, . In Sojka, Petr, editor, Towards Digital Mathematics Library, Proceedings of the DML 2008 workshop, pages 153–165. Masaryk University, Brno, 2008. (2008) 
  7. Cecill license, . http://www.cecill.info/, seen May 2009. 
  8. Digital library of mathematical functions, . project page at http://dlmf.nist.gov/, seen May 2009. Zbl1130.65045
  9. Grimm, Jose, Tralics, a latex to xml translator, , 2003. (2003) 
  10. Kohlhase, Michael, Şucan, Ioan, A search engine for mathematical formulae, . In Ida, Tetsuo, Calmet, Jacques, and Wang, Dongming, editors, Proceedings of Artificial Intelligence and Symbolic Computation, AISC’2006, number 4120 in LNAI, pages 241–253. Springer Verlag, 2006. (2006) Zbl1156.68306
  11. Miller, Bruce, LaTeXML website, . http://dlmf.nist.gov/LaTeXML/, seen May 2009. 
  12. Munavalli, Rajesh, Miner, Robert, Mathfind: a math-aware search engine, . In SIGIR ’06: Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval, pages 735–735, New York, NY, USA, 2006. ACM Press. (2006) 
  13. Plaice and Yannis Haralambous, Omega website, . 
  14. EDP Sciences, lxir website, . http://www.lxir-latex.org/, seen May 2009. 
  15. Stamerjohanns, Heinrich, Ginev, Deyan, David, Catalin, Misev, Dimitar, Zamdzhiev, Vladimir, Kohlhase, Michael, A comparison study of mathml-aware LaTeX converters, . Kwarc report, Jacobs University Bremen, 2009. (2009) 
  16. Stamerjohanns, Heinrich, Kohlhase, Michael, Transforming the ar χ iv to XML, . In Autexier, Serge et al., editors, Intelligent Computer Mathematics, 9th International Conference, MKM 2008 Birmingham, UK, July 28 – August 1, 2008, Proceedings, number 5144 in LNAI, pages 574–582. Springer Verlag, 2008. (2008) Zbl1166.68364
  17. TeX4HT website, . http://www.cse.ohio-state.edu/~gurari/TeX4ht/, seen May 2009. 
  18. Tralics website, . http://www-sop.inria.fr/miaou/tralics/, seen May 2009. 
  19. TtM website, . project page at http://hutchinson.belmont.ma.us/tth/mml/, seen May 2009. 
  20. Validator website, . http://homepage.mac.com/rcrews/software/validator/, seen May 2009. 
  21. Watt, Stephen, MathML at ORCCA, . project page at http://www.orcca.on.ca/MathML/, seen May 2009. 
  22. W3C Math WG, MathML software – converters, . http://www.w3.org/Math/Software/mathml_software_cat_converters.html, seen May 2009. 

NotesEmbed ?

top

You must be logged in to post comments.

To embed these notes on your page include the following JavaScript code on your page where you want the notes to appear.

Only the controls for the widget will be shown in your chosen language. Notes will be shown in their authored language.

Tells the widget how many notes to show per page. You can cycle through additional notes using the next and previous controls.

    
                

Note: Best practice suggests putting the JavaScript code just before the closing </body> tag.