Displaying similar documents to “Extracting Precise Data on the Mathematical Content of PDF Documents”

Producing MathML with Tralics

Grimm, José

Similarity:

We describe here how Tralics can be used to convert LaTeX documents into XML or HTML. It uses an ad-hoc DTD (a simplification of the TEI), but the translation of the math formulas is conforming to the presentation MathML 2.0 recommendations. We explain how to run and parametrize the software. We give an overview of the various MathML constructs, and how they are rendered by different browsers.

Mathematical Formulae Recognition and Logical Structure Analysis of Mathematical Papers

Suzuki, Masakazu

Similarity:

In most cases the current on-line journals in mathematics are supplied in the form of PDF with print images of papers in the front and OCR’ed hidden texts behind to provide with search facilily using key words. The embedded hidden texts usually does not include good information about mathematical formulae in the papers. We can say that, for the future development of DML, it is desirable to include, in the digitised journals, more structured information of the content of mathematical...