Displaying similar documents to “Ongoing Efforts to Generate “Tagged PDF” using pdfTeX”

Improving Mathematics Retrieval

Kamali, Shahab, Tompa, Frank Wm.

Similarity:

Despite the popularity of storing mathematical objects on the web, searching for mathematical expressions is extremely limited. Conventional retrieval systems are inadequate for mathematical expressions, because they are not tuned for text with complex structures that include only a few distinct terms. Surprisingly current approaches to the problem of retrieving mathematical information do not include a formal definition of the similarity between two expressions, and thus fail to find...

Web Interface and Collection for Mathematical Retrieval : WebMIaS and MREC

Líška, Martin, Sojka, Petr, Růžička, Michal, Mravec, Petr

Similarity:

We demonstrate searching of mathematical expressions in technical digital libraries on a MREC collection of 439,423 real scientific documents with more than 158 million mathematical formulae. Our solution—the WebMIaS system—allows the retrieval of mathematical expressions written in TeX or MathML. TeX queries are converted on-the-fly into tree representations of Presentation MathML, which is used for indexing. WebMIaS allows complex queries composed of plain text and mathematical formulae,...

CEDRICS: When CEDRAM Meets Tralics

Bouche, Thierry

Similarity:

We describe CEDRICS, a general purpose system for automated journal production entirely based on a LaTeX input format. We show how the very basic ideas that initiated the whole effort turned into an efficient system because of the ability of LaTeX markup to parametrise simultaneously and without compromise high typographical quality for the PDF output as well as accurate XML metadata with (presentation) MathML formulas. This was made possible by the availability of two entirely independent...