Displaying similar documents to “Producing MathML with Tralics”

Extracting Precise Data on the Mathematical Content of PDF Documents

Baker, Josef B., Sexton, Alan P., Sorge, Volker

Similarity:

As more and more scientific documents become available in PDF format, their automatic analysis becomes increasingly important. We present a procedure that extracts mathematical symbols from PDF documents by examining both the original PDF file and a rasterized version. This provides more precise information than is available either directly from the PDF file or by traditional character recognition techniques. The data can then be used to improve mathematical parsing methods that transform...

Conversion of TeX Documents to PDF

Pejović, Aleksandar, Mijajlović, Žarko

Similarity:

We discuss in some detail some of the drawbacks of PDF files obtained from mathematical papers prepared in TeX, particularly concerning indexing, copy/paste and OCR capabilities.