Displaying similar documents to “Towards a Flexible Author Name Disambiguation Framework”

Towards Reverse Engineering of PDF Documents

Baker, Josef B., Sexton, Alan P., Sorge, Volker

Similarity:

We present a progress report on our ongoing project of reverse engineering scientific PDF documents. The aim is to obtain mathematical markup that can be used as source for regenerating a document that resembles the original as closely as possible. This source can then be a basis for further document processing. Our current tool uses specialised PDF extraction together with image analysis to produce near perfect input for parsing mathematical formula. Applying a linear grammar and specific...