Small Scale Retrodigitization

Doob, Michael

  • Towards Digital Mathematics Library. Birmingham, United Kingdom, July 27th, 2008, Publisher: Masaryk University(Brno), page 103-113

Abstract

top
The digitization of papers born in the print-only era is vital for the health of the mathematical record. Many large scale retrodigitization projects are underway and, at this point, probably more that half of the mathematical history has been finished. Many smaller journals and books remain to be done. This paper gives a framework within which these may also be completed. It uses the digitization of the Canadian Journal of Mathematics (53,000 pages), completed as a one-man project over a few months, as the working example. The project described herein not only may be used as a model for similar efforts but also indicates some interesting problems yet to be solved.

How to cite

top

Doob, Michael. "Small Scale Retrodigitization." Towards Digital Mathematics Library. Birmingham, United Kingdom, July 27th, 2008. Brno: Masaryk University, 2008. 103-113. <http://eudml.org/doc/220476>.

@inProceedings{Doob2008,
abstract = {The digitization of papers born in the print-only era is vital for the health of the mathematical record. Many large scale retrodigitization projects are underway and, at this point, probably more that half of the mathematical history has been finished. Many smaller journals and books remain to be done. This paper gives a framework within which these may also be completed. It uses the digitization of the Canadian Journal of Mathematics (53,000 pages), completed as a one-man project over a few months, as the working example. The project described herein not only may be used as a model for similar efforts but also indicates some interesting problems yet to be solved.},
author = {Doob, Michael},
booktitle = {Towards Digital Mathematics Library. Birmingham, United Kingdom, July 27th, 2008},
keywords = {home retrodigitization; NUMDAM},
location = {Brno},
pages = {103-113},
publisher = {Masaryk University},
title = {Small Scale Retrodigitization},
url = {http://eudml.org/doc/220476},
year = {2008},
}

TY - CLSWK
AU - Doob, Michael
TI - Small Scale Retrodigitization
T2 - Towards Digital Mathematics Library. Birmingham, United Kingdom, July 27th, 2008
PY - 2008
CY - Brno
PB - Masaryk University
SP - 103
EP - 113
AB - The digitization of papers born in the print-only era is vital for the health of the mathematical record. Many large scale retrodigitization projects are underway and, at this point, probably more that half of the mathematical history has been finished. Many smaller journals and books remain to be done. This paper gives a framework within which these may also be completed. It uses the digitization of the Canadian Journal of Mathematics (53,000 pages), completed as a one-man project over a few months, as the working example. The project described herein not only may be used as a model for similar efforts but also indicates some interesting problems yet to be solved.
KW - home retrodigitization; NUMDAM
UR - http://eudml.org/doc/220476
ER -

References

top
  1. [unknown], The home web site for ArXiv is http://arxiv.org/ and is hosted by the Cornell University Library. The history of ArXiv is given in the article at http://en.wikipedia.org/wiki/ArXiv. 
  2. [unknown], http://www.ceic.math.ca/Publications/retro_bestpractices.pdf. 
  3. Dennis, Keith, [unknown], has had some encouraging results using Perl scripts developed by his working group at Cornell. His software has only been circulated informally. Zbl0527.16007
  4. Dennis, K., Michler, G. O., Schneider, G., Suzuki, M., Automatic reference linking in distributed digital libraries, , CVPRW 2003, Conference on Computer Vision and Pattern Recognition Workshop, paper #26, Volume 3 (Workshop on Document Image Analysis and Retrieval), 5 pp. (2003). (2003) 
  5. Ewing, J., Measuring Journals, . Notices of the AMS, 1049–1053, (2006). (2006) Zbl1142.00304
  6. [unknown], The project location is http://code.google.com/p/tesseract-ocr. 
  7. [unknown], Described at http://en.wikipedia.org/wiki/Tesseract_(software)and announced at http://google-code-updates.blogspot.com/2006/08/announcing-tesseract-ocr.html. 
  8. [unknown], The main site is at http://www.imagemagick.org/script/index.php. 
  9. [unknown], A full description of this project is at http://minidml.mathdoc.fr/. 
  10. NUMDAM 
  11. [unknown], See http://en.wikipedia.org/wiki/OCRopus. 
  12. [unknown], The home page for this software is http://www.pdfhacks.com/pdftk/. 
  13. [unknown], Documentation for the hyperref package can be found both at http://en.wikibooks.org/wiki/LaTeX/Packages/Hyperref and at http://www.tug.org/applications/hyperref/. 
  14. [unknown], http://www-sop.inria.fr/apics/tralics specifically translates LaTeX to XML. 
  15. [unknown], http://www.unicode.org/charts contains a list of the standard character names. 

NotesEmbed ?

top

You must be logged in to post comments.

To embed these notes on your page include the following JavaScript code on your page where you want the notes to appear.

Only the controls for the widget will be shown in your chosen language. Notes will be shown in their authored language.

Tells the widget how many notes to show per page. You can cycle through additional notes using the next and previous controls.

    
                

Note: Best practice suggests putting the JavaScript code just before the closing </body> tag.