bdim has been in operation since the fall of 2010 and has been slowly growing in the last year. I will report here: 1) on what is new in bdim with respect to the first presentation in DML 2010; 2) on some of the technical aspects of our implementation; 3) on the projects for the near future; 4) on some of the issues related to possible integration of bdim with EuDML.
We present bdim (Bibliteca Digitale Italiana di Matematica) the Italian project of math digitization. The project has been started by SIMAI (Società Italiana di Matematica Applicata e Industriale) and UMI (Unione Matematica Italiana) with initial support from the Biblioteca Digitale Italiana and the Italian Ministry of Beni and Attività Culturali and with the help of Numdam. At the moment bdim consists of approximately 1,300 articles, 11,000 pages (articles from Bollettino Unione Matematica Italiana,...
The quality of digital mathematical library depends on the formats and quality of data it offers. We show several enhancements of (meta)data of the Czech Digital Mathematics Library DML-CZ. We discuss possible minimalist modification of regular LaTeX documents that would simplify generating basic metadata that describes the article in an XML/MathML format. We also show a proof of concept of a method that enables us to include LaTeX source code of mathematical expressions into pdfTeX-generated PDFs...
We report on a new project to design a semantic ground truth set for mathematical document analysis. The ground truth set will be generated by annotating recognised mathematical symbols with respect to both their global meaning in the context of the considered documents and their local function within the particular mathematical formula they occur. The aim of our work is to have a reliable database available for semantic classification during the formula recognition process with the aim of enabling...
This paper describes an effort to develop a metadata element set for the exchange of descriptive metadata about mathematical literature. The approach taken uses the Dublin Core Application Profile (DCAP) framework, based on the DC Abstract Model. A fully developed DCAP for mathematical literature would be valuable, as both a guide and constraint in the creation of metadata records suitable for harvesting via OAI or sharing through other means. Adhering to the DCAP model would also enhance global...
The paper describes the background, the expected functionalities, and the architecture design goals of the European Digital Mathematics Library (Eu-DML), an infrastructure system aimed to integrate the mathematical contents available online throughout Europe, allowing for both extensive and specialized mathematics resource discovery. The three years long project to build the EuDML, partially funded by the European Commission, started in February 2010.
This work has been partially supported by Grant No. DO 02-275, 16.12.2008, Bulgarian
NSF, Ministry of Education and Science.Hausdorff distance (HD) seems the most efficient instrument
for measuring how far two compact non-empty subsets of a metric space are
from each other. This paper considers the possibilities provided by HD and
some of its modifications used recently by many authors for resemblance
between binary text images. Summarizing part of the existing word image
matching methods, relied...
This paper presents an alternative interface for browsing in the Czech Digital Mathematics Library (DML-CZ) using our Visual Browser web browsing tool. Using dynamic visualization, we have created a tool for browsing the library graphically. Visualization can help users orient themselves in complex data and at the same time reveal sometimes unexpected relationships among units; it at least speeds up browsing. This work follows the metadata processing undertaken on DML-CZ and visualizes all reasonable...
High-Energy Physics (HEP) has a long tradition in pioneering infrastructures for scholarly communication, and four leading laboratories are now rolling-out the next-generation digital library for the field: INSPIRE. This is an evolution of the extraordinarily successful, 40-years old SPIRES database. Based on the Invenio software, INSPIRE already provides seamless access to almost 1 million records, which will be expanded to cover multimedia, data, software, wikis. Services offered include citation...
Abstract. At an exclusively online university such as the UOC the necessity for communicating mathematics in the web is pressing. In an environment that does not allow for face to face communication, things implicitly communicated when using a blackboard, such as the canonical verbalization or handwriting of formulae, are lost and become a big obstacle. Also, the editorial process for the creation of learning/teaching resources is suited for a generalist approach and, consequently, needs such as...
In most cases the current on-line journals in mathematics are supplied in the form of PDF with print images of papers in the front and OCR’ed hidden texts behind to provide with search facilily using key words. The embedded hidden texts usually does not include good information about mathematical formulae in the papers. We can say that, for the future development of DML, it is desirable to include, in the digitised journals, more structured information of the content of mathematical papers, e.g....
For preparing and validating metadata for the Digital Mathematics Library DML-CZ, a new tool, the Metadata Editor, has been developed. This paper outlines the procedures for Linguistic and geographical localizations its components. Also mentioned are such aspects as dynamic generation of form editing based on the XML Schema, the validation procedures as well as support for semiautomatic procedures regarding quality assurance.