### 9th Asian Logic Conference, Novosibirsk, Russia, 16--19 August 2005.

We present a modular architecture for processing informal mathematical language as found in textbooks and mathematical publications. We point at its properties relevant in addressing three aspects of informal mathematical discourse: (i) the interleaved symbolic and natural language, (ii) the linguistic, domain, and notational context, and (iii) the imprecision of the informal language. The objective in the modular approach is to enable parameterisation of the system with respect to the natural language...

bdim has been in operation since the fall of 2010 and has been slowly growing in the last year. I will report here: 1) on what is new in bdim with respect to the first presentation in DML 2010; 2) on some of the technical aspects of our implementation; 3) on the projects for the near future; 4) on some of the issues related to possible integration of bdim with EuDML.

Experience in setting up a comprehensive journal processing system based on the TeX typesetting system with the CEDRAM workflow is described, following the example of the Archivum Mathematicum journal. The system automates the preparation of issues and simultaneously generates the materials needed for the Czech Digital Mathematics Library project (DML-CZ). The second part of the article describes the process of transformation of archival born-digital articles into a DML-CZ-suitable format.

We present bdim (Bibliteca Digitale Italiana di Matematica) the Italian project of math digitization. The project has been started by SIMAI (Società Italiana di Matematica Applicata e Industriale) and UMI (Unione Matematica Italiana) with initial support from the Biblioteca Digitale Italiana and the Italian Ministry of Beni and Attività Culturali and with the help of Numdam. At the moment bdim consists of approximately 1,300 articles, 11,000 pages (articles from Bollettino Unione Matematica Italiana,...

The paper describes the process of building the Czech Digital Mathematics Library (DML-CZ) upon DSpace System. At first, the DML-CZ will be briefly introduced. Then we will describe DSpace system and its architecture together with Manakin — a system for building user interface above DSpace. The first technical part of the paper will be about mapping DML-CZ structure onto DSpace structures and about our importing tools and the way of managing requested features which are not supported by the default...

We describe CEDRICS, a general purpose system for automated journal production entirely based on a LaTeX input format. We show how the very basic ideas that initiated the whole effort turned into an efficient system because of the ability of LaTeX markup to parametrise simultaneously and without compromise high typographical quality for the PDF output as well as accurate XML metadata with (presentation) MathML formulas. This was made possible by the availability of two entirely independent LaTeX...

In the present report, we show current status of mathematical journals in Japan. Because most of their electronic edition have been loaded on various digital repositories with support of OAI-PMH, we have been able to design a subject based portal website and visual user interface which consists of harvested metadata of the journals. The outline of the portal website is reported also.

The Special Issue of Kybernetika is devoted to the publication of selected peer-reviewed articles submitted by the participants of the Czech-Japanese Seminar in Applied Mathematics 2008 which took place on September 1-7, 2008 in Takachi-ho and Miyazaki, Japan. The Czech-Japanese Seminar in Applied Mathematics 2008 was organized by the Department of Applied Physics, Faculty of Engineering, University of Miyazaki. It was the fourth meeting in the series of the Czech-Japanese Seminars in Applied Mathematics....

The quality of digital mathematical library depends on the formats and quality of data it offers. We show several enhancements of (meta)data of the Czech Digital Mathematics Library DML-CZ. We discuss possible minimalist modification of regular LaTeX documents that would simplify generating basic metadata that describes the article in an XML/MathML format. We also show a proof of concept of a method that enables us to include LaTeX source code of mathematical expressions into pdfTeX-generated PDFs...

We report on a new project to design a semantic ground truth set for mathematical document analysis. The ground truth set will be generated by annotating recognised mathematical symbols with respect to both their global meaning in the context of the considered documents and their local function within the particular mathematical formula they occur. The aim of our work is to have a reliable database available for semantic classification during the formula recognition process with the aim of enabling...

This paper describes an effort to develop a metadata element set for the exchange of descriptive metadata about mathematical literature. The approach taken uses the Dublin Core Application Profile (DCAP) framework, based on the DC Abstract Model. A fully developed DCAP for mathematical literature would be valuable, as both a guide and constraint in the creation of metadata records suitable for harvesting via OAI or sharing through other means. Adhering to the DCAP model would also enhance global...

We describe an ongoing project carried out by the Mathematical Institute of Serbian Academy of Sciences and Arts, and the Faculty of Mathematics, Belgrade. The project concerns building of electronic resources and presentations of electronic editions of mathematical works in Serbia, including retro-digitization of old books, articles and the other mathematical works, and development of the corresponding virtual library. The resources built in the project are freely accessible through Internet.

The aim of the DML-CZ project (2005–2009 — Czech Academy of Sciences, Masaryk University in Brno, Charles University in Prague, Czech Republic) is to investigate, develop and apply techniques, methods and tools that would allow the creation of the Czech Digital Mathematics Library. The most important tool developed and used in the course of the project is the Metadata Editor — a complex web-based system supporting all essential steps in the development of the article oriented digital library: integration...

The paper describes the background, the expected functionalities, and the architecture design goals of the European Digital Mathematics Library (Eu-DML), an infrastructure system aimed to integrate the mathematical contents available online throughout Europe, allowing for both extensive and specialized mathematics resource discovery. The three years long project to build the EuDML, partially funded by the European Commission, started in February 2010.

The WWW became the main resource of mathematical knowledge. Currently available full text search engines can be used on these documents but they are deficient in almost all cases. By applying axioms, equal transformations, and by using different notation each formula can be expressed in numerous ways. Most of these documents do not contain semantic information; therefore, precise mathematical interpretation is impossible. On the other hand, semantic information can help to give more precise information....

As more and more scientific documents become available in PDF format, their automatic analysis becomes increasingly important. We present a procedure that extracts mathematical symbols from PDF documents by examining both the original PDF file and a rasterized version. This provides more precise information than is available either directly from the PDF file or by traditional character recognition techniques. The data can then be used to improve mathematical parsing methods that transform the mathematics...