Displaying similar documents to “The Unreasonable Effectiveness of Pattern Generation”

Towards New Czechoslovak Hyphenation Patterns

Petr Sojka, Ondřej Sojka (2020)

Zpravodaj Československého sdružení uživatelů TeXu

Similarity:

Space- and time-effective segmentation and hyphenation of natural languages stay at the core of every document preparation system, web browser, or mobile rendering system. Recently, the unreasonable effectiveness of pattern generation has been shown - it is possible to use hyphenation patterns to solve the dictionary problem for a single language without compromise. In this article, we will show how we applied the marvelous effectiveness of patgen for the generation of the new Czechoslovak...

Communication with www in Czech

Lukáš Svoboda, Luboš Popelínský (2004)

Kybernetika

Similarity:

This paper describes UIO, a multi–domain question–answering system for the Czech language that looks for answers on the web. UIO exploits two fields, namely natural language interface to databases and question answering. In its current version, UIO can be used for asking questions about train and coach timetables, cinema and theatre performances, about currency exchange rates, name–days and on the Diderot Encyclopaedia. Much effort have been made into making addition of a new domain...

A Roadmap for Universal Syllabic Segmentation

Ondřej Sojka, Petr Sojka, Jakub Máca (2023)

Zpravodaj Československého sdružení uživatelů TeXu

Similarity:

Space- and time-effective segmentation (word hyphenation) of natural languages remains at the core of every document rendering system, be it TeX, web browser, or mobile operating system. In most languages, segmentation mimicking syllabic pronunciation is a pragmatic preference today. As language switching is often not marked in rendered texts, the typesetting engine needs universal syllabic segmentation. In this article, we show the feasibility of this idea by offering a prototype solution...