Currently displaying 1 – 3 of 3

Showing per page

Order by Relevance | Title | Year of publication

The Unreasonable Effectiveness of Pattern Generation

Petr SojkaOndřej Sojka — 2019

Zpravodaj Československého sdružení uživatelů TeXu

Languages are constantly evolving, and so are their hyphenation rules and needs. The effectiveness and utility of TeX's hyphenation have been proven by its usage in almost all typesetting systems in use today. The current Czech hyphenation patterns were generated in 1995, and no hyphenated word database was freely available. We have developed a new Czech word database and have used the patgen program to generate new effective Czech hyphenation patterns efficiently and evaluated their generalization...

Towards New Czechoslovak Hyphenation Patterns

Petr SojkaOndřej Sojka — 2020

Zpravodaj Československého sdružení uživatelů TeXu

Space- and time-effective segmentation and hyphenation of natural languages stay at the core of every document preparation system, web browser, or mobile rendering system. Recently, the unreasonable effectiveness of pattern generation has been shown - it is possible to use hyphenation patterns to solve the dictionary problem for a single language without compromise. In this article, we will show how we applied the marvelous effectiveness of patgen for the generation of the new Czechoslovak hyphenation...

A Roadmap for Universal Syllabic Segmentation

Ondřej SojkaPetr SojkaJakub Máca — 2023

Zpravodaj Československého sdružení uživatelů TeXu

Space- and time-effective segmentation (word hyphenation) of natural languages remains at the core of every document rendering system, be it TeX, web browser, or mobile operating system. In most languages, segmentation mimicking syllabic pronunciation is a pragmatic preference today. As language switching is often not marked in rendered texts, the typesetting engine needs universal syllabic segmentation. In this article, we show the feasibility of this idea by offering a prototype solution to two...

Page 1

Download Results (CSV)