An OCR system for the Unified Northern Alphabet

Partanen N, Rießler M (2019)
In: The fifth International Workshop on Computational Linguistics for Uralic Languages. Pirinen TA, Kaalep H-J, Tyers FM, Association for Computational Linguistics (Eds); Tartu: Association for Computational Linguistics: 77-89.

Konferenzbeitrag | Veröffentlicht | Englisch
 
Download
OA 5.21 MB
Autor*in
Partanen, Niko; Rießler, MichaelUniBi
Herausgeber*in
Pirinen, Tommi A.; Kaalep, Heiki-Jaan; Tyers, Francis M.
herausgebende Körperschaft
Association for Computational Linguistics
Abstract / Bemerkung
This paper presents experiments done in order to build a functional OCR model for the Unified Northern Alphabet. This writing system was used between 1931 and 1937 for 16 (Uralic and non-Uralic) minority languages spoken in the Soviet Union. The character accuracy of the developed model reaches more than 98% and clearly shows cross-linguistic applicability. The tests described here therefore also include general guidelines for the amount of training data needed to boot-strap an OCR system under similar conditions.
Erscheinungsjahr
2019
Titel des Konferenzbandes
The fifth International Workshop on Computational Linguistics for Uralic Languages
Seite(n)
77-89
Konferenz
The fifth International Workshop on Computational Linguistics for Uralic Language
Konferenzort
Tartu
Konferenzdatum
2019-01-07 – 2019-01-08
ISBN
978-1-948087-92-6
Page URI
https://pub.uni-bielefeld.de/record/2933276

Zitieren

Partanen N, Rießler M. An OCR system for the Unified Northern Alphabet. In: Pirinen TA, Kaalep H-J, Tyers FM, Association for Computational Linguistics, eds. The fifth International Workshop on Computational Linguistics for Uralic Languages. Tartu: Association for Computational Linguistics; 2019: 77-89.
Partanen, N., & Rießler, M. (2019). An OCR system for the Unified Northern Alphabet. In T. A. Pirinen, H. - J. Kaalep, F. M. Tyers, & Association for Computational Linguistics (Eds.), The fifth International Workshop on Computational Linguistics for Uralic Languages (pp. 77-89). Tartu: Association for Computational Linguistics.
Partanen, N., and Rießler, M. (2019). “An OCR system for the Unified Northern Alphabet” in The fifth International Workshop on Computational Linguistics for Uralic Languages, Pirinen, T. A., Kaalep, H. - J., Tyers, F. M., and Association for Computational Linguistics eds. (Tartu: Association for Computational Linguistics), 77-89.
Partanen, N., & Rießler, M., 2019. An OCR system for the Unified Northern Alphabet. In T. A. Pirinen, et al., eds. The fifth International Workshop on Computational Linguistics for Uralic Languages. Tartu: Association for Computational Linguistics, pp. 77-89.
N. Partanen and M. Rießler, “An OCR system for the Unified Northern Alphabet”, The fifth International Workshop on Computational Linguistics for Uralic Languages, T.A. Pirinen, et al., eds., Tartu: Association for Computational Linguistics, 2019, pp.77-89.
Partanen, N., Rießler, M.: An OCR system for the Unified Northern Alphabet. In: Pirinen, T.A., Kaalep, H.-J., Tyers, F.M., and Association for Computational Linguistics (eds.) The fifth International Workshop on Computational Linguistics for Uralic Languages. p. 77-89. Association for Computational Linguistics, Tartu (2019).
Partanen, Niko, and Rießler, Michael. “An OCR system for the Unified Northern Alphabet”. The fifth International Workshop on Computational Linguistics for Uralic Languages. Ed. Tommi A. Pirinen, Heiki-Jaan Kaalep, Francis M. Tyers, and Association for Computational Linguistics. Tartu: Association for Computational Linguistics, 2019. 77-89.
Alle Dateien verfügbar unter der/den folgenden Lizenz(en):
Creative Commons Namensnennung 4.0 International Public License (CC-BY 4.0):
Volltext(e)
Access Level
OA Open Access
Zuletzt Hochgeladen
2019-09-06T09:19:05Z
MD5 Prüfsumme
ab96c8662dc550937b5612a9268ff554

Link(s) zu Volltext(en)
Access Level
OA Open Access

Export

Markieren/ Markierung löschen
Markierte Publikationen

Open Data PUB

Suchen in

Google Scholar
ISBN Suche