Fifty shapes of BLiMP: syntactic learning curves in language models are not uniform, but sometimes unruly

Bunzeck B, Zarrieß S (2024)
In: Proceedings of the 2024 CLASP Conference on Multimodality and Interaction in Language Learning. Qiu A, Noble B, Pagmar D, Maraev V, Ilinykh N (Eds); Kerrville, TX: Association for Computational Linguistics: 39-55.

Konferenzbeitrag | Englisch
 
Download
OA 3.70 MB
Herausgeber*in
Qiu, Amy; Noble, Bill; Pagmar, David; Maraev, Vladislav; Ilinykh, Nikolai
Abstract / Bemerkung
Syntactic learning curves in LMs are usually reported as relatively stable and power law-shaped. By analyzing the learning curves of different LMs on various syntactic phenomena using both small self-trained llama models and larger pre-trained pythia models, we show that while many phenomena do follow typical power law curves, others exhibit S-shaped, U-shaped, or erratic patterns. Certain syntactic paradigms remain challenging even for large models, resulting in persistent preference for ungrammatical sentences. Most phenomena show similar curves for their paradigms, but the existence of diverging patterns and oscillations indicates that average curves mask important developments, underscoring the need for more detailed analyses of individual learning trajectories.
Erscheinungsjahr
2024
Titel des Konferenzbandes
Proceedings of the 2024 CLASP Conference on Multimodality and Interaction in Language Learning
Seite(n)
39-55
Konferenz
CLASP Conference on Multimodality and Interaction in Language Learning
Konferenzort
Gothenburg, Sweden
Konferenzdatum
2024-10-14 – 2024-10-15
Page URI
https://pub.uni-bielefeld.de/record/2993430

Zitieren

Bunzeck B, Zarrieß S. Fifty shapes of BLiMP: syntactic learning curves in language models are not uniform, but sometimes unruly. In: Qiu A, Noble B, Pagmar D, Maraev V, Ilinykh N, eds. Proceedings of the 2024 CLASP Conference on Multimodality and Interaction in Language Learning. Kerrville, TX: Association for Computational Linguistics; 2024: 39-55.
Bunzeck, B., & Zarrieß, S. (2024). Fifty shapes of BLiMP: syntactic learning curves in language models are not uniform, but sometimes unruly. In A. Qiu, B. Noble, D. Pagmar, V. Maraev, & N. Ilinykh (Eds.), Proceedings of the 2024 CLASP Conference on Multimodality and Interaction in Language Learning (pp. 39-55). Kerrville, TX: Association for Computational Linguistics.
Bunzeck, Bastian, and Zarrieß, Sina. 2024. “Fifty shapes of BLiMP: syntactic learning curves in language models are not uniform, but sometimes unruly”. In Proceedings of the 2024 CLASP Conference on Multimodality and Interaction in Language Learning, ed. Amy Qiu, Bill Noble, David Pagmar, Vladislav Maraev, and Nikolai Ilinykh, 39-55. Kerrville, TX: Association for Computational Linguistics.
Bunzeck, B., and Zarrieß, S. (2024). “Fifty shapes of BLiMP: syntactic learning curves in language models are not uniform, but sometimes unruly” in Proceedings of the 2024 CLASP Conference on Multimodality and Interaction in Language Learning, Qiu, A., Noble, B., Pagmar, D., Maraev, V., and Ilinykh, N. eds. (Kerrville, TX: Association for Computational Linguistics), 39-55.
Bunzeck, B., & Zarrieß, S., 2024. Fifty shapes of BLiMP: syntactic learning curves in language models are not uniform, but sometimes unruly. In A. Qiu, et al., eds. Proceedings of the 2024 CLASP Conference on Multimodality and Interaction in Language Learning. Kerrville, TX: Association for Computational Linguistics, pp. 39-55.
B. Bunzeck and S. Zarrieß, “Fifty shapes of BLiMP: syntactic learning curves in language models are not uniform, but sometimes unruly”, Proceedings of the 2024 CLASP Conference on Multimodality and Interaction in Language Learning, A. Qiu, et al., eds., Kerrville, TX: Association for Computational Linguistics, 2024, pp.39-55.
Bunzeck, B., Zarrieß, S.: Fifty shapes of BLiMP: syntactic learning curves in language models are not uniform, but sometimes unruly. In: Qiu, A., Noble, B., Pagmar, D., Maraev, V., and Ilinykh, N. (eds.) Proceedings of the 2024 CLASP Conference on Multimodality and Interaction in Language Learning. p. 39-55. Association for Computational Linguistics, Kerrville, TX (2024).
Bunzeck, Bastian, and Zarrieß, Sina. “Fifty shapes of BLiMP: syntactic learning curves in language models are not uniform, but sometimes unruly”. Proceedings of the 2024 CLASP Conference on Multimodality and Interaction in Language Learning. Ed. Amy Qiu, Bill Noble, David Pagmar, Vladislav Maraev, and Nikolai Ilinykh. Kerrville, TX: Association for Computational Linguistics, 2024. 39-55.
Alle Dateien verfügbar unter der/den folgenden Lizenz(en):
Creative Commons Namensnennung - Nicht-kommerziell - Weitergabe unter gleichen Bedingungen 3.0 Unported (CC BY-NC-SA 3.0):
Volltext(e)
Access Level
OA Open Access
Zuletzt Hochgeladen
2024-10-13T19:05:43Z
MD5 Prüfsumme
bd855621fd76e2a7f1e7d5a04be2151a


Link(s) zu Volltext(en)
Access Level
OA Open Access

Export

Markieren/ Markierung löschen
Markierte Publikationen

Open Data PUB

Suchen in

Google Scholar