Fifty shapes of BLiMP: syntactic learning curves in language models are not uniform, but sometimes unruly
Bunzeck B, Zarrieß S (2024)
In: Proceedings of the 2024 CLASP Conference on Multimodality and Interaction in Language Learning. Qiu A, Noble B, Pagmar D, Maraev V, Ilinykh N (Eds); Kerrville, TX: Association for Computational Linguistics: 39-55.
Konferenzbeitrag | Englisch
Download
2024.clasp-1.7.pdf
3.70 MB
Autor*in
Herausgeber*in
Qiu, Amy;
Noble, Bill;
Pagmar, David;
Maraev, Vladislav;
Ilinykh, Nikolai
Abstract / Bemerkung
Syntactic learning curves in LMs are usually reported as relatively stable and power law-shaped. By analyzing the learning curves of different LMs on various syntactic phenomena using both small self-trained llama models and larger pre-trained pythia models, we show that while many phenomena do follow typical power law curves, others exhibit S-shaped, U-shaped, or erratic patterns. Certain syntactic paradigms remain challenging even for large models, resulting in persistent preference for ungrammatical sentences. Most phenomena show similar curves for their paradigms, but the existence of diverging patterns and oscillations indicates that average curves mask important developments, underscoring the need for more detailed analyses of individual learning trajectories.
Erscheinungsjahr
2024
Titel des Konferenzbandes
Proceedings of the 2024 CLASP Conference on Multimodality and Interaction in Language Learning
Seite(n)
39-55
Urheberrecht / Lizenzen
Konferenz
CLASP Conference on Multimodality and Interaction in Language Learning
Konferenzort
Gothenburg, Sweden
Konferenzdatum
2024-10-14 – 2024-10-15
Page URI
https://pub.uni-bielefeld.de/record/2993430
Zitieren
Bunzeck B, Zarrieß S. Fifty shapes of BLiMP: syntactic learning curves in language models are not uniform, but sometimes unruly. In: Qiu A, Noble B, Pagmar D, Maraev V, Ilinykh N, eds. Proceedings of the 2024 CLASP Conference on Multimodality and Interaction in Language Learning. Kerrville, TX: Association for Computational Linguistics; 2024: 39-55.
Bunzeck, B., & Zarrieß, S. (2024). Fifty shapes of BLiMP: syntactic learning curves in language models are not uniform, but sometimes unruly. In A. Qiu, B. Noble, D. Pagmar, V. Maraev, & N. Ilinykh (Eds.), Proceedings of the 2024 CLASP Conference on Multimodality and Interaction in Language Learning (pp. 39-55). Kerrville, TX: Association for Computational Linguistics.
Bunzeck, Bastian, and Zarrieß, Sina. 2024. “Fifty shapes of BLiMP: syntactic learning curves in language models are not uniform, but sometimes unruly”. In Proceedings of the 2024 CLASP Conference on Multimodality and Interaction in Language Learning, ed. Amy Qiu, Bill Noble, David Pagmar, Vladislav Maraev, and Nikolai Ilinykh, 39-55. Kerrville, TX: Association for Computational Linguistics.
Bunzeck, B., and Zarrieß, S. (2024). “Fifty shapes of BLiMP: syntactic learning curves in language models are not uniform, but sometimes unruly” in Proceedings of the 2024 CLASP Conference on Multimodality and Interaction in Language Learning, Qiu, A., Noble, B., Pagmar, D., Maraev, V., and Ilinykh, N. eds. (Kerrville, TX: Association for Computational Linguistics), 39-55.
Bunzeck, B., & Zarrieß, S., 2024. Fifty shapes of BLiMP: syntactic learning curves in language models are not uniform, but sometimes unruly. In A. Qiu, et al., eds. Proceedings of the 2024 CLASP Conference on Multimodality and Interaction in Language Learning. Kerrville, TX: Association for Computational Linguistics, pp. 39-55.
B. Bunzeck and S. Zarrieß, “Fifty shapes of BLiMP: syntactic learning curves in language models are not uniform, but sometimes unruly”, Proceedings of the 2024 CLASP Conference on Multimodality and Interaction in Language Learning, A. Qiu, et al., eds., Kerrville, TX: Association for Computational Linguistics, 2024, pp.39-55.
Bunzeck, B., Zarrieß, S.: Fifty shapes of BLiMP: syntactic learning curves in language models are not uniform, but sometimes unruly. In: Qiu, A., Noble, B., Pagmar, D., Maraev, V., and Ilinykh, N. (eds.) Proceedings of the 2024 CLASP Conference on Multimodality and Interaction in Language Learning. p. 39-55. Association for Computational Linguistics, Kerrville, TX (2024).
Bunzeck, Bastian, and Zarrieß, Sina. “Fifty shapes of BLiMP: syntactic learning curves in language models are not uniform, but sometimes unruly”. Proceedings of the 2024 CLASP Conference on Multimodality and Interaction in Language Learning. Ed. Amy Qiu, Bill Noble, David Pagmar, Vladislav Maraev, and Nikolai Ilinykh. Kerrville, TX: Association for Computational Linguistics, 2024. 39-55.
Alle Dateien verfügbar unter der/den folgenden Lizenz(en):
Creative Commons Namensnennung - Nicht-kommerziell - Weitergabe unter gleichen Bedingungen 3.0 Unported (CC BY-NC-SA 3.0):
Volltext(e)
Name
2024.clasp-1.7.pdf
3.70 MB
Access Level
Open Access
Zuletzt Hochgeladen
2024-10-13T19:05:43Z
MD5 Prüfsumme
bd855621fd76e2a7f1e7d5a04be2151a
Link(s) zu Volltext(en)
Access Level
Open Access