12 Publikationen
-
-
-
2025 | Konferenzbeitrag | PUB-ID: 3000275Bunzeck, B., Duran, D., Schade, L., and Zarrieß, S. (2025). “Small Language Models Also Work With Small Vocabularies: Probing the Linguistic Abilities of Grapheme- and Phoneme-Based Baby Llamas” in Proceedings of the 31st International Conference on Computational Linguistics, Rambow, O., Wanner, L., Apidianaki, M., Al-Khalifa, H., Eugenio, B. D., and Schockaert, S. eds. (Abu Dhabi, UAE: Association for Computational Linguistics), 6039-6048.PUB | PDF | Download (ext.)
-
2024 | Konferenzbeitrag | PUB-ID: 3001254Bunzeck, B., Duran, D., Schade, L., and Zarrieß, S. (2024). “Graphemes vs. phonemes: battling it out in character-based language models” in The 2nd BabyLM Challenge at the 28th Conference on Computational Natural Language Learning, Hu, M. Y., Mueller, A., Ross, C., Williams, A., Linzen, T., Zhuang, C., Choshen, L., Cotterell, R., Warstadt, A., and Wilcox, E. G. eds. (Miami, FL, USA: Association for Computational Linguistics), 54-64.PUB | PDF | Download (ext.)
-
2024 | Konferenzbeitrag | PUB-ID: 2993430Bunzeck, B., and Zarrieß, S. (2024). “Fifty shapes of BLiMP: syntactic learning curves in language models are not uniform, but sometimes unruly” in Proceedings of the 2024 CLASP Conference on Multimodality and Interaction in Language Learning, Qiu, A., Noble, B., Pagmar, D., Maraev, V., and Ilinykh, N. eds. (Kerrville, TX: Association for Computational Linguistics), 39-55.PUB | PDF | Download (ext.)
-
-
2024 | Konferenzbeitrag | PUB-ID: 2994136Bunzeck, B., and Zarrieß, S. (2024). “The SlayQA benchmark of social reasoning: testing gender-inclusive generalization with neopronouns” in Proceedings of the 2nd GenBench Workshop on Generalisation (Benchmarking) in NLP, Hupkes, D., Dankers, V., Batsuren, K., Kazemnejad, A., Christodoulopoulos, C., Giulianelli, M., and Cotterell, R. eds. (Miami, Florida, USA: Association for Computational Linguistics), 42-53.PUB | Download (ext.)
-
2023 | Datenpublikation | PUB-ID: 2993810Wojcik, P., Bunzeck, B., and Zarrieß, S. (2023). Replication Data for: "The Wikipedia Republic of Literary Characters". Harvard Dataverse.PUB | Dateien verfügbar | DOI
-
-
2023 | Konferenzbeitrag | Veröffentlicht | PUB-ID: 2985109Bunzeck, B., and Zarrieß, S. (2023). “GPT-wee: How Small Can a Small Language Model Really Get?” in Proceedings of the BabyLM Challenge at the 27th Conference on Computational Natural Language Learning, Warstadt, A., Mueller, A., Choshen, L., Wilcox, E., Zhuang, C., Ciro, J., Mosquera, R., Paranjabe, B., Williams, A., Linzen, T., et al. eds. ( Stroudsburg, PA: Association for Computational Linguistics), 35-46.PUB | PDF | DOI | Download (ext.)
-
-
2023 | Konferenzbeitrag | Veröffentlicht | PUB-ID: 2982902Bunzeck, B., and Zarrieß, S. (2023). “Entrenchment Matters: Investigating Positional and Constructional Sensitivity in Small and Large Language Models” in Proceedings of the 2023 CLASP Conference on Learning with Small Data (LSD), Breitholtz, E., Lappin, S., Loaiciga, S., Ilinykh, N., and Dobnik, S. eds. (Stroudsburg, PA: Association for Computational Linguistics), 25-37.PUB | PDF