PUB - Publikationen an der Universität Bielefeld

Bastian Bunzeck

bastian.bunzeck@uni-bielefeld.de
https://orcid.org/0000-0002-1832-4068

PEVZ-ID

419963705

12 Publikationen

Alle markieren

[12]

2025 | Preprint | PUB-ID: 3001572
Do Construction Distributions Shape Formal Language Learning In German BabyLMs?
Bunzeck, Bastian, Do Construction Distributions Shape Formal Language Learning In German BabyLMs?. arXiv:2503.11593 (). , 2025
PUB | PDF | DOI | arXiv
[11]

2025 | Preprint | PUB-ID: 3000929
Subword models struggle with word learning, but surprisal hides it
Bunzeck, Bastian, Subword models struggle with word learning, but surprisal hides it. arXiv:2502.12835 (). , 2025
PUB | PDF | DOI | arXiv
[10]

2025 | Konferenzbeitrag | PUB-ID: 3000275
Small Language Models Also Work With Small Vocabularies: Probing the Linguistic Abilities of Grapheme- and Phoneme-Based Baby Llamas
Bunzeck, Bastian, Small Language Models Also Work With Small Vocabularies: Probing the Linguistic Abilities of Grapheme- and Phoneme-Based Baby Llamas. Proceedings of the 31st International Conference on Computational Linguistics (). Abu Dhabi, UAE, 2025
PUB | PDF | Download (ext.)
[9]

2024 | Konferenzbeitrag | PUB-ID: 3001254
Graphemes vs. phonemes: battling it out in character-based language models
Bunzeck, Bastian, Graphemes vs. phonemes: battling it out in character-based language models. The 2nd BabyLM Challenge at the 28th Conference on Computational Natural Language Learning (). Miami, FL, USA, 2024
PUB | PDF | Download (ext.)
[8]

2024 | Konferenzbeitrag | PUB-ID: 2993430
Fifty shapes of BLiMP: syntactic learning curves in language models are not uniform, but sometimes unruly
Bunzeck, Bastian, Fifty shapes of BLiMP: syntactic learning curves in language models are not uniform, but sometimes unruly. Proceedings of the 2024 CLASP Conference on Multimodality and Interaction in Language Learning (). Kerrville, TX, 2024
PUB | PDF | Download (ext.)
[7]

2024 | Zeitschriftenaufsatz | Veröffentlicht | PUB-ID: 2999608
The richness of the stimulus: Constructional variation and development in child-directed speech
Bunzeck, Bastian, The richness of the stimulus: Constructional variation and development in child-directed speech. First Language (). , 2024
PUB | DOI
[6]

2024 | Konferenzbeitrag | PUB-ID: 2994136
The SlayQA benchmark of social reasoning: testing gender-inclusive generalization with neopronouns
Bunzeck, Bastian, The SlayQA benchmark of social reasoning: testing gender-inclusive generalization with neopronouns. Proceedings of the 2nd GenBench Workshop on Generalisation (Benchmarking) in NLP (). Miami, Florida, USA, 2024
PUB | Download (ext.)
[5]

2023 | Datenpublikation | PUB-ID: 2993810
Replication Data for: "The Wikipedia Republic of Literary Characters"
Wojcik, Paula, Replication Data for: "The Wikipedia Republic of Literary Characters". (). , 2023
PUB | Dateien verfügbar | DOI
[4]

2023 | Zeitschriftenaufsatz | Veröffentlicht | PUB-ID: 2980942
The Wikipedia Republic of Literary Characters
Wojcik, Paula, The Wikipedia Republic of Literary Characters. Journal of Cultural Analytics 8 (2). , 2023
PUB | PDF | DOI
[3]

2023 | Konferenzbeitrag | Veröffentlicht | PUB-ID: 2985109
GPT-wee: How Small Can a Small Language Model Really Get?
Bunzeck, Bastian, GPT-wee: How Small Can a Small Language Model Really Get?. Proceedings of the BabyLM Challenge at the 27th Conference on Computational Natural Language Learning (). Stroudsburg, PA, 2023
PUB | PDF | DOI | Download (ext.)
[2]

2023 | Zeitschriftenaufsatz | Veröffentlicht | PUB-ID: 2980943
Hexatomic: An extensible, OS-independent platform fordeep multi-layer linguistic annotation of corpora
Druskat, Stephan, Hexatomic: An extensible, OS-independent platform fordeep multi-layer linguistic annotation of corpora. Journal of Open Source Software 8 (86). , 2023
PUB | PDF | DOI
[1]

2023 | Konferenzbeitrag | Veröffentlicht | PUB-ID: 2982902
Entrenchment Matters: Investigating Positional and Constructional Sensitivity in Small and Large Language Models
Bunzeck, Bastian, Entrenchment Matters: Investigating Positional and Constructional Sensitivity in Small and Large Language Models. Proceedings of the 2023 CLASP Conference on Learning with Small Data (LSD) (). Stroudsburg, PA, 2023
PUB | PDF

PUB - Publikationen an der Universität Bielefeld

Bastian Bunzeck

12 Publikationen

Suche

Publikationen filtern

Darstellung / Sortierung

Export / Einbettung

PUB - Publikationen an der Universität Bielefeld

Bastian Bunzeck

12 Publikationen

Suche

Publikationen filtern

Darstellung / Sortierung

Export / Einbettung

Exportoptionen