Detecting Subtle Differences between Human and Model Languages Using Spectrum of Relative Likelihood
Xu Y, Wang Y, An H, Liu Z, Li Y (2024)
In: Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing. Al-Onaizan Y, Bansal M, Chen Y-N (Eds); Miami, Florida, USA: Association for Computational Linguistics: 10108-10121.
Konferenzbeitrag | Englisch
Download
Es wurden keine Dateien hochgeladen. Nur Publikationsnachweis!
Autor*in
Xu, Yang;
Wang, YuUniBi;
An, Hao;
Liu, Zhichen;
Li, Yongyuan
Herausgeber*in
Al-Onaizan, Yaser;
Bansal, Mohit;
Chen, Yun-Nung
Einrichtung
Abstract / Bemerkung
Human and model-generated texts can be distinguished by examining the magnitude of likelihood in language. However, it is becoming increasingly difficult as language model{'}s capabilities of generating human-like texts keep evolving. This study provides a new perspective by using the relative likelihood values instead of absolute ones, and extracting useful features from the spectrum-view of likelihood for the human-model text detection task. We propose a detection procedure with two classification methods, supervised and heuristic-based, respectively, which results in competitive performances with previous zero-shot detection methods and a new state-of-the-art on short-text detection. Our method can also reveal subtle differences between human and model languages, which find theoretical roots in psycholinguistics studies.
Erscheinungsjahr
2024
Titel des Konferenzbandes
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing
Seite(n)
10108-10121
Konferenz
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing
Konferenzort
Miami
Page URI
https://pub.uni-bielefeld.de/record/2994137
Zitieren
Xu Y, Wang Y, An H, Liu Z, Li Y. Detecting Subtle Differences between Human and Model Languages Using Spectrum of Relative Likelihood. In: Al-Onaizan Y, Bansal M, Chen Y-N, eds. Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing. Miami, Florida, USA: Association for Computational Linguistics; 2024: 10108-10121.
Xu, Y., Wang, Y., An, H., Liu, Z., & Li, Y. (2024). Detecting Subtle Differences between Human and Model Languages Using Spectrum of Relative Likelihood. In Y. Al-Onaizan, M. Bansal, & Y. - N. Chen (Eds.), Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing (pp. 10108-10121). Miami, Florida, USA: Association for Computational Linguistics.
Xu, Yang, Wang, Yu, An, Hao, Liu, Zhichen, and Li, Yongyuan. 2024. “Detecting Subtle Differences between Human and Model Languages Using Spectrum of Relative Likelihood”. In Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, ed. Yaser Al-Onaizan, Mohit Bansal, and Yun-Nung Chen, 10108-10121. Miami, Florida, USA: Association for Computational Linguistics.
Xu, Y., Wang, Y., An, H., Liu, Z., and Li, Y. (2024). “Detecting Subtle Differences between Human and Model Languages Using Spectrum of Relative Likelihood” in Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, Al-Onaizan, Y., Bansal, M., and Chen, Y. - N. eds. (Miami, Florida, USA: Association for Computational Linguistics), 10108-10121.
Xu, Y., et al., 2024. Detecting Subtle Differences between Human and Model Languages Using Spectrum of Relative Likelihood. In Y. Al-Onaizan, M. Bansal, & Y. - N. Chen, eds. Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing. Miami, Florida, USA: Association for Computational Linguistics, pp. 10108-10121.
Y. Xu, et al., “Detecting Subtle Differences between Human and Model Languages Using Spectrum of Relative Likelihood”, Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, Y. Al-Onaizan, M. Bansal, and Y.-N. Chen, eds., Miami, Florida, USA: Association for Computational Linguistics, 2024, pp.10108-10121.
Xu, Y., Wang, Y., An, H., Liu, Z., Li, Y.: Detecting Subtle Differences between Human and Model Languages Using Spectrum of Relative Likelihood. In: Al-Onaizan, Y., Bansal, M., and Chen, Y.-N. (eds.) Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing. p. 10108-10121. Association for Computational Linguistics, Miami, Florida, USA (2024).
Xu, Yang, Wang, Yu, An, Hao, Liu, Zhichen, and Li, Yongyuan. “Detecting Subtle Differences between Human and Model Languages Using Spectrum of Relative Likelihood”. Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing. Ed. Yaser Al-Onaizan, Mohit Bansal, and Yun-Nung Chen. Miami, Florida, USA: Association for Computational Linguistics, 2024. 10108-10121.