Detecting subtle differences between human and model languages using spectrum of relative likelihood
Y. Xu, Y. Wang, H. An, Z. Liu, Y. Li, in: Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, ACL, Miami, FL, USA, 2024, pp. 10108–10121.
Download (ext.)
Conference Paper
| Published
| English
Author
Xu, Yang;
Wang, Yu;
An, Hao;
Liu, Zhichen;
Li, Yongyuan
Department
Abstract
Human and model-generated texts can be distinguished by examining the magnitude of likelihood in language. However, it is becoming increasingly difficult as language model's capabilities of generating human-like texts keep evolving. This study provides a new perspective by using the relative likelihood values instead of absolute ones, and extracting useful features from the spectrum-view of likelihood for the human-model text detection task. We propose a detection procedure with two classification methods, supervised and heuristic-based, respectively, which results in competitive performances with previous zero-shot detection methods and a new state-of-the-art on short-text detection. Our method can also reveal subtle differences between human and model languages, which find theoretical roots in psycholinguistics studies.
Publishing Year
Proceedings Title
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing
Page
10108–10121
Conference
2024 Conference on Empirical Methods in Natural Language Processing (EMNLP 2024)
Conference Location
Miami, FL, USA
Conference Date
2024-11-12 – 2024-11-16
LibreCat-ID
Cite this
Xu Y, Wang Y, An H, Liu Z, Li Y. Detecting subtle differences between human and model languages using spectrum of relative likelihood. In: Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing. ACL; 2024:10108–10121. doi:10.18653/v1/2024.emnlp-main.564
Xu, Y., Wang, Y., An, H., Liu, Z., & Li, Y. (2024). Detecting subtle differences between human and model languages using spectrum of relative likelihood. Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 10108–10121. https://doi.org/10.18653/v1/2024.emnlp-main.564
@inproceedings{Xu_Wang_An_Liu_Li_2024, place={Miami, FL, USA}, title={Detecting subtle differences between human and model languages using spectrum of relative likelihood}, DOI={10.18653/v1/2024.emnlp-main.564}, booktitle={Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing}, publisher={ACL}, author={Xu, Yang and Wang, Yu and An, Hao and Liu, Zhichen and Li, Yongyuan}, year={2024}, pages={10108–10121} }
Xu, Yang, Yu Wang, Hao An, Zhichen Liu, and Yongyuan Li. “Detecting Subtle Differences between Human and Model Languages Using Spectrum of Relative Likelihood.” In Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 10108–10121. Miami, FL, USA: ACL, 2024. https://doi.org/10.18653/v1/2024.emnlp-main.564.
Y. Xu, Y. Wang, H. An, Z. Liu, and Y. Li, “Detecting subtle differences between human and model languages using spectrum of relative likelihood,” in Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, Miami, FL, USA, 2024, pp. 10108–10121, doi: 10.18653/v1/2024.emnlp-main.564.
Xu, Yang, et al. “Detecting Subtle Differences between Human and Model Languages Using Spectrum of Relative Likelihood.” Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, ACL, 2024, pp. 10108–10121, doi:10.18653/v1/2024.emnlp-main.564.
All files available under the following license(s):
Copyright Statement:
This Item is protected by copyright and/or related rights. [...]
Link(s) to Main File(s)
Access Level
Closed Access
