Detecting subtle differences between human and model languages using spectrum of relative likelihood

Xu, Yang; Wang, Yu; An, Hao; Liu, Zhichen; Li, Yongyuan

Detecting subtle differences between human and model languages using spectrum of relative likelihood

Y. Xu, Y. Wang, H. An, Z. Liu, Y. Li, in: Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, ACL, Miami, FL, USA, 2024, pp. 10108–10121.

Download (ext.)

https://doi.org/10.18653/v1/2024.emnlp-main.564

DOI

10.18653/v1/2024.emnlp-main.564

Conference Paper | Published | English

Author

Xu, Yang; Wang, Yu; An, Hao; Liu, Zhichen; Li, Yongyuan

Department

Sonderforschungsbereich Transregio 318

Project

TRR 318; TP A02: Verstehensprozess einer Erklärung beobachten und auswerten

Abstract

Human and model-generated texts can be distinguished by examining the magnitude of likelihood in language. However, it is becoming increasingly difficult as language model's capabilities of generating human-like texts keep evolving. This study provides a new perspective by using the relative likelihood values instead of absolute ones, and extracting useful features from the spectrum-view of likelihood for the human-model text detection task. We propose a detection procedure with two classification methods, supervised and heuristic-based, respectively, which results in competitive performances with previous zero-shot detection methods and a new state-of-the-art on short-text detection. Our method can also reveal subtle differences between human and model languages, which find theoretical roots in psycholinguistics studies.

Publishing Year

2024

Proceedings Title

Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing

Page

10108–10121

Conference

2024 Conference on Empirical Methods in Natural Language Processing (EMNLP 2024)

Conference Location

Miami, FL, USA

Conference Date

2024-11-12 – 2024-11-16

LibreCat-ID

61177

Cite this

Xu Y, Wang Y, An H, Liu Z, Li Y. Detecting subtle differences between human and model languages using spectrum of relative likelihood. In: Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing. ACL; 2024:10108–10121. doi:10.18653/v1/2024.emnlp-main.564

Xu, Y., Wang, Y., An, H., Liu, Z., & Li, Y. (2024). Detecting subtle differences between human and model languages using spectrum of relative likelihood. Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 10108–10121. https://doi.org/10.18653/v1/2024.emnlp-main.564

@inproceedings{Xu_Wang_An_Liu_Li_2024, place={Miami, FL, USA}, title={Detecting subtle differences between human and model languages using spectrum of relative likelihood}, DOI={10.18653/v1/2024.emnlp-main.564}, booktitle={Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing}, publisher={ACL}, author={Xu, Yang and Wang, Yu and An, Hao and Liu, Zhichen and Li, Yongyuan}, year={2024}, pages={10108–10121} }

Xu, Yang, Yu Wang, Hao An, Zhichen Liu, and Yongyuan Li. “Detecting Subtle Differences between Human and Model Languages Using Spectrum of Relative Likelihood.” In Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 10108–10121. Miami, FL, USA: ACL, 2024. https://doi.org/10.18653/v1/2024.emnlp-main.564.

Y. Xu, Y. Wang, H. An, Z. Liu, and Y. Li, “Detecting subtle differences between human and model languages using spectrum of relative likelihood,” in Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, Miami, FL, USA, 2024, pp. 10108–10121, doi: 10.18653/v1/2024.emnlp-main.564.

Xu, Yang, et al. “Detecting Subtle Differences between Human and Model Languages Using Spectrum of Relative Likelihood.” Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, ACL, 2024, pp. 10108–10121, doi:10.18653/v1/2024.emnlp-main.564.

All files available under the following license(s):

Copyright Statement:

This Item is protected by copyright and/or related rights. [...]

Link(s) to Main File(s)

URL

https://doi.org/10.18653/v1/2024.emnlp-main.564

Access Level

Closed Access

Export

Marked Publications

Open Data LibreCat

Search this title in

Google Scholar