CausalQA: A Benchmark for Causal Question Answering

Bondarenko, Alexander; Wolska, Magdalena; Heindorf, Stefan; Blübaum, Lukas; Ngonga Ngomo, Axel-Cyrille; Stein, Benno; Braslavski, Pavel; Hagen, Matthias; Potthast, Martin

CausalQA: A Benchmark for Causal Question Answering

A. Bondarenko, M. Wolska, S. Heindorf, L. Blübaum, A.-C. Ngonga Ngomo, B. Stein, P. Braslavski, M. Hagen, M. Potthast, in: Proceedings of the 29th International Conference on Computational Linguistics, International Committee on Computational Linguistics, Gyeongju, Republic of Korea, 2022, pp. 3296–3308.

Download (ext.)

https://aclanthology.org/2022.coling-1.291.pdf

Conference Paper | English

Author

Bondarenko, Alexander; Wolska, Magdalena; Heindorf, Stefan^LibreCat ; Blübaum, Lukas; Ngonga Ngomo, Axel-Cyrille^LibreCat; Stein, Benno; Braslavski, Pavel; Hagen, Matthias; Potthast, Martin

Department

Data Science / Heinz Nixdorf Institut
Data Science Junior Research Group

Project

PC2: Computing Resources Provided by the Paderborn Center for Parallel Computing

Abstract

At least 5% of questions submitted to search engines ask about cause-effect relationships in some way. To support the development of tailored approaches that can answer such questions, we construct Webis-CausalQA-22, a benchmark corpus of 1.1 million causal questions with answers. We distinguish different types of causal questions using a novel typology derived from a data-driven, manual analysis of questions from ten large question answering (QA) datasets. Using high-precision lexical rules, we extract causal questions of each type from these datasets to create our corpus. As an initial baseline, the state-of-the-art QA model UnifiedQA achieves a ROUGE-L F1 score of 0.48 on our new benchmark.

Publishing Year

2022

Proceedings Title

Proceedings of the 29th International Conference on Computational Linguistics

Page

3296–3308

LibreCat-ID

33739

Cite this

Bondarenko A, Wolska M, Heindorf S, et al. CausalQA: A Benchmark for Causal Question Answering. In: Proceedings of the 29th International Conference on Computational Linguistics. International Committee on Computational Linguistics; 2022:3296–3308.

Bondarenko, A., Wolska, M., Heindorf, S., Blübaum, L., Ngonga Ngomo, A.-C., Stein, B., Braslavski, P., Hagen, M., & Potthast, M. (2022). CausalQA: A Benchmark for Causal Question Answering. Proceedings of the 29th International Conference on Computational Linguistics, 3296–3308.

@inproceedings{Bondarenko_Wolska_Heindorf_Blübaum_Ngonga Ngomo_Stein_Braslavski_Hagen_Potthast_2022, place={Gyeongju, Republic of Korea}, title={CausalQA: A Benchmark for Causal Question Answering}, booktitle={Proceedings of the 29th International Conference on Computational Linguistics}, publisher={International Committee on Computational Linguistics}, author={Bondarenko, Alexander and Wolska, Magdalena and Heindorf, Stefan and Blübaum, Lukas and Ngonga Ngomo, Axel-Cyrille and Stein, Benno and Braslavski, Pavel and Hagen, Matthias and Potthast, Martin}, year={2022}, pages={3296–3308} }

Bondarenko, Alexander, Magdalena Wolska, Stefan Heindorf, Lukas Blübaum, Axel-Cyrille Ngonga Ngomo, Benno Stein, Pavel Braslavski, Matthias Hagen, and Martin Potthast. “CausalQA: A Benchmark for Causal Question Answering.” In Proceedings of the 29th International Conference on Computational Linguistics, 3296–3308. Gyeongju, Republic of Korea: International Committee on Computational Linguistics, 2022.

A. Bondarenko et al., “CausalQA: A Benchmark for Causal Question Answering,” in Proceedings of the 29th International Conference on Computational Linguistics, 2022, pp. 3296–3308.

Bondarenko, Alexander, et al. “CausalQA: A Benchmark for Causal Question Answering.” Proceedings of the 29th International Conference on Computational Linguistics, International Committee on Computational Linguistics, 2022, pp. 3296–3308.

All files available under the following license(s):

Copyright Statement:

This Item is protected by copyright and/or related rights. [...]

Link(s) to Main File(s)

URL

https://aclanthology.org/2022.coling-1.291.pdf

Access Level

Closed Access

Export

Marked Publications

Open Data LibreCat

Search this title in

Google Scholar