Towards comparable ratings: Exploring bias in German physician reviews

Kersting, Joschka; Maoro, Falk; Geierhos, Michaela

Towards comparable ratings: Exploring bias in German physician reviews

J. Kersting, F. Maoro, M. Geierhos, Data & Knowledge Engineering 148 (2023).

Download

Kersting 2023.pdf 1.38 MB

Download (ext.)

https://doi.org/10.1016/j.datak.2023.102235

DOI

10.1016/j.datak.2023.102235

Journal Article | Published | English

Author

Kersting, Joschka^LibreCat; Maoro, Falk; Geierhos, Michaela

Department

Digitale Kulturwissenschaften (bis 2020)

Project

SFB 901: SFB 901: On-The-Fly Computing - Individualisierte IT-Dienstleistungen in dynamischen Märkten
SFB 901 - B: SFB 901 - Project Area B
SFB 901 - B1: SFB 901 - Parametrisierte Servicespezifikation (Subproject B1)

Abstract

In this study, we evaluate the impact of gender-biased data from German-language physician reviews on the fairness of fine-tuned language models. For two different downstream tasks, we use data reported to be gender biased and aggregate it with annotations. First, we propose a new approach to aspect-based sentiment analysis that allows identifying, extracting, and classifying implicit and explicit aspect phrases and their polarity within a single model. The second task we present is grade prediction, where we predict the overall grade of a review on the basis of the review text. For both tasks, we train numerous transformer models and evaluate their performance. The aggregation of sensitive attributes, such as a physician’s gender and migration background, with individual text reviews allows us to measure the performance of the models with respect to these sensitive groups. These group-wise performance measures act as extrinsic bias measures for our downstream tasks. In addition, we translate several gender-specific templates of the intrinsic bias metrics into the German language and evaluate our fine-tuned models. Based on this set of tasks, fine-tuned models, and intrinsic and extrinsic bias measures, we perform correlation analyses between intrinsic and extrinsic bias measures. In terms of sensitive groups and effect sizes, our bias measure results show different directions. Furthermore, correlations between measures of intrinsic and extrinsic bias can be observed in different directions. This leads us to conclude that gender-biased data does not inherently lead to biased models. Other variables, such as template dependency for intrinsic measures and label distribution in the data, must be taken into account as they strongly influence the metric results. Therefore, we suggest that metrics and templates should be chosen according to the given task and the biases to be assessed.

Keywords

Language model fairness; Aspect phrase classification; Grade prediction; Physician reviews

Publishing Year

2023

Journal Title

Data & Knowledge Engineering

Volume

148

Article Number

102235

ISSN

0169-023X

Financial disclosure

Article Processing Charge funded by the Deutsche Forschungsgemeinschaft.

LibreCat-ID

53801

Cite this

Kersting J, Maoro F, Geierhos M. Towards comparable ratings: Exploring bias in German physician reviews. Data & Knowledge Engineering. 2023;148. doi:10.1016/j.datak.2023.102235

Kersting, J., Maoro, F., & Geierhos, M. (2023). Towards comparable ratings: Exploring bias in German physician reviews. Data & Knowledge Engineering, 148, Article 102235. https://doi.org/10.1016/j.datak.2023.102235

@article{Kersting_Maoro_Geierhos_2023, title={Towards comparable ratings: Exploring bias in German physician reviews}, volume={148}, DOI={10.1016/j.datak.2023.102235}, number={102235}, journal={Data & Knowledge Engineering}, publisher={Elsevier}, author={Kersting, Joschka and Maoro, Falk and Geierhos, Michaela}, year={2023} }

Kersting, Joschka, Falk Maoro, and Michaela Geierhos. “Towards Comparable Ratings: Exploring Bias in German Physician Reviews.” Data & Knowledge Engineering 148 (2023). https://doi.org/10.1016/j.datak.2023.102235.

J. Kersting, F. Maoro, and M. Geierhos, “Towards comparable ratings: Exploring bias in German physician reviews,” Data & Knowledge Engineering, vol. 148, Art. no. 102235, 2023, doi: 10.1016/j.datak.2023.102235.

Kersting, Joschka, et al. “Towards Comparable Ratings: Exploring Bias in German Physician Reviews.” Data & Knowledge Engineering, vol. 148, 102235, Elsevier, 2023, doi:10.1016/j.datak.2023.102235.

All files available under the following license(s):

Creative Commons Attribution 4.0 International Public License (CC-BY 4.0):