fl-IRT-ing with Psychometrics to Improve NLP Bias Measurement

Dominik Bachmann; Oskar van der Wal; Edita Chvojka; Willem H. Zuidema; Leendert van Maanen; Katrin Schulz

Download from

dx.doi.org

More download options

fl-IRT-ing with Psychometrics to Improve NLP Bias Measurement

Dominik Bachmann, Oskar van der Wal, Edita Chvojka, Willem H. Zuidema, Leendert van Maanen & Katrin Schulz

Minds and Machines 34 (4):1-34 (2024) Copy BIBT_EX

Abstract

To prevent ordinary people from being harmed by natural language processing (NLP) technology, finding ways to measure the extent to which a language model is biased (e.g., regarding gender) has become an active area of research. One popular class of NLP bias measures are bias benchmark datasets—collections of test items that are meant to assess a language model’s preference for stereotypical versus non-stereotypical language. In this paper, we argue that such bias benchmarks should be assessed with models from the psychometric framework of item response theory (IRT). Specifically, we tie an introduction to basic IRT concepts and models with a discussion of how they could be relevant to the evaluation, interpretation and improvement of bias benchmark datasets. Regarding evaluation, IRT provides us with methodological tools for assessing the quality of both individual test items (e.g., the extent to which an item can differentiate highly biased from less biased language models) as well as benchmarks as a whole (e.g., the extent to which the benchmark allows us to assess not only severe but also subtle levels of model bias). Through such diagnostic tools, the quality of benchmark datasets could be improved, for example by deleting or reworking poorly performing items. Finally, in regards to interpretation, we argue that IRT models’ estimates for language model bias are conceptually superior to traditional accuracy-based evaluation metrics, as the former take into account more information than just whether or not a language model provided a biased response.

Author Profiles

Leendert van Maanen

Katrin Schulz

University of Amsterdam

Keywords

Artificial Intelligence Cognitive Psychology Game Theory, Economics, Social and Behav. Sciences Philosophy of Mind Philosophy of Science Theory of Computation

Reprint years

DOI

10.1007/s11023-024-09695-9

Other Versions

No versions found

Links

PhilArchive

This entry is not archived by us. If you are the author and have permission from the publisher, we recommend that you archive it. Many publishers automatically grant permission to authors to archive pre-prints. By uploading a copy of your work, you will enable us to better index it, making it easier to find.

Upload a copy of this work Papers currently archived: 105,417

External links

Setup an account with your affiliations in order to access resources via your University's proxy server

Through your library

Sign in / register and customize your OpenURL resolver
Configure custom resolver

My notes

Analytics

Added to PP
2024-09-06

Downloads
11 (#1,503,314)

6 months
6 (#729,781)

Historical graph of downloads

How can I increase my downloads?

Author Profiles

Leendert van Maanen

Katrin Schulz

University of Amsterdam

Citations of this work

No citations found.

Add more citations

References found in this work

No references found.

Add more references

Applied ethics	Epistemology	History of Western Philosophy	Meta-ethics	Metaphysics	Normative ethics
Philosophy of biology	Philosophy of language	Philosophy of mind	Philosophy of religion	Science Logic and Mathematics	More ...

fl-IRT-ing with Psychometrics to Improve NLP Bias Measurement

Abstract

Author Profiles

Categories

Keywords

Reprint years

DOI

Other Versions

Links

PhilArchive

External links

Through your library

My notes

Similar books and articles

Analytics

Author Profiles

Citations of this work

References found in this work