I beg to differ: how disagreement is handled in the annotation of legal machine learning data sets

Artificial Intelligence and Law 32 (3):839-862 (2024)
  Copy   BIBTEX

Abstract

Legal documents, like contracts or laws, are subject to interpretation. Different people can have different interpretations of the very same document. Large parts of judicial branches all over the world are concerned with settling disagreements that arise, in part, from these different interpretations. In this context, it only seems natural that during the annotation of legal machine learning data sets, disagreement, how to report it, and how to handle it should play an important role. This article presents an analysis of the current state-of-the-art in the annotation of legal machine learning data sets. The results of the analysis show that all of the analysed data sets remove all traces of disagreement, instead of trying to utilise the information that might be contained in conflicting annotations. Additionally, the publications introducing the data sets often do provide little information about the process that derives the “gold standard” from the initial annotations, often making it difficult to judge the reliability of the annotation process. Based on the state-of-the-art, the article provides easily implementable suggestions on how to improve the handling and reporting of disagreement in the annotation of legal machine learning data sets.

Other Versions

No versions found

Links

PhilArchive



    Upload a copy of this work     Papers currently archived: 100,830

External links

Setup an account with your affiliations in order to access resources via your University's proxy server

Through your library

Similar books and articles

Towards a machine understanding of Malawi legal text.Amelia V. Taylor & Eva Mfutso-Bengo - 2023 - Artificial Intelligence and Law 31 (1):1-11.
Extractive summarisation of legal texts.Ben Hachey & Claire Grover - 2006 - Artificial Intelligence and Law 14 (4):305-345.

Analytics

Added to PP
2023-06-29

Downloads
19 (#1,067,153)

6 months
9 (#464,038)

Historical graph of downloads
How can I increase my downloads?