In Conversation with Artificial Intelligence: Aligning language Models with Human Values

Atoosa Kasirzadeh

PhilArchive

More download options

In Conversation with Artificial Intelligence: Aligning language Models with Human Values

Atoosa Kasirzadeh

Philosophy and Technology 36 (2):1-24 (2023) Copy BIBT_EX

Abstract

Large-scale language technologies are increasingly used in various forms of communication with humans across different contexts. One particular use case for these technologies is conversational agents, which output natural language text in response to prompts and queries. This mode of engagement raises a number of social and ethical questions. For example, what does it mean to align conversational agents with human norms or values? Which norms or values should they be aligned with? And how can this be accomplished? In this paper, we propose a number of steps that help answer these questions. We start by developing a philosophical analysis of the building blocks of linguistic communication between conversational agents and human interlocutors. We then use this analysis to identify and formulate ideal norms of conversation that can govern successful linguistic communication between humans and conversational agents. Furthermore, we explore how these norms can be used to align conversational agents with human values across a range of different discursive domains. We conclude by discussing the practical implications of our proposal for the design of conversational agents that are aligned with these norms and values.

Cite

Plain text

BibTeX

Formatted text

Zotero

EndNote

Reference Manager

RefWorks

Options

Edit

Mark as duplicate

Find it on Scholar

Request removal from index

Revision history

Author's Profile

Atoosa Kasirzadeh

Carnegie Mellon University

Keywords

Philosophy of Technology

Reprint years

DOI

10.1007/s13347-023-00606-x

Other Versions

No versions found

My notes

Analytics

Added to PP
2022-12-06

Downloads
1,537 (#10,757)

6 months
195 (#18,191)

Historical graph of downloads

How can I increase my downloads?

Author's Profile

Atoosa Kasirzadeh

Carnegie Mellon University

Citations of this work

ChatGPT: deconstructing the debate and moving it forward.Mark Coeckelbergh & David J. Gunkel - 2024 - AI and Society 39 (5):2221-2231.

Varieties of Bias.Gabbrielle M. Johnson - 2024 - Philosophy Compass (7):e13011.

Personhood and AI: Why large language models don’t understand us.Jacob Browning - 2023 - AI and Society 39 (5):2499-2506.

Mapping the Ethics of Generative AI: A Comprehensive Scoping Review.Thilo Hagendorff - 2024 - Minds and Machines 34 (4):1-27.

AI Enters Public Discourse: a Habermasian Assessment of the Moral Status of Large Language Models.Paolo Monti - 2024 - Ethics and Politics 61 (1):61-80.

View all 14 citations / Add more citations

References found in this work

Science as Social Knowledge: Values and Objectivity in Scientific Inquiry.Helen E. Longino - 1990 - Princeton University Press.

IX.—Essentially Contested Concepts.W. B. Gallie - 1956 - Proceedings of the Aristotelian Society 56 (1):167-198.

Inductive risk and values in science.Heather Douglas - 2000 - Philosophy of Science 67 (4):559-579.

The Scientist Qua Scientist Makes Value Judgments.Richard Rudner - 1953 - Philosophy of Science 20 (1):1-6.

Four Decades of Scientific Explanation.Wesley C. Salmon & Anne Fagot-Largeault - 1989 - History and Philosophy of the Life Sciences 16 (2):355.

View all 28 references / Add more references

Applied ethics	Epistemology	History of Western Philosophy	Meta-ethics	Metaphysics	Normative ethics
Philosophy of biology	Philosophy of language	Philosophy of mind	Philosophy of religion	Science Logic and Mathematics	More ...

In Conversation with Artificial Intelligence: Aligning language Models with Human Values

Abstract

Author's Profile

Categories

Keywords

Reprint years

DOI

Other Versions

Links

PhilArchive

External links

Through your library

My notes

Similar books and articles

Analytics

Author's Profile

Citations of this work

References found in this work