Can the predictive processing model of the mind ameliorate the value-alignment problem?

William Ratoff

Download from

dx.doi.org

More download options

Can the predictive processing model of the mind ameliorate the value-alignment problem?

William Ratoff

Ethics and Information Technology 23 (4):739-750 (2021) Copy BIBT_EX

Abstract

How do we ensure that future generally intelligent AI share our values? This is the value-alignment problem. It is a weighty matter. After all, if AI are neutral with respect to our wellbeing, or worse, actively hostile toward us, then they pose an existential threat to humanity. Some philosophers have argued that one important way in which we can mitigate this threat is to develop only AI that shares our values or that has values that ‘align with’ ours. However, there is nothing to guarantee that this policy will be universally implemented—in particular, ‘bad actors’ are likely to flout it. In this paper, I show how the predictive processing model of the mind, currently ascendant in cognitive science, may ameliorate the value-alignment problem. In essence, I argue that there are a plurality of reasons why any future generally intelligent AI will possess a predictive processing cognitive architecture (e.g. because we decide to build them that way; because it is the only possible cognitive architecture that can underpin general intelligence; because it is the easiest way to create AI.). I also argue that if future generally intelligent AI possess a predictive processing cognitive architecture, then they will come to share our pro-moral motivations (of valuing humanity as an end; avoiding maleficent actions; etc.), regardless of their initial motivation set. Consequently, these AI will pose a minimal threat to humanity. In this way then, I conclude, the value-alignment problem is significantly ameliorated under the assumption that future generally intelligent AI will possess a predictive processing cognitive architecture.

Cite

Plain text

BibTeX

Formatted text

Zotero

EndNote

Reference Manager

RefWorks

Options

Edit

Mark as duplicate

Find it on Scholar

Request removal from index

Revision history

Author's Profile

William Ratoff

Yale University

Keywords

Ethics Innovation/Technology Management Library Science Management of Computing and Information Systems User Interfaces and Human Computer Interaction

Reprint years

DOI

10.1007/s10676-021-09611-0

Other Versions

No versions found

My notes

Analytics

Added to PP
2021-12-06

Downloads
74 (#283,573)

6 months
15 (#207,490)

Historical graph of downloads

How can I increase my downloads?

Author's Profile

William Ratoff

Yale University

Citations of this work

Practical reason as theoretical reason.William Ratoff - forthcoming - Inquiry: An Interdisciplinary Journal of Philosophy.

Add more citations

References found in this work

The Intentional Stance.Daniel Clement Dennett - 1981 - MIT Press.

The Predictive Mind.Jakob Hohwy - 2013 - Oxford, GB: Oxford University Press UK.

The moral problem.Michael R. Smith - 1994 - Cambridge, Mass., USA: Blackwell.

Surfing Uncertainty: Prediction, Action, and the Embodied Mind.Andy Clark - 2015 - New York: Oxford University Press USA.

Self-constitution: agency, identity, and integrity.Christine M. Korsgaard - 2009 - New York: Oxford University Press.

View all 32 references / Add more references

Applied ethics	Epistemology	History of Western Philosophy	Meta-ethics	Metaphysics	Normative ethics
Philosophy of biology	Philosophy of language	Philosophy of mind	Philosophy of religion	Science Logic and Mathematics	More ...

Can the predictive processing model of the mind ameliorate the value-alignment problem?

Abstract

Author's Profile

Categories

Keywords

Reprint years

DOI

Other Versions

Links

PhilArchive

External links

Through your library

My notes

Similar books and articles

Analytics

Author's Profile

Citations of this work

References found in this work