Challenges to the Omohundro–Bostrom framework for AI motivations

Olle Häggström

doi:10.1108/fs-04-2018-0039

Loading next page...

References (25)

(2016)
Brett Hall tells us not to worry about AI Armageddon
R. Perdue (2017)
Superintelligence and Natural Resources: Morality and Technology in a Brave New World
Society & Natural Resources, 30
Max Tegmark (2014)
Friendly Artificial Intelligence: The Physics Challenge
ArXiv, abs/1409.0813
(2011)
Existential depression in gifted individuals
S. Armstrong, A. Sandberg, N. Bostrom (2012)
Thinking Inside the Box: Controlling and Using an Oracle AI
Minds and Machines, 22
D. Philpott (2002)
Moral Realism
The Review of Politics, 64
Policy desiderata in the development of machine superintelligence
Remarks on artificial intelligence and rational optimism
S. Omohundro (2008)
The Basic AI Drives
Superintelligence, N. Bostrom, A. Dafoe, Carrick Flynn, Paul Christiano, Jack Clark, Rebecca Crootof, Richard Danzig, Dan Dewey, E. Drexler, Sebastian Farquhar, Sophie-Charlotte Fischer, Mahendra Prasad, A. Sandberg, Carl Shulman, N. Soares, M. Stehlik (2016)
Policy Desiderata in the Development of Machine
Mark Roojen (2004)
Moral Cognitivism vs. Non-Cognitivism
Eliezer Yudkowsky (2006)
Artificial Intelligence as a Positive and Negative Factor in Global Risk
S. Omohundro (2012)
Rational Artificial Intelligence for the Greater Good
A. Clark, D. Chalmers (1998)
The Extended Mind
Analysis, 58
(2010)
The AI in a box boxes you
N. Bostrom (2012)
The Superintelligent Will: Motivation and Instrumental Rationality in Advanced Artificial Agents
Minds and Machines, 22
D. Brink (1997)
Moral Motivation
Ethics, 108
R. Karpinski, Audrey Kolb, Nicole Tetreault, T. Borowski (2018)
High intelligence: A risk factor for psychological and physiological overexcitabilities
Intelligence, 66
(2016)
Yes, we are worried about the existential risk of artificial intelligence
J. Danaher (2015)
Why AI Doomsayers are Like Sceptical Theists and Why it Matters
Minds and Machines, 25
V. Müller, N. Bostrom (2013)
Future Progress in Artificial Intelligence: A Survey of Expert Opinion
(2012)
Leakproofing the singularity
D. Bourget, D. Chalmers (2014)
What do philosophers believe?
Philosophical Studies, 170
Olle Häggström (2018)
Strategies for an Unfriendly Oracle AI with Reset Button
Artificial Intelligence Safety and Security
S. Cave, Seán ÓhÉigeartaigh (2017)
An AI Race for Strategic Advantage: Rhetoric and Risks
Proceedings of the 2018 AAAI/ACM Conference on AI, Ethics, and Society

Publisher: Emerald Publishing
Copyright: © Emerald Publishing Limited
ISSN: 1463-6689
DOI: 10.1108/fs-04-2018-0039
Publisher site: See Article on Publisher Site

Abstract

This paper aims to contribute to the futurology of a possible artificial intelligence (AI) breakthrough, by reexamining the Omohundro–Bostrom theory for instrumental vs final AI goals. Does that theory, along with its predictions for what a superintelligent AI would be motivated to do, hold water?Design/methodology/approachThe standard tools of systematic reasoning and analytic philosophy are used to probe possible weaknesses of Omohundro–Bostrom theory from four different directions: self-referential contradictions, Tegmark’s physics challenge, moral realism and the messy case of human motivations.FindingsThe two cornerstones of Omohundro–Bostrom theory – the orthogonality thesis and the instrumental convergence thesis – are both open to various criticisms that question their validity and scope. These criticisms are however far from conclusive: while they do suggest that a reasonable amount of caution and epistemic humility is attached to predictions derived from the theory, further work will be needed to clarify its scope and to put it on more rigorous foundations.Originality/valueThe practical value of being able to predict AI goals and motivations under various circumstances cannot be overstated: the future of humanity may depend on it. Currently, the only framework available for making such predictions is Omohundro–Bostrom theory, and the value of the present paper is to demonstrate its tentative nature and the need for further scrutiny.

Journal

foresight – Emerald Publishing

Published: Mar 11, 2019

Keywords: Artificial intelligence; Instrumental goals; Omohundro-Bostrom theory; Superintelligence

Get 20M+ Full-Text Papers For Less Than $1.50/day. Start a 14-Day Trial for You or Your Team.

Learn More →

Challenges to the Omohundro–Bostrom framework for AI motivations

Challenges to the Omohundro–Bostrom framework for AI motivations

Get 20M+ Full-Text Papers For Less Than $1.50/day. Start a 14-Day Trial for You or Your Team.

Learn More →

Challenges to the Omohundro–Bostrom framework for AI motivations

Challenges to the Omohundro–Bostrom framework for AI motivations

References (25)

Abstract

Journal

Recommended Articles

There are no references for this article.

Our policy towards the use of cookies