Skip to main content

Moral Philosophy of Artificial General Intelligence: Agency and Responsibility

  • Conference paper
  • First Online:

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 13154))

Abstract

The European Parliament recently proposed to grant the personhood of autonomous AI, which raises fundamental questions concerning the ethical nature of AI. Can they be moral agents? Can they be morally responsible for actions and their consequences? Here we address these questions, focusing upon, inter alia, the possibilities of moral agency and moral responsibility in artificial general intelligence; moral agency is a precondition for moral responsibility (which is, in turn, a precondition for legal punishment). In the first part of the paper we address the moral agency status of AI in light of traditional moral philosophy, especially Kant’s, Hume’s, and Strawson’s, and clarify the possibility of Moral AI (i.e., AI with moral agency) by discussing the Ethical Turing Test, the Moral Chinese Room Argument, and Weak and Strong Moral AI. In the second part we address the moral responsibility status of AI, and thereby clarify the possibility of Responsible AI (i.e., AI with moral responsibility). These issues would be crucial for AI-pervasive technosociety in the (possibly near) future, especially for post-human society after the development of artificial general intelligence.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   69.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   89.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

  1. Caliskan, A., et al.: Semantics derived automatically from language corpora contain human-like biases. Science 356, 183–186 (2017)

    Article  Google Scholar 

  2. Cole, D.: The Chinese room argument. In: The Stanford Encyclopedia of Philosophy (2009)

    Google Scholar 

  3. Denis, L.: Kant and Hume on morality. In: Stanford Encyclopedia of Philosophy (2008)

    Google Scholar 

  4. European Parliament report with recommendations to the Commission on Civil Law Rules on Robotics (2015/2103(INL))

    Google Scholar 

  5. Floridi, L., Sanders, J.: On the morality of artificial agents. Minds Mach. 14, 349–379 (2004)

    Article  Google Scholar 

  6. Götz, M.J., et al.: Criminality and antisocial behaviour in unselected men with sex chromosome abnormalities. Psychol. Med. 29, 953–962 (1999)

    Article  Google Scholar 

  7. Sheppard, B.H., et al.: Organizational Justice. Macmillan, Basingstoke (1992)

    Google Scholar 

  8. Hume, D.: A Treatise of Human Nature, pp. 1739–1740 (2003)

    Google Scholar 

  9. Hume, D.: An Enquiry Concerning the Principles of Morals. A. Millar (1751)

    Google Scholar 

  10. Kant, I.: Groundwork for the Metaphysics of Morals, James W. Ellington (trans.). Hackett Publishing Company, Indianapolis (1785)

    Google Scholar 

  11. Kant, I.: Critique of Pure Reason, trans. Paul Guyer and Allen Wood. Cambridge University Press, Cambridge (1998)

    Google Scholar 

  12. Kauppinen, A.: Moral sentimentalism. In: Stanford Encyclopedia of Philosophy (2014)

    Google Scholar 

  13. Maruyama, Y.: Reasoning about fuzzy belief and common belief: with emphasis on incomparable beliefs. In: Proceedings of IJCAI 2011, pp. 1008–1013 (2011)

    Google Scholar 

  14. Maruyama, Y.: Dualities for algebras of Fitting’s many-valued modal logics. Fundamenta Informaticae 106, 273–294 (2011)

    Article  MathSciNet  Google Scholar 

  15. Maruyama, Y.: From operational chu duality to coalgebraic quantum symmetry. In: Heckel, R., Milius, S. (eds.) CALCO 2013. LNCS, vol. 8089, pp. 220–235. Springer, Heidelberg (2013). https://doi.org/10.1007/978-3-642-40206-7_17

    Chapter  Google Scholar 

  16. Maruyama, Y.: Full lambek hyperdoctrine: categorical semantics for first-order substructural logics. In: Libkin, L., Kohlenbach, U., de Queiroz, R. (eds.) WoLLIC 2013. LNCS, vol. 8071, pp. 211–225. Springer, Heidelberg (2013). https://doi.org/10.1007/978-3-642-39992-3_19

    Chapter  Google Scholar 

  17. Maruyama, Y.: Duality theory and categorical universal logic: with emphasis on quantum structures. In: Proceedings of the Tenth Quantum Physics and Logic Conference, EPTCS, vol. 171, pp. 100–112 (2014)

    Google Scholar 

  18. Maruyama, Y.: Prior’s tonk, notions of logic, and levels of inconsistency: vindicating the pluralistic unity of science in the light of categorical logical positivism. Synthese 193, 3483–3495 (2016)

    Article  MathSciNet  Google Scholar 

  19. Maruyama, Y.: AI, quantum information, and external semantic realism: Searle’s observer-relativity and Chinese room, revisited. Fund. Issues Artif. Intell. Synth. Libr. 376, 115–127 (2016)

    MathSciNet  Google Scholar 

  20. Maruyama, Y.: Meaning and duality: from categorical logic to quantum physics. DPhil Thesis, University of Oxford (2017)

    Google Scholar 

  21. Maruyama, Y.: Quantum pancomputationalism and statistical data science: from symbolic to statistical AI, and to quantum AI. In: Müller, V.C. (ed.) PT-AI 2017. SAPERE, vol. 44, pp. 207–211. Springer, Cham (2018). https://doi.org/10.1007/978-3-319-96448-5_20

    Chapter  Google Scholar 

  22. Maruyama, Y.: Compositionality and contextuality: the symbolic and statistical theories of meaning. In: Bella, G., Bouquet, P. (eds.) CONTEXT 2019. LNCS (LNAI), vol. 11939, pp. 161–174. Springer, Cham (2019). https://doi.org/10.1007/978-3-030-34974-5_14

    Chapter  Google Scholar 

  23. Maruyama, Y.: The conditions of artificial general intelligence: logic, autonomy, resilience, integrity, morality, emotion, embodiment, and embeddedness. In: Goertzel, B., Panov, A.I., Potapov, A., Yampolskiy, R. (eds.) AGI 2020. LNCS (LNAI), vol. 12177, pp. 242–251. Springer, Cham (2020). https://doi.org/10.1007/978-3-030-52152-3_25

    Chapter  Google Scholar 

  24. Maruyama, Y.: The categorical integration of symbolic and statistical AI: quantum NLP and applications to cognitive and machine bias problems. In: Abraham, A., Siarry, P., Ma, K., Kaklauskas, A. (eds.) ISDA 2019. AISC, vol. 1181, pp. 466–476. Springer, Cham (2021). https://doi.org/10.1007/978-3-030-49342-4_45

    Chapter  Google Scholar 

  25. Maruyama, Y.: Post-truth AI and big data epistemology: from the genealogy of artificial intelligence to the nature of data science as a new kind of science. In: Abraham, A., Siarry, P., Ma, K., Kaklauskas, A. (eds.) ISDA 2019. AISC, vol. 1181, pp. 540–549. Springer, Cham (2021). https://doi.org/10.1007/978-3-030-49342-4_52

    Chapter  Google Scholar 

  26. Maruyama, Y.: Quantum physics and cognitive science from a Wittgensteinian perspective: Bohr’s Classicism, Chomsky’s Universalism, and Bell’s Contextualism. In: Wuppuluri, S., da Costa, N. (eds.) WITTGENSTEINIAN (adj.). TFC, pp. 375–407. Springer, Cham (2020). https://doi.org/10.1007/978-3-030-27569-3_20

    Chapter  Google Scholar 

  27. Maruyama, Y.: Symbolic and statistical theories of cognition: towards integrated artificial intelligence. In: Cleophas, L., Massink, M. (eds.) SEFM 2020. LNCS, vol. 12524, pp. 129–146. Springer, Cham (2021). https://doi.org/10.1007/978-3-030-67220-1_11

    Chapter  Google Scholar 

  28. Maruyama, Y.: Learning, development, and emergence of compositionality in natural language processing. In: Proceedings of IEEE ICDL, pp. 1–7 (2021)

    Google Scholar 

  29. Maruyama, Y.: Fibred algebraic semantics for a variety of non-classical first-order logics and topological logical translation. J. Symb. Logic 86, 1–27 (2021)

    Article  MathSciNet  Google Scholar 

  30. McLear, C.K: Philosophy of mind. In: Internet Encyclopedia of Philosophy

    Google Scholar 

  31. Noorman, M.: Computing and moral responsibility. In: Stanford Encyclopedia of Philosophy (2018)

    Google Scholar 

  32. Searle, J.R.: Minds, brains, and programs. Behav. Brain Sci. 3, 417–457 (1980)

    Article  Google Scholar 

  33. Searle, J.R.: Intentionality: an essay in the philosophy of mind. In: CUP (1983)

    Google Scholar 

  34. Searle, J.R.: The Rediscovery of the Mind. MIT Press, Cambridge (1992)

    Book  Google Scholar 

  35. Siewert, C.: Consciousness and intentionality. In: Stanford Encyclopedia of Philosophy (2016)

    Google Scholar 

  36. Smith, A.: Theory of Moral Sentiments, A. Millar (1761)

    Google Scholar 

  37. Strawson, P.F.: Freedom and resentment. Proc. Brit. Acad. 48, 1–25 (1962)

    Article  Google Scholar 

  38. Khoury, A.C.: Moral responsibility. J. Value Inquiry 48(4), 573–575 (2014). https://doi.org/10.1007/s10790-014-9457-6

    Article  Google Scholar 

  39. Turney, P., Pantel, P.: From frequency to meaning: vector space models of semantics. J. Artif. Intell. Res. 37, 141–188 (2010)

    Article  MathSciNet  Google Scholar 

  40. Williams, G.: Responsibility. In: Internet Encyclopedia of Philosophy (1995)

    Google Scholar 

Download references

Acknowledgements

This work was supported by the Moonshot R&D Programme (JST; JPMJMS2033). Special thanks to Professor Seth Lazar for his mentoring and guidance and for leading the Humanising Machine Intelligence project.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Yoshihiro Maruyama .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2022 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Maruyama, Y. (2022). Moral Philosophy of Artificial General Intelligence: Agency and Responsibility. In: Goertzel, B., Iklé, M., Potapov, A. (eds) Artificial General Intelligence. AGI 2021. Lecture Notes in Computer Science(), vol 13154. Springer, Cham. https://doi.org/10.1007/978-3-030-93758-4_15

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-93758-4_15

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-93757-7

  • Online ISBN: 978-3-030-93758-4

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics