Backward vs. Forward-Oriented Decision Making in the Iterated Prisoner’s Dilemma: A Comparison Between Two Connectionist Models

Lalev, Emilian; Grinberg, Maurice

doi:10.1007/978-3-540-74262-3_19

Emilian Lalev¹ &
Maurice Grinberg¹

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 4520))

Included in the following conference series:

Workshop on Anticipatory Behavior in Adaptive Learning Systems

930 Accesses
1 Citations

Abstract

We compare the performance of two connectionist models developed to account for some specific aspects of the decision making process in the Iterated Prisoner’s Dilemma Game. Both models are based on common recurrent network architecture. The first of them uses a backward-oriented reinforcement learning algorithm for learning to play the game while the second one makes its move decisions based on generated predictions about future games, moves and payoffs. Both models involve prediction of the opponent move and of the expected payoff and have an in-built autoassociator in their architecture aimed at more efficient payoff matrix representation. The results of the simulations show that the model with explicit anticipation about game outcomes could reproduce the experimentally observed dependency of the cooperation rate on the so-called cooperation index thus showing the importance of anticipation in modeling the actual decision making process in human participants. The role of the models’ building blocks and mechanisms is investigated and discussed. Comparisons with experiments with human participants are presented.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Deep multiagent reinforcement learning: challenges and directions

Article Open access 19 October 2022

Analysing factorizations of action-value networks for cooperative multi-agent reinforcement learning

Article Open access 07 June 2021

Deep Reinforcement Learning for FlipIt Security Game

References

Colman, A.: Cooperation, Psychological Game Theory, and Limitations of Rationality in Social Interaction. Behav. Brain Sci. 26, 139–153 (2003)
Google Scholar
Hristova, E., Grinberg, M.: Context Effects On Judgment Scales in the Prisoner’s Dilemma Game. In: Proceedings of the 1st European Conference on Cognitive Economics. ECCE1, Gif-sur-Yvette, France (2004)
Google Scholar
Hristova, E., Grinberg, M.: Investigation of Context Effects in Iterated Prisoner’s Dilemma Game. In: Dey, A.K., Kokinov, B., Leake, D., Turner, R. (eds.) CONTEXT 2005. LNCS (LNAI), vol. 3554, pp. 183–196. Springer, Heidelberg (2005)
Google Scholar
Hristova, E., Grinberg, M.: Information Acquisition in the Iterated Prisoner’s Dilemma Game: An Eye-tracking Study. In: Proceedings of the 27th Annual Conference of the Cognitive Science Society, Erlbaum, Hillsdale (2005)
Google Scholar
Grinberg, M., Hristova, E., Popova, M.: Applicability of Eye-Tracking Information Acquisition Methods for Studying the Strategy Dynamics in the Iterated Prisoner’s Dilemma Game. Position paper in the workshop: What have eye movements told us so far, and what is next. In: CogSci 2006, The 28th Annual Conference of the Cognitive Science Society, Vancouver (July 26-29, 2006)
Google Scholar
Rapoport, A., Chammah, A.: Prisoner’s Dilemma: A Study in Conflict and Cooperation. University of Michigan Press, Ann Arbor (1965)
Google Scholar
Erev, I., Roth, A.: Simple reinforcement learning models and reciprocation in the prisoner’s dilemma game. In: Gigerenzer, G., Selten, R. (eds.) Bounded rationality: the adaptive toolbox, MIT Press, Cambridge, Mass (2001)
Google Scholar
Camerer, C., Ho, T.-H., Chong, J.: Sophisticated EWA Learning and Strategic Teaching in Repeated Games. J. Econ. Theory 104, 137–188 (2002)
Article MATH Google Scholar
Macy, M.W., Flache, A.: Learning Dynamics in Social Dilemmas. PNAS 99(suppl. 3), 7229–7236 (2002)
Article Google Scholar
Taiji, M., Ikegami, T.: Dynamics of Internal Models in Game Players. Physica D 134, 253–266 (1999)
MATH MathSciNet Google Scholar
Elman, J.L.: Finding structure in time. Cognitive Science 14, 179–211 (1990)
Article Google Scholar
Leydesdorff, L.L., Dubois, D.: Anticipation in Social Systems. International Journal of Computing Anticipatory Systems 15, 203–216 (2004)
Google Scholar

Download references

Author information

Authors and Affiliations

Central and East European Center for Cognitive Science, New Bulgarian University, 21 Montevideo Street, 1618 Sofia, Bulgaria
Emilian Lalev & Maurice Grinberg

Authors

Emilian Lalev
View author publications
You can also search for this author in PubMed Google Scholar
Maurice Grinberg
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Martin V. Butz Olivier Sigaud Giovanni Pezzulo Gianluca Baldassarre

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Lalev, E., Grinberg, M. (2007). Backward vs. Forward-Oriented Decision Making in the Iterated Prisoner’s Dilemma: A Comparison Between Two Connectionist Models. In: Butz, M.V., Sigaud, O., Pezzulo, G., Baldassarre, G. (eds) Anticipatory Behavior in Adaptive Learning Systems. ABiALS 2006. Lecture Notes in Computer Science(), vol 4520. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-74262-3_19

Download citation

DOI: https://doi.org/10.1007/978-3-540-74262-3_19
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-74261-6
Online ISBN: 978-3-540-74262-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Backward vs. Forward-Oriented Decision Making in the Iterated Prisoner’s Dilemma: A Comparison Between Two Connectionist Models

Abstract

Access this chapter

Preview

Similar content being viewed by others

Deep multiagent reinforcement learning: challenges and directions

Analysing factorizations of action-value networks for cooperative multi-agent reinforcement learning

Deep Reinforcement Learning for FlipIt Security Game

References

Author information

Authors and Affiliations

Editor information

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Backward vs. Forward-Oriented Decision Making in the Iterated Prisoner’s Dilemma: A Comparison Between Two Connectionist Models

Abstract

Access this chapter

Preview

Similar content being viewed by others

Deep multiagent reinforcement learning: challenges and directions

Analysing factorizations of action-value networks for cooperative multi-agent reinforcement learning

Deep Reinforcement Learning for FlipIt Security Game

References

Author information

Authors and Affiliations

Editor information

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation