An Open Framework for the Reproducible Study of the Iterated Prisoner’s Dilemma

Vincent Knight; Owen Campbell; Marc Harper; Karol Langner; James Campbell; Thomas Campbell; Alex Carney; Martin Chorley; Cameron Davidson-Pilon; Kristian Glass; Nikoleta Glynatsi; Tomáš Ehrlich; Martin Jones; Georgios Koutsovoulos; Holly Tibble; Jochen Müller; Geraint Palmer; Piotr Petunov; Paul Slavin; Timothy Standen; Luis Visintini; Karl Molden

Software Metapapers

An Open Framework for the Reproducible Study of the Iterated Prisoner’s Dilemma

Authors

Vincent Knight
Owen Campbell
Marc Harper
Karol Langner
James Campbell
Thomas Campbell
Alex Carney
Martin Chorley
Cameron Davidson-Pilon
Kristian Glass
Nikoleta Glynatsi
Tomáš Ehrlich
Martin Jones
Georgios Koutsovoulos
Holly Tibble
Jochen Müller
Geraint Palmer
Piotr Petunov
Paul Slavin
Timothy Standen
Luis Visintini
Karl Molden

Abstract

The Axelrod library is an open source Python package that allows for reproducible game theoretic research into the Iterated Prisoner’s Dilemma. This area of research began in the 1980s but suffers from a lack of documentation and test code. The goal of the library is to provide such a resource, with facilities for the design of new strategies and interactions between them, as well as conducting tournaments and ecological simulations for populations of strategies.

With a growing collection of 139 strategies, the library is a also a platform for an original tournament that, in itself, is of interest to the game theoretic community.

This paper describes the Iterated Prisoner’s Dilemma, the Axelrod library and its development, and insights gained from some novel research.

Keywords:

Year: 2016

Volume: 4 Issue: 1

Page/Article: e35

DOI: 10.5334/jors.125

Submitted on Apr 5, 2016

Accepted on Jul 1, 2016

Published on Aug 31, 2016

Peer Reviewed

CC BY 4.0

(1) Overview

Introduction

Several Iterated Prisoner’s Dilemma tournaments have generated much interest; Axelrod’s original tournaments [, ], two 2004 anniversary tournaments [], and the Stewart and Plotkin 2012 tournament [], following the discovery of zero-determinant strategies. Subsequent research has spawned a number of papers (many of which are referenced throughout this paper), but rarely are the results reproducible. Amongst well-known tournaments, in only one case is the full original source code available (Axelrod’s second tournament [], in FORTRAN). In no cases is the available code well-documented, easily modifiable, or released with significant test suites.

To complicate matters further, a new strategy is often studied in isolation with opponents chosen by the creator of that strategy. Often such strategies are not sufficiently described to enable reliable recreation (in the absence of source code), with [] being a notable counter-example. In some cases, strategies are revised without updates to their names or published implementations [, ]. As such, the results cannot be reliably replicated and therefore have not met the basic scientific criterion of falsifiability.

This paper introduces a software package: the Axelrod-Python library. The Axelrod-Python project has the following stated goals:

To enable the reproduction of Iterated Prisoner’s Dilemma research as easily as possible
To produce the de-facto tool for any future Iterated Prisoner’s Dilemma research
To provide as simple a means as possible for anyone to define and contribute new and original Iterated Prisoner’s Dilemma strategies

The presented library is partly motivated by an ongoing discussion in the academic community about reproducible research [, , , ], and is:

Open: all code is released under an MIT license;
Reproducible and well-tested: at the time of writing there is an excellent level of integrated tests with 99.73% coverage (including property based tests: [])
Well-documented: all features of the library are documented for ease of use and modification
Extensive: 139 strategies are included, with infinitely-many available in the case of parametrised strategies
Extensible: easy to modify to include new strategies and to run new tournaments

Review of the literature

As stated in []: “few works in social science have had the general impact of [Axelrod’s study of the evolution of cooperation]”. In 1980, Axelrod wrote two papers: [, ] which describe a computer tournament that has been a major influence on subsequent game theoretic work [, , , , , , , , , , , , , , , , , , ]. As described in [] this work has not only had impact in mathematics but has also led to insights in biology (for example in [], a real tournament where Blue Jays are the participants is described) and in particular in the study of evolution.

The tournament is based on an iterated game (see [] or similar for details) where two players repeatedly play the normal form game of (1) in full knowledge of each other’s playing history to date. An excellent description of the one shot game is given in [] which is paraphrased below:

Two players must choose between Cooperate (C) and Defect (D):

If both choose C, they receive a payoff of R (Reward);
If both choose D, they receive a payoff of P (Punishment);
If one chooses C and the other D, the defector receives a payoff of T (Temptation) and the cooperator a payoff of S (Sucker).

and the following reward matrix results from the Cartesian product of two decision vectors 〈C, D〉,

(1)

(R, R S, T T, S P, P) such that T > R > P > S and 2 R > T + S

M1 \documentclass[10pt]{article} \usepackage{wasysym} \usepackage[substack]{amsmath} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage[mathscr]{eucal} \usepackage{mathrsfs} \usepackage{pmc} \usepackage[Euler]{upgreek} \pagestyle{empty} \oddsidemargin -1.0in \begin{document} \[ \left( {\begin{array}{*{20}{c}} {R,R}&{S,T}\\ {T,S}&{P,P} \end{array}} \right){\rm{ such}}\;{\rm{that }} T > R > P > S{\rm{ and }}2R > T + S \] \end{document}

The game of (1) is called the Prisoner’s Dilemma. Specific numerical values of (R, S, T, P) = (3, 0, 5, 1) are often used in the literature [, ], although any satisfying the conditions in 1 will yield similar results. Axelrod’s tournaments (and further implementations of these) are sometimes referred to as Iterated Prisoner’s Dilemma (IPD) tournaments. An incomplete representative overview of published tournaments is given in Table 1.

Table 1

An overview of a selection of published tournaments. Not all tournaments were ‘standard’ round robins; for more details see the indicated references.

Year	Reference	Number of Strategies	Type	Source Code

1979	[]	13	Standard	Not immediately available
1979	[]	64	Standard	Available in FORTRAN
1991	[]	13	Noisy	Not immediately available
2002	[]	16	Wildlife	Not a computer based tournament
2005	[]	223	Varied	Not available
2012	[]	13	Standard	Not fully available

In [] a description is given of how incomplete information can be used to enhance cooperation, in a similar approach to the proof of the Folk theorem for repeated games []. This aspect of incomplete information is also considered in [, , ] where “noisy” tournaments randomly flip the choice made by a given strategy. In [], incomplete information is considered in the sense of a probabilistic termination of each round of the tournament.

As mentioned before, IPD tournaments have been studied in an evolutionary context: [, , , ] consider this in a traditional evolutionary game theory context.

These works investigate particular evolutionary contexts within which cooperation can evolve and persist. This can be in the context of direct interactions between strategies or population dynamics for populations of many players using a variety of strategies, which can lead to very different results. For example, in [] a machine learning algorithm in a population context outperforms strategies described in [] and [] that are claimed to dominate any evolutionary opponent in head-to-head interactions.

Further to these evolutionary ideas, [, ] are examples of using machine learning techniques to evolve particular strategies. In [], Axelrod describes how similar techniques are used to genetically evolve a high performing strategy from a given set of strategies. Note that in his original work, Axelrod only used a base strategy set of 12 strategies for this evolutionary study. This is noteworthy as the library now boasts over 139 strategies that are readily available for a similar analysis.

Implementation and architecture

Description of the Axelrod Python package

The library is written in Python (http://www.python.org/) which is a popular language in the academic community with libraries developed for a variety of uses including:

Algorithmic Game Theory [] (http://gambit.sourceforge.net/).
Astrophysics [] (http://www.astropy.org/);
Data manipulation [] (http://pandas.pydata.org/);
Machine learning [] (http://scikit-learn.org/);
Mathematics [] (http://www.sagemath.org/);
Visualisation [] (http://matplotlib.org/);

Furthermore, in [] Python is described as an appropriate language for the reproduction of Iterated Prisoner’s Dilemma tournaments due to its object oriented nature and readability.

The library itself is available at https://github.com/Axelrod-Python/Axelrod.

This is a hosted git repository. Git is a version control system which is one of the recommended aspects of reproducible research [, ].

As stated in the Introduction, one of the main goals of the library is to allow for the easy contribution of strategies. Doing this requires the writing of a simple Python class (which can inherit from other predefined classes). All components of the library are automatically tested using a combination of unit, property and integration tests. These tests are run as new features are added to the library to ensure compatibility (they are also run automatically using travis-ci.org). When submitting a strategy, a simple test is required which ensures the strategy behaves as expected. Full contribution guidelines can be found in the documentation, which is also part of the library itself and is hosted using readthedocs.org. As an example, Figures 1 and 2 show the source code for the Grudger strategy as well as its corresponding test.

Figure 1

Source code for the Grudger strategy.

Figure 2

Test code for the Grudger strategy.

You can see an overview of the structure of the source code in Figure 3. This shows the parallel collection of strategies and their tests. Furthermore the underlying engine for the library is a class for tournaments which lives in the tournament.py module. This class is responsible for coordinating the play of generated matches (from the match.py module). This generation of matches is the responsibility of a match generator class (in the match_generator.py module) which is designed in such a way as to be easily modifiable to create new types of tournaments. This is described further in a tutorial in the documentation which shows how to easily create a tournament where players only play each other with probability 0.5. This will be discussed further in the reuse section of this paper.

Figure 3

An overview of the source code.

To date the library has had contributions from 26 contributors from a variety of backgrounds which are not solely academic. These contributions have been mostly in terms of strategies. One strategy is the creation of an undergraduate mathematics student with little prior knowledge of programming. Multiple other strategies were written by a 15 year old secondary school student. Both of these students are authors of this paper. As well as these strategy contributions, vital architectural improvements to the library itself have also been contributed.

(2) Availability

Operating system

The Axelrod library runs on all major operating systems: Linux, Mac OS X and Windows.

Programming language

The library is continuously tested for compatibility with Python 2.7 and the two most recent python 3 releases.

Additional system requirements

There are no specific additional system requirements.

Support

Support is readily available in multiple forms:

An online chat channel: https://gitter.im/Axelrod-Python/Axelrod.
An email group: https://groups.google.com/forum/#!topic/axelrod-python.

Dependencies

The following Python libraries are required dependencies:

Numpy 1.9.2
Matplotlib 1.4.2 (only a requirement if graphical output is required)
Tqdm 3.4.0
Hypothesis 3.0 (only a requirement for development)

List of contributors

The names of all the contributors are not known: as these were mainly done through Github and some have not provided their name or responded to a request for further details. Here is an incomplete list:

Owen Campbell
Marc Harper
Vincent Knight
Karol M. Langner
James Campbell
Thomas Campbell
Martin Jones
Georgios Koutsovoulos
Holly Tibble
Jochen Müller
Geraint Palmer
Paul Slavin
Alex Carney
Martin Chorley
Cameron Davidson-Pilon
Kristian Glass
Nikoleta Glynatsi
Tomáš Ehrlich
Timothy Standen
Luis Visintini
Karl Molden
Jason Young
Andy Boot
Anna Barriscale

Software location

Code repository

Name: Github

Identifier: https://github.com/Axelrod-Python/Axelrod

Licence: MIT

Date published: 2015-02-16

Reuse potential

The Axelrod library has been designed with sustainable software practices in mind. There is an extensive documentation suite: axelrod.readthedocs.org/en/latest/. Furthermore, there is a growing set of example Jupyter notebooks available here: https://github.com/Axelrod-Python/Axelrod-notebooks.

The availability of a large number of strategies makes this tool an excellent and obvious example of the benefits of open research which should positively impact the game theory community. This is evidently true already as the library has been used to study and create interesting and powerful new strategies.

Installation of the library is straightforward via standard python installation repositories (https://pypi.python.org/pypi). The package name is axelrod and can thus be installed by calling: pip install axelrod on all major operating systems (Windows, OS X and Linux).

Figure 4 shows a very simple example of using the library to create a basic tournament giving the graphical output shown in Figure 5.

Figure 4

A simple set of commands to create a demonstration tournament. The output is shown in Figure 5.

Figure 5

The results from a simple tournament.

New strategies, tournaments and implications

Due to the open nature of the library the number of strategies included has grown at a fast pace, as can be seen in Figure 6.

Figure 6

The number of strategies included in the library.

Nevertheless, due to previous research being done in an irreproducible manner with, for example, no source code and/or vaguely described strategies, not all previous tournaments can yet be reproduced. In fact, some of the early tournaments might be impossible to reproduce as the source code is apparently forever lost. This library aims to ensure reproducibility in the future.

One tournament that is possible to reproduce is that of []. The strategies used in that tournament are the following:

Cooperator
Defector
ZD-Extort-2
Joss: 0.9
Hard Tit For Tat
Hard Tit For 2 Tats
Tit For Tat
Grudger
Tit For 2 Tats
Win-Stay Lose-Shift
Random: 0.5
ZD-GTFT-2
GTFT: 0.33
Hard Prober
Prober
Prober 2
Prober 3
Calculator
Hard Go By Majority

This can be reproduced as shown in Figure 8, which gives the plot of Figure 7. Note that slight differences with the results of [] are due to stochastic behaviour of some strategies.

Figure 7

The results from [], reproduced with the Axelrod library.

Figure 8

Source code for reproducing the tournament of [].

In parallel to the Python library, a tournament is being kept up to date that pits all available strategies against each other. Figure 9 shows the results from the full tournament which can also be seen (in full detail) here: http://axelrod-tournament.readthedocs.org/. Data sets are also available showing the plays of every match that takes place. Note that to recreate this tournament simply requires changing a single line of the code shown in Figure 4, changing:

>>> strategies = [s() for s in axelrod.demo_strategies]}

to:

>>> strategies = [s() for s in axelrod.ordinary_strategies]}.

Figure 9

Results from the library tournament (2016-06-13).

The current winning strategy is new to the research literature: Looker Up. This is a strategy that maps a given set of states to actions. The state space is defined generically by m, n so as to map states to actions as shown in (2).

(2)

((C, D, D, D, C, D, D, C) ︸ m first actions  by opponent, ((C, C), (C, C)) ︷ n last pairs of actions) → D

M2 \documentclass[10pt]{article} \usepackage{wasysym} \usepackage[substack]{amsmath} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage[mathscr]{eucal} \usepackage{mathrsfs} \usepackage{pmc} \usepackage[Euler]{upgreek} \pagestyle{empty} \oddsidemargin -1.0in \begin{document} \[ (\underbrace {(C,D,D,D,C,D,D,C)}_{m\;{\rm{first}}\;{\rm{actions}}\;\,{\rm{by}}\;{\rm{opponent}}},\overbrace {((C,\,C),\,\,(C,\,C))}^{n\;{\rm{last}}\;{\rm{pairs}}\;{\rm{of}}\;{\rm{actions}}}) \to D \] \end{document}

The example of (2) is an incomplete illustration of the mapping for m = 8, n = 2. Intuitively, this state space uses the initial plays of the opponent to gain some information about its intentions whilst still taking into account the recent play. The actual winning strategy is an instance of the framework for m = n = 2 for which a particle swarm algorithm has been used to train it. The second placed strategy was trained with an evolutionary algorithm []. In [] experiments are described that evaluate how the second placed strategy behaves in environments other than those in which it was trained and it continues to perform strongly.

There are various other insights that have been gained from ongoing open research on the library, details can be found in []. These include:

A closer look at zero determinant strategies, showing that extortionate strategies obtain a large number of wins: the number of times they outscore an opponent during a given match. However these do not perform particularly well from the overall tournament ranking point of view. This is relevant given the findings of [] in which zero determinant strategies are shown to be able to perform better than any other strategy. This finding extends to noisy tournaments (which are also implemented in the library).
This negative relationship between wins and performance does not generalise. There are some strategies that perform well, both in terms of matches won and overall performance: Back stabber, Double crosser, Looker Up, and Fool Me Once. These strategies continue to perform well in noisy tournaments, however some of these have knowledge of the length of the game (Back stabber and Double crosser). This is not necessary to rank well in both wins and score as demonstrated by Looker Up and Fool Me Once.
Strategies like Looker Up and Meta Hunter seem to be generally cooperative yet still exploit naive strategies. The Meta Hunter strategy is a particular type of Meta strategy which uses a variety of other strategy behaviours to choose a best action. These strategies perform very well in general and continue to do so in noisy tournaments.

Conclusion

This paper has presented a game theoretic software package that aims to address reproducibility of research into the Iterated Prisoner’s Dilemma. The open nature of the development of the library has lead rapidly to the inclusion of many well known strategies, many novel strategies, and new and recapitulated insights.

The capabilities of the library mentioned above are not at all comprehensive, a list of the current abilities include:

Noisy tournaments.
Tournaments with probabilistic ending of interactions.
Ecological analysis of tournaments.
Moran processes.
Morality metrics based on [].
Transformation of strategies (in effect giving an infinite number of strategies).
Classification of strategies according to multiple dimensions.
Gathering of full interaction history for all interactions.
Parallelization of computations for tournaments with a high computational cost.

These capabilities are constantly being updated.

Acknowledgements

The authors would like to thank all contributors. Also, they thank Robert Axelrod himself for his well wishes with the library.

Competing Interests

The authors declare that they have no competing interests.

References

Astropy Collaboration et al. (2013). “Astropy: A community Python package for astronomy” Astronomy and Astrophysics Oct. 2013558: A33. DOI: https://doi.org/10.1051/0004-6361/201322068 arXiv: 1307.6212 [as-tro-ph.IM].
Axelrod, R (1980). “Effective Choice in the Prisoner’s Dilemma” Journal of Conflict Resolution 24(1): 3–25, DOI: https://doi.org/10.1177/002200278002400301
Axelrod, R (1980). “More Effective Choice in the Prisoner’s Dilemma” Journal of Conflict Resolution 24(3): 379–403, 0022-0027 DOI: https://doi.org/10.1177/002200278002400301
Axelrod, R M (2006). The Evolution of Cooperation. Basic books.
Banks, J S and Sundaram, R K (1990). “Repeated games, finite automata, and complexity” Games and Economic Behavior 2(2): 97–117, 08998256 DOI: https://doi.org/10.1016/0899-8256(90)90024-O
Bendor, J, Kramer, R M and Stout, S (1991). “When in doubt …: Cooperation in a noisy prisoner’s dilemma” Journal of Conflict Resolution 35(4): 691–719, 0022-0027 DOI: https://doi.org/10.1177/0022002791035004007
Boyd, R and Lorberbaum, J P (1987). “No pure strategy is evolutionarily stable in the repeated Prisoner’s Dilemma game” Nature 327: 58–59, 0028-0836 DOI: https://doi.org/10.1038/327058a0
Chellapilla, K and Fogel, D B (1999). “Evolution, neural networks, games, and intelligence” Proceedings of the IEEE : 1471–1496, 87(9)00189219 DOI: https://doi.org/10.1109/5.784222
Crick, T et al. (2014). ““Share and Enjoy”: Publishing Useful and Usable Scientific Models” arXiv: 1409.0367.
David, B F (1993). “Evolving Behaviors in the Iterated Prisoner’s Dilemma” Evol. Comput : 77–97, 1(1)1063-6560 DOI: https://doi.org/10.1162/evco.1993.1.1.77
Doebeli, M and Hauert, C (2005). “Models of cooperation based on the Prisoner’s Dilemma and the Snowdrift game” Ecology Letters : 748–766, 8(7)1461023X DOI: https://doi.org/10.1111/j.1461-0248.2005.00773.x
Ellison, G (1994). “Cooperation in the prisoner’s dilemma with anonymous random matching” Review of Economic Studies (567): 588.61(3)00346527 DOI: https://doi.org/10.2307/2297904
Gotts, N, Polhill, J and Law, A (2003). “Agent-based simulation in the study of social dilemmas” Artificial Intelligence Review 19: 3–92, 0269-2821 DOI: https://doi.org/10.1023/A:1022120928602
Harper, M (2015). Marc Harper Codes, http://marcharper.codes/2015-11-17/ipd2.html
Hilbe, C, Nowak, M A and Traulsen, A (2013). “Adaptive Dynamics of Extortion and Compliance” PLoS ONE 8(11): e77886.1932-6203 DOI: https://doi.org/10.1371/journal.pone.0077886
Hong, N P C et al. (2015). “Top Tips to Make Your Research Irreproducible” : 5–6. arXiv: 1504.00062.
Hunter, J D (2007). “Matplotlib: A 2D graphics environment” Computing In Science & Engineering : 90–95, 9(3) DOI: https://doi.org/10.1109/MCSE.2007.55
Isaac, A (2008). “Simulating Evolutionary Games: A Python-Based Introduction” Journal of Artificial Societies and Social Simulation 11(3): 8.14607425
Jones, M (2015). Evolving strategies for an Iterated Prisoner’s Dilemma tournament, http://mojones.net/evolving-strategies-for-an-iterated-prisoners-dilemma-tournament.html
Kendall, G, Yao, X and Chong, S Y (2007). The iterated prisoners’ dilemma: 20 years on. World Scientific Publishing Co., Inc..
Knight, V (2015). http://vknight.org/unpeudemath/gametheory/2015/11/28/Experimenting-with-a-high-performing-evolved-strategy-in-other-environments.html
Koutsovoulos, G (2016). Optimising the LookerUp strategy for an Iterated Prisoner’s Dilemma tournament,
Kraines, D and Kraines, V (1989). “Pavlov and the prisoner’s dilemma” Theory and Decision 26(1): 47–79, 00405833 DOI: https://doi.org/10.1007/BF00134056
Lee, C, Harper, M and Fryer, D (2015). “The Art of War: Beyond Memory-one Strategies in Population Games” Plos One 10(3): e0120625.1932-6203 DOI: https://doi.org/10.1371/journal.pone.0120625
Li, J (2007). “How to design a strategy to win an IPD tournament” The iterated prisoners dilemma 20: 89–104, DOI: https://doi.org/10.1142/9789812770684_0004
Li, J, Hingston, P and Kendall, G (2011). “Engineering design of strategies for winning iterated prisoner’s dilemma competitions” Computational Intelligence and AI in Games, IEEE Transactions on 3(4): 348–360, DOI: https://doi.org/10.1109/tciaig.2011.2166268
Lorberbaum, J P (1994). “No strategy is evolutionarily stable in the repeated Prisoner’s Dilemma game” Journal of Theoretical Biology 168(2): 117–130, DOI: https://doi.org/10.1006/jtbi.1994.1092
Maclver, D R (2016). Hypothesis 3.0.3, https://github.com/DRMacIver/hypothesis
Maschler, M, Solan, E and Zamir, S (2013). Game theory. Cambridge University Press, 10039781107005488 DOI: https://doi.org/10.1017/CBO9780511794216
Mckelvey, R et al. (2006). Gambit: Software tools for game theory, Tech. rep..
McKinney, W (2010). van der Walt, S. and Millman, J. eds. “Data Structures for Statistical Computing in Python” Proceedings of the 9th Python in Science Conference. : 51–56.
Milgrom, P, Roberts, J and Wilson, R (1982). “Rational Cooperation in the Finitely Repeated Prisoners’ Dilemma” Journal of Economic Theory 252: 245–252.
Molander, P (1985). “The optimal level of generosity in a selfish, uncertain environment” The Journal of Conflict Resolution : 611–618, 29(4)0022-0027 DOI: https://doi.org/10.1177/0022002785029004004
Murnighan, J K et al. (1983). “Expecting Continued Play in Prisoner’s Dilemma Games” 27(2): 279–300.
Pedregosa, F et al. (2011). “Scikit-learn: Machine Learning in Python” Journal of Machine Learning Research 12: 2825–2830.
Press, W H and Dyson, F J (2012). “Iterated Prisoner’s Dilemma contains strategies that dominate any evolutionary opponent” Proceedings of the National Academy of Sciences 109(26): 10409–10413, 0027-8424 DOI: https://doi.org/10.1073/pnas.1206569109
Prlik, A and Procter, J B (2012). “Ten Simple Rules for the Open Development of Scientific Software” PLoS Computational Biology 8(12): e1002802.1553-7358 DOI: https://doi.org/10.1371/journal.pcbi.1002802
Sandve, G K et al. (2013). “Ten Simple Rules for Reproducible Computational Research” PLoS Computational Biology 9(10): 1–4, 1553734X DOI: https://doi.org/10.1371/journal.pcbi.1003285
Singer-Clark, T (2014). “Morality Metrics On Iterated Prisoners Dilemma Players”
Slany, W and Kienreich, W (2007). “On some winning strategies for the iterated prisoners dilemma” The iterated prisoners dilemma, : 171–204.
Stephens, D W, McLinn, C M and Stevens, J R (2002). “Discounting and reciprocity in an Iterated Prisoner’s Dilemma.” Science New York, N.Y.: 298(5601): 2216–2218, 00368075 DOI: https://doi.org/10.1126/science.1078498
Stewart, A J and Plotkin, J B (2012). “Extortion and cooperation in the Prisoner’s Dilemma” Proceedings of the National Academy of Sciences 109(26): 10134–10135, 0027-8424 DOI: https://doi.org/10.1073/pnas.1208087109
The Sage Developers (). Sage Mathematics Software (Version 7.0), http://www.sagemath.org 7.0.

[B1] Astropy Collaboration et al. (2013). “Astropy: A community Python package for astronomy” Astronomy and Astrophysics Oct. 2013558: A33. DOI: https://doi.org/10.1051/0004-6361/201322068 arXiv: 1307.6212 [as-tro-ph.IM].

[B2] Axelrod, R (1980). “Effective Choice in the Prisoner’s Dilemma” Journal of Conflict Resolution 24(1): 3–25, DOI: https://doi.org/10.1177/002200278002400301

[B3] Axelrod, R (1980). “More Effective Choice in the Prisoner’s Dilemma” Journal of Conflict Resolution 24(3): 379–403, 0022-0027 DOI: https://doi.org/10.1177/002200278002400301

[B4] Axelrod, R M (2006). The Evolution of Cooperation. Basic books.

[B5] Banks, J S and Sundaram, R K (1990). “Repeated games, finite automata, and complexity” Games and Economic Behavior 2(2): 97–117, 08998256 DOI: https://doi.org/10.1016/0899-8256(90)90024-O

[B6] Bendor, J, Kramer, R M and Stout, S (1991). “When in doubt …: Cooperation in a noisy prisoner’s dilemma” Journal of Conflict Resolution 35(4): 691–719, 0022-0027 DOI: https://doi.org/10.1177/0022002791035004007

[B7] Boyd, R and Lorberbaum, J P (1987). “No pure strategy is evolutionarily stable in the repeated Prisoner’s Dilemma game” Nature 327: 58–59, 0028-0836 DOI: https://doi.org/10.1038/327058a0

[B8] Chellapilla, K and Fogel, D B (1999). “Evolution, neural networks, games, and intelligence” Proceedings of the IEEE : 1471–1496, 87(9)00189219 DOI: https://doi.org/10.1109/5.784222

[B9] Crick, T et al. (2014). ““Share and Enjoy”: Publishing Useful and Usable Scientific Models” arXiv: 1409.0367.

[B10] David, B F (1993). “Evolving Behaviors in the Iterated Prisoner’s Dilemma” Evol. Comput : 77–97, 1(1)1063-6560 DOI: https://doi.org/10.1162/evco.1993.1.1.77

[B11] Doebeli, M and Hauert, C (2005). “Models of cooperation based on the Prisoner’s Dilemma and the Snowdrift game” Ecology Letters : 748–766, 8(7)1461023X DOI: https://doi.org/10.1111/j.1461-0248.2005.00773.x

[B12] Ellison, G (1994). “Cooperation in the prisoner’s dilemma with anonymous random matching” Review of Economic Studies (567): 588.61(3)00346527 DOI: https://doi.org/10.2307/2297904

[B13] Gotts, N, Polhill, J and Law, A (2003). “Agent-based simulation in the study of social dilemmas” Artificial Intelligence Review 19: 3–92, 0269-2821 DOI: https://doi.org/10.1023/A:1022120928602

[B14] Harper, M (2015). Marc Harper Codes, http://marcharper.codes/2015-11-17/ipd2.html

[B15] Hilbe, C, Nowak, M A and Traulsen, A (2013). “Adaptive Dynamics of Extortion and Compliance” PLoS ONE 8(11): e77886.1932-6203 DOI: https://doi.org/10.1371/journal.pone.0077886

[B16] Hong, N P C et al. (2015). “Top Tips to Make Your Research Irreproducible” : 5–6. arXiv: 1504.00062.

[B17] Hunter, J D (2007). “Matplotlib: A 2D graphics environment” Computing In Science & Engineering : 90–95, 9(3) DOI: https://doi.org/10.1109/MCSE.2007.55

[B18] Isaac, A (2008). “Simulating Evolutionary Games: A Python-Based Introduction” Journal of Artificial Societies and Social Simulation 11(3): 8.14607425

[B19] Jones, M (2015). Evolving strategies for an Iterated Prisoner’s Dilemma tournament, http://mojones.net/evolving-strategies-for-an-iterated-prisoners-dilemma-tournament.html

[B20] Kendall, G, Yao, X and Chong, S Y (2007). The iterated prisoners’ dilemma: 20 years on. World Scientific Publishing Co., Inc..

[B21] Knight, V (2015). http://vknight.org/unpeudemath/gametheory/2015/11/28/Experimenting-with-a-high-performing-evolved-strategy-in-other-environments.html

[B22] Koutsovoulos, G (2016). Optimising the LookerUp strategy for an Iterated Prisoner’s Dilemma tournament,

[B23] Kraines, D and Kraines, V (1989). “Pavlov and the prisoner’s dilemma” Theory and Decision 26(1): 47–79, 00405833 DOI: https://doi.org/10.1007/BF00134056

[B24] Lee, C, Harper, M and Fryer, D (2015). “The Art of War: Beyond Memory-one Strategies in Population Games” Plos One 10(3): e0120625.1932-6203 DOI: https://doi.org/10.1371/journal.pone.0120625

[B25] Li, J (2007). “How to design a strategy to win an IPD tournament” The iterated prisoners dilemma 20: 89–104, DOI: https://doi.org/10.1142/9789812770684_0004

[B26] Li, J, Hingston, P and Kendall, G (2011). “Engineering design of strategies for winning iterated prisoner’s dilemma competitions” Computational Intelligence and AI in Games, IEEE Transactions on 3(4): 348–360, DOI: https://doi.org/10.1109/tciaig.2011.2166268

[B27] Lorberbaum, J P (1994). “No strategy is evolutionarily stable in the repeated Prisoner’s Dilemma game” Journal of Theoretical Biology 168(2): 117–130, DOI: https://doi.org/10.1006/jtbi.1994.1092

[B28] Maclver, D R (2016). Hypothesis 3.0.3, https://github.com/DRMacIver/hypothesis

[B29] Maschler, M, Solan, E and Zamir, S (2013). Game theory. Cambridge University Press, 10039781107005488 DOI: https://doi.org/10.1017/CBO9780511794216

[B30] Mckelvey, R et al. (2006). Gambit: Software tools for game theory, Tech. rep..

[B31] McKinney, W (2010). van der Walt, S. and Millman, J. eds. “Data Structures for Statistical Computing in Python” Proceedings of the 9th Python in Science Conference. : 51–56.

[B32] Milgrom, P, Roberts, J and Wilson, R (1982). “Rational Cooperation in the Finitely Repeated Prisoners’ Dilemma” Journal of Economic Theory 252: 245–252.

[B33] Molander, P (1985). “The optimal level of generosity in a selfish, uncertain environment” The Journal of Conflict Resolution : 611–618, 29(4)0022-0027 DOI: https://doi.org/10.1177/0022002785029004004

[B34] Murnighan, J K et al. (1983). “Expecting Continued Play in Prisoner’s Dilemma Games” 27(2): 279–300.

[B35] Pedregosa, F et al. (2011). “Scikit-learn: Machine Learning in Python” Journal of Machine Learning Research 12: 2825–2830.

[B36] Press, W H and Dyson, F J (2012). “Iterated Prisoner’s Dilemma contains strategies that dominate any evolutionary opponent” Proceedings of the National Academy of Sciences 109(26): 10409–10413, 0027-8424 DOI: https://doi.org/10.1073/pnas.1206569109

[B37] Prlik, A and Procter, J B (2012). “Ten Simple Rules for the Open Development of Scientific Software” PLoS Computational Biology 8(12): e1002802.1553-7358 DOI: https://doi.org/10.1371/journal.pcbi.1002802

[B38] Sandve, G K et al. (2013). “Ten Simple Rules for Reproducible Computational Research” PLoS Computational Biology 9(10): 1–4, 1553734X DOI: https://doi.org/10.1371/journal.pcbi.1003285

[B39] Singer-Clark, T (2014). “Morality Metrics On Iterated Prisoners Dilemma Players”

[B40] Slany, W and Kienreich, W (2007). “On some winning strategies for the iterated prisoners dilemma” The iterated prisoners dilemma, : 171–204.

[B41] Stephens, D W, McLinn, C M and Stevens, J R (2002). “Discounting and reciprocity in an Iterated Prisoner’s Dilemma.” Science New York, N.Y.: 298(5601): 2216–2218, 00368075 DOI: https://doi.org/10.1126/science.1078498

[B42] Stewart, A J and Plotkin, J B (2012). “Extortion and cooperation in the Prisoner’s Dilemma” Proceedings of the National Academy of Sciences 109(26): 10134–10135, 0027-8424 DOI: https://doi.org/10.1073/pnas.1208087109

[B43] The Sage Developers (). Sage Mathematics Software (Version 7.0), http://www.sagemath.org 7.0.

Journal of Open Research Software

Software Metapapers

An Open Framework for the Reproducible Study of the Iterated Prisoner’s Dilemma

Abstract

(1) Overview

Introduction

Review of the literature

Implementation and architecture

Description of the Axelrod Python package

(2) Availability

Operating system

Programming language

Additional system requirements

Support

Dependencies

List of contributors

Software location

Archive

Code repository

Reuse potential

New strategies, tournaments and implications

Conclusion

Acknowledgements

Competing Interests

References