更改

囚徒困境 (查看源代码)

2021年1月25日 (一) 19:49的版本

删除5字节、 2021年1月25日 (一) 19:49

→‎Stochastic iterated prisoner's dilemma

第567行：第567行：

===Stochastic iterated prisoner's dilemma===

−

~~随机的重复囚徒困境~~

+

随机重复囚徒困境

第643行：第643行：

虽然勒索零决定策略在人口众多的情况下并不稳定，但另一种宽松的零决定策略既稳定又稳健。事实上，当人口不算太少的时候，这些策略可以取代任何其他零决定策略，甚至在一系列针对重复囚徒困境的广泛通用策略（包括“获胜-保持-输”的转换策略）中表现良好。亚历山大·斯图尔特 Alexander Stewart和约书亚·普洛特金 Joshua Plotkin在2013年的捐赠博弈中证明了这一点。<ref name=Stewart2013>{{cite journal|last=Stewart|first=Alexander J.|author2=Joshua B. Plotkin|title=From extortion to generosity, evolution in the Iterated Prisoner's Dilemma|journal=[[Proceedings of the National Academy of Sciences of the United States of America]]|year=2013|doi=10.1073/pnas.1306246110|pmid=24003115|volume=110|issue=38|pages=15348–53|bibcode=2013PNAS..11015348S|pmc=3780848}}</ref>宽松的策略会与其他合作的玩家合作，面对背叛，慷慨的玩家比他的对手失去更多的效用。宽松策略是零决定策略和所谓的“好”策略的交集，阿金(2013) <ref name=Akin2013>{{cite arxiv|last=Akin|first=Ethan|title=Stable Cooperative Solutions for the Iterated Prisoner's Dilemma|year=2013|page=9|class=math.DS|eprint=1211.0969}} {{bibcode|2012arXiv1211.0969A}}</ref> Among good strategies, the generous (ZD) subset performs well when the population is not too small. If the population is very small, defection strategies tend to dominate.将这两种策略定义为玩家对过去的相互合作作出回应，并在至少获得合作预期收益的情况下平均分配预期收益的策略。在好的策略中，当总体不太小时，宽松(零决定)子集表现良好。如果总体很少，背叛策略往往占主导地位。<ref name=Stewart2013 />

−

===Continuous iterated prisoner's dilemma===

Vicky

99

个编辑