更改

跳到导航 跳到搜索
添加70字节 、 2020年12月14日 (一) 23:15
第755行: 第755行:  
给定一个状态s和动作a,r和s是执行(s,a)后的奖励和状态,
 
给定一个状态s和动作a,r和s是执行(s,a)后的奖励和状态,
 
a'动作集。<ref>Torrey, L. Crowd Simulation Via Multi-agent Reinforcement Learning. In: ''Proceedings of the Sixth AAAI Conference On Artificial Intelligence and Interactive Digital Entertainment''. AAAI Press, Menlo Park (2010)</ref>
 
a'动作集。<ref>Torrey, L. Crowd Simulation Via Multi-agent Reinforcement Learning. In: ''Proceedings of the Sixth AAAI Conference On Artificial Intelligence and Interactive Digital Entertainment''. AAAI Press, Menlo Park (2010)</ref>
 +
 
--[[用户:WildBoar|WildBoar]]([[用户讨论:WildBoar|讨论]])r和s这句,我觉得s应该是s’吧
 
--[[用户:WildBoar|WildBoar]]([[用户讨论:WildBoar|讨论]])r和s这句,我觉得s应该是s’吧
 +
 +
--[[用户:Vicky|Vicky]]([[用户讨论:Vicky|讨论]])同意S'
    
== Crowd rendering and animation 人群渲染和动画==
 
== Crowd rendering and animation 人群渲染和动画==
99

个编辑

导航菜单