分散式部分可观察马尔可夫决策过程：修订间差异

删除的内容添加的内容

行内

2022年4月24日 (日) 09:16的版本

分散式部分可观察马尔可夫决策过程（英語：Decentralized partially observable Markov decision process，Dec-POMDP）^[1]^[2]是一种多智慧体协调决策模型。这是一种概率模型，对于现实生活中结果、传感器和通信的不确定性具有很好的解决能力。

该模型是马尔可夫决策过程和部分可观察马尔可夫决策过程的泛化，适用于分布式多智慧体的情形。^[3]

参考文献

^ Bernstein, Daniel S.; Givan, Robert; Immerman, Neil; Zilberstein, Shlomo. The Complexity of Decentralized Control of Markov Decision Processes. Math. Oper. Res. November 2002, 27 (4): 819–840. ISSN 0364-765X. S2CID 1195261. arXiv:1301.3836 . doi:10.1287/moor.27.4.819.297.
^ Oliehoek, Frans A.; Amato, Christopher. A Concise Introduction to Decentralized POMDPs | SpringerLink (PDF). SpringerBriefs in Intelligent Systems. 2016. ISBN 978-3-319-28927-4. S2CID 3263887. doi:10.1007/978-3-319-28929-8 （英国英语）.
^ Oliehoek, Frans A.; Amato, Christopher. A Concise Introduction to Decentralized POMDPs. Springer. 2016-06-03. ISBN 978-3-319-28929-8 （英语）.

[1] Bernstein, Daniel S.; Givan, Robert; Immerman, Neil; Zilberstein, Shlomo. The Complexity of Decentralized Control of Markov Decision Processes. Math. Oper. Res. November 2002, 27 (4): 819–840. ISSN 0364-765X. S2CID 1195261. arXiv:1301.3836 . doi:10.1287/moor.27.4.819.297.

[2] Oliehoek, Frans A.; Amato, Christopher. A Concise Introduction to Decentralized POMDPs | SpringerLink (PDF). SpringerBriefs in Intelligent Systems. 2016. ISBN 978-3-319-28927-4. S2CID 3263887. doi:10.1007/978-3-319-28929-8 （英国英语）.

[3] Oliehoek, Frans A.; Amato, Christopher. A Concise Introduction to Decentralized POMDPs. Springer. 2016-06-03. ISBN 978-3-319-28929-8 （英语）.

[1]

[2]

[3]