Expectation Maximization for Average Reward Decentralized POMDPs
J. Pajarinen, and J. Peltonen. Machine Learning and Knowledge Discovery in Databases, volume 8188 of Lecture Notes in Computer Science, Springer Berlin Heidelberg, (2013)
DOI: 10.1007/978-3-642-40988-2\_9