Publications

Point-based POMDP Solving with Factored Value Function Approximation

Tiago S. Veiga, Matthijs T. J. Spaan, and Pedro U. Lima. Point-based POMDP Solving with Factored Value Function Approximation. In Proceedings of the Twenty-Eighth AAAI Conference on Artificial Intelligence, pp. 2512–2518, 2014.

Download

pdf [908.5kB]  

Abstract

Partially observable Markov decision processes (POMDPs) provide a principled mathematical framework for modeling autonomous decision-making problems. A POMDP solution is often represented by a value function comprised of a set of vectors. In the case of factored models, the size of these vectors grows exponentially with the number of state factors, leading to scalability issues. We consider an approximate value function representation based on a linear combination of basis functions. In particular, we present a backup operator that can be used in any point-based POMDP solver. Furthermore, we show how under certain conditions independence between observation factors can be exploited for large computational gains. We experimentally verify our contributions and show that they have the potential to improve point-based methods in policy quality and solution size.

BibTeX Entry

@InProceedings{Veiga14aaai,
  author =       {Tiago S. Veiga and Matthijs T. J. Spaan and Pedro
                  U. Lima},
  title =        {Point-based {POMDP} Solving with Factored Value
                  Function Approximation},
  booktitle =    {Proceedings of the Twenty-Eighth AAAI Conference on
                  Artificial Intelligence},
  year =         2014,
  pages =        {2512--2518}
}

Note: This material is presented to ensure timely dissemination of scholarly and technical work. Copyright and all rights therein are retained by authors or by other copyright holders. All persons copying this information are expected to adhere to the terms and constraints invoked by each author's copyright. In most cases, these works may not be reposted without the explicit permission of the copyright holder.

Generated by bib2html.pl (written by Patrick Riley) on Thu Feb 29, 2024 16:15:45 UTC