papers

Other/Autres sources: HAL, DBLP, gScholar.

Journal papers

Heuristic Search Value Iteration for zero-sum Stochastic Games.
O. Buffet and J. Dibangoye and A. Saffidine and V. Thomas
IEEE Transactions on Games, 2020. [www] [bib]
Goal Probability Analysis in MDP Probabilistic Planning:
Exploring and Enhancing the State of the Art.
M. Steinmetz, J. Hoffmann, and O. Buffet.
Journal of Artificial Intelligence Research, volume 57, 2016, p. 229-271. [www] [pdf] [bib]
Optimally Solving Dec-POMDPs as Continuous-State MDPs.
J. Dibangoye, C. Amato, O. Buffet and F. Charpillet.
Journal of Artificial Intelligence Research, volume 55, 2016, p. 443-497. [www] [pdf] [bib]
Intersections intelligentes pour le contrôle de véhicules sans pilote.
Coordination locale et optimisation globale.
M. Tlig, O. Buffet, O. Simonin.
Revue d'Intelligence Artificielle (RIA), volume 30(3), 2016, p. 353-382. [www] [pdf] ^[ria] [bib]
The Factored Policy Gradient Planner.
O. Buffet and D. Aberdeen.
Artificial Intelligence, volume 173(5-6), 2009, p. 722-747. [www] [bib]
Reachability Analysis for Uncertain SSPs.
O. Buffet.
International Journal on Artificial Intelligence Tools, volume 16(4), 2007, p. 725-749. [bib]
Shaping Multi-Agent Systems with Gradient Reinforcement Learning.
O. Buffet, A. Dutech, F. Charpillet.
Autonomous Agents and Multi-Agent Systems Journal, volume 15(2), 2007, p. 197-220. [click me] [bib]
Etude de différentes combinaisons de comportements adaptatives.
O. Buffet, A. Dutech, F. Charpillet.
Revue d'Intelligence Artificielle (RIA), volume 20(2-3), 2006, p. 311-344. [pdf] ^[ria] [bib]
Développement autonome des comportements de base d'un agent.
O. Buffet, A. Dutech, F. Charpillet.
Revue d'Intelligence Artificielle (RIA), volume 19(4-5), 2005, p. 603-632.
(extension de l'article de même titre publié dans la conférence CAp'04) [pdf] ^[ria] [bib]
Nidification hivernale de la chouette hulotte (strix haluco) en Lorraine.
O. Buffet, Y. Muller.
CICONIA 27 (3), 2003, p. 129-130. [bib]

^[ria]Note that Revue d'Intelligence Artificielle (RIA) has been sold by Elsevier (cf. here and there), so that "pre-sale" papers are not accessible anymore (e.g., doi links are broken).

International conferences and workshops

Solving infinite-horizon Dec-POMDPs using Finite State Controllers within JESP.
Y. You, V. Thomas, F. Colas, and O. Buffet.
Proceedings of the 33rd International Conference on Tools with Artificial Intelligence (ICTAI-21), held virtually, 2021. [pdf] [bib]
K-N-MOMDPs: Towards Interpretable Solutions for Adaptive Management.
J. Ferrer-Mestres, T. G. Dietterich, O. Buffet, and I. Chadès.
Proceedings of the 35th AAAI Conference on Artificial Intelligence (AAAI-21), held virtually 2021. [doi] [bib]
Solving K-MDPs.
J. Ferrer-Mestres, T. G. Dietterich, O. Buffet, and I. Chadès.
Proceedings of the 30th International Conference on Automated Planning and Scheduling (ICAPS-20), not Nancy, France, 2020. [pdf] [doi] [bib]
Monte Carlo Information-Oriented Planning.
V. Thomas, G. Hutin, and O. Buffet.
Proceedings of the 24th European Conference on Artificial Intelligence (ECAI-20), not Santiago Di Compostella, Spain, 2020. [pdf] [bib]
Optimally Solving Two-Agent Decentralized POMDPs Under One-Sided Information Sharing.
Y. Xie, J. Dibangoye, and O. Buffet.
Proceedings of the 37th International Conference on Machine Learning (ICML-20), not Vienna, Austria, 2020. [www] [pdf] [bib]
rho-POMDPs have Lipschitz-Continuous epsilon-Optimal Value Functions.
M. Fehr, O. Buffet, V. Thomas, and J. Dibangoye.
Advances in Neural Information Processing Systems 32 (NIPS-18), Montreal, Canada, 2018. [www] [pdf] [suppl.zip] [bib]
Learning to Act in Decentralized Partially Observable MDPs.
J. Dibangoye and O. Buffet.
PMLR, Proceedings of the 35th International Conference on Machine Learning (ICML-18), Stockholm, Sweden, 2018. [www] [pdf] [bib]
Revisiting Goal Probability Analysis in Probabilistic Planning.
M. Steinmetz, J. Hoffmann, and O. Buffet.
Proceedings of the 26th International Conference on Automated Planning and Scheduling (ICAPS'16), London, UK, 2016. [pdf] [pdf-tr] [bib]
Structural Results for Cooperative Decentralized Control Models.
J. Dibangoye, O. Buffet, O. Simonin.
Proceedings of the International Joint Conference on Artificial Intelligence (IJCAI-15), Buenos-Aires, Argentina, 2015. [pdf] [bib]
Exploiting separability in multiagent planning with continuous-state MDPs (extended abstract).
J. Dibangoye, C. Amato, O. Buffet, F. Charpillet.
Proceedings of the International Joint Conference on Artificial Intelligence (IJCAI-15) [Best Papers From Sister Conferences Track], Buenos-Aires, Argentina, 2015. [pdf] [bib]
Error-bounded Approximations for Infinite-Horizon Discounted Decentralized POMDPs.
J. Dibangoye, O. Buffet, F. Charpillet.
Proceedings of the European Conference on Machine Learning (ECML/PKDD-14), Nancy, France, 2014. [pdf] [bib]
Learning Pruning Rules for Heuristic Search Planning.
M. Krajňanský, J. Hoffmann, O. Buffet, A. Fern.
Proceedings of the Twenty-first European Conference on Artificial Intelligence (ECAI-14), Prague, Czech Republic, 2014. [pdf] [bib]
Simultaneous Tracking and Activity Recognition (STAR) using Advanced Agent-Based Behavioral Simulations.
A. Fansi Tchango, V. Thomas, O. Buffet, F. Flacher, A. Dutech.
Proceedings of the Twenty-first European Conference on Artificial Intelligence (ECAI-14), Prague, Czech Republic, 2014. [pdf] [bib]
Stop-Free Strategies for Traffic Networks: Decentralized On-line Optimization.
M. Tlig, O. Buffet, O. Simonin.
Proceedings of the Twenty-first European Conference on Artificial Intelligence (PAIS/ECAI-14), Prague, Czech Republic, 2014. [pdf] [bib]
Towards the Usage of Advanced Behavioral Simulations for Simultaneous Tracking and Activity Recognition.
A. Fansi Tchango, V. Thomas, O. Buffet, F. Flacher, A. Dutech.
Proceedings of the Seventh European Starting AI Researcher Symposium (STAIRS-14), Prague, Czech Republic, 2014. [pdf] [bib]
Tracking Multiple Interacting Targets Using a Joint Probabilistic Data Association Filter
A. Fansi Tchango, V. Thomas, O. Buffet, A. Dutech, F. Flacher.
Proceedings of the Seventeenth International Conference on Information Fusion (Fusion-14), Salamanca, Spain, 2014. [pdf] [bib]
Decentralized Traffic Management: A Synchronization-Based Intersection Control.
M. Tlig, O. Buffet, O. Simonin.
Proceedings of the Third International Conference on Advanced Logistics and Transport (ICALT-14) / Symposium on Intelligent Transportation Systems (ITS), Hammamet, Tunisia, 2014. [pdf] [bib]
Exploiting separability in multi-agent planning with continuous-state MDPs.
J. Dibangoye, C. Amato, O. Buffet, F. Charpillet.
Proceedings of the Thirteenth International Conference on Autonomous Agents and Multi-Agent Systems (AAMAS-14), Paris, France, 2014. [Best Paper] [pdf] [bib]
Simulation-Based Behavior Tracking of Pedestrians in Partially Observed Indoor Environments.
A. Fansi Tchango, V. Thomas, O. Buffet, F. Flacher, A. Dutech.
Proceedings of the Thirteenth International Conference on Autonomous Agents and Multi-Agent Systems (AAMAS-14), Paris, France, 2014. [pdf] [bib]
Optimally Solving Dec-POMDPs as Continuous-State MDPs.
J. Dibangoye, C. Amato, O. Buffet, F. Charpillet.
Proceedings of the Twenty-Third International Joint Conference on Artificial Intelligence (IJCAI-13), Beijing, China, 2013. [pdf] [bib] [French version published in JFPDA-13]
Adaptive Management of Migratory Birds under Sea Level Rise.
S. Nicol, T. Iwamura, O. Buffet, I. Chadès.
Proceedings of the Twenty-Third International Joint Conference on Artificial Intelligence (IJCAI-13), Beijing, China, 2013. [pdf] [bib]
Abstraction Pathologies in Markov Decision Processes.
M. Tagorti M., B. Scherrer, O. Buffet, J. Hoffmann.
Proceedings of the ICAPS-13 workshop on Heuristics and Search for Domain-Independent Planning (HSDIP-13), Roma, Italy, 2013. [pdf] [bib] [also published in JFPDA-13]
Reactive coordination rules for traffic optimization in road sharing problems.
M. Tlig, O. Buffet, O. Simonin.
Proceedings of the PAAMS Workshop on Agent-based Approaches for the Transportation Modelling and Optimisation (AATMO-13), Salamanca, Spain, 2013. [pdf] [bib]
Optimistic Heuristics for MineSweeper.
W.-T. Lin, O. Buffet, C.-S. Lee, O. Teytaud.
Proceedings of the International Computer Symposium (ICS-12), Hualien, Taiwan, 2012. [pdf] [bib]
Cooperative Behaviors for the Self-Regulation of Autonomous Vehicles in Space Sharing Conflicts.
M. Tlig, O. Buffet, O. Simonin.
Proceedings of the Twenty-Fourth International Conference on Tools with Artificial Intelligence (ICTAI-12), Athens, Greece, 2012. [pdf] [bib]
Near-Optimal BRL using Optimistic Local Transitions.
M. Araya-López, V. Thomas, O. Buffet.
Proceedings of the Twenty-Ninth International Conference on Machine Learning (ICML-12), Edinburgh, Scotland, 2012. [pdf] [www] [bib]
MOMDPs: a Solution for Modelling Adaptive Management Problems.
I. Chadès, J. Carwardine, T.G. Martin, S. Nicol, R. Sabbadin, O. Buffet.
Proceedings of the Twenty-Sixth AAAI Conference on Artificial Intelligence (AAAI-12), Toronto, Canada, 2012. [pdf] [bib]
POMDPs Make Better Hackers: Accounting for Uncertainty in Penetration Testing.
C. Sarraute, O. Buffet, J. Hoffmann.
Proceedings of the Twenty-Sixth AAAI Conference on Artificial Intelligence (AAAI-12), Toronto, Canada, 2012. [pdf] [www] [bib]
Optimal Priority Assignment Algorithms for Probabilistic Real-Time Systems
D. Maxim, O. Buffet, L. Santinelli, L. Cucu-Grosjean, R. Davis.
Proceedings of the 19th International Conference on Real-Time and Network Systems (RTNS-11), Nantes, France, 2011. [pdf] [bib]
Active Learning of MDP Models.
M. Araya-López, O. Buffet, V. Thomas, F. Charpillet.
Proceedings of the Ninth European Workshop on Reinforcement Learning (EWRL-11), Athens, Greece, 2011. [pdf] [bib]
Formalizing and Solving Information Collection Problems with Autonomous Sensor Systems.
M. Godichaud, E. Chanthery, O. Buffet, M. Contat.
Proceedings of the Eighteenth IFAC World congress (IFAC-11), Milano, Italy, 2011. [pdf] [bib]
Penetration Testing == POMDP Solving?
C. Sarraute, O. Buffet, J. Hoffmann.
Working Notes for the 2011 IJCAI Workshop on Intelligent Security (SecArt-11), Barcelona, Spain, 2011. [pdf] [bib]
Impact of job dropping on the probabilistic schedulability of uniprocessor deterministic real-time systems
O. Buffet, L. Cucu-Grosjean.
Online Proceedings of the Workshop EVOLVE - A bridge between Probability, Set Oriented Numerics and Evolutionary Computation, Bourglinster Castle, Luxembourg, 2011. [pdf] [bib]
A POMDP Extension with Belief-dependent Rewards.
M. Araya-López, O. Buffet, V. Thomas, F. Charpillet.
Advances in Neural Information Processing Systems 23 (NIPS-10), Vancouver, Canada, 2010.
[the extended version has been published as INRIA research report #7433] [pdf] [ext.pdf] [poster] [bib]
A Closer Look at MOMDPs.
M. Araya-López, V. Thomas, O. Buffet, F. Charpillet.
Proceedings of the Twenty-Second IEEE International Conference on Tools with Artificial Intelligence (ICTAI-10), Arras, France, 2010. [pdf] [bib]
From ``I like'' to ``I prefer'' in Collaborative Filtering.
A. Brun, A. Hamad, O. Buffet, A. Boyer.
Proceedings of the Twenty-Second IEEE International Conference on Tools with Artificial Intelligence (ICTAI-10), Arras, France, 2010. [Poster presentation.] [pdf] [bib]
Towards Preference Relations in Recommender Systems.
A. Brun, A. Hamad, O. Buffet and A. Boyer. ECML/PKDD Workshop on Preference Learning (PL-10), Barcelona, Spain, 2010. [pdf] [bib]
Impact of job dropping on the schedulability of uniprocessor probabilistic real-time systems with variable execution times.
O. Buffet, Cucu-Grosjean L.
Proceedings of the First International Real-Time Scheduling Open Problems Seminar (RTSOPS 2010), joint workshop with the 22nd Euromicro International Conference on Real-Time Systems (ECRTS 2010), Bruxelles, Belgium, 2010. [pdf] [bib]
All that Glitters is not Gold: Using Landmarks for Reward Shaping in FPG.
O. Buffet and J. Hoffmann.
Proceedings of the ICAPS'10 Workshop on Planning and Scheduling under Uncertainty (PSUWS), Toronto, Canada, 2010. [pdf] [bib]
Influence of Different Execution Models on Patrolling Ant Behaviors: from Agents to Robots.
A. Glad, O. Simonin, O. Buffet, F. Charpillet.
Proceedings of the Ninth International Conference on Autonomous Agents and MultiAgent Systems (AAMAS'10), Toronto, Canada, 2010. [pdf] [bib]
Global Multiprocessor Real-Time Scheduling as a Constraint Satisfaction Problem.
L. Cucu-Grosjean, O. Buffet.
Proceedings of the ICPP'09 Workshop on Real-time systems on multicore platforms: Theory and Practice (XRTS'09), Vienna, Austria, 2009. [pdf] [bib]
Self-Organization of Patrolling-Ant Algorithms.
A. Glad, O. Buffet, O. Simonin, F. Charpillet.
Proceedings of the Third International Conference on Self-Adaptive and Self-Organizing Systems (SASO'09), San Francisco, CA, USA, 2009. [pdf] [bib]
Theoretical Study of Ant-based Algorithms for Multi-Agent Patrolling.
A. Glad, O. Simonin, O. Buffet, F. Charpillet.
Proceedings of the Eighteenth European Conference on Artificial Intelligence (ECAI'08), Patras, Greece, 2008. [pdf] [bib]
FF+FPG: Guiding a Policy-Gradient Planner.
O. Buffet, D. Aberdeen.
Proceedings of the Seventeenth International Conference on Automated Planning and Scheduling (ICAPS'07), Providence, USA, 2007. [pdf] [bib]
Temporal Probabilistic Planning with Policy-Gradients.
D. Aberdeen, O. Buffet.
Proceedings of the Seventeenth International Conference on Automated Planning and Scheduling (ICAPS'07), Providence, USA, 2007. [pdf] [bib]
Policy-Gradients for PSRs and POMDPs.
D. Aberdeen, O. Buffet, O. Thomas.
Proceedings of the Eleventh International Conference on Artificial Intelligence and Statistics (AISTATS'07), San Juan, Puerto Rico, 2007. [pdf] [bib]
Factored Planning using Decomposition Trees.
A. Kelareva, O. Buffet, J. Huang, S. Thiébaux.
Proceedings of the Twentieth International Joint Conference on Artificial Intelligence (IJCAI'07), Hyderabad, India, 2007. [pdf] [bib]
Policy-Gradient for Robust Planning
O. Buffet, D. Aberdeen.
Proceedings of the ECAI'06 Workshop on Planning, Learning and Monitoring with Uncertainty and Dynamic Worlds (PLMUDW'06), Riva del Garda, Italy, 2006. [pdf] [bib]
The Factored Policy Gradient planner (IPC-06 Version). [winner of the probabilistic track]
O. Buffet, D. Aberdeen.
Proceedings of the Fifth International Planning Competition, Lake District, Cumbria, UK, 2006. [pdf] [bib]
Reachability Analysis for Uncertain SSPs.
O. Buffet.
Proceedings of the Seventeenth IEEE International Conference on Tools with Artificial Intelligence (ICTAI'05), Hong-Kong, China, 2005. [pdf] [bib]
A Two-Teams Approach for Robust Probabilistic Temporal Planning.
O. Buffet, D. Aberdeen.
Proceedings of the ECML'05 workshop on Reinforcement Learning in Non-Stationary Environments, Porto, Portugal, 2005. [pdf] [bib]
Robust Planning with (L)RTDP.
O. Buffet, D. Aberdeen.
Proceedings of the Nineteenth International Joint Conference on Artificial Intelligence (IJCAI'05), Edinburgh, Scotland, 2005. [pdf] [bib]
Fast Reachability Analysis for Uncertain SSPs.
O. Buffet.
Proceedings of the IJCAI'05 Workshop on Planning and Learning in A Priori Unknown or Dynamic Domains, Edinburgh, Scotland, 2005. [pdf] [bib]
Simulation Methods for Uncertain Decision-Theoretic Planning.
D. Aberdeen, O. Buffet.
Proceedings of the IJCAI'05 Workshop on Planning and Learning in A Priori Unknown or Dynamic Domains, Edinburgh, Scotland, 2005. [pdf] [bib]
Dynamic Programming using Quantum Search for Optimizing Petri Net Models.
S. Naguleswaran, L. White, O. Buffet.
The IFORS Triennial 2005 Conference, Hawaii, USA, 2005. [bib]
Self-Growth of Basic Behaviors in an Action Selection Based Agent.
O. Buffet, A. Dutech, F. Charpillet.
From Animals to Animats 8: Proceedings of the Eighth International Conference on Simulation of Adaptive Behavior (SAB'04), Los Angeles, CA, USA, 2004. [ps.gz] [pdf] [bib]
A Self-Made Agent Based on Action-Selection.
O. Buffet, A. Dutech.
Proceedings of the Sixth European Workshop on Reinforcement Learning (EWRL'03), Nancy, France, 2003. [ps.gz] [pdf] [bib]
Automatic Generation of an Agent's Basic Behaviors.
O. Buffet, A. Dutech, F. Charpillet.
Proceedings of the Second International Joint Conference on Autonomous Agents & Multi-Agent Systems (AAMAS'03), Melbourne, Australia, 2003. [ps.gz] [pdf] [bib]
Adaptive Combination of Behaviors in an Agent.
O. Buffet, A. Dutech, F. Charpillet.
Proceedings of the Fiveteenth European Conference on Artificial Intelligence (ECAI'02), Lyon, France, 2002. [ps.gz] [pdf] [bib]
Learning to weigh basic behaviors in Scalable Agents.
O. Buffet, A. Dutech, F. Charpillet.
Proceedings of the First International Joint Conference on Autonomous Agents & Multi-Agent Systems (AAMAS'02), Bologna, Italy, 2002. [Poster presentation.] [ps.gz] [pdf] [bib]
Looking for Scalable Agents.
O. Buffet, A. Dutech.
Proceedings of the Fifth European Workshop on Reinforcement Learning (EWRL'01), Utrecht, Netherland, 2001. [ps.gz] [pdf] [bib]
Multi-Agent Systems by Incremental Gradient Reinforcement Learning.
A. Dutech, O. Buffet, F. Charpillet.
Proceedings of the Seventeenth International Joint Conference on Artificial Intelligence (IJCAI'01), Seattle, 2001. [pdf] [ps.gz] [bib]
Incremental Reinforcement Learning for designing Multi-Agent Systems.
O. Buffet, A. Dutech, F. Charpillet.
Proceedings of the Fifth International Conference on Autonomous Agents (Agents'01), Montréal, 2001. [Poster presentation.] [ps.gz] [pdf] [bib]

National conferences and workshops

HSVI pour zs-POSG usant de propriétés de convexité, concavité, et Lipschitz-continuité.
A. Delage, O. Buffet, J. Dibangoye.
Actes des seizièmes journées francophones planification, décision, apprentissage pour la conduite de systèmes (JFPDA-21), not Bordeaux, France, 2020. [bib]
Résolution de Dec-POMDP à horizon infini à l’aide de contrôleurs à états finis dans JESP.
Y. You, V. Thomas, F. Colas, O. Buffet.
Actes des seizièmes journées francophones planification, décision, apprentissage pour la conduite de systèmes (JFPDA-21), not Bordeaux, France, 2020. [bib]
Multiagent Planning and Learning As Mixed-Integer Linear Programming.
J. Dibangoye, O. Buffet.
Actes des quinzièmes journées francophones planification, décision, apprentissage pour la conduite de systèmes (JFPDA-20), not Angers, France, 2020. [bib]
Sur le principe d’optimalité de Bellman pour les zs-POSG.
O. Buffet, J. Dibangoye, A. Delage, A. Saffidine, V. Thomas.
Actes des quinzièmes journées francophones planification, décision, apprentissage pour la conduite de systèmes (JFPDA-20), not Angers, France, 2020. [bib] [long version in arXiv/CoRR-20]
Planification Monte Carlo orientée information.
V. Thomas, G. Hutin, O. Buffet.
Actes des quatorzièmes journées francophones planification, décision, apprentissage pour la conduite de systèmes (JFPDA-19), Toulouse, France, 2019. [bib]
Recherche heuristique pour jeux stochastiques (à somme nulle).
O. Buffet, J. Dibangoye, A. Saffidine, V. Thomas.
Actes des treizièmes journées francophones planification, décision, apprentissage pour la conduite de systèmes (JFPDA-18), Nancy, France, 2018. [bib]
Learning to Act in Decentralized Partially Observable MDPs.
J. Dibangoye and O. Buffet.
Actes des treizièmes journées francophones planification, décision, apprentissage pour la conduite de systèmes (JFPDA-18), Nancy, France, 2018. [bib] [also published in ICML-18]
MDP s-lipschitziens et ρ-POMDP non-convexes.
O. Buffet, V. Thomas and J. Dibangoye.
Actes des douzièmes journées francophones planification, décision, apprentissage pour la conduite de systèmes (JFPDA-17), Caen, France, 2017. [bib] [fixed and published in NIPS-18]
Revisiting Goal Probability Analysis in Probabilistic Planning.
M. Steinmetz, J. Hoffmann, and O. Buffet.
Actes des onzièmes journées francophones planification, décision, apprentissage pour la conduite de systèmes (JFPDA-16), Grenoble, France, 2016. [bib] [also published in ICAPS-16]
Résultats structurels pour les modèles de contrôle décentralisé coopératif.
J. Dibangoye, O. Buffet, O. Simonin.
Actes des dixièmes journées francophones planification, décision, apprentissage pour la conduite de systèmes (JFPDA-15), Rennes, France, 2015. [bib] [also published in IJCAI-15]
Learning pruning rules for heuristic search planning.
M. Krajnansky, J. Hoffmann, O. Buffet, A. Fern.
Actes des neuvièmes journées francophones planification, décision, apprentissage pour la conduite de systèmes (JFPDA-14), Liège, Belgique, 2014. [bib] [also published in ECAI-14]
Abstraction Pathologies in Markov Decision Processes.
M. Tagorti, B. Scherrer, O. Buffet, J. Hoffmann.
Actes des huitièmes journées francophones planification, décision, apprentissage pour la conduite de systèmes (JFPDA-13), Lille, France, 2013. [pdf] [bib] [also published in HSDIP-13]
Active Diagnosis Through Belief-lookahead Information Gathering.
M. Araya-López, O. Buffet, V. Thomas.
Actes des huitièmes journées francophones planification, décision, apprentissage pour la conduite de systèmes (JFPDA-13), Lille, France, 2013. [pdf] [bib]
Résoudre des Dec-POMDP optimalement comme des MDP à espace d'états continu.
J. Dibangoye, C. Amato, O. Buffet, F. Charpillet.
Actes des huitièmes journées francophones planification, décision, apprentissage pour la conduite de systèmes (JFPDA-13), Lille, France, 2013. [pdf] [bib] [English version presented at IJCAI-13]
Synchronisation de véhicules autonomes aux croisements d'un réseau de routes.
M. Tlig, O. Buffet, O. Simonin.
Démonstration - Vingt-et-unième Journées francophones sur les systèmes multi-agents (JFSMA-13), Lille, France, 2013. [bib]
Synchronisation de véhicules autonomes aux croisements d'un réseau de routes.
M. Tlig, O. Buffet, O. Simonin.
Actes des onzièmes rencontres jeunes chercheurs en intelligence artificielle (RJCIA-13), Lille, France, 2013. [Best paper] [pdf] [bib]
BRL Quasi-Optimal à l'aide de Transitions Locales Optimistes.
M. Araya-López, V. Thomas, O. Buffet.
Actes des septièmes journées francophones planification, décision, apprentissage pour la conduite de systèmes (JFPDA-12), Nancy, France, 2012. [pdf] [bib]
Les POMDP font de meilleurs hackers: Tenir compte de l'incertitude dans les tests de pénétration.
C. Sarraute, O. Buffet, J. Hoffmann.
Actes des septièmes journées francophones planification, décision, apprentissage pour la conduite de systèmes (JFPDA-12), Nancy, France, 2012. [pdf] [bib]
Les POMDP: une solution pour modéliser des problèmes de gestion adaptative en biologie de la conservation.
I. Chadès, J. Carwardine, T.G. Martin, S. Nicol, O. Buffet.
Actes des sixièmes journées francophones planification, décision, apprentissage pour la conduite de systèmes (JFPDA'11), Rouen, France, 2011. [pdf] [bib]
Apprentissage actif de modèle de MDP.
M. Araya-López, O. Buffet, V. Thomas, F. Charpillet.
Actes des sixièmes journées francophones planification, décision, apprentissage pour la conduite de systèmes (JFPDA'11), Rouen, France, 2011. [pdf] [bib]
Une extension des POMDP avec des récompenses dépendant de l'état de croyance.
M. Araya-López, O. Buffet, V. Thomas, F. Charpillet.
Actes de la conférence francophone sur l'apprentissage automatique (CAp'11), Chambéry, France, 2011.
[french version of the NIPS-10 paper | best paper CAp'11] [pdf] [bib]
Apprentissage par Renforcement Développemental en Robotique Autonome.
L. Sarzyniec, O. Buffet, A. Dutech.
Actes de la conférence francophone sur l'apprentissage automatique (CAp'11), Chambéry, France, 2011. [pdf] [bib]
Recherche systématique pour l'ordonnancement temps réel global multiprocesseur.
O. Buffet, L. Cucu-Grosjean.
Actes du douzième congrès annuel de la Société française de Recherche Opérationnelle et d'Aide à la Décision (ROADEF'11), Saint-Etienne, France, 2011. [pdf] [bib]
Formalisation et résolution de problèmes d'acquisition d'informations par des systèmes autonomes.
M. Godichaud, E. Chanthery, O. Buffet, M. Contat.
Actes du douzième congrès annuel de la Société française de Recherche Opérationnelle et d'Aide à la Décision (ROADEF'11), Saint-Etienne, France, 2011. [pdf] [bib]
Des POMDPs avec des variables d'état visibles.
M. Araya-López, V. Thomas, O. Buffet, F. Charpillet.
Actes des cinquièmes journées francophones planification, décision, apprentissage pour la conduite de systèmes (JFPDA'10), Besançon, France, 2010. [pdf] [bib]
Vers l'utilisation de relations de préférence pour le filtrage collaboratif.
A. Brun, A. Hamad, O. Buffet, A. Boyer.
Actes du dix-septième congrés francophone AFRIF-AFIA sur la Reconnaissance des Formes et l'Intelligence Artificielle (RFIA'10), Caen, France, 2010. [pdf] [bib]
Auto-organisation dans les algorithmes fourmis pour la patrouille multi-agent.
A. Glad, O. Buffet, O. Simonin, F. Charpillet.
Actes des quatrièmes journées francophones planification, décision, apprentissage pour la conduite de systèmes (JFPDA'09), Paris, France, 2009. [pdf] [bib]
FF+FPG: Guider un planificateur basé sur une méthode de gradient.
O. Buffet, D. Aberdeen.
Actes des deuxièmes journées francophones planification, décision, apprentissage pour la conduite de systèmes (JFPDA'07), Grenoble, France, 2007. [bib]
Planification robuste à l'aide d'une montée de gradient.
O. Buffet, D. Aberdeen.
Actes de la Conférence d'Apprentissage (CAp'06), Trégastel, France, 2006. [pdf] [bib]
Planification Robuste avec (L)RTDP.
O. Buffet, D. Aberdeen.
Actes de la Conférence d'Apprentissage (CAp'05), Nice, France, 2005. [french version of IJCAI'05 paper] [pdf] [bib]
Développement autonome des comportements de base d'un agent.
A. Dutech, O. Buffet, F. Charpillet.
Actes de la Conférence d'Apprentissage (CAp'04), Montpellier, France, 2004. [pdf] [ps.gz] [bib]
Apprentissage par renforcement pour la conception de systèmes multi-Agents réactifs.
A. Dutech, O. Buffet, F. Charpillet.
Actes des Journées Francophones sur les Systèmes Multi-Agents (JFSMA'03), Hammamet, Tunisie, 2003. [ps.gz] [pdf] [bib]

Preprints / Technical reports

HSVI fo zs-POSGs using Concavity, Convexity and Lipschitz Properties.
A. Delage, O. Buffet, and J. Dibangoye.
arXiv/CoRR abs/2110.14529, 2021. [url] [pdf] [bib]
Solving infinite-horizon Dec-POMDPs using Finite State Controllers within JESP (extended version).
Y. You, V. Thomas, F. Colas, and O. Buffet.
arXiv/CoRR abs/2109.08755, 2021. [extended version of the ICTAI 2021 paper] [url] [pdf] [bib]
Monte Carlo Information-Oriented Planning (Revised version).
V. Thomas, G. Hutin, and O. Buffet.
arXiv/CoRR abs/2103.11345, 2021. [revised version of the ECAI 2020 paper] [url] [pdf] [bib]
On Bellman's Optimality Principle for zs-POSGs.
O. Buffet, J. Dibangoye, A. Delage, A. Saffidine, V. Thomas.
arXiv/CoRR abs/2006.16395, 2020. [url] [pdf] [bib]
Optimally solving Dec-POMDPs as Continuous-State MDPs: Theory and Algorithms.
J. Dibangoye, C. Amato, O. Buffet, F. Charpillet.
INRIA Research Report #8517, 2014. [pdf] [bib]
Decentralized Traffic Management: A Synchronization-Based Intersection Control -- extended version.
M. Tlig, O. Buffet, O. Simonin.
INRIA Research Report #8500, 2014 [extended version of ICALT'14 paper]. [pdf] [bib]
Near-Optimal BRL using Optimistic Local Transitions (Extended Version).
M. Araya-Lòpez, V. Thomas, O. Buffet.
INRIA Research Report #7965, 2012 [extended version of ICML'12 paper]. [pdf] [bib]
A POMDP Extension with Belief-dependent Rewards (Extended Version).
M. Araya-Lòpez, O. Buffet, V. Thomas, F. Charpillet.
INRIA Research Report #7433, 2010 [extended version of NIPS'10 paper]. [pdf] [bib]
Systematic Searches for Global Multiprocessor Real-Time Scheduling.
O. Buffet, L. Cucu-Grosjean.
INRIA Research Report #7386, 2010. [pdf] [bib]
Rapport Agata: Proposition de modélisation pour le suivi de situation et la prise de décision.
O. Buffet.
LAAS/CNRS - CNES, Projet Agata, 19 novembre 2007. [bib]
Rapport Agata T0+5 mois: Proposition de modélisation pour le suivi de situation et la prise de décision.
O. Buffet.
LAAS/CNRS - CNES, Projet Agata, 7 juin 2007. [bib]
Robust Probabilistic Temporal Planning: Dynamic Programming vs Policy Search.
O. Buffet, D. Aberdeen.
National ICT Australia, september 2005. [bib]
Robust (L)RTDP: Reachability Analysis.
O. Buffet.
National ICT Australia, december 2004. [pdf] [ps.gz] [bib]
Planning with Robust (L)RTDP.
O. Buffet, D. Aberdeen.
National ICT Australia, november 2004. [pdf] [ps.gz] [bib]
Apprentissage par renforcement pour la conception de systèmes multi-Agents réactifs.
O. Buffet.
Rapport d'avancement de thèse. 2002. [pdf] [ps.gz] [bib]

Theses

Prise de décision séquentielle dans l’incertain : Exploiter la structure et rester dans le cadre.
O. Buffet.
Mémoire d'habilitation à diriger des recherches. 2017. [pdf] [bib]
Une double approche modulaire de l'apprentissage par renforcement pour des agents intelligents adaptatifs.
O. Buffet.
Mémoire de thèse. 2003. [ps.gz] [pdf] [bib]
Apprentissage par renforcement dans un système multi-agents.
O. Buffet.
Mémoire de DEA. 2000. [pdf] [ps.gz] [bib]

Books (editions)

Proceedings of the Thirtieth International Conference on Automated Planning and Scheduling, Nancy, France, October 26-30, 2020.
J. Christopher Beck, O. Buffet, J. Hoffmann, E. Karpas and S. Sohrabi (Eds), AAAI Press. [bib] [www]
Proceedings of the ICAPS'10 workshop on Planning and Scheduling Under Uncertainty (13 May 2010),
J. Bidot, D. Bryce, O. Buffet, H. Palacios, S. Sanner (Eds). 2010. [bib] [www]
Markov Decision Processes & Artificial Intelligence.
O. Sigaud and O. Buffet, Eds,
ISTE/Wiley, 2010. [bib]
Processus décisionnels de Markov en intelligence artificielle, volume 1.
O. Sigaud and O. Buffet, Eds,
Hermes Science Publishing, 2008. [bib]
Processus décisionnels de Markov en intelligence artificielle, volume 2.
O. Buffet and O. Sigaud, Eds,
Hermes Science Publishing, 2008. [bib]
Proceedings of the ICAPS'07 workshop on Artificial Intelligence Planning and Learning (22 september 2007),
U. Kuter, D. Aberdeen, O. Buffet and P. Stone (Eds). 2007. [bib]
Proceedings of the ECAI'06 Workshop on Planning, Learning and Monitoring with Uncertainty and Dynamic Worlds (PLMUDW'06),
A. Botea, O. Buffet and M. Zanella (Eds). 2006. [bib]
Proceedings of the Sixth European Workshop on Reinforcement Learning.
A. Dutech and O. Buffet (Eds). 2003. [www] [bib]

Book Chapters

Reinforcement Learning.
O. Buffet, O. Pietquin and P. Weng
A Guided Tour of Artificial Intelligence Research, volume 1: Knowledge Representation, Reasoning and Learning.
Springer, 2020. [bib]
3- Policy-Gradient Algorithms.
O. Buffet
Markov Decision Processes in Artificial Intelligence.
ISTE/Wiley, 2010. [bib]
15- Operations Planning.
S. Thiébaux and O. Buffet
Markov Decision Processes in Artificial Intelligence.
ISTE/WILEY, 2010. [bib]
3- Méthodes de gradient pour la recherche de politiques paramétrées.
O. Buffet
Processus décisionnels de Markov en intelligence artificielle, volume 2.
Hermes Science Publishing, 2008. [bib]
8- Planification d'opérations.
S. Thiébaux and O. Buffet
Processus décisionnels de Markov en intelligence artificielle, volume 2.
Hermes Science Publishing, 2008. [bib]

Submissions

Last modified: Mon Mar 1 10:10:52 CEST 2018

Star Field