| Authors |
Title |
Abstract |
| L Dorard and J Shawe-Taylor |
Gaussian Process Bandits and Monte-Carlo Planning |
[pdf] |
| P Hennig, DH Stern, and T Graepel |
Bayesian Quadratic Reinforcement Learning |
[pdf] |
| I Baldwin and P Newman |
Learning to Plan |
[pdf] |
| A Guez and J Pineau |
Performing Tasks while doing Simultaneous Localization and Mapping |
[pdf] |
| SRHM Tonk and HJ Kappen |
Using graphical model inference for agent opponent modeling |
[pdf] |
| M Toussaint |
Pros and Cons of truncated Gaussian EP in the context of Approximate Inference Control |
[pdf] |
| N Jetchev and M Toussaint |
Trajectory Prediction and Parallel Exploration of Movement Trajectories for Online Planning |
[pdf] |
| S Ross and JA Bagnell |
Efficient Reductions for Imitation Learning |
[pdf] |
| MG Azar and HJ Kappen |
Dynamic Policy Programming by Kullback Leibler Divergence Minimization |
[pdf] |
| B van den Broek, W Wiegerink, and B Kappen |
Stochastic Optimal Control by Hybrid Monte Carlo Sampling |
[ps] |
| M Hoffman, P Carbonetto, N de Freitas, and A Doucet |
Inference strategies for solving semi-Markov decision processes |
[pdf] |
| A Boularias and B Chaib-draa |
Learning Probabilistic Models via Inverse Planning |
[pdf] |
| K Dvijotham and E Todorov |
Efficient Algorithms for Inverse Optimal Control |
[pdf] |
| K Dvijotham and E Todorov |
Optimal control problems with simplified HJB/Bellman equations |
[pdf] |
| BD Ziebart, JA Bagnell, and AK Dey |
Purposeful Adaptive Behavior Modeling for Robotics |
[pdf] |
| G Comanici and D Precup |
Optimal Time-Scales for Reinforcement Learning Behavior Strategies |
[pdf] |
| B Boots, SM Siddiqi, and GJ Gordon |
Closing the Learning-Planning Loop with PSRs |
[pdf] |
| J Tang, A Singh, N Goehausen, and P Abbeel |
Learning Parameterized Maneuvers for Autonomous Helicopter Aerobatics from
Expert Demonstration |
[pdf] |
| A Shademan, AM Farahmand, and M Jägersand |
Statistically Robust Outlier Detection and Jacobian Learning in
Uncalibrated Visual Servoing |
[pdf] |
| T Erez and WD Smart |
Deterministic, locally-quadratic value-function approximation over belief space |
[pdf] |
| Y Tassa |
Solution of Linear Bellman Equations with a Mixture-of-Gaussians Approximation |
[pdf] |
| IR Fasel, P Ruvulo, T Wu, and J Movellan |
Infomax Control for Social Robots |
[pdf] |