Markov decision process

Bootstrapping Simulation-Based Algorithms with a Suboptimal Policy

Autonomous Assistance

This project investigates the challenge of building helpers in two-party collaboration.

Bootstrapping Monte Carlo Tree Search with an imperfect heuristic