UROP Openings

Reinforcement Learning Model Development and Comparison

Term:

Summer

Department:

4: Architecture

Faculty Supervisor:

Takehiko Nagakura

Faculty email:

takehiko@MIT.EDU

Apply by:

Contact:

Paloma Francisca Gonzalez Rojas <palomagr@mit.edu>

Project Description

We are using human trajectory data from drone videos from Machu Picchu to feed to a Reinforcement Learning Model in Unity3D. We have two training approaches to compare, Exploratory RL and Imitation Learning. For each a training scene is set up with rewards and a curated label system. Such scenes have been set up, and the main task is to test them in order to do a quantitative/qualitative comparison.