Bio

I am a 3rd year Computer Science PhD student at Georgia Tech, advised by Dhruv Batra,
and working closely with Devi Parikh. My research focuses on deep learning
and its applications in building agents that can see (computer vision),
think (reasoning/interpretability), talk (language modeling), and
act (reinforcement learning).

I’ve spent three wonderful semesters as an intern at Facebook AI Research — Summer 2017 and Spring 2018 at Menlo Park, working with Georgia Gkioxari, Devi Parikh and Dhruv Batra on training embodied agents for navigation and question-answering in simulated environments (see embodiedqa.org), and Summer 2018 at Montréal, working with Mike Rabbat and Joelle Pineau on emergent communication protocols in large-scale multi-agent reinforcement learning.

On the side, I built neural-vqa, an efficient Torch implementation for visual question answering (and its extension neural-vqa-attention),
and maintain aideadlin.es (countdowns to a bunch of CV/NLP/ML/AI conference deadlines),
and several other side projects (HackFlowy, graf, etc).
I also help maintain Erdős, a competitive math learning platform I created during my undergrad.
I often tweet, and post pictures from my travels on Instagram and Tumblr.

Torch implementation of an attention-based visual question answering model (Yang et al., CVPR16).
The model looks at an image, reads a question, and comes up with an answer to the question and a heatmap of where it looked in the image to answer it.
Some results here.

Clone of WorkFlowy.com, a beautiful, list-based note-taking website that has a 500-item monthly limit on the free tier :-(. This project is an open-source clone of WorkFlowy. "Make lists. Not war." :-)

Another fun hackathon-winning project built during Yahoo! HackU! 2012 that involves webRTC-based P2P video chat, and was faster than any other video chat provider (at the time, before Google launched Hangouts).