In all cases, data was recorded using a pair of AVT Marlins F033C mounted on a chariot respectively a car, with a resolution of 640 x 480 (bayered), and a framerate of 13--14 FPS. For each dataset, we provide the unbayered images for both cameras, the camera calibration, and if available, the set of bounding box annotations. Depth maps were created based on this data using the publicy available belief-propagation-based stereo algorithm of Huttenlocher and Felzenszwalb (note: this has no occlusion handling built in, if you know of a better, publicly available stereo algorithm, please contact me).