Speech/Song Database – old

Welcome to the RAVDESS website

The Ryerson Audio-Visual Database of Emotional Speech and Song (RAVDESS) is a set of 24 actors (12 male, 12 female) speaking and singing with various emotions, in a North American English accent. The RAVDESS is freely available for scientific-research and general use under a Non-commercial Creative Commons License Information. Please visit the Download page to register for and access the database.

The RAVDESS contains 7,356 high-quality video recordings of emotionally-neutral statements, spoken and sung with a range of emotions. The speech set consists of the 8 emotional expressions: neutral, calm, happy, sad, angry, fearful, surprise, and disgust. The song set consists of the 6 emotional expressions: neutral, calm, happy, sad, angry, and fearful. All emotions except neutral are expressed at two levels of emotional intensity: normal and strong. There are 2,452 unique vocalizations, all of which is available in three modality formats: full audio-video (720p, H.264), video-only, and audio-only (wave). The database has been validated in a perceptual experiment involving 297 participants. For more information, see the Design Features page.

More information about the database, including examples, can be found on the Supplemental Data page. Additional online material includes: validation data, vocal acoustic analyses, and facial motion tracking analyses, which can be found on the Supplemental Data page.

Citations

Please use the following reference when you cite the RAVDESS. Please note, this conference paper citation is temporary until the journal paper is accepted for publication.