Academic Commons Search Resultshttps://academiccommons.columbia.edu/catalog?action=index&controller=catalog&f%5Bauthor_facet%5D%5B%5D=Sheh%2C+Alexander&f%5Bdepartment_facet%5D%5B%5D=Electrical+Engineering&format=rss&fq%5B%5D=has_model_ssim%3A%22info%3Afedora%2Fldpd%3AContentAggregator%22&q=&rows=500&sort=record_creation_date+desc
Academic Commons Search Resultsen-usChord Recognition and Segmentation Using EM-trained Hidden Markov Modelshttps://academiccommons.columbia.edu/catalog/ac:149652
Sheh, Alexander; Ellis, Daniel P. W.http://hdl.handle.net/10022/AC:P:14009Thu, 12 Jul 2012 15:11:25 +0000An overview of work on chord recognition and EM-trained Hidden Markov Models.Artificial intelligence, Electrical engineeringde171Electrical EngineeringPresentationsChord Segmentation and Recognition using EM-Trained Hidden Markov Modelshttps://academiccommons.columbia.edu/catalog/ac:148748
Sheh, Alexander; Ellis, Daniel P. W.http://hdl.handle.net/10022/AC:P:13736Fri, 29 Jun 2012 16:31:17 +0000Automatic extraction of content description from commercial audio recordings has a number of important applications, from indexing and retrieval through to novel musicological analyses based on very large corpora of recorded performances. Chord sequences are a description that captures much of the character of a piece in a compact form and using a modest lexicon. Chords also have the attractive property that a piece of music can (mostly) be segmented into time intervals that consist of a single chord, much as recorded speech can (mostly) be segmented into time intervals that correspond to specific words. In this work, we build a system for automatic chord transcription using speech recognition tools. For features we use "pitch class profile" vectors to emphasize the tonal content of the signal, and we show that these features far outperform cepstral coefficients for our task. Sequence recognition is accomplished with hidden Markov models (HMMs) directly analogous to subword models in a speech recognizer, and trained by the same Expectation-Maximization (EM) algorithm. Crucially, this allows us to use as input only the chord sequences for our training examples, without requiring the precise timings of the chord changes— which are determined automatically during training. Our results on a small set of 20 early Beatles songs show frame-level accuracy of around 75% on a forced-alignment task.Electrical engineering, Artificial intelligencede171Electrical EngineeringArticles