Theory and Evaluation of a Bayesian Music Structure Extractor

Abdallah, Samer; Noland, Katy; Sandler, Mark; Casey, Michael A.; and Rhodes, Christophe. 2005. 'Theory and Evaluation of a Bayesian Music Structure Extractor'. In: International Conference on Music Information Retrieval. London, United Kingdom 11 - 15 September 2005. [Conference or Workshop Item]
Copy

We introduce a new model for extracting end points of music structure segments, such as intro, verse, chorus, break and so forth, from recorded music. Our methods are applied to the problem of grouping audio features into continuous structural segments with start and end times corresponding as closely as possible to a ground truth of independent human structure judgements. Our work extends previous work on automatic summarization and structure extraction by providing a model for segment end-points posed in a Bayesian framework. Methods to infer parameters to the model using Expectation Maximization and Maximum Likelihood methods are discussed. The model identifies all the segments in a song, not just the chorus or longest segment. We discuss the theory and implementation of the model and evaluate the model in an automatic structure segmentation experiment against a ground truth of human judgements. Our results shows a segment boundary intersection rate break-even point of approximately 80%.


picture_as_pdf
segmentation.pdf
subject
Published Version

View Download

Atom BibTeX OpenURL ContextObject in Span OpenURL ContextObject Dublin Core Dublin Core MPEG-21 DIDL Data Cite XML EndNote HTML Citation METS MODS RIOXX2 XML Reference Manager Refer ASCII Citation
Export

Downloads