An Efficient Voice Transcription Scheme for Music Retrieval

IEEE International Conference on Multimedia and Ubiquitous Engineering (MUE 2007)

Byeong-jun Han(1), Seungmin Rho(2), Eenjun Hwang(1)

(1) School of Electrical Engineering, Korea University.
(2) Graduate School of Information and Communication, Artificial Intelligence Laboratory.

Abstract

In this paper, we propose a new scheme for transcribing sung or hummed queries into a sequence of pitch and duration pairs automatically for efficient music retrieval. More specifically, we present two novel methods called WAE (Windowed Average Energy) and dynamic threshold method for ADF onsets for note segmentation and onset/offset detection in acoustic signal, respectively. The former improves previous energy-based approaches such as AE by defining small but coherent windows with local and global threshold values. The latter also improves the traditional global/local threshold method. By performing various experiments on our prototype music retrieval system, we show the effectiveness of our proposed scheme.

Full paper

Available on IEEE Xplore.
Available on IEEE Computer Society Digital Library.
Linked on DBLP.

Presentation

Presentation by Byeong-jun Han.
Download (PDF, 558KB)

Last Updated: 2007-09-28


All rights reserved by Byeong-jun, Han. 2005-2008.