Music Information Retrieval

1555 Words4 Pages

Abstract:

This paper describes a retrieval method for a significant amount of music tracks. It takes a melody sung or a text input as a search clue which is sent over the internet to retrieve the song from the database. This system uses a variety of procedures to support multiple search modes. The thorough indexing method allows for rapid recovery of the song. Integration of these components within a single architecture shall improve performance and functionality.

Overview of Objectives:

The aim of this paper is to utilise and develop existing techniques for Music Information Retrieval. The system shall be able to retrieve a song from either a text query or a melody query. This melody can be sung, a riff or humming. Browsing the database shall also be available to the user. The database shall reside on a server. For the purposes of this paper, it shall contain 10,000 songs. However, taking into consideration all the features of this system, I would expect it to be able to process a database 1000% larger.

Text

Query

Search Engines

Text Matching

Theme Matching

Phonetic Matching

Markov Representation

Music

Archive

User

Interface

Aural

Query

Figure 1 – Name That Tune Architecture

Functional Description of the System:

The system shall have a graphical interface containing 2 search engines:

1. Text Input – User may enter the title, line of song, artist

2. Aural Input – User may sing, hum, play a riff

The user would choose their preferable search method.

The aural input is processed through a Speech Recognition System. This comprises of 2 components:

1. An acoustic model – using statistical modelling using hidden Markov models. 2. A language model – model of the expected word sequences. This vocabulary will contain 80,000 words.

The songs shall be contained in a database with a corresponding value recorded in an index. This would contain 4 sections:

1. Traditional Text Matching

2. Theme Matching

3. Phonetic Matching

4. Markov Representation

There are other techniques, but after researching I feel that these are the most productive.

Sections 1 and 2 deal with the Text Input and 3 and 4 with the Aural Input. The file structure shall be that of the inverted file. This stores term locations. It shall also include term proximity. The most effective method to compute term locations is to use a hashing algorithm which will be employed in this system.

As all lyrics shall be recorded with each song, the Text Matching search shall be the most accurate. However, many users may not input the correct words and incorrect songs would be displayed.

This method shall use general information retrieval techniques.

More about Music Information Retrieval

Open Document