A computational model of music transcription
暂无分享,去创建一个
The purpose of this research was to create a computational model of music transcription. The computer system that resulted processed natural musical sounds and automatically produced the music notation symbols that represented those sounds. The learned, human skill of transcribing music is one of the most sophisticated auditory-based pattern-recognition tasks that humans perform. Two related signal-to-symbol, machine-perception disciplines are automatic speech recognition and computer vision.
In the computational-model approach, hypotheses are implemented in precise algorithmic form on the computer. Any single algorithm, in turn, must work in harmony with a constellation of other algorithms that together form the integrated system. The robustness of the system was tested using unconstrained music played on a variety of musical instruments.
A bottom-up (i.e., data-driven) approach was implemented in this working system. The digitized sound signal from monophonic (one-part) music was first transformed into its spectral representation, forming the basis for extracting the time-varying partials. Next the time-varying pitch was established from these partials. Musical note segmentation was done via pitch and amplitude edge operators processing the pitch information. The discrete acoustical events thus produced were then classified into music-notation note symbols (representing pitch and duration). Last, the musical information was presented in the graphical printed form of music familiar to millions of musicians.
The backbone of the system was in the pitch-detection/note segmentation method. Highly precise pitch tracking was found not to be necessary although context was important in determining the time-varying pitch.
The automatic transcription system yielded notation that closely followed the original music performance. Additional research is necessary to incorporate higher-level musical knowledge that appears essential for the proper presentation of music notation. Other more ambitious goals include automatic polyphonic music transcription.