Music Plagiarism Detection via Bipartite Graph Matching

Nowadays, with the prevalence of social media and music creation tools, musical pieces are spreading much quickly, and music creation is getting much easier. The increasing number of musical pieces have made the problem of music plagiarism prominent. There is an urgent need for a tool that can detect music plagiarism automatically. Researchers have proposed various methods to extract low-level and high-level features of music and compute their similarities. However, low-level features such as cepstrum coefficients have weak relation with the copyright protection of musical pieces. Existing algorithms considering high-level features fail to detect the case in which two musical pieces are not quite similar overall, but have some highly similar regions. This paper proposes a new method named MESMF, which innovatively converts the music plagiarism detection problem into the bipartite graph matching task. It can be solved via the maximum weight matching and edit distances model. We design several kinds of melody representations and the similarity computation methods according to the music theory. The proposed method can deal with the shift, swapping, transposition, and tempo variance problems in music plagiarism. It can also effectively pick out the local similar regions from two musical pieces with relatively low global similarity. We collect a new music plagiarism dataset from real legally-judged music plagiarism cases and conduct detailed ablation studies. Experimental results prove the excellent performance of the proposed algorithm. The source code and our dataset are available at https://anonymous.4open.science/r/a41b8fb4-64cf4190-a1e1-09b7499a15f5/

[1]  Julien Allali,et al.  Adaption of String Matching Algorithms for Identification of Near-Duplicate Music Documents , 2007, PAN.

[2]  Remco C. Veltkamp,et al.  Searching notated polyphonic music using transportation distances , 2004, MULTIMEDIA '04.

[3]  David Sankoff,et al.  Time Warps, String Edits, and Macromolecules: The Theory and Practice of Sequence Comparison , 1983 .

[4]  Patrick A. V. Hall,et al.  Approximate String Matching , 1994, Encyclopedia of Algorithms.

[5]  Delfina Malandrino,et al.  Fuzzy vectorial-based similarity detection of music plagiarism , 2017, 2017 IEEE International Conference on Fuzzy Systems (FUZZ-IEEE).

[6]  François Pachet,et al.  The Cuidado music browser: an end-to-end electronic music distribution system , 2006, Multimedia Tools and Applications.

[7]  Delfina Malandrino,et al.  A computational intelligence text-based detection system of music plagiarism , 2017, 2017 4th International Conference on Systems and Informatics (ICSAI).

[8]  Emmanuel Vincent,et al.  The 2005 Music Information retrieval Evaluation Exchange (MIREX 2005): Preliminary Overview , 2005, ISMIR.

[9]  Ian H. Witten,et al.  Searching digital music libraries , 2002, Inf. Process. Manag..

[10]  Daniel Müllensiefen,et al.  Court decisions on music plagiarism and the predictive value of similarity algorithms , 2009 .

[11]  Mauro Vallati,et al.  Symbolic Melodic Similarity: State of the Art and Future Challenges , 2016, Computer Music Journal.

[12]  Daniel Gärtner,et al.  Audio forensics meets Music Information Retrieval — A toolbox for inspection of music plagiarism , 2012, 2012 Proceedings of the 20th European Signal Processing Conference (EUSIPCO).

[13]  Harold W. Kuhn,et al.  The Hungarian method for the assignment problem , 1955, 50 Years of Integer Programming.

[14]  Nicola Lettieri,et al.  Visualization of Music Plagiarism: Analysis and Evaluation , 2016, 2016 20th International Conference Information Visualisation (IV).

[15]  Alicja Wieczorkowska,et al.  Music Information Retrieval , 2009, Encyclopedia of Data Warehousing and Mining.

[16]  Thomas G. Szymanski,et al.  A fast algorithm for computing longest common subsequences , 1977, CACM.

[17]  A. Tversky Features of Similarity , 1977 .

[18]  Seth Pettie,et al.  Linear-Time Approximation for Maximum Weight Matching , 2014, JACM.

[19]  Justin Zobel,et al.  Melodic matching techniques for large music databases , 1999, MULTIMEDIA '99.

[20]  David Sankoff,et al.  Comparison of musical sequences , 1990, Comput. Humanit..

[21]  Antonio Esposito,et al.  Music Plagiarism at a Glance: Metrics of Similarity and Visualizations , 2017, 2017 21st International Conference Information Visualisation (IV).

[22]  Mert Bay,et al.  Audio Cover Song Identification: MIREX 2006-2007 Results and Analyses , 2008, ISMIR.

[23]  François Pachet,et al.  Deep learning for music generation: challenges and directions , 2018, Neural Comput. Appl..

[24]  Stefan M. Rüger,et al.  Robust Polyphonic Music Retrieval with N-grams , 2003, Journal of Intelligent Information Systems.