Unweighted fusion in microphone forensics using a decision tree and linear logistic regression models

For the exemplarily chosen domain of microphone forensics we show that media forensics can strongly benefit from combining statistical pattern recognition (using supervised classification) and unweighted information fusion (on the example of match-, rank- and decision level fusion). The practical results presented show that, by using a carefully selected fusion strategy and two multi-class classifiers (a decision tree and linear logistic regression models), the accuracy achieved in practical testing can be increased to 100%. This result is based on first tests on two sets of four and seven different microphones. For each of those microphones ten reference samples are recorded in ten different locations and are used in the ratio 80% to 20% for supervised training and testing by the two classifiers. The overall positive tendency indicates that microphone forensics might become an important security mechanism for the verification of source authenticity. Recent gunshot classification approaches, which try to determine the gun used in gunshot audio recordings, have the problem that they rely on carefully controlled conditions, amongst them the fact that the microphone used for all evaluations has to remain the same. A microphone classification approach as introduced here would allow for similarity estimation for microphones and thereby would enable exchanging microphones in such a gunshot classification approach without complete loss of confidence. Furthermore microphone forensics could be used in provenance verification of digital audio media to verify the microphone used for recordings to be submitted into secure long term archiving systems.

[1]  Aiko M. Hormann,et al.  Programs for Machine Learning. Part I , 1962, Inf. Control..

[2]  J. Ross Quinlan,et al.  C4.5: Programs for Machine Learning , 1992 .

[3]  Robert C. Maher,et al.  Acoustical Characterization of Gunshots , 2007 .

[4]  Jana Dittmann,et al.  Microphone Classification Using Fourier Coefficients , 2009, Information Hiding.

[5]  B. Atal Effectiveness of linear prediction characteristics of the speech wave for automatic speaker identification and verification. , 1974, The Journal of the Acoustical Society of America.

[6]  Jana Dittmann,et al.  Digital audio forensics: a first practical evaluation on microphone and environment classification , 2007, MM&Sec.

[7]  Jana Dittmann,et al.  Impact of feature selection in classification for hidden channel detection on the example of audio data hiding , 2008, MM&Sec '08.

[8]  Ian Witten,et al.  Data Mining , 2000 .

[9]  Eibe Frank,et al.  Logistic Model Trees , 2003, ECML.

[10]  Jan P. Allebach,et al.  Survey of Scanner and Printer Forensics at Purdue University , 2008, IWCF.

[11]  Arun Ross,et al.  Handbook of Multibiometrics (International Series on Biometrics) , 2006 .

[12]  Miroslav Goljan,et al.  Using sensor pattern noise for camera model identification , 2008, 2008 15th IEEE International Conference on Image Processing.

[13]  Jana Dittmann,et al.  Sensometrics: identifying pen digitizers by statistical multimedia signal processing , 2007, Electronic Imaging.

[14]  Arun Ross,et al.  Handbook of Multibiometrics , 2006, The Kluwer international series on biometrics.