论文信息 - Automatic detection of deception in child-produced speech using syntactic complexity features

Automatic detection of deception in child-produced speech using syntactic complexity features

It is important that the testimony of children be admissible in court, especially given allegations of abuse. Unfortunately, children can be misled by interrogators or might offer false information, with dire consequences. In this work, we evaluate various parameterizations of five classifiers (including support vector machines, neural networks, and random forests) in deciphering truth from lies given transcripts of interviews with 198 victims of abuse between the ages of 4 and 7. These evaluations are performed using a novel set of syntactic features, including measures of complexity. Our results show that sentence length, the mean number of clauses per utterance, and the StajnerMitkov measure of complexity are highly informative syntactic features, that classification accuracy varies greatly by the age of the speaker, and that accuracy up to 91.7% can be achieved by support vector machines given a sufficient amount of data.

Frank Rudzicz | Maria Yancheva

[1] T. Lyon,et al. Truth induction in young maltreated children: the effects of oath-taking and reassurance on true and false disclosures. , 2008, Child abuse & neglect.

[2] J. Pennebaker,et al. Lying Words: Predicting Deception from Linguistic Styles , 2003, Personality & social psychology bulletin.

[3] Andreas Stolcke,et al. Distinguishing deceptive from non-deceptive speech , 2005, INTERSPEECH.

[4] Michael Lewis,et al. Deception in 3-Year-Olds , 1989 .

[5] Yejin Choi,et al. Syntactic Stylometry for Deception Detection , 2012, ACL.

[6] L. Gillam,et al. “I Don’t Know Where He is Not”: Does Deception Research yet Offer a Basis for Deception Detectives? , 2012 .

[7] Detecting deceit via analyses of verbal and nonverbal behavior in children and adults , 2004 .

[8] James J. Lindsay,et al. Cues to deception. , 2003, Psychological bulletin.

[9] Fuhui Long,et al. Feature selection based on mutual information criteria of max-dependency, max-relevance, and min-redundancy , 2003, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[10] Carlo Strapparava,et al. The Lie Detector: Explorations in the Automatic Recognition of Deceptive Language , 2009, ACL.

[11] Dan Klein,et al. Accurate Unlexicalized Parsing , 2003, ACL.