INVocD: Identifier name vocabulary dataset

INVocD is a database of the identifier name declarations and vocabulary found in 60 FLOSS Java projects where the source code structure is recorded and the identifier name vocabulary is made directly available, offering advantages for identifier name research over conventional source code models. The database has been used to support a range of research projects from identifier name analysis to concept location, and provides many opportunities to researchers. INVocD may be downloaded from http://oro.open.ac.uk/36992.

[1]  Timothy Lethbridge,et al.  The Dagstuhl Middle Metamodel: A Schema For Reverse Engineering , 2004, ateM.

[2]  Einar W. Høst,et al.  The Java Programmer's Phrase Book , 2009, SLE.

[3]  Michel Wermelinger,et al.  Relating developers' concepts and artefact vocabulary in a financial software module , 2011, 2011 27th IEEE International Conference on Software Maintenance (ICSM).

[4]  Yijun Yu,et al.  Mining java class naming conventions , 2011, 2011 27th IEEE International Conference on Software Maintenance (ICSM).

[5]  Nicolas Anquetil,et al.  MSE and FAMIX 3.0: an Interexchange Format and Source Code Model Family , 2011 .

[6]  Yijun Yu,et al.  Improving the Tokenisation of Identifier Names , 2011, ECOOP.

[7]  Yann-Gaël Guéhéneuc,et al.  Can Lexicon Bad Smells Improve Fault Prediction? , 2012, 2012 19th Working Conference on Reverse Engineering.