A hybrid-token and textual based approach to find similar code segments

In this paper we propose a system capable of finding similar source code segments which are called as code clones. Such system would help maintainer solve their problem in a fast and efficient manner. It has been found that a code consist of 7 to 23 percent of cloning which make task difficult. So to solve such a problem we propose a hybrid approach of token and textual to find code cloning solving the problem encountered in tool named BOREAS which was not capable of finding type-3 types of code clones. BOREAS tool so designed was capable to match the speed that of tree matching algorithm and hence this tool would extend the performance over the present tool.

[1]  Brenda S. Baker,et al.  On finding duplication and near-duplication in large software systems , 1995, Proceedings of 2nd Working Conference on Reverse Engineering.

[2]  Susan Horwitz,et al.  Using Slicing to Identify Duplication in Source Code , 2001, SAS.

[3]  Shinji Kusumoto,et al.  CCFinder: A Multilinguistic Token-Based Code Clone Detection System for Large Scale Source Code , 2002, IEEE Trans. Software Eng..

[4]  Stéphane Ducasse,et al.  A language independent approach for detecting duplicated code , 1999, Proceedings IEEE International Conference on Software Maintenance - 1999 (ICSM'99). 'Software Maintenance for Business Change' (Cat. No.99CB36360).

[5]  J. Howard Johnson,et al.  Identifying redundancy in source code using fingerprints , 1993, CASCON.

[6]  K. Ghosh,et al.  India , 1988, The Lancet.

[7]  Yang Yuan,et al.  Boreas: an accurate and scalable token-based approach to code clone detection , 2012, 2012 Proceedings of the 27th IEEE/ACM International Conference on Automated Software Engineering.

[8]  Seung-won Hwang,et al.  Integrating code search into the development session , 2011, 2011 IEEE 27th International Conference on Data Engineering.

[9]  Zhendong Su,et al.  DECKARD: Scalable and Accurate Tree-Based Detection of Code Clones , 2007, 29th International Conference on Software Engineering (ICSE'07).