Efficient token based clone detection with flexible tokenization