Data augmentation using Heuristic Masked Language Modeling