TEL'M: Test and Evaluation of Language Models