Categorizing Languages and Speakers: Processes of Erasure in Data Treatment and Presentation