Large vocabulary word scoring as a basis for transcription generation