论文信息 - Constructing a speech audio–video corpus by aligning long segments of speech and text - 字舞流文

Constructing a speech audio–video corpus by aligning long segments of speech and text

Anton Konushin | I. Karpukhin