Method, system and computer for realizing sound-and-caption synchronization in video file

The invention provides a method, a system and a computer for realizing sound-and-caption synchronization in a video file. The method comprise the following steps of: acquiring a first sound and a first caption of the currently played video file, wherein the first sound is not matched with the first caption, the first sound corresponds to a first time stamp in the video file, and the first captioncorresponds to a second time stamp in the video file; calculating the similarity of the first sound and the first caption to obtain a result; when the result shows that the similarity is greater thana threshold value, comparing the first time stamp with the second time stamp to obtain a time difference value; and adjusting the first time stamp and the second time stamp according to the time difference value to enable the first sound and the first caption in the video file to be synchronously output. Due to the adoption of the method, the system and the computer for realizing the sound-and-caption synchronization in the video file, automatic synchronization of the sound and the caption can be realized when the played sound and the current caption are asynchronous, so user experience is greatly improved.