Automatic Text Summarization of Video Lectures Using Subtitles

Text summarization can be defined as a process of reducing a text document using computer program in order to generate a summary of original document that consists of most important things covered in that. An example of summarization technology is search engines such as Google. This paper orients for analyzing and producing text summary of video lectures by harnessing the subtitles file provided along with the lectures. Extractive text summarization method has been adopted to produce the summaries from the source subtitles. This would help user in deciding whether a particular lecture is relevant to them or not, thereby saving their time and aiding them in quick decision making. Experiments were conducted on various subtitle files belonging to different lectures, and it has been found that extractive text summarization reduces the content of original subtitle file up to sixty percent by tf-idf approach.