Instructional Video Summarization with Transformers: A Curriculum Learning Approach for ASR-Generated Transcripts
Discuss this preprint
Start a discussion What are Sciety discussions?Listed in
This article is not in any list yet, why not save it to one of your lists.Abstract
This paper addresses the challenge of abstractive summarization for instructional video transcripts. Utilizing a document-level encoder rooted in transformer architectures, the proposed methodology enhances the fluency and generalizability of generated summaries across diverse video content. A unique dataset of over 5,000 extracted transcripts supports the training process, employing specific fine-tuning and order-preserving techniques. Assessments based on metrics such as Content F1 and human evaluations confirm that the synthesized narratives achieve quality comparable to human-authored text, providing concise and informative overviews for online educational platforms.