ASR (Automatic Speech Recognition)The AI technology that converts spoken audio into written text. The core engine behind any auto subtitle generator.Burned-in Subtitles (Hardcoded)Captions permanently rendered into the video pixels. Cannot be turned off by the viewer. Best for social media where platform caption support is unreliable.Closed Captions (CC)Subtitles stored as a separate text file (SRT, VTT) that viewers can toggle on/off. Required for YouTube, streaming platforms, and accessibility compliance.SRT FileSubRip Subtitle format. A plain text file with subtitle text and timestamps. The most universal subtitle file format, supported by all major video platforms.VTT FileWeb Video Text Tracks. HTML5 web standard for subtitle files. Used by browsers, streaming players, and the Web Accessibility Initiative (WAI).Whisper AIOpenAI's open-source speech recognition model trained on 680,000 hours of audio. The current gold standard for auto subtitle generation accuracy.CPS (Characters Per Second)Reading speed metric for subtitles. Standard is 17 CPS for general audiences, 20 CPS for children's content, 21 CPS for adult drama.SDH (Subtitles for the Deaf and Hard of Hearing)Subtitles that include not just speech but also non-speech audio descriptions: [door slams], [music playing], [phone buzzing].TimestampThe in-and-out time codes for each subtitle segment. Format: HH:MM:SS,mmm --> HH:MM:SS,mmm in SRT. Precision to milliseconds.VFR (Variable Frame Rate)Video recorded at a non-constant frame rate. Common with screen recordings and some phones. Can cause subtitle sync drift if not processed correctly.