By the numbers

95–98%
Transcription accuracy
On clear audio with standard accents
<60s
Average generation time
For videos under 60 seconds
50+
Supported languages
Auto-detected by Whisper AI
1 500+
Active creators
Trust Scenith for video captions
0
Watermarks on 720p exports
Always free, always clean

Ready? Takes under 2 minutes

Start Adding
Subtitles
Right Now

No account required to preview. Free-tier users download at 720p with no watermark. Upgrade for 1080p → 4K.

Open Free Subtitle Generator
No software install No credit card Works on mobile Instant MP4 download

Step-by-step

How to Add Subtitles to a
Video Online in 4 Steps

The entire workflow lives in your browser — upload, generate, edit, export. No account setup, no queue, no complicated timeline editors.

Upload Your Video

Drag-and-drop or select an MP4, MOV, AVI, MKV, or WMV file up to 500 MB. Your video is stored securely and processed on Scenith servers — nothing is shared with third parties.

AI Generates Captions

Click "Generate Subtitles". OpenAI Whisper analyses the audio track, transcribes speech, timestamps every word, and groups captions into readable segments — typically done in under 60 seconds for a 1-minute clip.

Edit Text, Timing & Style

Open any caption in the timeline editor to fix text, nudge start/end times, or swap styles. Change font family, colour, size, stroke, background box, letter-spacing, and exact X/Y position. Preview updates live in the video player.

Download Your MP4

Hit "Process Subtitles" and Scenith renders the final video with captions permanently burned in. Download your MP4 in the quality your plan supports — 720p on the free tier, up to 4K on Studio.

The case for captions

Why Every Video Needs
Subtitles in 2026

Silent Viewing Is the Default on Mobile

Studies consistently show that over 85% of social media videos are watched without sound in public settings. Instagram, Facebook, and TikTok autoplay videos muted by default. If your video relies on spoken dialogue to carry meaning, a viewer scrolling in a café or on a commute will simply scroll past it.

Adding subtitles online for free is therefore not optional for modern content — it is the minimum viable accessibility layer that separates videos that convert from videos that get skipped.

Research insight

Facebook's internal data showed that videos with captions earned 12% more watch time on average than uncaptioned equivalents. The effect is even larger on short-form vertical video.

Accessibility Is a Legal and Ethical Obligation

466 million people worldwide live with disabling hearing loss, according to the WHO. In the United States, the Americans with Disabilities Act (ADA) and the Web Content Accessibility Guidelines (WCAG 2.1 AA) both require captions for publicly available video content at Level 1.2.2.

Educational institutions receiving federal funding are legally required to caption all instructional video under Section 508. Major streaming platforms, corporate intranets, and government websites face similar mandates globally. Tools that let you add subtitles to videos online for free remove the cost barrier from compliance.

SEO Uplift You Can Actually Measure

Search engines cannot watch video — but they can read caption text. When you burn subtitles into a video and also embed a transcript on the page, you give crawlers rich semantic content. YouTube's algorithm ranks captioned videos higher in suggested results and the text is indexed by Google Video Search.

An analysis of 10,000 YouTube videos found those with accurate closed captions received 7.32% more views within 90 days of upload compared to uncaptioned equivalents — purely from algorithmic uplift, independent of engagement metrics.

Language Barriers and Global Reach

English is the first language of roughly 400 million people, yet the internet has 5.4 billion users. Adding subtitles in the native language of your target audience — or even providing accurate English captions for ESL audiences — dramatically expands the addressable market for any piece of video content.

AI subtitle generators that support 50+ languages, like Scenith's Whisper-powered tool, let a solo creator add multilingual captions to their videos for free, something that previously required hiring professional translators and caption-formatting agencies.

Creator economy stat

Creators who add subtitles in at least one additional language report 23–45% more international engagement on average, based on data from 2 000+ YouTube channels in 2025.

Cognitive Load and Information Retention

Educational psychologists refer to dual-coding theory: when learners process information through both visual text and audio simultaneously, retention improves by up to 40% compared to audio alone. This is why every serious e-learning platform — Coursera, Udemy, LinkedIn Learning — mandates subtitles on all instructional video content.

For marketing content, subtitles reinforce key messages: brand names, product features, call-to-action phrases. A viewer who both hears and reads "Free 14-day trial" is measurably more likely to act on that prompt than a viewer who only hears it.

Platform Algorithm Signals

TikTok's algorithm explicitly rewards high completion rate. Captions increase completion rate for muted viewers (who would otherwise drop off as soon as they realise they cannot follow the content). LinkedIn reports that video posts with captions achieve 3× higher engagement than those without. Instagram Reels with on-screen text are currently boosted by the Explore algorithm in 2026 as the platform doubles down on accessibility signals.

What you get

Everything You Need to
Subtitle Videos Online

Scenith is not a stripped-down free tool with a paywall behind every feature. The core subtitle workflow — upload, generate, edit, export — is entirely free up to 720p.

Whisper AI Transcription — 95–98% Accuracy

OpenAI's Whisper model is trained on 680 000 hours of multilingual audio. It handles accents, background noise, overlapping speech, and technical vocabulary better than any other open-weight speech recognition model available in 2026. You get a production-quality draft transcript in under a minute for most videos — not the rough approximation you get from browser-native speech APIs.

50+ Languages

Live In-Player Preview

Every styling change — colour, font, size, position — updates instantly in the embedded video player. No blind exports to check your work.

Real-Time

25+ Style Presets

Choose from professionally designed subtitle templates — cinematic white-stroke, TikTok bold-pop, documentary yellow, minimal lowercase, and more.

Custom Font & Typography

Swap font family, adjust weight, letter-spacing, line height, and border stroke width. Apply changes to one caption or all at once.

Pixel-Level Positioning

Drag or type exact X/Y coordinates. Useful for vertical 9:16 videos where bottom placement must clear lower-third graphics or safe-zone text.

Undo Delete

Accidentally deleted a caption segment? The undo stack brings it back — even after a backend save — without regenerating the whole transcript.

Background Box with Opacity Control

Add a semi-transparent background rectangle behind caption text for readability on busy footage. Control fill colour, opacity, padding, corner radius, and border independently — great for the social-card style popular on LinkedIn and Instagram in 2026.

Accessibility-first

Export from 720p to 4K

Free tier exports at 720p without watermark. Creator Lite unlocks 1080p, Creator unlocks 2K, and Studio plan goes all the way to 4K — always burned into a clean H.264 MP4 compatible with every platform.

No Watermark on Free

Platform-specific guidance

Subtitle Specs for Every Platform

Different platforms have different safe zones, aspect ratios, and viewer behaviours. Here is what to set when you add subtitles to video online for each major channel in 2026.

YouTube

Use bottom-centre placement with 10% margin from edges. White text + black stroke reads on any thumbnail. Caption text helps YouTubes search index rank your video.

Aspect: 16:9 or 9:16 ShortsFont size: 24–32px recommendedPosition: Bottom centre, Y: 85%

Instagram Reels

Vertical 9:16 only. Place captions in the middle 60% of the frame — avoid the top (where the username sits) and bottom (where the action buttons overlap).

Aspect: 9:16 verticalSafe zone: Y: 20%–80%Style: Bold + background box

TikTok

Large, high-contrast text is rewarded by TikToks completion-rate signal. Bold sans-serif at 130%+ scale performs best. Many creators add captions at the top 1/3 and bottom 1/3 simultaneously for dual-pane content.

Aspect: 9:16Rec. font size: Large / 140% scaleAvoid: Bottom 20% (UI chrome)

LinkedIn

Square (1:1) or landscape. Professional, minimal subtitle styles outperform flashy ones. LinkedIns algorithm gives a 3× engagement boost to captioned native video vs uncaptioned — add them every time.

Aspect: 1:1 or 16:9Style: Minimal / cleanImpact: +300% engagement

Facebook

Facebook autoplays muted in the News Feed. Captions are the only way viewers see your message before they choose to unmute. Keep captions large and avoid relying on audio for context.

Aspect: 16:9 or 1:1Muted views: ~85% of totalCTA: Repeat key phrases in captions

LMS / E-Learning

Accuracy matters more than style here. Use simple sans-serif, 1.4 line-height, and slow reading pace (≤17 chars/sec). Add [sound effect] descriptors for accessibility compliance.

WCAG req.: 1.2.2 AA captionsSpeed: ≤17 chars/secondContrast: 4.5:1 minimum

Who this is for

Real-World Uses for
Online Subtitle Tools

From solo creators to enterprise teams, anyone who publishes video content benefits from adding subtitles — here is how different professionals use the tool.

Content Creation

YouTubers & Influencers

Growing a channel means making content accessible to international audiences and viewers who watch silently. AI subtitle generation lets solo creators caption every video in minutes — not hours — maintaining the upload frequency the algorithm rewards.

  • Auto-subtitle every upload
  • Boost watch time with captions
  • Reach international audience
Film & Production

Independent Filmmakers

Festival submissions, streaming pitches, and distributor screeners increasingly require accessible captions. Generating a subtitle track from AI and fine-tuning timing and style takes a fraction of the time of manual captioning services — at zero cost on the free tier.

  • Festival-ready captions
  • Custom cinematic styles
  • Multi-language export
Corporate

Marketing & Comms Teams

Product demo videos, explainer ads, and brand content all need captions for LinkedIn, YouTube, and OTT distribution. Subtitle generation removes the dependency on agency timelines — caption a video in-house in under 5 minutes.

  • Brand-consistent subtitle styles
  • No agency dependency
  • ADA-compliant output
Education

Educators & Course Creators

Section 508, ADA, and institutional accessibility policies mandate captions for instructional video. AI generation achieves the accuracy most educators need for draft review, with manual editing to correct technical terms specific to the discipline.

  • WCAG 2.1 compliant captions
  • Supports ESL students
  • 40% better retention
NGO / Nonprofit

Mission-Driven Organisations

Nonprofits communicate globally with limited budgets. Being able to add subtitles to videos online for free — across 50+ languages with no watermark — is a genuine force-multiplier for organisations whose audiences span multiple continents and languages.

  • 50+ languages for free
  • No budget required
  • Reach underserved communities
Podcasting

Podcasters & Interview Shows

Long-form interview clips repurposed for social media perform significantly better with captions. AI transcription of a 5-minute clip is accurate enough for a clean social media cut with only minor editing — enabling a production workflow no assistant required.

  • Repurpose long-form content
  • Accurate speaker transcription
  • Social-ready in minutes

Global reach

Auto-Subtitle in 50+ Languages
— No Extra Cost

Whisper AI automatically detects the spoken language in your video. You do not need to select a language before generating. All 50+ languages are included in the free tier — no upsell, no language pack to buy.

EnglishHindiSpanishFrenchGermanMandarinJapanesePortugueseArabicKoreanItalianRussianTurkishDutchPolishSwedishNorwegianDanishFinnishGreekHebrewThaiVietnameseIndonesianMalaySwahiliUrduBengaliPunjabiTamilTeluguKannadaMarathiGujaratiRomanianHungarianCzechSlovakUkrainianCatalanCroatianSerbianBulgarianLithuanianLatvianEstonianSlovenianAlbanianMacedonianBosnian+ more

How language detection works

Whisper analyses the first 30 seconds of audio to identify the predominant language. If your video switches languages mid-way (code-switching), the model follows the dominant language for the full transcript. For mixed-language content, we recommend segmenting clips before uploading.

Honest comparison

Scenith vs Other Ways to
Add Subtitles Online Free

Many tools claim to be free — then lock editing, export, or quality behind a paywall, or slap a watermark on every video. Here is an honest comparison.

FeatureScenith (Free)KapwingVEED.IORev.comYouTube Auto-CC
Cost to export✓ Free at 720pWatermark on freeWatermark on free$1.25/minFree
Watermark-free export✓ Yes (720p free)✗ Paid only✗ Paid only✓ Yes✓ (within YT only)
Custom font & colour✓ Full control⚠ Limited free⚠ Limited free✗ No✗ No
Burned-in MP4 export✓ Yes✓ Yes (paid)✓ Yes (paid)✓ Yes✗ No (captions only)
AI accuracy✓ Whisper 95–98%✓ Whisper✓ Proprietary✓ Human 99%⚠ 85–90%
50+ languages✓ All free⚠ Limited⚠ Limited⚠ Extra cost✓ Many
Manual timing edit✓ Yes✓ Yes✓ Yes✓ Yes✗ No
Works without login⚠ Login to process⚠ Login required⚠ Login required✗ Account✓ YT account
Max free resolution720p720p (watermarked)720p (watermarked)AnyN/A (caption file)

The watermark tax

Most "free" video tools export with a watermark unless you pay. Scenith exports 720p videos watermark-free on the free tier — because we believe professional-quality output should not require a subscription for creators just starting out.

Expert guide

The Complete 2026 Guide to
Subtitle Design Best Practices

Typography Rules That Maximise Readability

The wrong font choice can make accurately transcribed subtitles illegible. The rules professionals have refined over decades of broadcast captioning still hold in 2026:

  • Sans-serif only for screen. Serif fonts (Times, Georgia) add detail that degrades at small sizes on compressed video. Use Roboto, Outfit, Montserrat, or similar geometric sans-serifs.
  • Font weight: semi-bold to bold. Regular-weight text disappears against mid-tone backgrounds. Semi-bold (600) is the minimum; bold (700) is safer for mobile.
  • Minimum 24px at 1080p baseline. Scale proportionally: 720p minimum 18px, 4K minimum 48px. Scenith's scale slider handles this automatically.
  • Avoid all-caps for continuous reading. Capitalised text is useful for sound effects ([DOOR SLAMS]) but exhausting for full sentences. It slows reading speed by ~15%.
  • Line height 1.2–1.4×. Cramped line height strains eyes on small screens. Default line spacing of 1.3 is a safe universal baseline.

Colour and Contrast — WCAG in Practice

WCAG 2.1 Success Criterion 1.4.3 requires a contrast ratio of at least 4.5:1between text and background for normal text. For large text (18px+ bold), the minimum drops to 3:1. In practice, the simplest solution that always works is white text with a 2–3px black stroke — this creates the equivalent of a black background wherever the text sits, guaranteeing contrast against any scene.

  • White + black stroke: Universal. Works on every scene. Industry default.
  • Yellow (#FFE000) + no stroke: Film convention. High luminance stands out on dark scenes but can wash on bright scenes.
  • White on 60% black box: Modern social media style. Ensures readability while the box background moves with the text.
  • Avoid pure red or pure green: 8% of men have red-green colour vision deficiency. These colours have low contrast against each other and some backgrounds.

Timing and Reading Speed

The BBC's subtitle specification — one of the most studied in the world — sets a maximum reading speed of 17 words per second for subtitle text, which equates to roughly 180 words per minute. The minimum display duration for any caption segment should be 1.5 seconds, even for very short phrases, to allow viewers time to register the text.

Quick timing rules

Min duration: 1.5 sec · Max duration: 7 sec ·Max chars per line: 42 · Max lines: 2 ·Reading speed: ≤17 words/sec

Positioning: Safe Zones and Context Awareness

The bottom-centre position is standard because it sits below the action area in most shot compositions, leaving actors' faces and on-screen graphics unobstructed. However, it is not always optimal:

  • Documentary / interview: Lower-third name chyrons typically occupy bottom-left. Move subtitles to bottom-centre above the chyron, or use a separate top-centre position for the speaker's name.
  • Vertical video (9:16): The bottom 15% of mobile screens is consumed by platform UI (TikTok's heart/share buttons, Instagram's action bar). Position subtitles at Y: 70–80% to stay clear.
  • Action sequences: Fast-moving scenes often have important subject matter in the lower frame. Temporarily shifting subtitles to the top third helps in these moments.
  • Music videos: Lyrics often work best centred vertically, large, with minimal styling — treating the caption as an integrated design element rather than an overlay.

Breaking Lines Correctly

Poor line breaks are the most common complaint about AI-generated subtitles — not the accuracy, but the way the text wraps. The rules:

  1. Break at grammatical boundaries: after commas, conjunctions, or between subject and predicate.
  2. Never split a verb from its object ("she picked up / the phone" is bad; "she / picked up the phone" is acceptable).
  3. Keep prepositional phrases together: "in the morning" should not wrap between "in" and "the".
  4. Aim for roughly equal line lengths — a "pyramid" shape (shorter top, longer bottom) is preferred by broadcast standards.
  5. Avoid orphaned single words on a line unless they represent a dramatic pause: "He opened the door. He saw / her."

Sound Effect and Non-Speech Descriptions

For truly accessible captions, transcribe important non-speech audio in brackets. This is a legal requirement for WCAG 1.2.2 Level AA compliance and a practical aid for deaf viewers who would otherwise miss essential context:

  • [Phone ringing] — communicates plot context
  • [Tense orchestral music] — sets mood
  • [Crowd applauding] — conveys audience reaction
  • [Whispered] before dialogue — indicates delivery
  • [Background noise] or [Inaudible] — honest acknowledgement of poor audio

What creators say

Trusted by 1 500+ Creators
Worldwide

★★★★★
"I was spending $80/month on a captioning service. Scenith does the same job in literally minutes, for free. The Whisper accuracy on my interview podcast is genuinely impressive."
Marcus OyelaranPodcast Host · Lagos, Nigeria
★★★★★
"I add subtitles to every Reel now — engagement on my cooking videos doubled once the silent-scroll audience could follow along. The style presets saved me so much time."
Priya SharmaFood Creator · Mumbai, India
★★★★★
"We produce compliance training videos at work. Scenith gave us WCAG-compliant captions in under 5 minutes per video — what used to take a contractor a full day."
Daniel WeissL&D Manager · Frankfurt, Germany
★★★★★
"No watermark on the free plan was the dealbreaker for me. I showed the output to a client and they asked which agency we used. That says everything."
Camila FerreiraVideo Marketer · São Paulo, Brazil
★★★★★
"I film in Hindi and add English subtitles for international reach. Scenith nailed the Hindi transcription and the editor made adding English translations straightforward."
Arjun MehtaYouTube Educator · Delhi, India
★★★★★
"The positioning control is what sets this apart. I can subtitle vertical Reels without subtitles disappearing under the Instagram UI — small thing, huge difference."
Sophie ChenShort-Form Creator · Toronto, Canada

No account required to preview

Try it.
It's Free.

Upload a video, generate captions, edit styles, download MP4. Free tier — 720p, no watermark.

Add Subtitles Free — Open Tool

Common questions

Frequently Asked Questions
About Adding Subtitles Online

Can I add subtitles to a video online without downloading software?

Yes — Scenith runs entirely in your browser. Upload your video, generate captions with AI, customise styling, and download the finished MP4 with no desktop application required.

Is the free subtitle generator really free, or is there a catch?

The core workflow — upload, AI generation, editing, 720p export — is genuinely free with no watermark. The "catch" is that higher export resolutions (1080p, 2K, 4K) require a paid plan. You pay to go premium; the free tier is complete.

What video formats can I upload?

All major formats including MP4, MOV, AVI, MKV, WMV, and WebM. Maximum file size is 500 MB per upload. For longer or larger files, consider compressing to H.264 MP4 before uploading.

How accurate is the AI subtitle generation?

Whisper AI achieves 95–98% word accuracy on clear audio recorded in standard conditions. Accuracy drops for heavy background noise, overlapping speakers, thick regional accents, or highly technical vocabulary. Manual editing is quick for any corrections needed.

Can I edit the subtitle text after it is generated?

Yes. Click any caption segment in the timeline list to open the editor. You can correct text, adjust start/end timestamps, change styling, or delete the segment entirely. All changes auto-save before you export.

Do the subtitles get burned into the video permanently?

Yes — the "Process Subtitles" step renders captions permanently into the MP4 (also called "burning in" or "hard-coding"). This ensures subtitles display on every platform, even those without separate caption track support.

Can I add subtitles to a vertical 9:16 video?

Yes. Sceniths position controls let you set exact X/Y coordinates, so you can place subtitles in the centre of the frame rather than the bottom — clearing TikTok's and Instagram's lower UI chrome.

What languages are supported?

Whisper AI supports 50+ languages including English, Hindi, Spanish, French, German, Mandarin, Arabic, Japanese, Portuguese, Korean, Italian, Russian, Turkish, and many more. The language is auto-detected from the audio — no manual selection needed.

Is there a video length limit?

The current free tier supports videos up to 1 minute. Paid plans support longer content. For full feature details and limits, check the pricing page.

Are my uploaded videos kept private?

Videos are stored securely on Scenith servers for processing and are automatically deleted after 24–48 hours. Your content is never shared with third parties or used for training data.

Can I apply the same style to all subtitle segments at once?

Yes. If you make style edits without selecting a specific segment, changes apply globally to all captions. You can also click "Apply to All" for position changes made in the editor panel.

What is the difference between burned-in subtitles and closed captions?

Burned-in (open) subtitles are permanently part of the video image — they always show and cannot be turned off. Closed captions (SRT/VTT files) are separate text tracks that viewers can toggle on or off. Scenith currently produces burned-in subtitles; SRT export is on the roadmap.

More from Scenith

Subtitles are one piece of a complete video production workflow. Scenith's other tools slot in around them.

AI Voice Generator

Generate a professional voiceover for your video before adding subtitles — or create silent captions-only content with AI narration.

Open tool →

Free Image Editor

Design thumbnail images that match your subtitle aesthetic — same brand colours, same font family, consistent look across the content.

Open tool →

Video Tools

Explore Sceniths full suite of browser-based video tools — trim, filter, resize, compress — before or after adding your captions.

Open tool →

Stop waiting. Start captioning.

Upload your first video and get AI-generated subtitles in under 2 minutes — free, no software download, no watermark on 720p.

Open Free Tool