What is Seedance 2.0?

Seedance 2.0 is ByteDance's next-generation video model that supports four-modal input — images, videos, audio, and text — for controllable, cinematic video generation.

When will Seedance 2.0 be available on OpenCreator?

We're actively working on integration. Join the waitlist to be the first to know when it launches.

What are the input limits?

Up to 9 images, 3 video clips (≤ 15s total), 3 audio files (≤ 15s total), and a combined maximum of 12 mixed references.

Does it support video continuation and editing?

Yes. Seedance 2.0 supports extending existing footage, inserting new scenes between clips, and replacing characters or segments — all driven by natural language prompts.

How does the @ reference system work?

You assign roles to uploaded materials using @ mentions in your prompt — for example, '@Image 1 as first frame, @Video 1 for camera movement, @Audio 1 as BGM'.

What types of content is Seedance 2.0 best for?

It excels at choreography, emotional short drama, music videos, action sequences, commercial clips, and VFX-driven narrative content with strong reference control.

Coming SoonText + Image + Video + AudioReference-driven controlSmoother motion & consistencyContinuation & multi-cameraComing to OpenCreator

Seedance 2.0

Multimodal video creation, reimagined

Seedance 2.0 accepts four types of input — images, videos, audio, and text — enabling cinematic generation with precise reference control. Set visual style with an image, define motion with a video clip, set mood with audio, and direct it all through natural language.

Join Waitlist View Pricing

See It in Action

From choreography to action sequences — explore what Seedance 2.0 can generate across different creative scenarios.

Rhythm + camera choreography

Street Dance Battle

High-tempo movement with beat-aware cuts and character continuity under fast choreography.

Dialogue-driven short film

Domestic Drama

Emotion-driven scene pacing with close-up performance and natural conversational timing.

Audio-visual narrative

Music Video

Stylized visual motifs synchronized with music cadence and atmosphere shifts.

Action choreography

Wuxia Duel

Complex martial arts movement, directional camera transitions, and dramatic fight choreography.

Large-scale VFX

Monster Invasion

Wide-scene composition, destruction sequences, and continuity in effects-heavy narrative.

Creative scene flow

Cinematic Transition

Smooth scene transitions, creative visual effects, and narrative continuity across shots.

Director-Level Workflow

Choose your entry mode, bind references with @, and iterate without starting over — all on one canvas.

Choose your entry mode

Use First & Last Frames for anchor-driven shots, or All-in-One Reference for multimodal composition.

Prioritize high-impact assets

Upload assets that define composition, motion rhythm, and tone first — then add secondary references.

Bind materials with @

Assign clear roles — first frame, camera language, character identity, BGM — to each material in your prompt.

Continue & iterate

Use continuation and timeline-style edits to preserve narrative momentum across versions.

Supported interaction modes

First & Last Frames

Best for anchor-frame generation with structured opening and ending control.

All-in-One Reference

Designed for multimodal mixing and precise asset orchestration in a single prompt.

Core Capabilities

A comprehensive upgrade in generation quality, controllability, and creative expression.

✱

Four-modal input

Combine text, images, videos, and audio in a single generation — each modality enriches the final output.

✱

Reference-driven control

Upload reference images for composition, videos for camera language and motion, and audio for rhythm — the model understands and reproduces them.

✱

Motion & consistency upgrade

Significantly smoother physical dynamics, more stable movement, and stronger scene-level visual coherence across frames.

✱

Continuation & storytelling

Extend existing footage and build multi-shot narrative sequences — pick up where the last clip left off.

✱

Generation meets editing

Support for character replacement, clip insertion, and timeline-style iteration — create and refine in the same workflow.

✱

Expressive short-form video

Optimized for choreography, emotional dialogue, music videos, action scenes, and VFX-driven micro stories.

Technical Specifications

Input limits, generation parameters, and interaction controls at a glance.

Input Limits

Image inputUp to 9 images
Video inputUp to 3 clips, total duration ≤ 15s
Audio inputUp to 3 files (MP3), total duration ≤ 15s
Mixed-input capUp to 12 files across all modalities

Generation

Duration4–15 seconds, user-selectable
Output formatMP4 with optional built-in sound effects / BGM
Aspect ratioPortrait, square, and landscape supported

Interaction Controls

Entry modesFirst & Last Frames · All-in-One Reference
Material addressing@Image / @Video / @Audio role assignment in prompt
Video extensionPrompt-driven continuation of existing clips
Timeline editingAdd, replace, and delete clips within a sequence

Quality Highlights

Motion stabilityEnhanced physical dynamics and smoother continuous motion
Visual consistencyStronger facial, clothing, and scene-level coherence
Audio-visual syncMore realistic sound timing, effects, and ambient audio

Coming Soon to OpenCreator

Seedance 2.0 is not yet available — join the waitlist to get notified at launch

Seedance 2.0

Next-gen multimodal video generation by ByteDance

Coming Soon

Join Waitlist

Pricing and availability will be announced after integration and quality validation are complete.

View full pricing →

FAQ

Seedance 2.0 is ByteDance's next-generation video model that supports four-modal input — images, videos, audio, and text — for controllable, cinematic video generation.

Seedance 2.0

Multimodal video creation, reimagined