Startseite
Erkunden
Gemini Omni
google/gemini-omni-flash/text-to-video-developer
Gemini Omni Flash Text-to-Video Developer
Text-zu-Video
DEV

Gemini Omni Flash Text-to-Video Developer API by Google

google/gemini-omni-flash/text-to-video-developer
Text-to-video-developer

Gemini Omni Flash is Google's multimodal video generation model. This text-to-video variant generates high-quality cinematic videos from text prompts with support for multiple resolutions, aspect ratios, and controllable duration.

Gemini Omni Flash — Text to Video (Developer)

Model ID: google/gemini-omni-flash/text-to-video-developer

Gemini Omni is Google's multimodal video generation model designed to create high-quality video content from diverse input types. This variant accepts a text prompt only, making it ideal for pure creative generation where you describe the scene entirely through language.


Overview

Gemini Omni brings together Google's deep knowledge of physics, narrative logic, biology, culture, and visual composition to produce contextually coherent videos. Rather than simple clip synthesis, the model reasons about scene dynamics, camera language, and temporal flow to produce results that feel intentional and cinematic.

The developer tier provides direct API access with full control over generation parameters including resolution, aspect ratio, duration, and random seed.


Key Capabilities

  • Rich prompt understanding — Describe subjects, actions, camera movements, lighting, mood, and style in a single prompt of up to 20,000 characters.
  • Multi-resolution output — Generate at 720p, 1080p, or 4K.
  • Flexible aspect ratios — 16:9 landscape or 9:16 portrait.
  • Controllable duration — 4, 6, 8, or 10 seconds per generation.
  • Reproducible results — Set a fixed seed to reproduce or iterate on a specific generation.

Input Parameters

ParameterTypeRequiredDefaultDescription
modelstringYesgoogle/gemini-omni-flash/text-to-video-developerModel identifier
promptstringYesText description of the video. Max 20,000 characters.
durationintegerNo8Video length in seconds. Enum: 4, 6, 8, 10.
aspect_ratiostringNo16:9Output aspect ratio. Enum: 16:9, 9:16.
resolutionstringNo720pOutput resolution. Enum: 720p, 1080p, 4k.
seedintegerNo-1Random seed for reproducibility. -1 uses a random seed.

Use Cases

  • Creative storytelling — Generate cinematic scenes from narrative descriptions.
  • Concept visualization — Quickly visualize ideas, moods, or environments.
  • Storyboard prototyping — Turn scene descriptions into video drafts before full production.
  • Marketing assets — Produce short-form video content from copy briefs.
  • Educational content — Illustrate concepts, processes, or historical scenes through natural language.

Pricing

Pricing is based on output resolution and video duration.

ResolutionFormulaExample (8s)
720p / 1080p$0.2 + duration × $0.1$1
4k$1 + duration × $0.1$1.8

Formula: (resolution == "4k" ? $1 : $0.2) + duration × $0.1

720p and 1080p are identically priced. The 0.2/0.2 / 1 term is a fixed base charge per generation; $0.1 is the per-second rate applied to the requested duration.

Beginnen Sie mit 300+ Modellen,

Alle Modelle erkunden

Join our Discord community

Join the Discord community for the latest model updates, prompts, and support.