atlascloud/wan-2.2/image-to-video-lora

gambar-ke-video

Wan 2.2 Image-to-Video Lora API by Alibaba

atlascloud/wan-2.2/image-to-video-lora

Image-to-video-lora

Open and Advanced Large-Scale Video Generative Models.

INPUT

Prompt *

Prompt Negatif

Gambar *

Anda dapat drag & drop file atau klik untuk mengunggah

MAX:1

Resolusi

Durasi

Loras

MAKS: 3

High noise loras

MAKS: 3

Low noise loras

MAKS: 3

Seed

Wan 2.2 architecture splits into High Noise and Low Noise. Verify your LoRA type and place it in the correct field — mixing them up causes corrupted output or failure.
Civitai links require your API token appended: download_url&token=YOUR_TOKEN. Get it from Civitai Account Settings → API Keys.
Hugging Face links must point to a specific file (e.g. .safetensors), not a repository directory.

OUTPUT

Menunggu

Video yang dihasilkan akan muncul di sini

Konfigurasikan pengaturan Anda dan klik Jalankan untuk memulai

Permintaan Anda akan dikenakan biaya $0.04 per eksekusi. Dengan $10 Anda dapat menjalankan model ini sekitar 250 kali.

Berikut yang dapat Anda lakukan selanjutnya:

Seedance 2.0 Kling v3 Vidu Wan2.7

Parameter

Contoh kode
import requests
import time

# Step 1: Start video generation
generate_url = "https://api.atlascloud.ai/api/v1/model/generateVideo"
headers = {
    "Content-Type": "application/json",
    "Authorization": "Bearer $ATLASCLOUD_API_KEY"
}
data = {
    "model": "atlascloud/wan-2.2/image-to-video-lora",  # Required. model name
    "image": "example_value",  # Required. The first-frame image URL for generating the video
    "prompt": "A beautiful sunset over the ocean with gentle waves",  # Required. The positive prompt for the generation
    "negative_prompt": "",  # The negative prompt to avoid certain content in the generation
    "resolution": "480p",  # The resolution of the generated video. options: 480p | 720p
    "duration": 5,  # The duration of the generated video in seconds. (min: 3, max: 10)
    "loras": [],  # List of LoRAs to apply (max 3)
    "high_noise_loras": [],  # List of high noise LoRAs to apply (max 3)
    "low_noise_loras": [],  # List of low noise LoRAs to apply (max 3)
    "seed": -1,  # The random seed to use for the generation
}

generate_response = requests.post(generate_url, headers=headers, json=data)
generate_result = generate_response.json()
prediction_id = generate_result["data"]["id"]

# Step 2: Poll for result
poll_url = f"https://api.atlascloud.ai/api/v1/model/prediction/{prediction_id}"

def check_status():
    while True:
        response = requests.get(poll_url, headers={"Authorization": "Bearer $ATLASCLOUD_API_KEY"})
        result = response.json()

        if result["data"]["status"] in ["completed", "succeeded"]:
            print("Generated video:", result["data"]["outputs"][0])
            return result["data"]["outputs"][0]
        elif result["data"]["status"] == "failed":
            raise Exception(result["data"]["error"] or "Generation failed")
        else:
            # Still processing, wait 2 seconds
            time.sleep(2)

video_url = check_status()

Instalasi

Instal paket dependensi yang diperlukan.

pip install requests

Autentikasi

Semua permintaan API memerlukan autentikasi melalui API key. Anda bisa mendapatkan API key dari dasbor Atlas Cloud.

export ATLASCLOUD_API_KEY="your-api-key-here"

HTTP Headers

import os

API_KEY = os.environ.get("ATLASCLOUD_API_KEY")
headers = {
    "Content-Type": "application/json",
    "Authorization": f"Bearer {API_KEY}"
}

Jaga keamanan API key Anda

Jangan pernah mengekspos API key Anda di kode sisi klien atau repositori publik. Gunakan variabel lingkungan atau proxy backend sebagai gantinya.

Kirim permintaan

import requests

url = "https://api.atlascloud.ai/api/v1/model/generateVideo"
headers = {
    "Content-Type": "application/json",
    "Authorization": "Bearer $ATLASCLOUD_API_KEY"
}
data = {
    "model": "your-model",
    "prompt": "A beautiful landscape"
}

response = requests.post(url, headers=headers, json=data)
print(response.json())

Kirim Permintaan

Kirim permintaan pembuatan asinkron. API mengembalikan prediction ID yang dapat Anda gunakan untuk memeriksa status dan mengambil hasil.

POST/api/v1/model/generateVideo

Isi Permintaan

import requests

url = "https://api.atlascloud.ai/api/v1/model/generateVideo"
headers = {
    "Content-Type": "application/json",
    "Authorization": "Bearer $ATLASCLOUD_API_KEY"
}

data = {
    "model": "atlascloud/wan-2.2/image-to-video-lora",
    "prompt": "A beautiful sunset over the ocean with gentle waves"
}

response = requests.post(url, headers=headers, json=data)
result = response.json()

print(f"Prediction ID: {result['data']['id']}")
print(f"Status: {result['data']['status']}")

Respons

{
  "code": 200,
  "data": {
    "id": "pred_abc123",
    "status": "processing",
    "model": "model-name",
    "created_at": "2025-01-01T00:00:00Z"
  }
}

Periksa Status

Polling prediction endpoint untuk memeriksa status permintaan Anda saat ini.

GET/api/v1/model/prediction/{prediction_id}

Contoh Polling

import requests
import time

prediction_id = "pred_abc123"
url = f"https://api.atlascloud.ai/api/v1/model/prediction/{prediction_id}"
headers = { "Authorization": "Bearer $ATLASCLOUD_API_KEY" }

while True:
    response = requests.get(url, headers=headers)
    result = response.json()
    status = result["data"]["status"]
    print(f"Status: {status}")

    if status in ["completed", "succeeded"]:
        output_url = result["data"]["outputs"][0]
        print(f"Output URL: {output_url}")
        break
    elif status == "failed":
        print(f"Error: {result['data'].get('error', 'Unknown')}")
        break

    time.sleep(3)

Nilai Status

processingPermintaan masih diproses.

completedPembuatan selesai. Output tersedia.

succeededPembuatan berhasil. Output tersedia.

failedPembuatan gagal. Periksa field error.

Respons Selesai

{
  "data": {
    "id": "pred_abc123",
    "status": "completed",
    "outputs": [
      "https://storage.atlascloud.ai/outputs/result.mp4"
    ],
    "metrics": {
      "predict_time": 45.2
    },
    "created_at": "2025-01-01T00:00:00Z",
    "completed_at": "2025-01-01T00:00:10Z"
  }
}

Unggah File

Unggah file ke penyimpanan Atlas Cloud dan dapatkan URL yang dapat Anda gunakan dalam permintaan API Anda. Gunakan multipart/form-data untuk mengunggah.

POST/api/v1/model/uploadMedia

Contoh Unggah

import requests

url = "https://api.atlascloud.ai/api/v1/model/uploadMedia"
headers = { "Authorization": "Bearer $ATLASCLOUD_API_KEY" }

with open("image.png", "rb") as f:
    files = {"file": ("image.png", f, "image/png")}
    response = requests.post(url, headers=headers, files=files)

result = response.json()
download_url = result["data"]["download_url"]
print(f"File URL: {download_url}")

Respons

{
  "data": {
    "download_url": "https://storage.atlascloud.ai/uploads/abc123/image.png",
    "file_name": "image.png",
    "content_type": "image/png",
    "size": 1024000
  }
}

Input Schema

Parameter berikut diterima di isi permintaan.

Total: 10Wajib: 3Opsional: 7

Wan 2.2 architecture splits into High Noise and Low Noise. Verify your LoRA type and place it in the correct field — mixing them up causes corrupted output or failure.
Civitai links require your API token appended: download_url&token=YOUR_TOKEN. Get it from Civitai Account Settings → API Keys.
Hugging Face links must point to a specific file (e.g. .safetensors), not a repository directory.

modelstringrequired

model name

Default: "atlascloud/wan-2.2/image-to-video-lora"

imagestringrequired

The first-frame image URL for generating the video.

promptstringrequired

The positive prompt for the generation.

negative_promptstring

The negative prompt to avoid certain content in the generation.

Default: ""

resolutionstring

The resolution of the generated video.

Default: "480p"

480p720p

durationinteger

The duration of the generated video in seconds.

Default: 5Min: 3Max: 10

lorasarray[object]

List of LoRAs to apply (max 3). Module is auto-inferred from the safetensors filename.

Max items: 3

pathstringrequired

URL or the path to the LoRA weights (safetensors format).

scalenumber

The scale of the LoRA weights. This is used to scale the LoRA weight before merging it with the base model.

Default: 1Min: 0Max: 4

high_noise_lorasarray[object]

List of high noise LoRAs to apply (max 3). Loaded into the transformer (high noise stage).

Max items: 3

pathstringrequired

URL or the path to the LoRA weights (safetensors format).

scalenumber

The scale of the LoRA weights. This is used to scale the LoRA weight before merging it with the base model.

Default: 1Min: 0Max: 4

low_noise_lorasarray[object]

List of low noise LoRAs to apply (max 3). Loaded into transformer_2 (low noise stage).

Max items: 3

pathstringrequired

URL or the path to the LoRA weights (safetensors format).

scalenumber

The scale of the LoRA weights. This is used to scale the LoRA weight before merging it with the base model.

Default: 1Min: 0Max: 4

seedinteger

The random seed to use for the generation. -1 means a random seed will be used.

Default: -1

Contoh Isi Permintaan

{
  "model": "atlascloud/wan-2.2/image-to-video-lora",
  "image": "example_image",
  "prompt": "A beautiful landscape",
  "negative_prompt": "",
  "resolution": "480p",
  "duration": 5,
  "seed": -1
}

Output Schema

API mengembalikan respons prediction dengan URL output yang dihasilkan.

created_atstring

ISO timestamp of when the request was created.

idstring

Unique identifier for the prediction.

modelstring

Model ID used for the prediction.

outputsarray

Array of URLs to the generated content (empty when status is not completed).

statusstring

Status of the task: created, processing, completed, or failed.

Contoh Respons

{
  "id": "pred_abc123",
  "status": "completed",
  "model": "model-name",
  "outputs": [
    "https://storage.atlascloud.ai/outputs/result.mp4"
  ],
  "metrics": {
    "predict_time": 45.2
  },
  "created_at": "2025-01-01T00:00:00Z",
  "completed_at": "2025-01-01T00:00:10Z"
}

Atlas Cloud Skills

Atlas Cloud Skills mengintegrasikan 400+ model AI langsung ke asisten pengkodean AI Anda. Satu perintah untuk menginstal, lalu gunakan bahasa alami untuk menghasilkan gambar, video, dan mengobrol dengan LLM.

Klien yang Didukung

Claude Code

OpenAI Codex

Gemini CLI

Cursor

Windsurf

VS Code

Trae

GitHub Copilot

Cline

Roo Code

Amp

Goose

Replit

40+ klien yang didukung

Instalasi

npx skills add AtlasCloudAI/atlas-cloud-skills

Atur API Key

Dapatkan API key dari dasbor Atlas Cloud dan atur sebagai variabel lingkungan.

export ATLASCLOUD_API_KEY="your-api-key-here"

Kemampuan

Setelah diinstal, Anda dapat menggunakan bahasa alami di asisten AI Anda untuk mengakses semua model Atlas Cloud.

Pembuatan GambarBuat gambar dengan model seperti Nano Banana 2, Z-Image, dan lainnya.

Pembuatan VideoBuat video dari teks atau gambar dengan Kling, Vidu, Veo, dll.

Obrolan LLMMengobrol dengan Qwen, DeepSeek, dan model bahasa besar lainnya.

Unggah MediaUnggah file lokal untuk pengeditan gambar dan alur kerja gambar-ke-video.

Pelajari lebih lanjut

github.com/AtlasCloudAI/atlas-cloud-skills

MCP Server

Atlas Cloud MCP Server menghubungkan IDE Anda dengan 400+ model AI melalui Model Context Protocol. Berfungsi dengan klien apa pun yang kompatibel dengan MCP.

Klien yang Didukung

Cursor

VS Code

Windsurf

Claude Code

OpenAI Codex

Gemini CLI

Cline

Roo Code

100+ klien yang didukung

Instalasi

npx -y atlascloud-mcp

Konfigurasi

Tambahkan konfigurasi berikut ke file pengaturan MCP di IDE Anda.

{
  "mcpServers": {
    "atlascloud": {
      "command": "npx",
      "args": [
        "-y",
        "atlascloud-mcp"
      ],
      "env": {
        "ATLASCLOUD_API_KEY": "your-api-key-here"
      }
    }
  }
}

Alat yang Tersedia

atlas_generate_imageBuat gambar dari prompt teks.

atlas_generate_videoBuat video dari teks atau gambar.

atlas_chatMengobrol dengan model bahasa besar.

atlas_list_modelsJelajahi 400+ model AI yang tersedia.

atlas_quick_generatePembuatan konten satu langkah dengan pemilihan model terbaik otomatis.

atlas_upload_mediaUnggah file lokal untuk alur kerja API.

Pelajari lebih lanjut

github.com/AtlasCloudAI/mcp-server

Schema API

{
  "info": {
    "title": "AtlasCloud API",
    "version": "1.0.0",
    "description": "The AtlasCloud API."
  },
  "openapi": "3.0.0",
  "paths": {
    "/api/v1/model/generateVideo": {
      "post": {
        "requestBody": {
          "content": {
            "application/json": {
              "schema": {
                "$ref": "#/components/schemas/Input"
              }
            }
          },
          "required": true
        },
        "responses": {
          "200": {
            "content": {
              "application/json": {
                "schema": {
                  "$ref": "#/components/schemas/PredictionResponse"
                }
              }
            },
            "description": "The request status."
          }
        }
      },
      "x-api-name": "model_run"
    },
    "/api/v1/model/result/{request_id}": {
      "get": {
        "parameters": [
          {
            "in": "path",
            "name": "request_id",
            "required": true,
            "schema": {
              "description": "Request ID",
              "type": "string"
            }
          }
        ],
        "responses": {
          "200": {
            "content": {
              "application/json": {
                "schema": {
                  "$ref": "#/components/schemas/PredictionResponse"
                }
              }
            },
            "description": "Result of the request."
          }
        }
      },
      "x-api-name": "model_result"
    }
  },
  "components": {
    "schemas": {
      "Input": {
        "properties": {
          "model": {
            "type": "string",
            "description": "model name",
            "default": "atlascloud/wan-2.2/image-to-video-lora"
          },
          "image": {
            "description": "The first-frame image URL for generating the video.",
            "type": "string"
          },
          "prompt": {
            "description": "The positive prompt for the generation.",
            "type": "string"
          },
          "negative_prompt": {
            "description": "The negative prompt to avoid certain content in the generation.",
            "type": "string",
            "default": ""
          },
          "resolution": {
            "default": "480p",
            "description": "The resolution of the generated video.",
            "enum": [
              "480p",
              "720p"
            ],
            "type": "string"
          },
          "duration": {
            "default": 5,
            "description": "The duration of the generated video in seconds.",
            "type": "integer",
            "minimum": 3,
            "maximum": 10,
            "x-ui-component": "slider"
          },
          "loras": {
            "description": "List of LoRAs to apply (max 3). Module is auto-inferred from the safetensors filename.",
            "items": {
              "$ref": "#/components/schemas/LoraWeight"
            },
            "maxItems": 3,
            "type": "array",
            "x-ui-component": "loras"
          },
          "high_noise_loras": {
            "description": "List of high noise LoRAs to apply (max 3). Loaded into the transformer (high noise stage).",
            "items": {
              "$ref": "#/components/schemas/LoraWeight"
            },
            "maxItems": 3,
            "type": "array",
            "x-ui-component": "loras"
          },
          "low_noise_loras": {
            "description": "List of low noise LoRAs to apply (max 3). Loaded into transformer_2 (low noise stage).",
            "items": {
              "$ref": "#/components/schemas/LoraWeight"
            },
            "maxItems": 3,
            "type": "array",
            "x-ui-component": "loras"
          },
          "seed": {
            "default": -1,
            "description": "The random seed to use for the generation. -1 means a random seed will be used.",
            "type": "integer"
          }
        },
        "required": [
          "model",
          "image",
          "prompt"
        ],
        "type": "object",
        "x-order-properties": [
          "image",
          "prompt",
          "negative_prompt",
          "resolution",
          "duration",
          "loras",
          "high_noise_loras",
          "low_noise_loras",
          "seed"
        ]
      },
      "LoraWeight": {
        "properties": {
          "path": {
            "description": "URL or the path to the LoRA weights (safetensors format).",
            "type": "string"
          },
          "scale": {
            "default": 1,
            "description": "The scale of the LoRA weights. This is used to scale the LoRA weight before merging it with the base model.",
            "maximum": 4,
            "minimum": 0,
            "type": "number"
          }
        },
        "required": [
          "path"
        ],
        "type": "object",
        "x-order-properties": [
          "path",
          "scale"
        ]
      },
      "PredictionResponse": {
        "properties": {
          "created_at": {
            "description": "ISO timestamp of when the request was created.",
            "format": "date-time",
            "type": "string"
          },
          "has_nsfw_contents": {
            "description": "Array of boolean values indicating NSFW detection for each output.",
            "items": {
              "type": "boolean"
            },
            "type": "array"
          },
          "id": {
            "description": "Unique identifier for the prediction.",
            "type": "string"
          },
          "model": {
            "description": "Model ID used for the prediction.",
            "type": "string"
          },
          "outputs": {
            "description": "Array of URLs to the generated content (empty when status is not completed).",
            "items": {
              "type": "object"
            },
            "type": "array"
          },
          "status": {
            "description": "Status of the task: created, processing, completed, or failed.",
            "type": "string"
          },
          "urls": {
            "description": "Object containing related API endpoints.",
            "type": "object"
          }
        },
        "type": "object"
      }
    }
  },
  "servers": [
    {
      "url": "https://api.atlascloud.ai"
    }
  ]
}

Template Prompt untuk LLM

# atlascloud/wan-2.2/image-to-video-lora

> Open and Advanced Large-Scale Video Generative Models.


## Overview

- **Submit endpoint (POST)**: `https://api.atlascloud.ai/api/v1/model/generateVideo` — start an async generation; returns a `prediction_id`
- **Poll endpoint (GET)**: `https://api.atlascloud.ai/api/v1/model/prediction/{prediction_id}` — poll this until the prediction finishes
- **Model ID**: `atlascloud/wan-2.2/image-to-video-lora`


## API Information

This model can be used via our HTTP API or more conveniently via our client libraries.
See the input and output schema below, as well as the usage examples.


### Input Schema

The API accepts the following input parameters:

- **`image`** (`string`, _required_):
  The first-frame image URL for generating the video.

- **`prompt`** (`string`, _required_):
  The positive prompt for the generation.

- **`negative_prompt`** (`string`, _optional_):
  The negative prompt to avoid certain content in the generation.
  - Default: `""`

- **`resolution`** (`string`, _optional_):
  The resolution of the generated video.
  - Default: `"480p"`
  - Options: "480p", "720p"

- **`duration`** (`integer`, _optional_):
  The duration of the generated video in seconds.
  - Default: `5`
  - Min: 3
  - Max: 10

- **`loras`** (`array`, _optional_):
  List of LoRAs to apply (max 3). Module is auto-inferred from the safetensors filename.
  - Max items: 3

- **`high_noise_loras`** (`array`, _optional_):
  List of high noise LoRAs to apply (max 3). Loaded into the transformer (high noise stage).
  - Max items: 3

- **`low_noise_loras`** (`array`, _optional_):
  List of low noise LoRAs to apply (max 3). Loaded into transformer_2 (low noise stage).
  - Max items: 3

- **`seed`** (`integer`, _optional_):
  The random seed to use for the generation. -1 means a random seed will be used.
  - Default: `-1`



**Required Parameters Example**:

```json
{
  "model": "atlascloud/wan-2.2/image-to-video-lora",
  "image": "",
  "prompt": ""
}
```


**Full Example**:

```json
{
  "model": "atlascloud/wan-2.2/image-to-video-lora",
  "image": "",
  "prompt": "",
  "negative_prompt": "",
  "resolution": "480p",
  "duration": 5,
  "loras": [],
  "high_noise_loras": [],
  "low_noise_loras": [],
  "seed": -1
}
```


### Output Schema

The API returns the following output format:


- **`created_at`** (`string`, _optional_):
  ISO timestamp of when the request was created.

- **`has_nsfw_contents`** (`array[boolean]`, _optional_):
  Array of boolean values indicating NSFW detection for each output.

- **`id`** (`string`, _optional_):
  Unique identifier for the prediction.

- **`model`** (`string`, _optional_):
  Model ID used for the prediction.

- **`outputs`** (`array[object]`, _optional_):
  Array of URLs to the generated content (empty when status is not completed).

- **`status`** (`string`, _optional_):
  Status of the task: created, processing, completed, or failed.

- **`urls`** (`object`, _optional_):
  Object containing related API endpoints.



**Example Response**:

```json
{
  "created_at": "",
  "has_nsfw_contents": [],
  "id": "",
  "model": "",
  "outputs": [],
  "status": "",
  "urls": {}
}
```


## Usage Examples

### cURL

```bash
# Step 1: Start generation (async)
curl -X POST "https://api.atlascloud.ai/api/v1/model/generateVideo" \
  -H "Authorization: Bearer $ATLASCLOUD_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
  "model": "atlascloud/wan-2.2/image-to-video-lora",
  "image": "",
  "prompt": "",
  "negative_prompt": "",
  "resolution": "480p",
  "duration": 5,
  "loras": [],
  "high_noise_loras": [],
  "low_noise_loras": [],
  "seed": -1
}'

# Response will contain: {"code": 200, "data": {"id": "prediction_id", "status": "processing"}}

# Step 2: Poll for result (replace {prediction_id} with the id returned above)
curl -X GET "https://api.atlascloud.ai/api/v1/model/prediction/{prediction_id}" \
  -H "Authorization: Bearer $ATLASCLOUD_API_KEY"

# Keep polling until status is "completed", "succeeded" or "failed"
# When completed, outputs will contain the generated content URL(s)
```

## Additional Resources

### Documentation

- [Model Playground](https://www.atlascloud.ai/models/atlascloud/wan-2.2/image-to-video-lora)

A group of intelligent rats sits around a wooden table in a cozy room, intently playing a card game. The rats hold their cards carefully, some nibbling on a stick while others eye the table with focused expressions. The dim light from a nearby candle flickers, casting shadows across the walls adorned with old portraits. The sound of cards shuffling and soft squeaks fill the air as the rats eagerly plot their next move, creating an atmosphere of quiet tension and excitement.

Memuat...

Wan 2.2: Open and Advanced Large-Scale Video Generative Model by Alibaba Wanxiang

Model Card Overview

Field	Description
Model Name	Wan 2.2 Image-to-Video LoRA
Developed by	Alibaba Tongyi Wanxiang Lab
Model Type	Image-to-Video Generation with LoRA Support
Resolution	480p, 720p (via VSR upscaling)
Frame Rate	30 fps
Duration	3–10 seconds
Related Links	GitHub: https://github.com/Wan-Video/Wan2.2, Hugging Face: https://huggingface.co/Wan-AI/Wan2.2-I2V-A14B, Paper (arXiv): https://arxiv.org/abs/2503.20314

Introduction

Wan 2.2 is a significant upgrade to the Wan series of foundational video models, designed to push the boundaries of generative AI in video creation. This image-to-video LoRA variant takes a reference image as the first frame and generates a high-quality video, with full support for custom LoRA weights to fine-tune the generation style, motion characteristics, or subject identity.

The model generates videos at 480p natively and supports 720p output via Video Super Resolution (VSR) upscaling, delivering smooth 30 fps playback at both resolutions.

Key Features & Innovations

Effective MoE Architecture: Wan 2.2 integrates a Mixture-of-Experts (MoE) architecture into the video diffusion model. Specialized expert models handle different stages of the denoising process, increasing model capacity without raising computational costs. The model has 27B total parameters with only 14B active during any given step.
Cinematic-Level Aesthetics: Trained on a meticulously curated dataset with detailed labels for cinematic properties like lighting, composition, and color tone. This allows generation of videos with precise and controllable artistic styles, achieving a professional, cinematic look.
Complex Motion Generation: Trained on a vastly expanded dataset (+65.6% more images and +83.2% more videos compared to Wan 2.1), Wan 2.2 demonstrates superior ability to generate complex and realistic motion with enhanced generalization across motions, semantics, and aesthetics.
Custom LoRA Support: This variant supports user-provided LoRA weights for fine-grained style and motion control. Three separate LoRA input channels are available:
- high_noise_loras — Applied to the high-noise expert (transformer stage), influencing overall structure and layout.
- low_noise_loras — Applied to the low-noise expert (transformer_2 stage), influencing fine details and textures.
- loras — General-purpose LoRA input where the module is auto-inferred from the safetensors filename.
VSR-Enhanced Output: All output videos are delivered at 30 fps. When 720p resolution is selected, the model leverages Video Super Resolution to upscale from a 480p base generation, preserving fine details while achieving higher resolution output.

Model Architecture

The architecture is built upon the Diffusion Transformer (DiT) paradigm with a Mixture-of-Experts (MoE) framework:

High-Noise Expert: Activated during initial denoising stages, establishing overall structure and layout.
Low-Noise Expert: Activated in later stages, refining details, textures, and fine-grained motion.

The transition between experts is dynamically determined by the signal-to-noise ratio (SNR) during generation. Custom LoRA weights can be applied to each expert independently, enabling precise control over different aspects of the generation pipeline.

Intended Use & Applications

Stylized Video Production: Generating videos with custom visual styles by applying LoRA weights trained on specific aesthetic data.
Character & Subject Consistency: Using identity-preserving LoRAs to maintain consistent characters across multiple video generations.
Cinematic Video Production: Generating high-fidelity video clips from reference images for short films, advertisements, or social media content.
Creative Experimentation: Combining multiple LoRAs to explore novel visual effects and motion styles.
Academic Research: Serving as a powerful foundation model for researchers exploring LoRA-based fine-tuning techniques in video generation.

Jelajahi Model Serupa

NEW

referensi-ke-video

HappyHorse-1.1 Reference-to-video

Generates videos from one to nine reference images and a text prompt, supporting 720P or 1080P output, flexible aspect ratios, and durations from 3 to 15 seconds.

HappyHorse-1.1 Text-to-video

Generates videos from text prompts with HappyHorse 1.1, supporting 720P or 1080P output, flexible aspect ratios, and durations from 3 to 15 seconds.

HappyHorse-1.1 Image-to-video

Animates a first-frame image into video with optional prompt guidance, 720P or 1080P output, and durations from 3 to 15 seconds.

HappyHorse-1.0 Image-to-video

Animates a first-frame image into video with optional prompt guidance, 720P or 1080P output, and durations from 3 to 15 seconds.

HappyHorse-1.0 Text-to-video

Generates videos from text prompts with HappyHorse 1.0, supporting 720P or 1080P output, flexible aspect ratios, and durations from 3 to 15 seconds.

HappyHorse-1.0 Reference-to-video

Generates videos from one to nine reference images and a text prompt, supporting 720P or 1080P output, flexible aspect ratios, and durations from 3 to 15 seconds.

HappyHorse-1.0 Video-edit

Edits an input video with text instructions and optional reference images, supporting 720P or 1080P output.

Wan-2.7 Image-to-video

Animates images into videos with first-frame, first-and-last-frame, video continuation, and audio-driven modes.

Wan-2.7 Text-to-video

Generates videos from text prompts with multi-shot narrative, audio generation, and sound-image synchronization.

Wan-2.7 Video-edit

Edits videos using text instructions, reference images, and style transfer with multi-modal input support.

Wan-2.7 Reference-to-video

Generates character-driven videos from reference images and videos, with multi-subject and voice-cloning support.