atlascloud/wan-2.2/image-to-video-lora

이미지를 비디오로

Wan 2.2 Image-to-Video Lora API by Alibaba

atlascloud/wan-2.2/image-to-video-lora

Image-to-video-lora

Open and Advanced Large-Scale Video Generative Models.

입력

프롬프트 *

네거티브 프롬프트

이미지 *

파일을 드래그 앤 드롭하거나 클릭하여 업로드할 수 있습니다

MAX:1

해상도

길이

Loras

최대: 3

High noise loras

최대: 3

Low noise loras

최대: 3

시드

Wan 2.2 architecture splits into High Noise and Low Noise. Verify your LoRA type and place it in the correct field — mixing them up causes corrupted output or failure.
Civitai links require your API token appended: download_url&token=YOUR_TOKEN. Get it from Civitai Account Settings → API Keys.
Hugging Face links must point to a specific file (e.g. .safetensors), not a repository directory.

출력

대기

생성된 비디오가 여기에 표시됩니다

설정을 구성하고 실행을 클릭하여 시작하세요

요청당 $0.04가 소요됩니다. $10로 이 모델을 약 250번 실행할 수 있습니다.

다음으로 할 수 있는 작업:

Seedance 2.0 Kling v3 Vidu Wan2.7

파라미터

코드 예시
import requests
import time

# Step 1: Start video generation
generate_url = "https://api.atlascloud.ai/api/v1/model/generateVideo"
headers = {
    "Content-Type": "application/json",
    "Authorization": "Bearer $ATLASCLOUD_API_KEY"
}
data = {
    "model": "atlascloud/wan-2.2/image-to-video-lora",  # Required. model name
    "image": "example_value",  # Required. The first-frame image URL for generating the video
    "prompt": "A beautiful sunset over the ocean with gentle waves",  # Required. The positive prompt for the generation
    "negative_prompt": "",  # The negative prompt to avoid certain content in the generation
    "resolution": "480p",  # The resolution of the generated video. options: 480p | 720p
    "duration": 5,  # The duration of the generated video in seconds. (min: 3, max: 10)
    "loras": [],  # List of LoRAs to apply (max 3)
    "high_noise_loras": [],  # List of high noise LoRAs to apply (max 3)
    "low_noise_loras": [],  # List of low noise LoRAs to apply (max 3)
    "seed": -1,  # The random seed to use for the generation
}

generate_response = requests.post(generate_url, headers=headers, json=data)
generate_result = generate_response.json()
prediction_id = generate_result["data"]["id"]

# Step 2: Poll for result
poll_url = f"https://api.atlascloud.ai/api/v1/model/prediction/{prediction_id}"

def check_status():
    while True:
        response = requests.get(poll_url, headers={"Authorization": "Bearer $ATLASCLOUD_API_KEY"})
        result = response.json()

        if result["data"]["status"] in ["completed", "succeeded"]:
            print("Generated video:", result["data"]["outputs"][0])
            return result["data"]["outputs"][0]
        elif result["data"]["status"] == "failed":
            raise Exception(result["data"]["error"] or "Generation failed")
        else:
            # Still processing, wait 2 seconds
            time.sleep(2)

video_url = check_status()

설치

사용하는 언어에 필요한 패키지를 설치하세요.

pip install requests

인증

모든 API 요청에는 API 키를 통한 인증이 필요합니다. Atlas Cloud 대시보드에서 API 키를 받을 수 있습니다.

export ATLASCLOUD_API_KEY="your-api-key-here"

HTTP 헤더

import os

API_KEY = os.environ.get("ATLASCLOUD_API_KEY")
headers = {
    "Content-Type": "application/json",
    "Authorization": f"Bearer {API_KEY}"
}

API 키를 안전하게 보관하세요

클라이언트 측 코드나 공개 저장소에 API 키를 노출하지 마세요. 대신 환경 변수 또는 백엔드 프록시를 사용하세요.

요청 제출

import requests

url = "https://api.atlascloud.ai/api/v1/model/generateVideo"
headers = {
    "Content-Type": "application/json",
    "Authorization": "Bearer $ATLASCLOUD_API_KEY"
}
data = {
    "model": "your-model",
    "prompt": "A beautiful landscape"
}

response = requests.post(url, headers=headers, json=data)
print(response.json())

요청 제출

비동기 생성 요청을 제출합니다. API는 상태 확인 및 결과 조회에 사용할 수 있는 예측 ID를 반환합니다.

POST/api/v1/model/generateVideo

요청 본문

import requests

url = "https://api.atlascloud.ai/api/v1/model/generateVideo"
headers = {
    "Content-Type": "application/json",
    "Authorization": "Bearer $ATLASCLOUD_API_KEY"
}

data = {
    "model": "atlascloud/wan-2.2/image-to-video-lora",
    "prompt": "A beautiful sunset over the ocean with gentle waves"
}

response = requests.post(url, headers=headers, json=data)
result = response.json()

print(f"Prediction ID: {result['data']['id']}")
print(f"Status: {result['data']['status']}")

응답

{
  "code": 200,
  "data": {
    "id": "pred_abc123",
    "status": "processing",
    "model": "model-name",
    "created_at": "2025-01-01T00:00:00Z"
  }
}

상태 확인

예측 엔드포인트를 폴링하여 요청의 현재 상태를 확인합니다.

GET/api/v1/model/prediction/{prediction_id}

폴링 예시

import requests
import time

prediction_id = "pred_abc123"
url = f"https://api.atlascloud.ai/api/v1/model/prediction/{prediction_id}"
headers = { "Authorization": "Bearer $ATLASCLOUD_API_KEY" }

while True:
    response = requests.get(url, headers=headers)
    result = response.json()
    status = result["data"]["status"]
    print(f"Status: {status}")

    if status in ["completed", "succeeded"]:
        output_url = result["data"]["outputs"][0]
        print(f"Output URL: {output_url}")
        break
    elif status == "failed":
        print(f"Error: {result['data'].get('error', 'Unknown')}")
        break

    time.sleep(3)

상태 값

processing요청이 아직 처리 중입니다.

completed생성이 완료되었습니다. 출력을 사용할 수 있습니다.

succeeded생성이 성공했습니다. 출력을 사용할 수 있습니다.

failed생성에 실패했습니다. 오류 필드를 확인하세요.

완료 응답

{
  "data": {
    "id": "pred_abc123",
    "status": "completed",
    "outputs": [
      "https://storage.atlascloud.ai/outputs/result.mp4"
    ],
    "metrics": {
      "predict_time": 45.2
    },
    "created_at": "2025-01-01T00:00:00Z",
    "completed_at": "2025-01-01T00:00:10Z"
  }
}

파일 업로드

Atlas Cloud 스토리지에 파일을 업로드하고 API 요청에 사용할 수 있는 URL을 받습니다. multipart/form-data를 사용하여 업로드합니다.

POST/api/v1/model/uploadMedia

업로드 예시

import requests

url = "https://api.atlascloud.ai/api/v1/model/uploadMedia"
headers = { "Authorization": "Bearer $ATLASCLOUD_API_KEY" }

with open("image.png", "rb") as f:
    files = {"file": ("image.png", f, "image/png")}
    response = requests.post(url, headers=headers, files=files)

result = response.json()
download_url = result["data"]["download_url"]
print(f"File URL: {download_url}")

응답

{
  "data": {
    "download_url": "https://storage.atlascloud.ai/uploads/abc123/image.png",
    "file_name": "image.png",
    "content_type": "image/png",
    "size": 1024000
  }
}

입력 Schema

다음 파라미터를 요청 본문에서 사용할 수 있습니다.

전체: 10필수: 3선택: 7

Wan 2.2 architecture splits into High Noise and Low Noise. Verify your LoRA type and place it in the correct field — mixing them up causes corrupted output or failure.
Civitai links require your API token appended: download_url&token=YOUR_TOKEN. Get it from Civitai Account Settings → API Keys.
Hugging Face links must point to a specific file (e.g. .safetensors), not a repository directory.

modelstringrequired

model name

Default: "atlascloud/wan-2.2/image-to-video-lora"

imagestringrequired

The first-frame image URL for generating the video.

promptstringrequired

The positive prompt for the generation.

negative_promptstring

The negative prompt to avoid certain content in the generation.

Default: ""

resolutionstring

The resolution of the generated video.

Default: "480p"

480p720p

durationinteger

The duration of the generated video in seconds.

Default: 5Min: 3Max: 10

lorasarray[object]

List of LoRAs to apply (max 3). Module is auto-inferred from the safetensors filename.

Max items: 3

pathstringrequired

URL or the path to the LoRA weights (safetensors format).

scalenumber

The scale of the LoRA weights. This is used to scale the LoRA weight before merging it with the base model.

Default: 1Min: 0Max: 4

high_noise_lorasarray[object]

List of high noise LoRAs to apply (max 3). Loaded into the transformer (high noise stage).

Max items: 3

pathstringrequired

URL or the path to the LoRA weights (safetensors format).

scalenumber

The scale of the LoRA weights. This is used to scale the LoRA weight before merging it with the base model.

Default: 1Min: 0Max: 4

low_noise_lorasarray[object]

List of low noise LoRAs to apply (max 3). Loaded into transformer_2 (low noise stage).

Max items: 3

pathstringrequired

URL or the path to the LoRA weights (safetensors format).

scalenumber

The scale of the LoRA weights. This is used to scale the LoRA weight before merging it with the base model.

Default: 1Min: 0Max: 4

seedinteger

The random seed to use for the generation. -1 means a random seed will be used.

Default: -1

요청 본문 예시

{
  "model": "atlascloud/wan-2.2/image-to-video-lora",
  "image": "example_image",
  "prompt": "A beautiful landscape",
  "negative_prompt": "",
  "resolution": "480p",
  "duration": 5,
  "seed": -1
}

출력 Schema

API는 생성된 출력 URL이 포함된 예측 응답을 반환합니다.

created_atstring

ISO timestamp of when the request was created.

idstring

Unique identifier for the prediction.

modelstring

Model ID used for the prediction.

outputsarray

Array of URLs to the generated content (empty when status is not completed).

statusstring

Status of the task: created, processing, completed, or failed.

응답 예시

{
  "id": "pred_abc123",
  "status": "completed",
  "model": "model-name",
  "outputs": [
    "https://storage.atlascloud.ai/outputs/result.mp4"
  ],
  "metrics": {
    "predict_time": 45.2
  },
  "created_at": "2025-01-01T00:00:00Z",
  "completed_at": "2025-01-01T00:00:10Z"
}

Atlas Cloud Skills

Atlas Cloud Skills는 400개 이상의 AI 모델을 AI 코딩 어시스턴트에 직접 통합합니다. 한 번의 명령으로 설치하고 자연어로 이미지, 동영상 생성 및 LLM과 대화할 수 있습니다.

지원 클라이언트

Claude Code

OpenAI Codex

Gemini CLI

Cursor

Windsurf

VS Code

Trae

GitHub Copilot

Cline

Roo Code

Amp

Goose

Replit

40+ 지원 클라이언트

설치

npx skills add AtlasCloudAI/atlas-cloud-skills

API 키 설정

Atlas Cloud 대시보드에서 API 키를 받아 환경 변수로 설정하세요.

export ATLASCLOUD_API_KEY="your-api-key-here"

기능

설치 후 AI 어시스턴트에서 자연어를 사용하여 모든 Atlas Cloud 모델에 접근할 수 있습니다.

이미지 생성Nano Banana 2, Z-Image 등의 모델로 이미지를 생성합니다.

동영상 제작Kling, Vidu, Veo 등으로 텍스트나 이미지에서 동영상을 만듭니다.

LLM 채팅Qwen, DeepSeek 등 대규모 언어 모델과 대화합니다.

미디어 업로드이미지 편집 및 이미지-동영상 변환 워크플로우를 위해 로컬 파일을 업로드합니다.

더 알아보기

github.com/AtlasCloudAI/atlas-cloud-skills

MCP Server

Atlas Cloud MCP Server는 Model Context Protocol을 통해 IDE와 400개 이상의 AI 모델을 연결합니다. MCP 호환 클라이언트에서 사용할 수 있습니다.

지원 클라이언트

Cursor

VS Code

Windsurf

Claude Code

OpenAI Codex

Gemini CLI

Cline

Roo Code

100+ 지원 클라이언트

설치

npx -y atlascloud-mcp

설정

다음 설정을 IDE의 MCP 설정 파일에 추가하세요.

{
  "mcpServers": {
    "atlascloud": {
      "command": "npx",
      "args": [
        "-y",
        "atlascloud-mcp"
      ],
      "env": {
        "ATLASCLOUD_API_KEY": "your-api-key-here"
      }
    }
  }
}

사용 가능한 도구

atlas_generate_image텍스트 프롬프트로 이미지를 생성합니다.

atlas_generate_video텍스트나 이미지로 동영상을 만듭니다.

atlas_chat대규모 언어 모델과 대화합니다.

atlas_list_models400개 이상의 사용 가능한 AI 모델을 탐색합니다.

atlas_quick_generate최적 모델을 자동 선택하여 한 번에 콘텐츠를 생성합니다.

atlas_upload_mediaAPI 워크플로우를 위해 로컬 파일을 업로드합니다.

더 알아보기

github.com/AtlasCloudAI/mcp-server

API 스키마

{
  "info": {
    "title": "AtlasCloud API",
    "version": "1.0.0",
    "description": "The AtlasCloud API."
  },
  "openapi": "3.0.0",
  "paths": {
    "/api/v1/model/generateVideo": {
      "post": {
        "requestBody": {
          "content": {
            "application/json": {
              "schema": {
                "$ref": "#/components/schemas/Input"
              }
            }
          },
          "required": true
        },
        "responses": {
          "200": {
            "content": {
              "application/json": {
                "schema": {
                  "$ref": "#/components/schemas/PredictionResponse"
                }
              }
            },
            "description": "The request status."
          }
        }
      },
      "x-api-name": "model_run"
    },
    "/api/v1/model/result/{request_id}": {
      "get": {
        "parameters": [
          {
            "in": "path",
            "name": "request_id",
            "required": true,
            "schema": {
              "description": "Request ID",
              "type": "string"
            }
          }
        ],
        "responses": {
          "200": {
            "content": {
              "application/json": {
                "schema": {
                  "$ref": "#/components/schemas/PredictionResponse"
                }
              }
            },
            "description": "Result of the request."
          }
        }
      },
      "x-api-name": "model_result"
    }
  },
  "components": {
    "schemas": {
      "Input": {
        "properties": {
          "model": {
            "type": "string",
            "description": "model name",
            "default": "atlascloud/wan-2.2/image-to-video-lora"
          },
          "image": {
            "description": "The first-frame image URL for generating the video.",
            "type": "string"
          },
          "prompt": {
            "description": "The positive prompt for the generation.",
            "type": "string"
          },
          "negative_prompt": {
            "description": "The negative prompt to avoid certain content in the generation.",
            "type": "string",
            "default": ""
          },
          "resolution": {
            "default": "480p",
            "description": "The resolution of the generated video.",
            "enum": [
              "480p",
              "720p"
            ],
            "type": "string"
          },
          "duration": {
            "default": 5,
            "description": "The duration of the generated video in seconds.",
            "type": "integer",
            "minimum": 3,
            "maximum": 10,
            "x-ui-component": "slider"
          },
          "loras": {
            "description": "List of LoRAs to apply (max 3). Module is auto-inferred from the safetensors filename.",
            "items": {
              "$ref": "#/components/schemas/LoraWeight"
            },
            "maxItems": 3,
            "type": "array",
            "x-ui-component": "loras"
          },
          "high_noise_loras": {
            "description": "List of high noise LoRAs to apply (max 3). Loaded into the transformer (high noise stage).",
            "items": {
              "$ref": "#/components/schemas/LoraWeight"
            },
            "maxItems": 3,
            "type": "array",
            "x-ui-component": "loras"
          },
          "low_noise_loras": {
            "description": "List of low noise LoRAs to apply (max 3). Loaded into transformer_2 (low noise stage).",
            "items": {
              "$ref": "#/components/schemas/LoraWeight"
            },
            "maxItems": 3,
            "type": "array",
            "x-ui-component": "loras"
          },
          "seed": {
            "default": -1,
            "description": "The random seed to use for the generation. -1 means a random seed will be used.",
            "type": "integer"
          }
        },
        "required": [
          "model",
          "image",
          "prompt"
        ],
        "type": "object",
        "x-order-properties": [
          "image",
          "prompt",
          "negative_prompt",
          "resolution",
          "duration",
          "loras",
          "high_noise_loras",
          "low_noise_loras",
          "seed"
        ]
      },
      "LoraWeight": {
        "properties": {
          "path": {
            "description": "URL or the path to the LoRA weights (safetensors format).",
            "type": "string"
          },
          "scale": {
            "default": 1,
            "description": "The scale of the LoRA weights. This is used to scale the LoRA weight before merging it with the base model.",
            "maximum": 4,
            "minimum": 0,
            "type": "number"
          }
        },
        "required": [
          "path"
        ],
        "type": "object",
        "x-order-properties": [
          "path",
          "scale"
        ]
      },
      "PredictionResponse": {
        "properties": {
          "created_at": {
            "description": "ISO timestamp of when the request was created.",
            "format": "date-time",
            "type": "string"
          },
          "has_nsfw_contents": {
            "description": "Array of boolean values indicating NSFW detection for each output.",
            "items": {
              "type": "boolean"
            },
            "type": "array"
          },
          "id": {
            "description": "Unique identifier for the prediction.",
            "type": "string"
          },
          "model": {
            "description": "Model ID used for the prediction.",
            "type": "string"
          },
          "outputs": {
            "description": "Array of URLs to the generated content (empty when status is not completed).",
            "items": {
              "type": "object"
            },
            "type": "array"
          },
          "status": {
            "description": "Status of the task: created, processing, completed, or failed.",
            "type": "string"
          },
          "urls": {
            "description": "Object containing related API endpoints.",
            "type": "object"
          }
        },
        "type": "object"
      }
    }
  },
  "servers": [
    {
      "url": "https://api.atlascloud.ai"
    }
  ]
}

LLM 친화적 프롬프트 템플릿

# atlascloud/wan-2.2/image-to-video-lora

> Open and Advanced Large-Scale Video Generative Models.


## Overview

- **Submit endpoint (POST)**: `https://api.atlascloud.ai/api/v1/model/generateVideo` — start an async generation; returns a `prediction_id`
- **Poll endpoint (GET)**: `https://api.atlascloud.ai/api/v1/model/prediction/{prediction_id}` — poll this until the prediction finishes
- **Model ID**: `atlascloud/wan-2.2/image-to-video-lora`


## API Information

This model can be used via our HTTP API or more conveniently via our client libraries.
See the input and output schema below, as well as the usage examples.


### Input Schema

The API accepts the following input parameters:

- **`image`** (`string`, _required_):
  The first-frame image URL for generating the video.

- **`prompt`** (`string`, _required_):
  The positive prompt for the generation.

- **`negative_prompt`** (`string`, _optional_):
  The negative prompt to avoid certain content in the generation.
  - Default: `""`

- **`resolution`** (`string`, _optional_):
  The resolution of the generated video.
  - Default: `"480p"`
  - Options: "480p", "720p"

- **`duration`** (`integer`, _optional_):
  The duration of the generated video in seconds.
  - Default: `5`
  - Min: 3
  - Max: 10

- **`loras`** (`array`, _optional_):
  List of LoRAs to apply (max 3). Module is auto-inferred from the safetensors filename.
  - Max items: 3

- **`high_noise_loras`** (`array`, _optional_):
  List of high noise LoRAs to apply (max 3). Loaded into the transformer (high noise stage).
  - Max items: 3

- **`low_noise_loras`** (`array`, _optional_):
  List of low noise LoRAs to apply (max 3). Loaded into transformer_2 (low noise stage).
  - Max items: 3

- **`seed`** (`integer`, _optional_):
  The random seed to use for the generation. -1 means a random seed will be used.
  - Default: `-1`



**Required Parameters Example**:

```json
{
  "model": "atlascloud/wan-2.2/image-to-video-lora",
  "image": "",
  "prompt": ""
}
```


**Full Example**:

```json
{
  "model": "atlascloud/wan-2.2/image-to-video-lora",
  "image": "",
  "prompt": "",
  "negative_prompt": "",
  "resolution": "480p",
  "duration": 5,
  "loras": [],
  "high_noise_loras": [],
  "low_noise_loras": [],
  "seed": -1
}
```


### Output Schema

The API returns the following output format:


- **`created_at`** (`string`, _optional_):
  ISO timestamp of when the request was created.

- **`has_nsfw_contents`** (`array[boolean]`, _optional_):
  Array of boolean values indicating NSFW detection for each output.

- **`id`** (`string`, _optional_):
  Unique identifier for the prediction.

- **`model`** (`string`, _optional_):
  Model ID used for the prediction.

- **`outputs`** (`array[object]`, _optional_):
  Array of URLs to the generated content (empty when status is not completed).

- **`status`** (`string`, _optional_):
  Status of the task: created, processing, completed, or failed.

- **`urls`** (`object`, _optional_):
  Object containing related API endpoints.



**Example Response**:

```json
{
  "created_at": "",
  "has_nsfw_contents": [],
  "id": "",
  "model": "",
  "outputs": [],
  "status": "",
  "urls": {}
}
```


## Usage Examples

### cURL

```bash
# Step 1: Start generation (async)
curl -X POST "https://api.atlascloud.ai/api/v1/model/generateVideo" \
  -H "Authorization: Bearer $ATLASCLOUD_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
  "model": "atlascloud/wan-2.2/image-to-video-lora",
  "image": "",
  "prompt": "",
  "negative_prompt": "",
  "resolution": "480p",
  "duration": 5,
  "loras": [],
  "high_noise_loras": [],
  "low_noise_loras": [],
  "seed": -1
}'

# Response will contain: {"code": 200, "data": {"id": "prediction_id", "status": "processing"}}

# Step 2: Poll for result (replace {prediction_id} with the id returned above)
curl -X GET "https://api.atlascloud.ai/api/v1/model/prediction/{prediction_id}" \
  -H "Authorization: Bearer $ATLASCLOUD_API_KEY"

# Keep polling until status is "completed", "succeeded" or "failed"
# When completed, outputs will contain the generated content URL(s)
```

## Additional Resources

### Documentation

- [Model Playground](https://www.atlascloud.ai/models/atlascloud/wan-2.2/image-to-video-lora)

A group of intelligent rats sits around a wooden table in a cozy room, intently playing a card game. The rats hold their cards carefully, some nibbling on a stick while others eye the table with focused expressions. The dim light from a nearby candle flickers, casting shadows across the walls adorned with old portraits. The sound of cards shuffling and soft squeaks fill the air as the rats eagerly plot their next move, creating an atmosphere of quiet tension and excitement.

로드 중...

Wan 2.2: Open and Advanced Large-Scale Video Generative Model by Alibaba Wanxiang

Model Card Overview

Field	Description
Model Name	Wan 2.2 Image-to-Video LoRA
Developed by	Alibaba Tongyi Wanxiang Lab
Model Type	Image-to-Video Generation with LoRA Support
Resolution	480p, 720p (via VSR upscaling)
Frame Rate	30 fps
Duration	3–10 seconds
Related Links	GitHub: https://github.com/Wan-Video/Wan2.2, Hugging Face: https://huggingface.co/Wan-AI/Wan2.2-I2V-A14B, Paper (arXiv): https://arxiv.org/abs/2503.20314

Introduction

Wan 2.2 is a significant upgrade to the Wan series of foundational video models, designed to push the boundaries of generative AI in video creation. This image-to-video LoRA variant takes a reference image as the first frame and generates a high-quality video, with full support for custom LoRA weights to fine-tune the generation style, motion characteristics, or subject identity.

The model generates videos at 480p natively and supports 720p output via Video Super Resolution (VSR) upscaling, delivering smooth 30 fps playback at both resolutions.

Key Features & Innovations

Effective MoE Architecture: Wan 2.2 integrates a Mixture-of-Experts (MoE) architecture into the video diffusion model. Specialized expert models handle different stages of the denoising process, increasing model capacity without raising computational costs. The model has 27B total parameters with only 14B active during any given step.
Cinematic-Level Aesthetics: Trained on a meticulously curated dataset with detailed labels for cinematic properties like lighting, composition, and color tone. This allows generation of videos with precise and controllable artistic styles, achieving a professional, cinematic look.
Complex Motion Generation: Trained on a vastly expanded dataset (+65.6% more images and +83.2% more videos compared to Wan 2.1), Wan 2.2 demonstrates superior ability to generate complex and realistic motion with enhanced generalization across motions, semantics, and aesthetics.
Custom LoRA Support: This variant supports user-provided LoRA weights for fine-grained style and motion control. Three separate LoRA input channels are available:
- high_noise_loras — Applied to the high-noise expert (transformer stage), influencing overall structure and layout.
- low_noise_loras — Applied to the low-noise expert (transformer_2 stage), influencing fine details and textures.
- loras — General-purpose LoRA input where the module is auto-inferred from the safetensors filename.
VSR-Enhanced Output: All output videos are delivered at 30 fps. When 720p resolution is selected, the model leverages Video Super Resolution to upscale from a 480p base generation, preserving fine details while achieving higher resolution output.

Model Architecture

The architecture is built upon the Diffusion Transformer (DiT) paradigm with a Mixture-of-Experts (MoE) framework:

High-Noise Expert: Activated during initial denoising stages, establishing overall structure and layout.
Low-Noise Expert: Activated in later stages, refining details, textures, and fine-grained motion.

The transition between experts is dynamically determined by the signal-to-noise ratio (SNR) during generation. Custom LoRA weights can be applied to each expert independently, enabling precise control over different aspects of the generation pipeline.

Intended Use & Applications

Stylized Video Production: Generating videos with custom visual styles by applying LoRA weights trained on specific aesthetic data.
Character & Subject Consistency: Using identity-preserving LoRAs to maintain consistent characters across multiple video generations.
Cinematic Video Production: Generating high-fidelity video clips from reference images for short films, advertisements, or social media content.
Creative Experimentation: Combining multiple LoRAs to explore novel visual effects and motion styles.
Academic Research: Serving as a powerful foundation model for researchers exploring LoRA-based fine-tuning techniques in video generation.

유사한 모델 탐색

NEW

참조를 비디오로

HappyHorse-1.1 Reference-to-video

Generates videos from one to nine reference images and a text prompt, supporting 720P or 1080P output, flexible aspect ratios, and durations from 3 to 15 seconds.

HappyHorse-1.1 Text-to-video

Generates videos from text prompts with HappyHorse 1.1, supporting 720P or 1080P output, flexible aspect ratios, and durations from 3 to 15 seconds.

HappyHorse-1.1 Image-to-video

Animates a first-frame image into video with optional prompt guidance, 720P or 1080P output, and durations from 3 to 15 seconds.

HappyHorse-1.0 Image-to-video

Animates a first-frame image into video with optional prompt guidance, 720P or 1080P output, and durations from 3 to 15 seconds.

HappyHorse-1.0 Text-to-video

Generates videos from text prompts with HappyHorse 1.0, supporting 720P or 1080P output, flexible aspect ratios, and durations from 3 to 15 seconds.

HappyHorse-1.0 Reference-to-video

Generates videos from one to nine reference images and a text prompt, supporting 720P or 1080P output, flexible aspect ratios, and durations from 3 to 15 seconds.

HappyHorse-1.0 Video-edit

Edits an input video with text instructions and optional reference images, supporting 720P or 1080P output.

Wan-2.7 Image-to-video

Animates images into videos with first-frame, first-and-last-frame, video continuation, and audio-driven modes.

Wan-2.7 Text-to-video

Generates videos from text prompts with multi-shot narrative, audio generation, and sound-image synchronization.

Wan-2.7 Video-edit

Edits videos using text instructions, reference images, and style transfer with multi-modal input support.

Wan-2.7 Reference-to-video

Generates character-driven videos from reference images and videos, with multi-subject and voice-cloning support.