Willkommen in deiner zentralen KI-Toolbox. Für müheloses Stöbern und schnellen Einsatz konzipiert, vereint diese Seite unser gesamtes Intelligenzangebot – von leistungsstarken großen Sprachmodellen (LLMs) bis zu modernsten Bild- und Videogeneratoren. Hier findest und nutzt du die passende Engine für deinen Workflow auf einen Blick.

Generates videos from text prompts with HappyHorse 1.1, supporting 720P or 1080P output, flexible aspect ratios, and durations from 3 to 15 seconds.

Animates a first-frame image into video with optional prompt guidance, 720P or 1080P output, and durations from 3 to 15 seconds.

Generates videos from one to nine reference images and a text prompt, supporting 720P or 1080P output, flexible aspect ratios, and durations from 3 to 15 seconds.
Agent-oriented model built for complex reasoning, tool use, and autonomous task execution.
Powerful coding model for programming, debugging, and AI developer workflows.

Kling V3.0 Turbo Image-to-Video transforms static images into dynamic cinematic videos using MVL technology. Supports first/last frame control and audio generation.

Kling V3.0 Turbo Text-to-Video generates dynamic cinematic videos from text prompts using MVL technology. Supports first/last frame control and audio generation.

Kling Omni Video O3 (4K) Image-to-Video transforms static images into dynamic cinematic videos using MVL technology. Supports first/last frame control and audio generation.

Kling Omni Video O3 (4K) is Kuaishou advanced unified multi-modal video model with MVL (Multi-modal Visual Language) technology. Generates high-quality videos from text prompts with natural motion and audio generation support.

Microsoft's fast, cost-optimized text-to-image generation model, creating high-quality images at lower cost using the same diffusion-based architecture as MAI-Image-2.5.

Microsoft's flagship image-to-image editing model, enabling precise, controllable edits to existing images through natural language instructions.

Microsoft's flagship text-to-image generation model, designed to create high-quality, visually rich images from natural language prompts.

Midjourney automatically removes the background from an input image, returning one transparent-background result.

Midjourney retexture changes the artistic style of an input image while preserving its composition, returning four restyled results.

Midjourney V8.1 blends two to five input images into four fused results, with an optional guiding prompt and native 2K HD.

Midjourney V8.1 re-imagines an input image guided by a text prompt, returning four variations. Supports native 2K HD, style reference, and aspect-ratio / stylize / chaos / weird controls.

ByteDance Seed3D 2.0 — generates a textured, PBR-shaded 3D model (glb/obj/usd/usdz) from a single input image. Returns a downloadable .zip archive containing the 3D file.
Specialized coding model optimized for software development, code generation, debugging, refactoring, and developer workflows.
Advanced conversational AI model optimized for natural dialogue, knowledge exploration, reasoning, and interactive chat experiences.

Midjourney V8.1 animates an input image into four 5-second videos at 480p or 720p.

Midjourney V8.1 generates four images from a text prompt, with optional native 2K HD, a style reference, and aspect-ratio / stylize / chaos / weird controls.
Anthropic's most capable model, built for advanced reasoning, complex workflows, deep analysis, and high-quality content generation.
Fast and cost-efficient multimodal model designed for high-throughput applications, real-time interactions, and everyday AI tasks.
DeepSeek V4 Pro is a state-of-the-art large language model combining efficient sparse attention, strong reasoning, and integrated agent capabilities for robust long-context understanding and versatile AI applications.

xAI TTS v1 is a high-fidelity text-to-speech model that converts text into natural, expressive speech with sub-second latency, supporting 20 languages and 80+ voices with fine-grained delivery control.
DeepSeek V4 Flash is a state-of-the-art large language model combining efficient sparse attention, strong reasoning, and integrated agent capabilities for robust long-context understanding and versatile AI applications.
Keine Beschreibung verfügbar.

Tencent Hunyuan 3D Rapid (Express) — fast lightweight 3D mesh generation from a single image, with optional PBR materials. Outputs GLB/OBJ/USDZ/FBX/STL/MP4.

Tencent Hunyuan 3D Rapid (Express) — fast lightweight 3D mesh generation from a text prompt, with optional PBR materials. Outputs GLB/OBJ/USDZ/FBX/STL/MP4.

Tencent Hunyuan 3D Pro (v3.1) — high-quality textured 3D mesh generation from a single image, with optional PBR materials and custom face count. Outputs GLB/OBJ/USDZ/FBX/STL.

Tencent Hunyuan 3D Pro (v3.1) — high-quality textured 3D mesh generation from a text prompt, with optional PBR materials and custom face count. Outputs GLB/OBJ/USDZ/FBX/STL.
Enhanced model for reasoning, coding, and productivity.
The latest Qwen reasoning model.
Versatile model for chat, and productivity workflows.
Professional-grade model built for advanced workloads, complex analysis, and enterprise AI applications.
Developer-focused model specialized in coding agents, repository understanding, and software engineering.
Ultra-efficient model focused on lightweight AI tasks, rapid inference, and large-scale deployment.
Small yet capable model designed for edge scenarios, automation, and cost-sensitive services.
Next-generation assistant model with improved instruction following and deeper contextual understanding.
High-speed model engineered for instant responses, real-time interaction, and massive request workloads.
Versatile foundation model providing reliable conversation, knowledge understanding, and content creation.

Google's advanced AI-powered video-to-image generation model, designed to generate high-quality static images from video clips combined with text instructions.

Google's advanced AI-powered video-to-image generation model, designed to generate high-quality static images from video clips combined with text instructions.

xAI Grok Imagine Video v1.5 animates a starting frame image with natural-language motion prompts at 480p or 720p.

xAI Grok Imagine generates polished visuals from natural-language prompts at 1K or 2K resolution, with 14 aspect ratios.

xAI Grok Imagine edits one or more reference images with natural-language instructions at 1K or 2K resolution. Supports single image and multi-image (<IMAGE_0>, <IMAGE_1>) reference editing.

Generates videos from text prompts with HappyHorse 1.0, supporting 720P or 1080P output, flexible aspect ratios, and durations from 3 to 15 seconds.

Animates a first-frame image into video with optional prompt guidance, 720P or 1080P output, and durations from 3 to 15 seconds.
Join the Discord community for the latest model updates, prompts, and support.