GLM LLM Models

GLM is a cutting-edge LLM series by Z.ai (Zhipu AI) featuring GLM-5, GLM-4.7, and GLM-4.6. Engineered for complex systems and long-horizon agentic tasks, GLM-5 outperforms top-tier closed-source models in elite benchmarks like Humanity’s Last Exam and BrowseComp. While GLM-4.7 specializes in reasoning, coding, and real-world intelligent agents, the entire GLM suite is fast, smart, and reliable, making it the ultimate tool for building websites, analyzing data, and delivering instant, high-quality answers for any professional workflow.

주요 모델 탐색

Atlas Cloud는 업계 최고의 최신 크리에이티브 모델을 제공합니다.

NEW

HOT

GLM-5.1 is Z.AI’s latest flagship model, featuring upgrades in two key areas: enhanced programming capabilities and more stable multi-step reasoning/execution. It demonstrates significant improvements in executing complex agent tasks while delivering more natural conversational experiences and superior front-end aesthetics.

LLM

GLM 5.1

GLM-5 Turbo is Z.AI’s latest flagship model, featuring upgrades in two key areas: enhanced programming capabilities and more stable multi-step reasoning/execution. It demonstrates significant improvements in executing complex agent tasks while delivering more natural conversational experiences and superior front-end aesthetics.

LLM

TURBO

GLM 5 Turbo

GLM-5 is Z.AI’s latest flagship model, featuring upgrades in two key areas: enhanced programming capabilities and more stable multi-step reasoning/execution. It demonstrates significant improvements in executing complex agent tasks while delivering more natural conversational experiences and superior front-end aesthetics.

LLM

GLM 5

GLM-4.7 is Z.AI’s latest flagship model, featuring upgrades in two key areas: enhanced programming capabilities and more stable multi-step reasoning/execution. It demonstrates significant improvements in executing complex agent tasks while delivering more natural conversational experiences and superior front-end aesthetics.

LLM

GLM 4.7

357B-parameter efficient MoE model from Zhipu AI.

LLM

GLM 4.6

GLM LLM Models의 주요 특징

Atlas Cloud는 업계 최고의 최신 크리에이티브 모델을 제공합니다.

Advanced Reasoning

Tuned for strong logical reasoning, structured analysis, and multi-step problem solving.

Cost-Efficiency

Optimized architectures keep latency and costs under control.

Safety & Governance

Built-in content filters, auditing tools, and policy controls help teams deploy.

Enterprise Reliability

Production-ready SLAs, monitoring, and governance features help teams confidently ship applications.

Chinese–English Excellence

Native-strength Chinese and fluent English support enable high-quality bilingual chat, search, and generation.

Developer-Friendly Ecosystem

Clean APIs, SDKs, and tooling make it easy to integrate, fine-tune, and operate Z.ai across products and platforms.

최고 속도

최저 비용

Model	Description
GLM-5	GLM-5 is Z.ai's flagship LLM featuring a massive 202.75K context window optimized for complex systems and long-horizon agentic tasks. Outperforming elite closed-source models in benchmarks like Humanity’s Last Exam and BrowseComp, it provides robust programming and stable multi-step reasoning at highly competitive baseline pricing.
GLM-4.7	GLM-4.7 is a high-performance LLM with a 202.75K context window specifically engineered for real-world intelligent agents, advanced reasoning, and professional coding. Fast, smart, and reliable, it serves as the ideal engine for building complex websites and automating sophisticated professional workflows with precision.
GLM-4.6	GLM-4.6 is a powerful MoE LLM with a 202.75K context window designed for rapid data analysis and instant, high-fidelity answers. This dependable model excels at high-efficiency tasks like creating professional slides and web content, offering a smart balance of speed and enterprise-grade performance.

GLM LLM Models 새로운 기능 + 쇼케이스

고급 모델과 Atlas Cloud의 GPU 가속 플랫폼을 결합하여 이미지 및 비디오 생성에서 비할 데 없는 속도, 확장성 및 창의적 제어를 제공합니다.

대규모 744B MoE 아키텍처 및 범용 지식 베이스

GLM-5 모델은 오픈 소스 성능 한계를 재정의하기 위해 무려 28.5조 개의 토큰으로 훈련된 7,440억 매개변수의 MoE(Mixture-of-Experts) 아키텍처를 활용합니다. 400억 개의 활성 매개변수를 최적화하여 세계 지식 밀도와 검색 정밀도에서 비약적인 발전을 촉진합니다. 이는 대규모 인지 작업 및 복잡한 데이터 합성을 위한 최고의 기반입니다.

GLM-5를 활용한 획기적인 에이전틱 시스템 엔지니어링

GLM-5는 다단계 추론 환경 전반에 걸친 장기적이고 체계적인 작업 실행을 위해 설계된 고급 에이전트 기능을 도입합니다. 정교한 계획 로직을 핵심 아키텍처에 통합함으로써, 이 모델은 자동화된 소프트웨어 개발 및 전문 법률 초안 작성 중에 탁월한 안정성을 유지합니다. 이는 극한의 정밀도와 장기적인 일관성을 요구하는 자율 워크플로를 위한 결정적인 엔진 역할을 합니다。

Slime 비동기 강화 학습 및 논리 진화

GLM-5는 혁신적인 "Slime" 비동기 강화 학습 인프라를 활용하여 사후 학습(post-training) 효율성과 논리적 엄밀성을 혁신합니다. 이 획기적인 기술은 코드 생성 품질과 알고리즘 추론 능력을 크게 향상시켜 이전 벤치마크를 능가하며 최고 수준의 오픈 소스 모델로서의 입지를 확고히 했습니다. 이는 풀스택 개발 및 고차원적 구조적 문제 해결을 위한 궁극적인 솔루션입니다。

GLM LLM Models으로 할 수 있는 것

이 모델 패밀리로 구축할 수 있는 실용적인 사용 사례와 워크플로를 발견하세요 — 콘텐츠 제작과 자동화부터 프로덕션급 애플리케이션까지.

GLM-5를 활용한 포괄적인 리포지토리 인텔리전스

GLM-5 API는 개발자가 전체 코드베이스를 수집하여 심층 논리 분석 및 구조적 리팩토링을 수행할 수 있도록 지원합니다. 의존성 그래프를 매핑하고 복잡한 비동기 데이터 흐름을 추적함으로써 에지 케이스(edge-case)의 경쟁 상태(race condition)와 숨겨진 기술 부채를 식별합니다. 신속한 팀 온보딩, 자동화된 PR 검토, 확장 가능한 고성능 마이크로서비스 아키텍처 유지 관리에 완벽합니다.

GLM-5를 활용한 즉각적인 풀스택 프로토타이핑

바이브 주도 개발(Vibe-driven development)을 위해 GLM-5는 추상적인 시각적 목업과 단편적인 메모를 배포 가능한 React 또는 Next.js 컴포넌트로 변환합니다. 보일러플레이트 생성, Tailwind CSS 스타일링 및 상태 관리와 같은 번거로운 작업을 처리하는 동시에 페이지 간 일관성을 보장합니다. 1인 창업자, UX 실험가, 그리고 기능적인 MVP를 빛의 속도로 출시하려는 경우에 이상적입니다。

GLM-5를 활용한 자율 워크플로우 오케스트레이션

GLM-5는 다단계 추론과 실시간 도구 통합이 필요한 장기 연구 과제 관리에 탁월합니다. 맥락을 잃지 않으면서 다중 소스 시장 데이터를 독립적으로 종합하고, 규정을 준수하는 법률 요약을 작성하며, 복잡한 크로스 플랫폼 일정을 자동화할 수 있습니다. 이 사용 사례는 프로젝트 관리자, 법률 전문가 및 체계적인 운영을 위해 고신뢰성 디지털 에이전트가 필요한 모든 사람에게 적합합니다。

모델 비교

다양한 프로바이더의 모델 비교 — 성능, 가격, 고유한 강점을 비교하여 현명한 선택을 하세요.

Model	Context	Max Output	Input	Positioning
GLM-5	202.75K	202.75K	Text	Flagship Foundation Model
GLM-4.7	202.75K	202.75K	Text	Flagship Foundation Model
GLM-4.6	202.75K	202.75K	Text	Efficient MoE Model
DeepSeek V3.2	163.84K	163.84K	Text	Flagship General
MiniMax-M2.5	204.8K	196.6K	Text	SOTA Agentic Coding

Atlas Cloud에서 GLM LLM Models 사용하는 방법

몇 분 만에 시작하세요 — 간단한 단계를 따라 Atlas Cloud 플랫폼을 통해 모델을 통합하고 배포하세요.

Atlas Cloud 계정 생성

atlascloud.ai에서 가입하고 인증을 완료하세요. 신규 사용자는 플랫폼 탐색과 모델 테스트를 위한 무료 크레딧을 받습니다.

Atlas Cloud에서 GLM LLM Models을(를) 사용하는 이유

고급 GLM LLM Models 모델과 Atlas Cloud의 GPU 가속 플랫폼을 결합하여 비교할 수 없는 성능, 확장성 및 개발자 경험을 제공합니다.

성능 및 유연성

낮은 지연 시간:
실시간 추론을 위한 GPU 최적화 추론.

통합 API:
하나의 통합으로 GLM LLM Models, GPT, Gemini 및 DeepSeek를 실행합니다.

투명한 가격:
Serverless 옵션을 포함한 예측 가능한 token당 청구.

엔터프라이즈 및 확장

개발자 경험:
SDK, 분석, 파인튜닝 도구 및 템플릿.

신뢰성:
99.99% 가동 시간, RBAC 및 규정 준수 로깅.

보안 및 규정 준수:
SOC 2 Type II, HIPAA 준수, 미국 내 데이터 주권.

GLM LLM Models에 대한 자주 묻는 질문

28.5T 토큰의 훈련 데이터와 뛰어난 벤치마크 결과를 바탕으로 GLM-5는 널리 '오픈 소스의 한계점(ceiling)'으로 간주됩니다. 용량과 논리 면에서 세계 최고 수준의 상용 모델과 경쟁하거나 이를 능가하며, 글로벌 개발자 생태계를 위한 강력하고 고성능의 기반을 제공합니다。

HLE는 AI가 전문가 수준의 인간 지식과 추론 능력을 갖추었는지 테스트하기 위해 설계된 고난도 벤치마크입니다. GLM-5가 최고 점수를 획득한 것은 첨단 과학과 복잡한 논리에 대한 숙달도가 선도적인 폐쇄형(closed-source) 모델의 수준에 도달했거나 이를 능가했음을 의미합니다.

BrowseComp는 "에이전트(Agentic)" 역량에 대한 결정적인 리더보드로, 실제 웹 환경에서의 복잡한 작업 계획 및 실행에 중점을 둡니다. 가장 높은 점수는 브라우저를 자율적으로 탐색하고 페이지 간 정보를 통합하는 GLM-5의 능력을 나타내며, 이는 최고의 Web Agent 엔진임을 의미합니다.

이 아키텍처는 7,440억 개의 파라미터로 구성된 거대한 "지식 기반"을 제공하면서도, 추론(inference) 시에는 약 400억 개만 활성화합니다. 개발자에게 있어 이는 Llama-3 405B와 같은 덴스(dense) 모델을 능가하는 세계적 수준의 지식 밀도와 추론 깊이를 더 낮은 지연 시간(latency)과 비용으로 제공함을 의미합니다.

총 파라미터는 모델의 "지식 용량"을 나타내며, 744B는 방대한 세계적 사실과 전문가적 논리를 저장할 수 있게 합니다. 활성 파라미터는 추론(inference) 시 사용되는 "연산 능력"을 나타냅니다. MoE 아키텍처 덕분에 GLM-5는 40B의 연산량만으로 744B 수준의 지능을 제공하며, 거대한 지식 기반과 고속의 비용 효율적인 성능 간의 균형을 유지합니다.

사전 학습 데이터의 양은 모델의 "시야의 폭"을 결정합니다. 28.5T 토큰은 전 세계적으로 가장 큰 데이터셋 중 하나로(Llama-3의 약 2배), 희귀 언어, 전문 학술 논문 및 방대한 고품질 코드를 포함합니다. 이는 GLM-5가 복잡한 롱테일 쿼리, 다문화적 뉘앙스, 저수준 시스템 프로그래밍을 처리할 때 탁월한 정확성과 일반화 능력을 갖추도록 보장합니다.

더 많은 패밀리 탐색

Happy Horse 1.0

HappyHorse-1.0 is a unified multimodal AI video generation model that climbed to the top of the Artificial Analysis Video Arena blind-test leaderboard for both text-to-video and image-to-video generation. CNBC Alibaba Group confirmed ownership of HappyHorse, developed under its Alibaba Token Hub (ATH) business unit, where it leads benchmarks outperforming ByteDance's Seedance 2.0 and others. Caixin Global Led by Zhang Di — the former VP of Kuaishou who architected Kling AI — the 15-billion parameter model generates 1080p video with synchronized audio in a single pass using a unified transformer architecture that bypasses the multi-stage pipelines used by every major competitor.

패밀리 보기

Seedance 2.0 Models

Seedance 2.0（by Bytedance） is a multimodal video generation model that redefines "controllable creation," moving beyond the limitations of text or start/end frames. It supports quad-modal inputs—text, image, video, and audio—and introduces an industry-leading "Universal Reference" system. By precisely replicating the composition, camera movement, and character actions from reference assets, Seedance 2.0 solves critical issues with character consistency and physical coherence, empowering creators to act as true "directors" with deep control over their output.

패밀리 보기

GPT Image 2 Models

GPT Image 2 is a state-of-the-art multimodal foundation model engineered for exceptional text-to-image generation with unprecedented photorealism and creative versatility. Developed by OpenAI as the evolution of the DALL-E lineage, it transforms detailed natural language descriptions into hyper-realistic imagery at up to 4K resolution. With proprietary "Neural Rendering Engine" technology for precise visual control, GPT Image 2 delivers studio-quality results with accurate anatomy, lighting, and composition—making it the premier AI tool for professional creators, enterprises, and developers demanding production-ready visual assets.

패밀리 보기

Wan2.7 Models

Launching this March, Wan2.7 is the latest powerhouse in the Qwen ecosystem, delivering a massive upgrade in visual fidelity, audio synchronization, and motion consistency over version 2.6. This all-in-one AI video generator supports advanced features like first-and-last frame control, 3x3 grid synthesis, and instruction-based video editing. Outperforming competitors like Jimeng, Wan2.7 offers superior flexibility with support for real-person image inputs, up to five video references, and 1080P high-definition outputs spanning 2 to 15 seconds, making it the premier choice for professional digital storytelling and high-end content marketing.

패밀리 보기

Veo3.1 Models

Google DeepMind’s Veo 3.1 represents a paradigm shift in AI video generation, empowering creators with director-level narrative control and cinematic-grade audio quality that seamlessly integrates with its enhanced visual realism. By bridging the gap between imaginative concepts and photorealistic execution, this advanced model offers a transformative solution for a wide range of application scenarios, from professional filmmaking and high-end advertising to immersive digital content creation.

패밀리 보기

ERNIE Image Models

ERNIE-Image is an open-weight text-to-image model developed by the ERNIE-Image Team at Baidu, built on a single-stream Diffusion Transformer (DiT) with 8B parameters and paired with a lightweight Prompt Enhancer that rewrites short prompts into richer, more structured descriptions before passing them to the diffusion backbone. NYU Shanghai RITS Released on April 15, 2026 under the Apache 2.0 license, it transforms natural language descriptions into detailed imagery with particular strength in text rendering and structured layout generation. ERNIE-Image is designed not only for strong visual quality, but for controllability in practical generation scenarios where accurate content realization matters as much as aesthetics — making it well-suited for commercial posters, comics, multi-panel layouts, and other content creation tasks that require both visual quality and precise control.

패밀리 보기

GPT Image Models

The GPT Image Family is OpenAI's latest suite of multimodal image generation and editing models, built on the powerful GPT architecture. This family includes three tiers — GPT Image-1, GPT Image-1.5, and GPT Image-1 Mini — each available in both Text-to-Image and Image-to-Image variants. Combining GPT's world-class language understanding with DALL·E-class visual synthesis, these models deliver exceptional prompt adherence, photorealistic rendering, and creative versatility across illustration, photography, design, and visualization tasks. The series offers flexible pricing and quality tiers to match any workflow — from rapid prototyping and high-volume content production to professional-grade final deliverables. Whether you need ultra-fast iterations at minimal cost or maximum quality for brand campaigns, the GPT Image Family has a solution tailored to your needs.

패밀리 보기

Nano Banana2 Models

Nano Banana 2 (by Google), is a generative image model that perfectly balances lightning-fast rendering with exceptional visual quality. With an improved price-performance ratio, it achieves breakthrough micro-detail depiction, accurate native text rendering, and complex physical structure reconstruction. It serves as a highly efficient, commercial-grade visual production tool for developers, marketing teams, and content creators.

패밀리 보기

Seedream5.0 Models

Seedream 5.0, developed by ByteDance’s Jimeng AI, is a high-performance AI image generation model that integrates real-time search with intelligent reasoning. Purpose-built for time-sensitive content and complex visual logic, it excels at professional infographics, architectural design, and UI assistance. By blending live web insights with creative precision, Seedream 5.0 empowers commercial branding and marketing with a seamless, logic-driven workflow that turns sophisticated data into stunning, high-fidelity visuals.

패밀리 보기

Kling3.0 Models

Kuaishou’s flagship video generation suite, Kling 3.0, features two powerhouse models—Kling 3.0 (Upgraded from Kling 2.6) and Kling 3.0 Omni (Kling O3, Upgraded from Kling O1)—both offering high-fidelity native audio integration. While Kling 3.0 excels in intelligent cinematic storytelling, multilingual lip-syncing, and precision text rendering, Kling O3 sets a new standard for professional-grade subject consistency by supporting custom subjects and voice clones derived from video or image inputs. Together, these models provide a comprehensive solution tailored for cinematic narratives, global marketing campaigns, social media content, and digital skit production.

패밀리 보기

GLM LLM Models

패밀리 보기

Open AI Model Families

Explore OpenAI’s language and video models on Atlas Cloud: ChatGPT for advanced reasoning and interaction, and Sora-2 for physics-aware video generation.

패밀리 보기

Open AI Model Families

Explore OpenAI’s language and video models on Atlas Cloud: ChatGPT for advanced reasoning and interaction, and Sora-2 for physics-aware video generation.

패밀리 보기

300개 이상의 모델로 시작하세요,

모든 모델 탐색

GLM LLM Models

주요 모델 탐색

GLM 5.1

GLM 5 Turbo

GLM 5

GLM 4.7

GLM 4.6

GLM LLM Models의 주요 특징

Advanced Reasoning

Cost-Efficiency

Safety & Governance

Enterprise Reliability

Chinese–English Excellence

Developer-Friendly Ecosystem

최고 속도

GLM LLM Models 새로운 기능 + 쇼케이스

대규모 744B MoE 아키텍처 및 범용 지식 베이스

GLM-5를 활용한 획기적인 에이전틱 시스템 엔지니어링

Slime 비동기 강화 학습 및 논리 진화

GLM LLM Models으로 할 수 있는 것

GLM-5를 활용한 포괄적인 리포지토리 인텔리전스

GLM-5를 활용한 즉각적인 풀스택 프로토타이핑

GLM-5를 활용한 자율 워크플로우 오케스트레이션

모델 비교

Atlas Cloud에서 GLM LLM Models 사용하는 방법

Atlas Cloud 계정 생성

Atlas Cloud에서 GLM LLM Models을(를) 사용하는 이유

성능 및 유연성

엔터프라이즈 및 확장

GLM LLM Models에 대한 자주 묻는 질문

더 많은 패밀리 탐색

Happy Horse 1.0

Seedance 2.0 Models

GPT Image 2 Models

Wan2.7 Models

Veo3.1 Models

ERNIE Image Models

GPT Image Models

Nano Banana2 Models

Seedream5.0 Models

Kling3.0 Models

GLM LLM Models

Open AI Model Families

Happy Horse 1.0

Seedance 2.0 Models

GPT Image 2 Models

Wan2.7 Models

Veo3.1 Models

ERNIE Image Models

GPT Image Models

Nano Banana2 Models

Seedream5.0 Models

Kling3.0 Models

GLM LLM Models

Open AI Model Families

300개 이상의 모델로 시작하세요,

Join our Discord community