GLM 4.5 Air

GLM-4.5-Air is an efficient 106B-parameter model offering 128K context and hybrid reasoning for cost-conscious deployments.

GLM-4.5-Air is a highly efficient 106B total (12B active) parameter model that offers a long 128K context window and hybrid reasoning capabilities tuned for budget-sensitive deployments. Its architecture is optimized to preserve reasoning depth while aggressively controlling memory and compute footprint, making it attractive for large-scale API workloads and SaaS integrations. The extended context window allows it to operate over large document sets, multi-step business processes, or long-running agent sessions.

image of technology-driven data analysis scene

Explore other AI Models