The Decoder February 26, 2026 NEUTRAL

New Qwen 3.5 models push open AI toward lower-cost use

Alibaba has expanded the Qwen 3.5 series with four models: Qwen3.5-Flash, Qwen3.5-35B-A3B, Qwen3.5-122B-A10B, and Qwen3.5-27B. The models support text, images, and video as input, generate text as output, and are available under Apache License 2.0.

WTF Index NEUTRAL

◄ Terminator 1 Idiocracy 0 ►

This is mostly a routine open-model release focused on lower-cost deployment, with only a mild capability increase signal.

New Qwen 3.5 models push open AI toward lower-cost use

Alibaba has broadened its Qwen 3.5 model family with a new group of open models aimed at stronger performance and lower compute demands. The release adds four models to the series and puts particular attention on production use, multimodal input, and permissive licensing.

The lineup arrives in a market where developers and companies are weighing model quality against cost, deployment flexibility, and access. Based on the details provided by Alibaba, Qwen 3.5 is being positioned as an open alternative that can serve both hosted and self-directed AI use cases.

What Alibaba added to Qwen 3.5

The expanded Qwen 3.5 lineup includes four models: Qwen3.5-Flash, Qwen3.5-35B-A3B, Qwen3.5-122B-A10B, and Qwen3.5-27B. All four models accept text, images, and video as input, while generating text as output.

That input mix matters because many AI workflows no longer rely only on written prompts. A model that can take visual and video context alongside text can support broader tasks while still returning a text response that applications can use directly.

The Qwen 3.5 series began with the release of Qwen3.5-397B-A17B in mid-February. The new models extend that series with smaller and production-focused options, rather than presenting one single model as the only path forward.

Alibaba says the models deliver stronger performance while using less compute. The source does not provide benchmark tables or detailed test results, but the central claim is clear: the company is emphasizing efficiency as much as raw capability.

Smaller models are part of the story

One of the most notable details is the comparison between Qwen3.5-35B-A3B and the larger Qwen3-235B-A22B. According to the source, the smaller Qwen3.5-35B-A3B outperforms that much larger predecessor.

That point is important because it shifts attention away from model size alone. If a smaller system can do better than a larger earlier model, then architecture, data quality, and reinforcement learning become central parts of the discussion.

This does not mean model size no longer matters. The same lineup also includes Qwen3.5-122B-A10B and Qwen3.5-27B, which are described as variants designed to close the remaining gap to top-tier models, especially in complex agent scenarios.

In practical terms, the Qwen 3.5 expansion suggests a portfolio approach. Different models can serve different needs: a smaller model for efficiency, larger variants for more demanding work, and a hosted production model for teams that want API access.

Open access and commercial flexibility

The models are available on Hugging Face, ModelScope, and through Qwen Chat. That gives users several routes to access the Qwen 3.5 family, depending on whether they want to explore, test, or integrate the models into a workflow.

The series ships under Apache License 2.0. The source describes it as a permissive open-source license that allows commercial use, modification, and redistribution.

For developers and companies, that licensing detail is one of the release's most significant features. Open model access is useful, but the ability to use, modify, and redistribute under a permissive license is what can make a model more practical for commercial projects.

The source does not describe deployment requirements or hardware recommendations. Still, the licensing and availability information point toward a release intended to be used beyond simple demonstrations.

Qwen3.5-Flash targets hosted production use

Qwen3.5-Flash is described as the hosted production version of the lineup. It comes with a context length of one million tokens and built-in tools.

Its API pricing is also stated clearly: $0.10 per million input tokens and $0.40 per million output tokens. That makes cost a central part of the Qwen3.5-Flash positioning.

The source frames Alibaba's open Qwen 3.5 effort against GPT-5 mini and Claude Sonnet 4.5, with the cost angle playing a major role. It does not provide a detailed side-by-side table in the supplied article text, so the key supported takeaway is that Alibaba is aiming the model family at high-end competition while emphasizing a lower-cost path.

The one million token context length is another practical marker. Long context can matter for workflows that involve large documents, extended conversations, or complex task histories, though the source does not provide specific examples of those uses.

Why the release matters

The Qwen 3.5 expansion is not just a list of model names. It reflects several priorities that are now shaping AI model competition: stronger performance, lower compute use, multimodal input, commercial licensing, and accessible hosted pricing.

The most important facts can be summarized simply:

Alibaba has added Qwen3.5-Flash, Qwen3.5-35B-A3B, Qwen3.5-122B-A10B, and Qwen3.5-27B to the Qwen 3.5 series.
All four models take text, images, and video as input and generate text as output.
Qwen3.5-35B-A3B is described as outperforming Qwen3-235B-A22B despite being smaller.
The models are available on Hugging Face, ModelScope, and through Qwen Chat.
They are released under Apache License 2.0.
Qwen3.5-Flash offers a one million token context length, built-in tools, and API pricing of $0.10 per million input tokens and $0.40 per million output tokens.

For the AI market, the message is direct: open models are continuing to move into territory once defined mostly by large proprietary systems. Alibaba's Qwen 3.5 lineup is built around that pressure point, combining open availability with model variants aimed at different performance and deployment needs.