TechCrunch AI July 24, 2024 TERMINATOR

Why Mistral Large 2 raises the stakes for open AI models

Mistral released Large 2, a new flagship AI model aimed at competing with recent models from OpenAI and Meta. The company says it performs strongly in code generation, mathematics and reasoning, while using 123 billion parameters and offering a 128,000 token window.

WTF Index TERMINATOR

◄ Terminator 2 Idiocracy 0 ►

The story mildly leans Terminator because it highlights a more powerful and more accessible frontier-scale AI model, though mainly as a routine launch.

Why Mistral Large 2 raises the stakes for open AI models

Mistral has put another major model into the frontier AI race. Its new flagship, Large 2, is positioned as a direct answer to recent systems from OpenAI and Meta, with the company pointing to code generation, mathematics and reasoning as the areas where it wants to compete most clearly.

The release also sharpens a wider debate around open models. Mistral says Large 2 improves performance and cost for open models, but the model still sits in a more complicated category than traditional open source software.

A fast follow to Meta

Large 2 arrived on Wednesday, only one day after Meta released Llama 3.1 405B. That timing matters because both releases are aimed at the same high-end AI conversation: models that can handle complex coding, math and reasoning tasks while being more accessible than fully closed systems.

Mistral says Large 2 is on par with the latest cutting-edge models from OpenAI and Meta in those core areas. The company backs that claim with a set of benchmarks, including results where Large 2 appears to beat Llama 3.1 405B on code generation and math performance.

The parameter count is one of the headline contrasts. Large 2 has 123 billion parameters, which is under a third of Llama 3.1 405B. For developers and companies comparing AI models, that difference is important because model size affects how practical a system may be to run, evaluate or integrate.

Still, the source makes clear that raw access does not mean easy adoption. Even when a model is more open than GPT-4o, only a limited group has the expertise and infrastructure needed to implement a model this large. That challenge is even more obvious for Llama 3.1 405B.

Less guessing, more restraint

Mistral says reducing hallucination issues was a major training focus for Large 2. In plain terms, the company wants the model to be less willing to produce confident but unsupported answers.

According to Mistral, Large 2 was trained to be more selective when responding. The aim is for the model to acknowledge when it does not know something instead of inventing a plausible-sounding answer.

That is a practical point, not just a technical one. AI systems used for coding, research, business workflows or multilingual communication can create risk when they fill gaps with false information. A model that is more willing to admit uncertainty can be more useful in settings where accuracy matters.

Mistral also claims Large 2 produces more concise responses than leading AI models. That matters because some AI assistants can become verbose, making it harder for users to identify the actual answer or action item.

What Large 2 can handle

Large 2 includes a 128,000 token window. The source describes that as roughly equivalent to a 300-page book, which means the model can take in a large amount of material in one prompt.

A larger context window can make a model more useful for tasks that involve lengthy inputs. That could include long documents, extended codebases, detailed instructions or multi-part prompts, as long as the user has a reason to provide that much information at once.

The model also includes improved multilingual support. Mistral says Large 2 understands English, French, German, Spanish, Italian, Portuguese, Arabic, Hindi, Russian, Chinese, Japanese and Korean.

Its coding support is also broad. Large 2 works with 80 coding languages, placing software development near the center of the model’s intended use cases.

Core strengths: code generation, mathematics and reasoning.
Context window: 128,000 tokens, described as roughly a 300-page book.
Model size: 123 billion parameters.
Language support: English, French, German, Spanish, Italian, Portuguese, Arabic, Hindi, Russian, Chinese, Japanese and Korean.
Coding support: 80 coding languages.

The limits of openness

Mistral is often discussed in the context of open models, but Large 2 is not open source in the traditional sense. The source states that any commercial application of Mistral’s models requires a paid license.

That distinction is important for companies evaluating AI strategy. A model can be more open than a closed product while still carrying commercial restrictions. For teams planning production use, the license is part of the technical decision.

There is also an infrastructure barrier. Large models may be available, but availability does not automatically make them simple to deploy. The source emphasizes that only few groups have the expertise and infrastructure to implement models of this scale.

Mistral’s momentum is also backed by fresh funding. The Paris-based AI startup recently raised $640 million in a Series B funding round led by General Catalyst, at a $6 billion valuation. The company is still a newer entrant in artificial intelligence, but it is releasing models close to the cutting edge at a rapid pace.

Where Large 2 is available

Mistral Large 2 is available through several major AI platforms. The model can be used on Google Vertex AI, Amazon Bedrock, Azure AI Studio and IBM watsonx.ai.

It is also available on Mistral’s own La Plateforme under the name “mistral-large-2407”. Users can test it for free through Le Chat, Mistral’s ChatGPT competitor.

One notable absence is multimodal capability. The source says Large 2 does not process image and text simultaneously, and Meta’s Llama 3.1 release was also missing that capability. OpenAI is described as far ahead in multimodal AI systems, a feature some startups are increasingly looking to build with.

That leaves Large 2 with a clear profile. It is a text- and code-focused flagship model with a large context window, broad language support, strong benchmark claims and a licensing model that requires careful review for commercial use. For Mistral, it is another signal that the company wants to compete directly with the largest names in AI.