Just now, Mistral AI's latest magnetic chain has been released!8x22B MoE model, 281GB unbanned

author：New Zhiyuan 2024-04-10 14:37:00

Editor: Editorial Department

The start-up team Mistral AI once again released a magnetic chain, and the 281GB file lifted the ban on the latest 8x22B MoE model.

A magnetic chain, Mistral AI is here again to do things quietly.

Just now, Mistral AI's latest magnetic chain has been released!8x22B MoE model, 281GB unbanned

In the 281.24GB file, it turns out to be a new 8x22B MOE model!

The new MoE model has a total of 56 layers, 48 attention heads, 8 experts, and 2 active experts.

Moreover, the context length is 65k.

Netizens have said that Mistral AI, as always, has set off an AI community boom by relying on a magnetic chain.

In this regard, Jia Yangqing also said that he can't wait to see the detailed comparison between it and other SOTA models!

Relying on the magnetic chain to spread the entire AI community

In December last year, after the release of the first Magnet Chain, the 8x7B MoE model released by Mistral AI received a lot of praise.

In the benchmark test, the performance of the eight 7 billion parameter small models outperformed Llama 2, which has up to 70 billion parameters.

It handles 32k contexts well, supports English, French, Italian, German, and Spanish, and shows strong performance in code generation.

In February this year, the latest flagship model, Mistral Large, was launched, and its performance is directly comparable to GPT-4.

However, this version of the model is not open source.

Mistral Large has excellent logical reasoning capabilities and is capable of handling complex multilingual tasks including text understanding, conversion, and code generation.

That is, half a month ago, at a Cerebral Valley hackathon event, Mistral AI open-sourced the Mistral 7B v0.2 base model.

This model supports 32k contexts, no sliding window, Rope Theta = 1e6.

Now, the latest 8x22B MoE model is also available on the Hug Face platform, and community members can build their own applications based on it.

Just now, Mistral AI's latest magnetic chain has been released!8x22B MoE model, 281GB unbanned

The start-up team Mistral AI once again released a magnetic chain, and the 281GB file lifted the ban on the latest 8x22B MoE model.

Read on

Fuyou Truck's first ESG report has improved its "AI content", and cutting-edge technologies such as large models have performed well

Google released a new upgraded large model to face off against OpenAI; Meizu released the new Flyme AIOS system

These thinking models, according to practice, the more you practice, the more powerful you become!

Artificial intelligence is moving towards the new, and the industry model promotes new quality productivity and empowers thousands of industries

The United States plans to block the export of open-source AI large models, and the technology industry is shaken!

The great god Li Mu was revealed to have resigned! Devoted to large-scale model entrepreneurship, the GitHub project has been opened

Large models of cars with volcanic engines began to erupt

"Archaeological excavation", modeling, treasure appreciation...... Today, Jinshan is so lively!

Shadowless Cloud Classroom at an altitude of 3,200 meters: Children under the snow-capped mountains meet AI models

Xiao Xin shared: cellular automata model

The man stole 800 yuan of mobile phone models and was detained

Only Google's injured world has been achieved, but should the "all-round model" be followed?

Unraveling the Mystery of Memory: Ebbinghaus's Forgetting Curve and Mind Model Playing Cards Help You Grow and Leap

After GPU, NPU becomes the standard configuration again, how do mobile phones and PCs carry large AI models?

Be a sneak peek! ByteDance is unprecedented! The large model is stunningly unveiled, and the price is as low as 99%!

39 million people watched Lei Jun's live test drive; Musk recruits second brain-computer experiment patient; DeepMind launches a large-scale model risk assessment framework