Catching up with GPT-3.5, the "European version of OpenAI" launched a new model, with a valuation of $2 billion in 6 months

Catching up with GPT-3.5, the "European version of OpenAI" launched a new model, with a valuation of $2 billion in 6 months | At the forefront

author：36 Krypton 2023-12-22 18:24:00

Text: Wang Yining

Edited by Tang Yongyi

In early December, French AI startup Mistral AI officially closed its highly anticipated Series A funding round. According to Bloomberg, the company has raised 385 million euros (about $415 million), and the company is valued at about $2 billion. Mistral AI also opened up its commercial platform today.

On December 8th, Mistral AI's official Twitter released the third tweet since the opening of their account, and there was no lively and grand press conference, no eye-catching promotional video, just a magnetic link that was so ordinary that it could not be more ordinary, which broke the calm of the AI developer circle.

Catching up with GPT-3.5, the "European version of OpenAI" launched a new model, with a valuation of $2 billion in 6 months | At the forefront

△ Source: Twitter

This download link leads to the open-source MoE model that has recently shaken up the AI circle, Mixtral8x7B.

As the name suggests, Mixtral 8x7B reveals its architecture: a combination of eight small models with 7 billion parameters, also known as the Mixture of Experts (MoE) architecture, which refers to the division of complex tasks into a series of smaller and easier to handle subtasks, each of which is handled by an "expert" in a specific domain. The MoE architecture makes the overall model more functional and powerful. This is also the architecture adopted by GPT-4.

Judging from the official data, Mixtral8x7B performed very well in the benchmark test, surpassing GPT-3.5 in terms of running scores, and surpassing Llama2 70B in many running scores, and the reasoning speed is still 6 times that of the latter.

△图源:Mistral AI

When generating models of the same quality, the cost of the inference process of Mixtral8x7B is also significantly lower than that of Mistral AI, and the energy efficiency advantage is significant.

△图源:Mistral AI

In addition, Mixtral8x7B also supports five languages: English, French, Italian, German and Spanish, with natural coding capabilities.

Open source, excellent running score, efficient ...... The combination of this series of advantages gives Mixtral8x7B the momentum to catch up with Chat-GPT, which is really enough to make people feel excited.

A developer has fine-tuned the Mistral MoE and released the dolphin-2.5-mixtral-8x7 model. It's a completely uncensored, open-source model, which means it's not influenced by the ethics of the developer, and it doesn't reply "As an AI assistant, I can't ......"

△ Source: Twitter

And with Mixtral8x7B into the public eye, there is also its developer, Mistral AI, a large model company from France, who went from being unknown to becoming famous in one fell swoop, and they only took half a year.

The "European version of OpenAI" has only been established for half a year with a valuation of 2 billion US dollars

Bloomberg reported that the company had raised 385 million euros (about $415 million), and the company was valued at about $2 billion.

Just six months ago, Mistral AI closed a €105 million ($113 million) angel round – the largest seed round in European history. In just 7 pages of PowerPoint, Mistral AI has attracted a number of established European venture capital institutions, including Redpoint and Index Ventures. In fact, this AI unicorn company, which is favored by star investment institutions in Europe and the United States, was officially established in Paris, France in May this year.

Although it has not been established for a long time, the three-member founding team of Mistral AI is quite weighty, and has rich experience in the direction of multimodality and RAG. The company's CEO, Arthur Mensch, is a former DeepMind research scientist who led the publication of important papers such as Chinchilla, Retro, and Flamingo, which are Google's most important work in the field of LLM, RAG, and multimodality in the past 20-22 years.

Chief Scientist Guillaume Lample, also a former Meta Research Scientist, and CTO Timothee Lacroix, both from the Llama core team. Currently, MistralAI's team has only 22 employees, maintaining a small but beautiful style.

△图源:Mistral AI

Don't go big and do it, attack the small model

In today's era of large language models, Mistral AI has done the opposite, focusing on the development of "small models" from the beginning of its establishment. In an exclusive interview with Silicon Valley investors Sarah Guo and Elad Gil, co-founder and CEO Arthur mentioned that making models smaller will definitely help the development and application of agents, and that small models can effectively reduce the cost of use and run on more devices, so that more interesting applications have the opportunity to be built.

Mistral AI's recently released chat model is a practical example of this philosophy.

Recently, Mistral AI has just released the open platform La plateforme, and provides three chat models based on instructions to generate text, mistral-tiny, mistral-small and mistral-medium, as well as an embedding model.

Among them, the most popular in the market is the "medium cup" model Mistral-medium, which is still in the testing stage. As the strongest open-source model launched by Mistral AI, Mistral-medium has a score of 8.6 on MT-Bench, which is better than GPT-3.5 in mainstream evaluations.

△图源:Mistral AI

Another special feature of Mistral AI is its firm choice of the open-source route.

In fact, before 2020, most of the research results of large models were shared and transparent, and it was not until some companies began to accelerate commercialization that closed-source models became more widely used. OpenAI, as we know it, was first founded as an open-source non-profit organization, and then turned into a closed-source company, which is also a point that former investor Musk is quite dissatisfied with.

Nowadays, in addition to Meta's LLaMA series, most of the leading large model manufacturers, such as OpenAI, Google, and Microsoft, have chosen closed source, but the open source model is still welcomed for its ability to iterate quickly and be customizable. This is also the reason why Mistral is called the "European version of OpenAI", and at a time when large companies are closing the source, there are also people who call Mistral AI the hope of the open source route.

Interestingly, a recent trend chart produced by the ARK Invest team predicts the development of generative AI by the open source community and proprietary models in 2024. According to its predictions, the performance of open source models is improving, and the gap between them and proprietary models will be getting smaller and smaller. In this regard, Yann LeCun, chief AI scientist at Meta and winner of the Turing Award, also forwarded:

Open-source AI models are on the path to surpassing proprietary models.

△ Source: Twitter

At present, Mixtral8×7B has been launched on many open source model platforms, and it still needs time to verify whether open source can catch up with closed source.

Welcome to the exchange

Catching up with GPT-3.5, the "European version of OpenAI" launched a new model, with a valuation of $2 billion in 6 months | At the forefront

The "European version of OpenAI" has only been established for half a year with a valuation of 2 billion US dollars

Don't go big and do it, attack the small model

Read on

Google released a new upgraded large model to face off against OpenAI; Meizu released the new Flyme AIOS system

changes in the senior management of pharmaceutical companies Novartis and GSK in China; OpenAI's Chief Scientist Leaves | Executive Updates: May 5-17, 2024

The Conservative Rout? The driving force behind OpenAI's infighting left Altman: It makes me sad

OpenAI is shockingly exposed! Executives angrily denounced the suppression, and the 710 billion AI giant was embarrassed at home and abroad|Titanium Media AGI

GPT-4o sparks heated discussions about OpenAI's organizational innovation! Heavy responsibilities for fresh graduates and undergraduates, the ranks are all floating clouds

Ilya left OpenAI insider exposure: Ultraman cut his team's computing power and prioritized products to make money

In the second act of OpenAI's palace fight, the core security team was disbanded, and the person in charge blew up the inside story of his resignation

OpenAI forces departing employees to sign shut-up agreements: GPT can talk, but former employees can't

Be a sneak peek! ByteDance is unprecedented! The large model is stunningly unveiled, and the price is as low as 99%!

39 million people watched Lei Jun's live test drive; Musk recruits second brain-computer experiment patient; DeepMind launches a large-scale model risk assessment framework

OpenAI responds to "gag" resignation clauses; Didi Chengwei: Liu Qing was promoted to permanent partner, and the company no longer has the position of president; NetBSD prohibits AI-generated code | Geek headlines

OpenAI employees were "sealed" when they left their jobs, the core security team was disbanded, and Altman responded urgently: there was an agreement, but it was never implemented!

From "sky-high prices" to "fracture prices", large models are about to change

If you want to land a large model, let everyone afford to use it first

Direct interaction with hundreds of millions of users Third-party AI models accelerate access to the Weibo ecosystem

iFLYTEK Xinghuo large model empowerment, opening up the "new consciousness" of virtual people

聊聊OpenAI最新发布的GPT 4o

When open source meets large models, what kind of changes will occur?

OpenAI Shock! The chief scientist suddenly left! Wang Yuquan's exclusive analysis!

It is said that the senior management of the Tsinghua Department of the large model company has changed

58.com Sun Qiming: How to build a large model of life service vertical? Self-developed + open source with both hands

AI Dimensity Full Push, China's First End-to-End Large Model Mass Production on the Car Xpeng opens the era of AI intelligent driving

The price of large models has fallen, and the Internet-style "turf war" has reappeared, will big factories really lose money?

The Past of China's Large Model Capital: 20 Large Model Insiders Walk on the "Life and Death Table"

The price war of AI large models starts, and the winner will be decided in a year?

Baidu's first Wenxin large model learning machine Z30 is on sale, and 8G +256G is sold for 6694 yuan

OpenAI officially announced the launch of "next-generation cutting-edge model" training! It is expected that the training parameters will be further improved, or the "Wensheng video" model Sora will be integrated

Former OpenAI director reveals the inside story of Ultraman's recall: The board of directors knew that ChatGPT had been released from X

It's all "my own people"! OpenAI urgently set up a "safety committee", less than half a month after the disbandment of the "super alignment" team, and will face the first security "big test" in 90 days

OpenAI is caught in the biggest public relations crisis in history, and the head of Altman, who is in charge, donated half of his net worth to help the company tide over the difficulties

In the large-scale model competition, why are Chinese and American tech giants rolling in different directions?