Fudan MOSS big model is open source! Github and Hugging Face went live at the same time

2023-04-23 09:57:29

The rich color Xiao Xiao is from the Cave Fei Temple

Qubits | Official account QbitAI

Fudan large model MOSS, officially open source!

As the first ChatGPT-like product in China for open testing, as soon as the MOSS open source address was released, it rushed to the hot search of Zhihu:

Fudan MOSS big model is open source! Github and Hugging Face went live at the same time

According to the official website, MOSS is an open source dialogue language model with 16 billion parameters.

It is pre-trained by about 700 billion Chinese and English and code words, and if the accuracy requirements are not so high, it can even run on a single 3090 graphics card.

MOSS supports bilingual Chinese and English, has the dual ability of multi-round dialogue and using multiple plug-ins, and has "skill points" for search engines, literary diagrams, calculators, and equation solving.

At present, MOSS has 2.3k stars on GitHub, and the popularity continues to rise.

So, what projects are open sourced by MOSS and what are the current functions? Let's take a look.

Can solve equations, can also generate pictures

It is understood that this time MOSS open source is the third round of iteration version, called MOSS 003.

Of course, MOSS 003 is not open source all models, the main open is the pedestal pre-training model, and the final model will be open source in the near future:

Previously, MOSS has gone through two iterations, one is the open invitation for internal beta version of MOSS 002, and the other is the internal test version OpenChat 001.

The "first ChatGPT open test in China" witnessed by everyone in March is the MOSS 002 version.

How effective is this latest release?

The MOSS team released examples of its multiple rounds of dialogue, solving equations, writing code, generating pictures, and answering ethics.

This is the display effect of multiple rounds of dialogue, with the ability to trace the source of the data:

Here are examples of doing math problems, including the chicken and rabbit cage problem:

Also includes the ability to generate images:

There are also recent heated discussions among netizens about the ethics of AI:

Of course, there are also netizens who "smash the field" in this wave.

For example, in the official GitHub Q&A, some netizens are eager to know who is better than its Chinese ability and chatglm6b, and the following netizens ridiculed:

Are you here to smash the field?

This is also because the team previously said during the public test of MOSS 002 that the Chinese ability of MOSS is not too good, which is also related to the training data.

In addition, the team also open-sourced some training data this time.

As for the complete data, the Fudan MOSS team said that "all will be open source in the near future".

The first ChatGPT product released in China

MOSS is taken from the self-aware AI in The Wandering Earth.

It was developed by the team of Qiu Xipeng of Fudan Natural Language Processing Laboratory and officially released on February 20.

At that time, a number of large domestic manufacturers were still making high-profile official announcements to build ChatGPT, and no one expected it to be killed silently.

So even though MOSS has an order of magnitude smaller parameters than ChatGPT, everyone flocked to the server and overwhelmed the server.

A week after the release, Qiu Xipeng, a professor at Fudan University, proposed in his speech that if the optimization goes well, it is planned to open source MOSS at the end of March.

Because in his opinion:

The reason why people are easy to have the "stuck neck" problem is because of the lack of a good industry ecology.

In the case of ChatGPT, OpenAI only needs to focus on making models, and the computing power, data, and systems are handed over to other companies. On the other hand, many companies in China need to do all the links themselves when making large-scale language models, and the data and computing power of each company are also limited, and the result is that each company cannot do much. In the process of development, the high cost of large-scale language model training and online inference is also a major problem.

Therefore, they considered open-sourcing MOSS rather than commercializing it from the start.

With such a base, everyone does not need to repeatedly develop the underlying technology, and can connect different subdivisions on it.

Now, almost a month late, MOSS has done what it says.

Have you tried the Fudan MOSS open source model? How effective is it?

Fudan MOSS big model is open source! Github and Hugging Face went live at the same time

Read on

The first batch of deliveries were achieved in one week, and the Fudan team independently developed 7 kinds of intelligent sanitize equipment

A panacea for equity? Musk's open source plan for Twitter algorithms is far more complicated than imagined

Fudan beauty professor Chen Guo, from being admired by people to falling off the altar: in the face of the network, we must do a good job of 2 points

Huawei experts explain the OpenHarmony open source Hongmeng hardware resource pooling model

How many hours of sleep a day is best for the brain? Fudan research reveals that sleeping too much and too little is harmful!

To improve the effect of immunotherapy, the Fudan team revealed the mechanism of combination of RORγt agonists with PD-1 inhibitors

The 33-year-old female doctor left 70 dying diaries, and her husband 10 years later: I did not remarry, and my son became a school bully

Zhang Guili: "Diary Of a Hundred Years" - Li Ciming's Travels and Readings

Mozilla's open source speech dataset has 20,000 hours of content, and new support for Cantonese and Hokkien languages

Microsoft's participation in the Open 3D Foundation will drive the development of open source 3D engines

Fudan graduate students check the literature to make their own drugs, sales of more than 10 million, known as the cat version of the "medicine god"! He was eventually detained

Golden Shovel Battle "College Cup" Top Eight Interview——Fudan University. Baby Duo

In an instant 5k+star, Musk bought Twitter for $44 billion and announced that it would open source

Only one-tenth of the data is needed to complete the four visual tasks, and it is actually open source!

"Why don't you suggest Grandma bring a baby?" Fudan female teachers said a word, countless baoma said bluntly: It is too difficult

For the first time in the history of a tech company: a Meta open source AI model of the size of the GPT3 parameter

Huawei Hongmeng HarmonyOS 94 JS/eTS open source components are newly launched

Fudan pediatrics new crown close contact newborn discharge has exceeded 100 cases, the ward of this group of "big white mothers" care for small lives

Professor Chen Yinchi of Fudan University: Zhuangzi will not refuse mobile phones and the Internet

Taking Log4j as an example, how to assess and classify security risks

A programmer's success story: from open source tools to a $7.5 billion software empire

The Fudan team released the first ChatGPT-like model MOSS in China, inviting the public to participate in the internal test

The Fudan MOSS model is scheduled to be open source in mid-April, and Qiu Xipeng explains in detail how to build it

Microsoft Open Source Deep Speed Chat: The era of ChatGPT for everyone is here

Meta has thrown out another AI open source masterpiece! Animated graffiti and exposed new datasets

Professor Fudan gave a shocking speech: "You can no longer give in to your children", otherwise the consequences will be serious

Xiaoza personally officially announced the Meta vision big model! Self-supervised learning requires no fine-tuning

The name of Tsinghua University was given by Huang Rong

Open source big model, the next "stuck neck" technology? Deep web