Alibaba Cloud Releases Tongyi Qianwen Version 2.1, Improving Code Understanding and Generation Capabilities by 30%

author：Wall Street Sights 2023-12-01 19:48:00

On December 1, Alibaba Cloud held a press conference to release version 2.1 of the closed-source model of Tongyi Qianwen, upgrade the visual understanding model Qwen-VL, and open-source the 72 billion parameter model Qwen-72B. Compared with the previous version, the code comprehension and generation ability, mathematical reasoning ability, Chinese and English encyclopedia knowledge, and hallucination-induced resistance ability of Tongyi Qianwen 2.1 are increased by 30%, 10%, nearly 5%, and 14%, respectively. Users can experience the latest models for free in the Tongyi Qianwen APP.

Tongyi Qianwen also open-sourced the 1.8 billion parameter model Qwen-1.8B, and open-sourced the audio understanding model Qwen-Audio for the first time. So far, Tongyi Qianwen has open-sourced four large language models with 1.8 billion, 7 billion, 14 billion, and 72 billion parameters, as well as two multi-modal large models for visual understanding and audio understanding, realizing "full-size, full-modal" open source. The intensity is unparalleled in the industry.

The Tongyi Qianwen pedestal model continues to evolve, and the multimodal exploration is industry-leading

Since its launch in April this year, the Tongyi Qianwen pedestal model has been evolving. At the end of October, Alibaba Cloud released Tongyi Qianwen 2.0 at the Apsara Conference, and in just one month, Tongyi Qianwen was upgraded again, with code comprehension and generation capabilities increased by 30%, mathematical reasoning capabilities increased by 10%, Chinese and English encyclopedia knowledge increased by nearly 5%, and hallucination-induced resistance capabilities increased by 14%. At the same time, the context window length has been extended to 32k to better handle long text inputs.

In addition to LLMs, the Tongyi Qianwen team also has industry-leading explorations in the field of multimodality. In August, Tongyi Qianwen open-sourced Qwen-VL, a large visual understanding model, which quickly became one of the best practices in the international open source community. The conference also announced a major update to Qwen-VL, which greatly improves the basic capabilities of general OCR, visual reasoning, and Chinese text understanding, and can also process images of various resolutions and specifications, and even "look at pictures to do problems". Whether in terms of authoritative assessment results or the effect of real experience, Qwen-VL's Chinese text comprehension ability surpasses GPT-4V.

Alibaba Cloud Releases Tongyi Qianwen Version 2.1, Improving Code Understanding and Generation Capabilities by 30%

Qwen-VL can be "Picture Programming"

The Tongyi model can "see" and "hear". On the same day, Alibaba Cloud open-sourced Qwen-Audio, a large audio understanding model, for the first time. Qwen-Audio is able to perceive and understand all kinds of speech signals such as human voices, nature sounds, animal voices, music sounds, etc. Users can input a piece of audio and ask the model to give an understanding of the audio, and even use the audio to create literature, logical reasoning, story continuation, and so on. Audio comprehension enables large models to be close to human hearing.

The industry's strongest open source model, filling the gap in China's LLM open source field

Qwen-72B has achieved the best results in 10 authoritative benchmarks, becoming the industry's most powerful open source model, surpassing the open source benchmark Llama 2-70B and most commercial closed-source models. In the future, enterprise-level and research-level high-performance applications will also have the option of open source large models.

Qwen-72B is trained based on high-quality data of 3T tokens, which continues the strong performance of the Tongyi Qianwen pre-trained model. In English tasks, Qwen-72B achieved the highest score in the MMLU benchmark test of open source models; in Chinese tasks, Qwen-72B dominated the C-Eval, CMMLU, GaokaoBench and other benchmarks, and scored better than GPT-4; in terms of mathematical reasoning, Qwen-72B was ahead of other open source models in GSM8K and MATH evaluation interrupt layers; in terms of code understanding, Qwen-72B was in HumanEval, The performance in MBPP and other evaluations has been greatly improved, and the code ability has made a qualitative leap.

In the 10 authoritative evaluations, the 72 billion parameter model of Tongyi Qianwen won the best score of the open source model

Tongyi Qianwen's 72 billion open-source model has surpassed the closed-source GPT-3.5 and GPT-4 in part

The Qwen-72B can handle up to 32k long text inputs, surpassing ChatGPT-3.5-16k on the long text comprehension test set LEval. The R&D team optimized Qwen-72B's skills in instruction compliance and tool usage to make it better integrated with downstream applications. For example, the Qwen-72B is equipped with a powerful System Prompt capability, which allows users to customize the AI assistant with a single prompt word, requiring a large model to play a certain role or perform a specific response task.

Previously, there was no high-quality open-source model in China's large-scale model market that could benchmark Llama 2-70B. Qwen-72B fills the gap in China, with the advantages of high performance, high controllability and high cost performance, providing an option that is no less than that of commercial closed-source large models. Based on Qwen-72B, large and medium-sized enterprises can develop commercial applications, and universities and research institutes can carry out scientific research such as AI for Science.

From 1.8B to 72B, Tongyi Qianwen took the lead in realizing full-scale open source

If the Qwen-72B "touches the height" and raises the size and performance ceiling of the open source model, the Qwen-1.8B, another open source model at the press conference, "bottoms down" and becomes the smallest Chinese open source model, inferring 2K length text content only needs 3G video memory, and can be deployed on consumer-grade terminals.

From 1.8 billion, 7 billion, 14 billion to 72 billion parameters, Tongyi Qianwen has become the industry's first "full-scale open source" large model. Users can directly experience the effects of the Qwen series models in the Moda community, call the model API through the Alibaba Cloud Lingji platform, or customize large model applications based on the Alibaba Cloud Bailian platform. Alibaba Cloud's AI platform PAI provides in-depth adaptation to the full range of Tongyi Qianwen models, and provides services such as lightweight fine-tuning, full-parameter fine-tuning, distributed training, offline inference verification, and online service deployment.

Alibaba Cloud is the first technology company in China to open-source self-developed large models, and has open-sourced Qwen-7B, Qwen-14B, and Qwen-VL, a visual understanding model, since August. Several models have been listed on HuggingFace and Github, and have been favored by small and medium-sized enterprises and individual developers, with a cumulative number of downloads of more than 1.5 million and more than 150 new models and applications. At the press conference, a number of developer partners shared their practices of using Qwen to develop proprietary models and specific applications.

Zhou Jingren, CTO of Alibaba Cloud, said that the open source ecosystem is crucial to promoting the technological progress and application of China's large models, and Tongyi Qianwen will continue to invest in open source, hoping to become "the most open large model in the AI era" and work with partners to promote the ecological construction of large models.

Alibaba Cloud Releases Tongyi Qianwen Version 2.1, Improving Code Understanding and Generation Capabilities by 30%

The Tongyi Qianwen pedestal model continues to evolve, and the multimodal exploration is industry-leading

The industry's strongest open source model, filling the gap in China's LLM open source field

From 1.8B to 72B, Tongyi Qianwen took the lead in realizing full-scale open source

Read on

In less than a year, I tried a number of large models at home and abroad, and found that each has its own merits. Recently, I tested Tongyi Qianwen, an open source version of Alibaba Cloud's 720 parameter large model

Today, there is a remarkable change in the field of AI in China. Alibaba Cloud announced the open source Tongyi Qianwen, a powerful large model with 72 billion parameters, which quickly seized the open source large model

Tongyi Qianwen 72B, 1.8B, and Audio models were released, following the example of Meta and Alibaba Cloud to open source the Tongyi Qianwen 72 billion parameter model Qwen-72B, 1

开源12天,包揽Hugging Face等榜单冠军,通义千问甩Llama 2成新标杆

Use Ali Tongyi Qianwen and Semantic Kernel to build a knowledge assistant in 10 minutes!

#Zhang Jiechatabout Technology#Musk is indeed a veritable prophet! In his recent exchange with the Israeli Prime Minister, he mentioned: "China will emerge in the field of artificial intelligence and is expected to take the lead."

Intel Core Ultra was released, Ali Tongyi Qianwen took the lead in adapting, and the landing of AI PCs is exciting

Terracotta Warriors and Horses Dance Subject Three, Bezos Dance House Dance... Ali Tongyi Qianwen has been arranged!

Alibaba Cloud won the first instance of the lawsuit against Shanzhai Tongyi Qianwen App, Apple became the smartphone sales champion in 2023, and Win 11 completely eliminated the WordPad | Geek headlines

MediaTek and Alibaba Cloud have completed the device-side deployment of the Tongyi Qianwen model on the Dimensity mobile platform

Tongyi Qianwen has open-sourced 32 billion parameter models, and has realized 7 large language models that are all open-source

The release time of the new iPad is exposed, and the price may rise/ Huawei may release new products for 5 consecutive days/Tongyi Qianwen open-source 32 billion parameter model

Tongyi Qianwen landed MediaTek mobile phone chip offline can also continue to talk to AI

Technology tycoon Musk once predicted that many of China's AI capabilities have huge potential, even leading the world. It turns out

Tongyi Qianwen open source king fried, 110 billion parameters dominate the open source list, Chinese ability is the first in the world

The throne of the open source large model changed hands again, Tongyi Qianwen won the SOTA with 100 billion parameters, and 8 models have been launched in March

The Golden Age of Ali Advertising: The Great Wave of Wireless Recommendation

Ali is heavily punished, but China Resources Gas is lightly punished, how to maintain social fairness?

Ali head Tsai Chongxin: spent $2.35 billion to buy the Nets, how much money did he make in 4 seasons

Ali can impose a severe penalty of up to 10 billion, why can't China Resources Gas be also heavily fined?

On May 2, the top 20 U.S. stocks in terms of trading volume: Chinese e-commerce stocks rose, Alibaba rose 6.4%, and Pinduoduo rose 10.5%

Arenas: In the 78-year history of the NBA, there are only these 4 teams that can really be regarded as dynasty teams!

Why are Alibaba and China Resources Gas treated differently?

Ma Yun appeared in Ali, and Daniel Zhang invited coffee. With short sleeves and slacks, Jack Ma is still a teenager when he returns

Ali Lin Junyang: Large models are not enough for many people, and building multimodal agents is the key

The implementation of Alimama's marketing privacy computing platform SDH in the public cloud

The final lineup for the Madrid Masters was released, Rublev scored strongly, and Aliassime was the luckiest

"Punishing Ali and conniving at China Resources Gas, where is the social justice?"

The assets of MYbank have shrunk by 50 billion, and Alibaba's enterprises have lost the rivers and lakes!

Why are there many employees on the Internet who come out of Black Ali The truth behind the resignation of employees!

Ali's new action has launched unlimited returns and free shipping, but the comments of netizens are mixed

Missing the opportunity to become a billionaire? Yu Minhong: If he helped Jack Ma back then, he may now become the boss behind Alibaba