laitimes

SenseTime's "RiRixin 5.0" has been fully upgraded, saying that the performance of the large model surpasses that of GPT-4 Turbo

author:Thunder delivery
SenseTime's "RiRixin 5.0" has been fully upgraded, saying that the performance of the large model surpasses that of GPT-4 Turbo

Rakuten on April 23

In the field of large models, China is showing a situation of hundreds of schools of thought.

SenseTime held a technology exchange day in Shanghai today, releasing the industry's first "cloud, device, edge" full-stack large-scale model product matrix to meet the application needs of different scale scenarios, and newly upgrading the "SenseNova 5.0" large-scale model system.

SenseTime's "RiRixin 5.0" has been fully upgraded, saying that the performance of the large model surpasses that of GPT-4 Turbo

According to the evaluation, its comprehensive capabilities are fully benchmarked against GPT-4 Turbo, and the technology leads the acceleration of the comprehensive transition of generative AI to the industry, so as to realize the on-demand use of large models.

Under the principle of Scaling Law, the most basic law of AI development, SenseTime continues to seek the best data ratio and establish a data quality evaluation system to promote its own large model research and development, while also providing industry partners with large model training, fine-tuning, deployment and various generative AI capabilities and services.

Xu Li, Chairman and CEO of SenseTime, said: "Under the guidance of the law of scale, SenseTime will continue to explore the KRE three-layer architecture (knowledge-reasoning-execution) of large model capabilities and break through the boundaries of large model capabilities. ”

"日日新SenseNova5.0"性能超越GPT-4 Turbo

Since its first release in April 2023, SenseTime's "RiRixin SenseNova" large model system has officially launched five major iterations. Based on more than 10TB of tokens trained and covering a large amount of synthetic data, the new "Ririxin SenseNova 5.0" (abbreviation: Ririxin 5.0) adopts a hybrid expert architecture, and the context window can be effective to about 200K during inference.

SenseTime's "RiRixin 5.0" has been fully upgraded, saying that the performance of the large model surpasses that of GPT-4 Turbo

This update mainly focuses on enhancing knowledge, mathematics, reasoning and code capabilities, comprehensively benchmarking GPT-4 Turbo, and reaching or surpassing GPT-4 Turbo in mainstream objective evaluations.

According to reports, in terms of liberal arts ability, the creative writing ability, reasoning ability and summary ability of "Ririxin 5.0" have been greatly improved, and after the same Chinese knowledge is injected, you can get better understanding and summary and Q&A, providing strong assistance for vertical application scenarios such as education and content industry.

SenseTime's "RiRixin 5.0" has been fully upgraded, saying that the performance of the large model surpasses that of GPT-4 Turbo

"RiRixin 5.0" and GPT-4 answer interesting reasoning questions: "Mom made Yuanyuan a cup of coffee, and after Yuanyuan drank half a cup, she filled it with water, and then she drank another half cup, and then filled it with water, and finally drank it all." Ask Yuanyuan whether he drinks more coffee or more water?", "Ri Ri Xin 5.0" answered correctly.

In terms of science capabilities, the mathematical ability, code ability and reasoning ability of "Ririxin 5.0" have reached the leading level in the industry, providing a foundation for the implementation of scenarios such as finance and data analysis.

SenseTime's "RiRixin 5.0" has been fully upgraded, saying that the performance of the large model surpasses that of GPT-4 Turbo

SenseTime said that another core indicator of the "RiRixin 5.0" is multimodal capability, and SenseTime's multi-modal large model ranked first in the comprehensive score of MMBench, the authoritative comprehensive benchmark test of multi-modal large models, and achieved leading results in many well-known multi-modal lists MathVista, AI2D, ChartQA, TextVQA, DocVQA, and MMMU.

SenseTime's "RiRixin 5.0" has been fully upgraded, saying that the performance of the large model surpasses that of GPT-4 Turbo

"RiRixin SenseNova 5.0" also achieves more excellent multi-modal capabilities at the application product level, supports the analysis and understanding of high-definition long graphs, and the interactive generation of Wensheng diagrams, as well as complex cross-document knowledge extraction and summary Q&A display, and also has rich multi-modal interaction capabilities.

Completed the full-stack layout of "cloud edge": launched enterprise-level application all-in-one machines

In this technology exchange day, SenseTime launched a full-stack large-scale model product matrix of "cloud, device, and edge", including the "SenseTime end-side large model" applied to terminal equipment, and the edge product "SenseTime enterprise-level large-scale model all-in-one" for finance, code, medical, government affairs and other fields.

SenseTime's "RiRixin 5.0" has been fully upgraded, saying that the performance of the large model surpasses that of GPT-4 Turbo

SenseTime has also launched a device-cloud collaboration solution, which can give full play to the respective advantages of devices and clouds through intelligent judgment collaboration, and offload to the cloud for processing when it is necessary to search or process complex scenarios online, with end-side processing accounting for more than 80% of some scenarios, thereby reducing inference costs.

According to the official, the SenseTime Rixin device-side large language model can achieve an average generation speed of 18.3 words/s on the mid-end platform, and the flagship platform has reached 78.3 words/s.

The diffusion model can also achieve the fastest inference speed in the industry on the device side, and the inference speed of the end-side LDM-AI expansion technology is less than 1.5 seconds on a mainstream platform, which is 10 times faster than that of the cloud app of a competitor, and supports the output of high-definition images of 12 million pixels and above, and supports fast image editing functions such as equal ratio expansion on the device, free expansion and rotation expansion on the device.

SenseTime's "RiRixin 5.0" has been fully upgraded, saying that the performance of the large model surpasses that of GPT-4 Turbo

Starting today, SenseTime's end-to-end business SDK is officially released.

SenseTime's "RiRixin 5.0" has been fully upgraded, saying that the performance of the large model surpasses that of GPT-4 Turbo

In response to the growing demand for AI applications at the edge of key industries such as finance, code, healthcare, and government affairs, SenseTime officially launched an enterprise-level large model all-in-one machine, which can support both enterprise-level 100-billion-yuan model acceleration and knowledge retrieval hardware acceleration, realize localized deployment, and use it out-of-the-box, lowering the threshold for enterprise application large models. Compared with similar products in the industry, the inference cost is reduced by 80%, the retrieval acceleration is accelerated, and the CPU workload is 50%.

In-depth cooperation with Kingsoft Office, Haitong Securities, etc., commercialization is maturing

SenseTime is working with ecosystem partners to innovate product applications in the AI 2.0 era and create new quality productivity.

Since 2023, SenseTime has reached an in-depth cooperation with Kingsoft Office, based on the excellent code generation and tool invocation capabilities of the "RiRixin" large model, to help WPS 365 build a new office productivity platform that releases scene capabilities more efficiently, and build an exclusive "enterprise brain" for enterprises.

SenseTime's "RiRixin 5.0" has been fully upgraded, saying that the performance of the large model surpasses that of GPT-4 Turbo

Zhang Qingyuan, CEO of Kingsoft Office, said: "In the office application scenario, the performance of the SenseTime model is very good, which can help our users solve complex problems in the office and improve efficiency. ”

In the financial sector, Haitong Securities and SenseTime jointly released a multi-modal full-stack model of the financial industry, in which the two parties promoted business implementation in the fields of intelligent customer service, compliance and risk control, code assistance, and business office assistants, and jointly researched cutting-edge scenarios in the industry such as robo-advisors and public opinion monitoring, so as to open up the full-stack capabilities of the large-scale model landing in the securities industry.

SenseTime's "RiRixin 5.0" has been fully upgraded, saying that the performance of the large model surpasses that of GPT-4 Turbo

Mao Yuxing, Deputy General Manager and Chief Information Officer of Haitong Securities, said: "Through our cooperation with SenseTime, we have realized the digital and intelligent transformation of Haitong Securities by using large-scale model technology, and in the future, we will combine full-stack AI capabilities to carry out business processes, interactive transformation and digital intelligent business system reconstruction. ”

In the personal travel scenario, SenseTime's large model technology is applied to the smart cabin of the Xiaomi Auto SU7, which has recently become popular in the market.

SenseTime's "RiRixin 5.0" has been fully upgraded, saying that the performance of the large model surpasses that of GPT-4 Turbo

Wang Gang, general manager of Xiaomi Xiaoai, said in a dialogue with Wang Xiaogang, co-founder and chief scientist of SenseTime, "SenseTime's cloud-edge-end full-stack combination can well empower and adapt to Xiaomi's IoT ecosystem. We hope to work with SenseTime to create a more intelligent product experience for our users. ”

SenseTime also released a Ascend-based industry model today to jointly build an industry ecosystem for finance, healthcare, government affairs, code, and other industries. In terms of its own application, SenseTime's "Ririxin SenseNova 5.0" has been updated in Miaohua, Ronin, Gewu, Qiongyu, Dayi, Little Raccoon Family and other products.

SenseTime said that it is firmly moving towards the AGI era, and "Wensheng Video" is on the way

In the final session of this technical exchange day, Xu Li, chairman and CEO of SenseTime, also brought three videos generated entirely by large models, and emphasized the controllability of the Wensheng video platform for characters, actions and scenes.

In the future, a video can be generated by entering a text or a complete description, and the characters' clothing, hairstyles, and scenes can be pre-set to maintain the coherence and consistency of the video content.

SenseTime's "RiRixin 5.0" has been fully upgraded, saying that the performance of the large model surpasses that of GPT-4 Turbo
SenseTime's "RiRixin 5.0" has been fully upgraded, saying that the performance of the large model surpasses that of GPT-4 Turbo

Through the intelligent computing center built by SenseTime, it can continuously empower the training of large models, and at present, RiRixin's large model system has made innovations in natural language processing, video generation, and deep learning optimization.

On the one hand, the development of large models has entered the landing stage, and how to combine them with industries and application scenarios is a key part; on the other hand, the path of "law of scale" is gradually clear, and "emergence" is uncertain, and it is also the top priority to explore the most advanced large model technology in a forward-looking manner.

SenseTime said that its large-scale model technology and products have been applied in various industries such as medical care, education, law, and industry. For example, with the name of "new every day", SenseTime has always been firmly moving towards the goal of general artificial intelligence, breaking through the limitations of data and computing power, and leading the innovation and implementation of large models.

———————————————

Lei Di was founded by Lei Jianping, a media person, if it is reprinted, please indicate the source.

Read on