SenseTime's "RiRixin 5.0" has been fully upgraded, saying that the performance of the large model surpasses that of GPT-4 Turbo

Rakuten on April 23

In the field of large models, China is showing a situation of hundreds of schools of thought.

SenseTime held a technology exchange day in Shanghai today, releasing the industry's first "cloud, device, edge" full-stack large-scale model product matrix to meet the application needs of different scale scenarios, and newly upgrading the "SenseNova 5.0" large-scale model system.

According to the evaluation, its comprehensive capabilities are fully benchmarked against GPT-4 Turbo, and the technology leads the acceleration of the comprehensive transition of generative AI to the industry, so as to realize the on-demand use of large models.

Under the principle of Scaling Law, the most basic law of AI development, SenseTime continues to seek the best data ratio and establish a data quality evaluation system to promote its own large model research and development, while also providing industry partners with large model training, fine-tuning, deployment and various generative AI capabilities and services.

Xu Li, Chairman and CEO of SenseTime, said: "Under the guidance of the law of scale, SenseTime will continue to explore the KRE three-layer architecture (knowledge-reasoning-execution) of large model capabilities and break through the boundaries of large model capabilities. ”

"日日新SenseNova5.0"性能超越GPT-4 Turbo

Since its first release in April 2023, SenseTime's "RiRixin SenseNova" large model system has officially launched five major iterations. Based on more than 10TB of tokens trained and covering a large amount of synthetic data, the new "Ririxin SenseNova 5.0" (abbreviation: Ririxin 5.0) adopts a hybrid expert architecture, and the context window can be effective to about 200K during inference.

This update mainly focuses on enhancing knowledge, mathematics, reasoning and code capabilities, comprehensively benchmarking GPT-4 Turbo, and reaching or surpassing GPT-4 Turbo in mainstream objective evaluations.

According to reports, in terms of liberal arts ability, the creative writing ability, reasoning ability and summary ability of "Ririxin 5.0" have been greatly improved, and after the same Chinese knowledge is injected, you can get better understanding and summary and Q&A, providing strong assistance for vertical application scenarios such as education and content industry.

"RiRixin 5.0" and GPT-4 answer interesting reasoning questions: "Mom made Yuanyuan a cup of coffee, and after Yuanyuan drank half a cup, she filled it with water, and then she drank another half cup, and then filled it with water, and finally drank it all." Ask Yuanyuan whether he drinks more coffee or more water?", "Ri Ri Xin 5.0" answered correctly.

In terms of science capabilities, the mathematical ability, code ability and reasoning ability of "Ririxin 5.0" have reached the leading level in the industry, providing a foundation for the implementation of scenarios such as finance and data analysis.

SenseTime said that another core indicator of the "RiRixin 5.0" is multimodal capability, and SenseTime's multi-modal large model ranked first in the comprehensive score of MMBench, the authoritative comprehensive benchmark test of multi-modal large models, and achieved leading results in many well-known multi-modal lists MathVista, AI2D, ChartQA, TextVQA, DocVQA, and MMMU.

"RiRixin SenseNova 5.0" also achieves more excellent multi-modal capabilities at the application product level, supports the analysis and understanding of high-definition long graphs, and the interactive generation of Wensheng diagrams, as well as complex cross-document knowledge extraction and summary Q&A display, and also has rich multi-modal interaction capabilities.

Completed the full-stack layout of "cloud edge": launched enterprise-level application all-in-one machines

In this technology exchange day, SenseTime launched a full-stack large-scale model product matrix of "cloud, device, and edge", including the "SenseTime end-side large model" applied to terminal equipment, and the edge product "SenseTime enterprise-level large-scale model all-in-one" for finance, code, medical, government affairs and other fields.

SenseTime has also launched a device-cloud collaboration solution, which can give full play to the respective advantages of devices and clouds through intelligent judgment collaboration, and offload to the cloud for processing when it is necessary to search or process complex scenarios online, with end-side processing accounting for more than 80% of some scenarios, thereby reducing inference costs.

According to the official, the SenseTime Rixin device-side large language model can achieve an average generation speed of 18.3 words/s on the mid-end platform, and the flagship platform has reached 78.3 words/s.

The diffusion model can also achieve the fastest inference speed in the industry on the device side, and the inference speed of the end-side LDM-AI expansion technology is less than 1.5 seconds on a mainstream platform, which is 10 times faster than that of the cloud app of a competitor, and supports the output of high-definition images of 12 million pixels and above, and supports fast image editing functions such as equal ratio expansion on the device, free expansion and rotation expansion on the device.

Starting today, SenseTime's end-to-end business SDK is officially released.

In response to the growing demand for AI applications at the edge of key industries such as finance, code, healthcare, and government affairs, SenseTime officially launched an enterprise-level large model all-in-one machine, which can support both enterprise-level 100-billion-yuan model acceleration and knowledge retrieval hardware acceleration, realize localized deployment, and use it out-of-the-box, lowering the threshold for enterprise application large models. Compared with similar products in the industry, the inference cost is reduced by 80%, the retrieval acceleration is accelerated, and the CPU workload is 50%.

In-depth cooperation with Kingsoft Office, Haitong Securities, etc., commercialization is maturing

SenseTime is working with ecosystem partners to innovate product applications in the AI 2.0 era and create new quality productivity.

Since 2023, SenseTime has reached an in-depth cooperation with Kingsoft Office, based on the excellent code generation and tool invocation capabilities of the "RiRixin" large model, to help WPS 365 build a new office productivity platform that releases scene capabilities more efficiently, and build an exclusive "enterprise brain" for enterprises.

Zhang Qingyuan, CEO of Kingsoft Office, said: "In the office application scenario, the performance of the SenseTime model is very good, which can help our users solve complex problems in the office and improve efficiency. ”

In the financial sector, Haitong Securities and SenseTime jointly released a multi-modal full-stack model of the financial industry, in which the two parties promoted business implementation in the fields of intelligent customer service, compliance and risk control, code assistance, and business office assistants, and jointly researched cutting-edge scenarios in the industry such as robo-advisors and public opinion monitoring, so as to open up the full-stack capabilities of the large-scale model landing in the securities industry.

Mao Yuxing, Deputy General Manager and Chief Information Officer of Haitong Securities, said: "Through our cooperation with SenseTime, we have realized the digital and intelligent transformation of Haitong Securities by using large-scale model technology, and in the future, we will combine full-stack AI capabilities to carry out business processes, interactive transformation and digital intelligent business system reconstruction. ”

In the personal travel scenario, SenseTime's large model technology is applied to the smart cabin of the Xiaomi Auto SU7, which has recently become popular in the market.

Wang Gang, general manager of Xiaomi Xiaoai, said in a dialogue with Wang Xiaogang, co-founder and chief scientist of SenseTime, "SenseTime's cloud-edge-end full-stack combination can well empower and adapt to Xiaomi's IoT ecosystem. We hope to work with SenseTime to create a more intelligent product experience for our users. ”

SenseTime also released a Ascend-based industry model today to jointly build an industry ecosystem for finance, healthcare, government affairs, code, and other industries. In terms of its own application, SenseTime's "Ririxin SenseNova 5.0" has been updated in Miaohua, Ronin, Gewu, Qiongyu, Dayi, Little Raccoon Family and other products.

SenseTime said that it is firmly moving towards the AGI era, and "Wensheng Video" is on the way

In the final session of this technical exchange day, Xu Li, chairman and CEO of SenseTime, also brought three videos generated entirely by large models, and emphasized the controllability of the Wensheng video platform for characters, actions and scenes.

In the future, a video can be generated by entering a text or a complete description, and the characters' clothing, hairstyles, and scenes can be pre-set to maintain the coherence and consistency of the video content.

Through the intelligent computing center built by SenseTime, it can continuously empower the training of large models, and at present, RiRixin's large model system has made innovations in natural language processing, video generation, and deep learning optimization.

On the one hand, the development of large models has entered the landing stage, and how to combine them with industries and application scenarios is a key part; on the other hand, the path of "law of scale" is gradually clear, and "emergence" is uncertain, and it is also the top priority to explore the most advanced large model technology in a forward-looking manner.

SenseTime said that its large-scale model technology and products have been applied in various industries such as medical care, education, law, and industry. For example, with the name of "new every day", SenseTime has always been firmly moving towards the goal of general artificial intelligence, breaking through the limitations of data and computing power, and leading the innovation and implementation of large models.

———————————————

Lei Di was founded by Lei Jianping, a media person, if it is reprinted, please indicate the source.

SenseTime's "RiRixin 5.0" has been fully upgraded, saying that the performance of the large model surpasses that of GPT-4 Turbo

Read on

SenseTime's stock price surged 30% and just upgraded to Ririxin 5.0, saying that its performance surpassed GPT-4 Turbo

After the stock price rose by more than 30%, trading was suspended, what are the highlights of SenseTime Rixin 5.0?

The cloud-edge full-stack layout has been completed, and SenseTime has upgraded SenseNova 5.0 to achieve comprehensive industry implementation

GPT-4 was "beaten" by the small model on the end of the scene, and SenseTime 5.0: fully benchmarked against Turbo

端侧大模型爆发前夜商汤日日新性能超越GPT-4 Turbo

Garbage classification|Bayan South Road community carried out the activity of "daily cleaning of garbage cans and garbage classification every day".

From "One Factory in a Lifetime" to "New Journey Every Day": Comparison and Enlightenment of Chinese and Western Factory Cultures

🌿 Some people like to use up every inch of space in their home and fill it up, a sense of complexity in their lives, superior and reassuring. But I like to have a blank space in the kitchen that brings me life

If you can listen to it, you will see it, and you will find a topic! China's first WYSIWYG large-scale model "RiRixin 5O" was released

商汤科技发布"日日新5o",实时多模态流式交互对标GPT-4o

SenseTime Technology's "Daily New 5O" was released, and you can listen to it and look for topics

SenseTime Launches Multi-modal Large Model "RiRixin 5O"丨Kelin Launches AI Video Web Editor

Reward丨Gou Rixin, New Every Day: Fu Baoshi "Mirror Park Flying Spring"

The race is endless, and it is new every day! Sheyang handed over the "mid-term answer sheet" for high-quality development

The "Riri New Large Model" was unveiled at the Olympic Games, and what is the color of SenseTime's AI application?

Accelerate the momentum, and the new industrial city is new day by day

CNCC | The future of multimodal affective computing under large models

The "Fuxi Eye" large model was released! It has the world's largest ophthalmic image database

New car | The AI large model is on the car, 13 new/27 optimizations, and the ZEEKR 009 glorious OTA upgrade

AI Daily: Fudan and Baidu's new models can generate 1-hour long videos; The new version of ChatGPT for Windows is launched; Two new features have been added to NotebookLM

Surveying and Mapping Bulletin | Ren Ping: Noise data visualization based on LOD1 city model

The terminal AI grading standard has been implemented, and the "fire" of the mobile phone model has burned to the agent

J Clin Invest丨Yang Weili/Li Shihua/Li Xiaojiang's team used monkey models to reveal new pathological mechanisms of Parkinson's disease

Tens of millions of dollars lost by poisoning for large model training? Anthropic found a hidden bug in the LLM codebase

Nearly 1,000 teenagers in the city gathered at Zhonghai Expo to show their skills in the three major model competitions of navigation, aviation and architecture

DeepMind and MIT developed Fluid, which enables autoregressive models to achieve large-scale expansion of Wensheng graphs

AI Weekly | ByteDance's large model training was "poisoned"; Microsoft will terminate the Azure OpenAI service for individuals in China

ByteDance responded to the attack on the intern for the training of the large model: it has been dismissed and does not affect the online business

A number of large models have been rolled out in the field of traditional Chinese medicine, and the "AI old Chinese medicine" is coming?

Shoot the king to bomb? Photorealistic generative world model, with Pixar investment

Tencent, Huawei, etc. access to DeepSeek lose more than 400 million yuan per month, and the MaaS model as a service is about to be subverted? Titanium media AGI

The sex robot was unexpectedly empowered by a large model, and the concept stocks of adult products rose collectively, against the sky?