laitimes

SenseTime's "Daily Update" big model system has been comprehensively upgraded, and the "data flywheel" empowers the renewal of hundreds of industries

author:Wenhui.com
SenseTime's "Daily Update" big model system has been comprehensively upgraded, and the "data flywheel" empowers the renewal of hundreds of industries

At the 2023 World Artificial Intelligence Conference (WAIC) "Boundless Love, Evergreen" Artificial Intelligence Forum held today, SenseTime announced that its "SenseTime SenseNova" large model system has ushered in a comprehensive upgrade in multiple aspects, and the large model technology has been applied in the production practice of intelligent cockpit, transportation, finance, medical care, e-commerce, mobile terminals, industrial parks and other industries.

First released in April this year, SenseTime's model includes the Chinese language model "Consultation", the Wensheng diagram generation model "Second Painting", the AI digital human video generation platform "Ruying", the 3D background building generation platform "Qiongyu" and the 3D object generation platform "Gewu".

As a natural language processing model with hundreds of billions of parameters, SenseChat 2.0 breaks through the limitation of input length of large language models and launches model versions with different parameter magnitudes, which can perfectly adapt to the application requirements of different terminals and scenarios such as mobile and cloud, and reduce deployment costs. The model parameters of SenseMirage 3.0, SenseTime's self-developed large-scale model, SenseMirage 3.0, have increased from 1 billion since its first release in April this year to 7 billion, enabling professional photography-level image detailing.

Compared with version 1.0, SenseAvatar 2.0 digital human generation platform improves voice and lip shape fluency by more than 30%, realizes 4K high-definition video effects, and brings AIGC image generation and digital human singing functions. In addition, SenseTime Qiongyu SenseSpace 2.0 improves the space reconstruction efficiency by 20%, the rendering performance by 50%, and the mapping time per 100 square kilometers of scene can be completed in only 38 hours (1200 TFLOPS/sec computing power support). SenseThings 2.0 achieves millimeter-level fineness in texture and material restoration of small objects, and breaks through the problem of collecting highly reflective and specular objects.

Relying on the rapid iteration of the "SenseNova SenseNova" large model system in the underlying technology field, SenseTime is actively empowering industrial upgrading through the multi-modal capability combination of the large model, and bringing many new breakthroughs leading the industry. Xu Li, Chairman and CEO of SenseTime, said in the product launch session: "The breakthrough of large models has set off a new round of technological revolution in artificial intelligence, followed by explosive growth in industrial demand, and new application scenarios and application models are rapidly emerging. ”

SenseTime's "Daily Update" big model system has been comprehensively upgraded, and the "data flywheel" empowers the renewal of hundreds of industries

In the financial field, SenseTime cooperates with banks, insurance, securities firms and other customers, uses digital humans for intelligent customer service, smart marketing, etc., and provides new functions such as investment research analysis and research report writing by accessing the ability of large language models to achieve cost reduction and efficiency increase. In addition, after mounting the financial knowledge base, it can also output content Q&A based on the customer's product description 100%, and realize timely update of information.

In the medical scenario, SenseTime has built a Chinese medical language model "Big Doctor" based on massive medical knowledge and clinical data, providing multi-scenario and multi-round conversation capabilities such as guidance, consultation, health consultation, and decision-making assistance, and will soon support multi-modal comprehensive analysis of medical images, text, structured data, etc., and continuously improve medical language understanding and reasoning capabilities, and continue to empower the rate of hospital diagnosis and treatment and patient service improvement.

Combining the comprehensive capabilities of Discussion 2.0 and Miaohua 3.0, SenseTime also brings a variety of intelligent interaction solutions to mobile terminal customers, including Q&A interaction for information acquisition, knowledge interaction for life scenarios, and content interaction for language and image generation, etc., relying on the lightweight version of SenseTime's model, which can be easily deployed and operated on mobile terminals. In addition, in the immersive sci-fi experience space "Three-Body Beyond Gravity", created by SenseTime based on Liu Cixin's award-winning novel "The Three-Body Problem", SenseTime uses the ability of large models to break through the boundaries of imagination and create and present a futuristic sci-fi voyage.

SenseTime's "Daily Update" big model system has been comprehensively upgraded, and the "data flywheel" empowers the renewal of hundreds of industries

For offline scenarios, SenseTime uses large model capabilities to bring intelligent solutions such as long-tail fault identification and complex defect judgment to power grid inspection. Based on the spatial reconstruction of Qiongyu 2.0, SenseTime created a digital twin of the real space for the regional development of Mashan Town in Jinan, China Vision Park in Hefei, and Shanghai Ruijin Hospital to improve the efficiency of operation and management.

In the jewelry industry, SenseTime relies on Gewu 2.0 to reproduce jewelry for jewelry brands, meticulously display the characteristics of product craftsmanship, and enhance customer shopping experience. SenseTime has also reached channel strategic cooperation with a number of leading enterprises to build a "cloud + AIGC+ short video live broadcast" ecology, bringing more efficient, low-cost, convenient and easy-to-use AI video and marketing tools to the industry.

In the field of intelligent vehicles, SenseTime's intelligent cockpit, intelligent driving, vehicle-road coordination and other industry applications have also broken through the boundaries of innovation with the blessing of large models. In the intelligent cockpit, SenseTime perceives user needs in an all-round way through multi-modal integration such as vision and hearing, records user habits and preferences through tagged data, and provides exclusive personalized services. At the same time, SenseTime also uses the powerful environmental understanding, logical thinking and content generation capabilities of the big model to bring a more user-aware "cabin brain", as well as a digital human that can support rapid customization of image and voice for anthropomorphic interaction, bringing an intelligent cockpit experience that integrates safety, entertainment, education and efficiency.

Author: Shen Xiangsha

Image: Courtesy of SenseTime

Read on