Artificial intelligence large models have been launched one after another to accelerate the development of the empowerment industry

Recently, many domestic science and technology enterprises or scientific research institutions have officially opened self-developed artificial intelligence large models to the public, marking that domestic large models have begun to move from small-scale testing to large-scale application. How will the big model empower individuals and industries? What impact will this round of open services have on industry competition? What will be the future development trend of domestic large models?

The competition for artificial intelligence big models has entered a new stage

In early September, iFLYTEK announced that the iFLYTEK Spark Cognitive Model was open to the public, and so far, the first batch of more than ten AI models including iFLYTEK Spark, Baidu Wenxin Yiyan, SenseChat, etc. are all open to the public. In addition, the large models of some companies, including Tencent, have also been filed and will be opened to the public at the opportunity.

Artificial intelligence large models refer to the "big parameter" models trained using large-scale data and powerful computing power, which usually have a high degree of generalization and generalization ability, and can be applied to natural language processing, image recognition, speech recognition and other fields.

Previously, many enterprises and institutions in the mainland have released large-model products and conducted small-scale testing, and the large-scale model of the open service also means that they have passed the filing of relevant departments. On August 15, Mainland China officially implemented the Interim Measures for the Management of Generative Artificial Intelligence Services.

Yu Huan, director of Baidu's Science and Technology and Society Research Center, introduced that Wen Xin Yiyan has grown rapidly since its launch in March this year, mastering more than 200 creative genres, content richness is 1.6 times that of the early stage of release, the length of the thinking chain is 2.1 times that of the early stage, and the coverage of knowledge points has reached 8.3 times that of the initial stage. In terms of efficiency, the training speed of the Wenxin large model has reached 3 times and the inference speed has reached more than 30 times.

Liu Qingfeng, Chairman of iFLYTEK, introduced that since the first release of the iFLYTEK Spark cognitive big model on May 6, it has undergone two major iterative upgrades, and has continued to evolve in seven capabilities: text generation, language understanding, knowledge question and answer, logical reasoning, mathematical ability, code ability, and multimodal capability. At the latest iFLYTEK Spark V2.0 upgrade conference released on August 15, code capabilities and multimodal capabilities were upgraded.

According to the relevant person in charge of SenseTime, the characteristics and capabilities of the company's large model "Discussion SenseChat" include efficient processing of a large number of texts, powerful natural language dialogue, logical deduction capabilities, extensive knowledge reserves and information updates, support for Chinese Simplified, Chinese Traditional, English and various commonly used programming languages.

"With the successful filing, the various parties involved are eager to showcase the R&D progress and results since the beginning of the year. At the same time, the faster release of related services, you can get more user and industry feedback, so as to further accelerate the iteration and application of their respective large models, and gain as much competitive advantage as possible. Yu Hao, vice president of Legend Holdings Co., Ltd. and president of the Prospective Technology Research Institute, said.

The big model will empower individuals and industries

The domestic large model that has been launched has opened up a number of tools and is expected to become a "super assistant" for ordinary people to improve efficiency.

Liu Qingfeng said that since the launch of the test, users of the iFLYTEK Xinghuo APP have developed more than 20,000 personalized artificial intelligence assistants, each ordinary person can call a number of artificial intelligence assistants that have been put on the shelf, and can also develop their own exclusive assistants through a few simple steps to meet the needs of workplace, marketing, travel, life, official documents, customer service and other scenarios, and help solve various problems in work and life. According to the data since the launch test of iFLYTEK Xinghuo cognitive big model, the current scenarios that users just need to use are concentrated in scenarios such as knowledge Q&A, content generation, creative planning, life encyclopedia and programming assistance.

"As the automation of expert capabilities, large models can bring great convenience to ordinary people in all aspects, such as helping ordinary people learn knowledge, modify articles, generate solutions, and so on. Therefore, the centralized release of large models can allow ordinary people to quickly contact, use and understand large models, and the good experience of large models will enhance ordinary people's trust in large models. Sun Le, researcher at the Institute of Software of the Chinese Academy of Sciences and secretary general of the China Society for Chinese Information, said.

The relevant person in charge of Ailiai Think Tank believes that the centralized launch is expected to accelerate the commercialization of domestic large models and downstream applications. At present, the industry has the conditions for large-scale commercialization, from the supply side, the maturity of the existing technology can assist users in content production on a large scale, and the downstream application supply has blossomed at multiple points, involving painting, writing and video and other fields; From the demand side, driven by the fragmentation and lightweight of entertainment, the demand for user content consumption has increased exponentially, and traditional production methods have led to a sharp increase in cost pressure in the industry, and it is urgent to reduce costs and increase efficiency through artificial intelligence production content.

At present, a number of companies are exploring the in-depth application of large models in various industries. In early September, Tencent's hybrid model was officially unveiled and announced the opening of API access services through Tencent Cloud. According to the relevant person in charge of Tencent, Tencent Cloud, Tencent Advertising, Tencent Games, Tencent Fintech, Tencent Meeting, Tencent Docs, WeChat Souyi Search, QQ Browser and other Tencent internal businesses and products have been connected to the hybrid model and achieved preliminary results, and more services and applications are gradually being accessed.

"With large model generation technology as the core, artificial intelligence is becoming the key driving force for the next round of digital development, bringing new ideas to solve industrial pain points." Tang Daosheng, senior executive vice president of Tencent Group and CEO of the Cloud and Smart Industry Business Group, said.

Shenzhen Yuntian Lifei Technology Co., Ltd. first announced its large-scale model research and development in July this year. The relevant person in charge of the company introduced that based on the "Book of Heaven" large model, Yuntian Lifei and Shenzhen Longgang Political and Numerical Bureau jointly explored the application of the large model in the government consulting service system, and planned to gradually expand to other government agencies and public service fields in the future, such as legal, finance, education, medical care, transportation, etc.

For large model R&D companies, whoever can bring the best industry application the fastest will be able to obtain the most user feedback and data, further increasing the competitive advantage of their large model.

"When Wenxin opens up its services to hundreds of millions of Internet users on a large scale, it can get more human feedback in the real world, which will help further improve the basic model and iterate the product more quickly, creating a better user experience." Robin Li, founder, chairman and CEO of Baidu, said.

The "infancy" of the large model is still waiting to be "fed back"

Industry insiders believe that although the current domestic large-scale model research and development is progressing rapidly, bottlenecks such as computing power and talents still need to be further broken, and the business model needs to be further clarified.

According to Lin Dahua, a professor at the Shanghai Artificial Intelligence Laboratory, the biggest difficulty and challenge in the training of a language model with a scale of hundreds of billions of parameters is not the lack of data, but the high cost of trial and error. A large model with 100 billion parameters needs 3 months to train on a kcal cluster. Each update requires a lot of computing power, and the training cycle is very long.

Yu Hao believes that the large model as a "base technology" is still in the "infancy stage", and its growth requires extremely expensive and energy-consuming "computing power warming house", surrounded by a group of "expert nannies" specializing in artificial intelligence algorithm optimization, and a large number of high-quality "data ingredients". If you accidentally ingest "contaminated ingredients" or "do not strictly train" large models, it is very likely to cultivate "bear children" who are "nonsense", causing incalculable impact on large model enterprises.

"Considering future sustainable development, on the one hand, it is to build a large model infrastructure engine and build a solid foundation, on the other hand, cultivate a small model ecosystem, evolve a technology stack that adapts to the long-tail needs of diversified small scenarios of artificial intelligence, solve practical problems with small models, create value monetization, and then feed back large models." Yu Hao said.

When talking about the development trend of domestic large models, Sun Le believes that the first is that with the acquisition of a large number of high-value data and user feedback, the level of domestic large models will improve rapidly. Secondly, in terms of application, large models will be widely implemented, and typical application paradigms will gradually form in suitable scenarios. Finally, large models will still face some long-term scientific problems, such as reliable, controllable and credible problems, and will also discover new scientific problems in the large-scale implementation process, which need to be solved by governments, businesses and academia.

Shen Yang, a professor at Tsinghua University's School of Journalism and Communication, believes that the centralized launch of domestic large models will give rise to a series of development trends, including accelerated technology iteration, deepening industry applications, diversified business models, prominent data and ethical issues, and internationalization. These trends will not only affect the development of the model itself, but will also have a profound impact on socio-economic, legal norms and the international competitive landscape. Therefore, all parties need to pay close attention and make corresponding strategic planning and countermeasures.

Artificial intelligence large models have been launched one after another to accelerate the development of the empowerment industry

Read on

This year's college entrance examination essay was accused by netizens of discriminating against rural children, and artificial intelligence can't write at all!

There is no standard answer in life! Al artificial intelligence inter-question, in the problem of over-standardization!

Invest in AI and the next millionaire will be you! 600,000 millionaires in the United States were thus created

#头条创作挑战赛#Gai是现在人工智能追求的目标, which is also the essence of artificial intelligence now, the establishment of a knowledge base cannot be like an industry knowledge base

16 college entrance examination records! Use mathematical models to predict Tang Shangjun's 2024 college entrance examination scores!

The artificial intelligence of the essay questions in the new volume is just a pretense, and if you know too much, you may go off topic

Podcast Update|First Voter for MiniMax: MiniMax, GenAI Conference, and Big Model Playing Cards

WWDC24:苹果人工智能上线,Siri与ChatGPT整合惹怒马斯克

Artificial intelligence is a product of the progress of the times, is it the happiness or sorrow of human beings?

CVPR 2024|Only one language model is needed to generate high-quality 360-degree scenes from image diffusion models

This one is so cute! The latest electric smart robot dog in 2024 is not only realistic, but also has an eight-wheel design, which is flexible and free! Gifts that both boys and girls will love are parent-child interactions

The Digital Transformation Maturity Model and Assessment was released

In the restaurant, robots stir-fry and deliver food, replacing chefs and waiters; At home and on the road, robots sweep the floor, replacing housekeeping and cleaners; The automated production line in the factory replaces a large amount of turnover

The U.S. government has been revealed to be considering another move to restrict China's access to chip technology for artificial intelligence, which China has previously stated

3 types of children "will be abolished as soon as the test is taken", Dr. Tsinghua's iron triangle model will help you become a master of the exam

绝对新鲜实惠图源：archiminibricks#乐高 #乐高MOC #积木#模型#大人也要玩玩具

Development Trend of Large Models: Multimodal, Autonomous Intelligence, Edge Intelligence...

How important mathematics is, look at the official media to give you popular science. It is increasingly connected with aerospace, defense security, biomedicine, marine, artificial intelligence, information, energy, and advanced manufacturing.

The effect is benchmarked against Sora's domestic AI video application, and the large model of Kuaishou video generation can be unveiled

Carnegie Mellon University is researching how artificial intelligence can better provide justice for humanity

Altman talks about the opportunities, challenges and human self-reflection of AI: China will have a unique large language model

In-depth report on the artificial intelligence industry - after the "first year" - let's look at the commercialization progress of large model applications

Artificial intelligence iterates everything, and liberal arts students will be the "last survivors"

谷歌 Pixel 手机获功能更新：Pixel 8(a) 可用 Gemini Nano 模型

A purely money-spending thing has been done for 6 years in a row, and Ali is still happy to do it, and the ambition behind it can't be hidden! On June 13th, the finalists of the 2024 Ali Mathematics Competition were released

Dai Weijin, Executive Vice President and General Manager of the IP Business Unit of VeriSilicon: Large models are entering the edge and end side, and mobile phones, PCs and automobiles are the main forces. VeriSilicon's CPU, GPU, NPU, V

19 Best Large Language Models in 2024

2024: The most Fortune 500 states in the U.S. have headquarters in 226 cities in 37 states, from small towns in Ohio to Los Angeles

One of the top 10 models for data analysis: the funnel model

Who is the cockpit ceiling of new energy vehicles? The HarmonyOS cockpit is famous, but a new challenger has emerged! #智能座舱#6月12日, Great Wall Motor announced CoffeeO

Summary of today's bidding board (June 13) The structure of the 1-3rd daily line is under great pressure, and the bidding is flawed or suspected of being tempting, so they did not enter the market, but in the end they were all closed. The bid on the 4th is acceptable,