laitimes

AI audiobook "Intelligent Transportation" is online Robin Li generated 200,000 words of voice works in 300 sentences

On April 21, 2022, World Book Day approached, and the AI hyper-immersive audiobook "Intelligent Transportation" was launched on the Himalaya APP. This audio work is based on the book "Intelligent Transportation" written by Robin Li, chairman and CEO of Baidu, using about 300 sentences of public voice data by Robin Li, and created and generated by AIGC (AI Automatic Content Generation) technology. Its audio synthesis effect is almost the original sound, and ordinary users are basically unable to distinguish between real human voices and synthesized sounds.

"Intelligent Transportation" audiobook is divided into 86 episodes, 21 episodes on the first day, followed by 2 episodes per day, users can directly listen to "intelligent transportation audiobook" searching on Baidu. From 200,000 words of professional text to hyper-immersive audio works, behind the successful creation of the audiobook "Intelligent Transportation" is Baidu's leading edge in speech synthesis technology. At present, with only 9 sentences of material and 5 minutes of waiting, Baidu speech synthesis technology can realize the reproduction of the user's voice, and 300 sentences can generate audio content comparable to the professional sound library.

AI audiobook "Intelligent Transportation" is online Robin Li generated 200,000 words of voice works in 300 sentences

As the only company in the industry that can provide large-scale product-level personalized speech synthesis services within 10 sentences, Baidu-related technologies have long been applied to various products, such as attracting more than 600 celebrities to enter the voice square in Baidu Map, realizing personalized customized voice packages, with 200 million daily playback times.

The content of the audiobook "Intelligent Transportation" comes from the book "Intelligent Transportation: Major Changes Affecting Mankind in the Next 10-40 Years" by Robin Li, which is the first monograph in China to systematically expound the current situation and prospects of China's intelligent transportation development. The book believes that the mainland has a leading dividend of scientific and technological talents, has a wealth of artificial intelligence application scenarios, has an incomparable good policy environment, and the construction of intelligent transportation will surely be at the forefront of the world. In the future, the intelligent transportation system built by new technologies, new concepts and new models will be expected to reduce traffic safety accidents by 90%; within 10 years, relying on the improvement of traffic efficiency, the problem of urban congestion will be basically solved; with the popularity of shared unmanned vehicles, the demand for private cars will be greatly reduced.

AI audiobook "Intelligent Transportation" is online Robin Li generated 200,000 words of voice works in 300 sentences

In the digital age, the demand for content production continues to increase. AI automatically generates content represented by TTS technology (Text to speech) has become an emerging way of content production. This method specifically includes generating ideas (such as themes, ideas, etc.) through AI, generating materials (such as text, illustrations, dubbing, etc.), and finally producing content in a way that is automatically arranged and synthesized. In the future, it may only take a few seconds to generate content that used to take days of time and effort to create. This is not only a simple increase in efficiency, but also opens up more possibilities and stimulates creative ideas and creative cognition that humans have never had.

AI audiobook "Intelligent Transportation" is online Robin Li generated 200,000 words of voice works in 300 sentences

The application development of AIGC depends on the support of AI full-stack technology capabilities. As a leader in the field of domestic AI, Baidu has laid out artificial intelligence for more than ten years, built the world's largest knowledge graph, computer vision, speech, language and other core technical capabilities are industry-leading, Baidu Wenxin big model has higher learning efficiency and better versatility than the same industry big model, AIGC will achieve large-scale application with the help of the cross-modal comprehensive technical capabilities of the big model.

Upstream journalist Yang Ye intern Liu Jin

Read on