Zhihu CTO Li Dahai: Multimodal Exploration of Intelligent Communities under the Trend of Video

2021-11-07 03:08:23

The 2021 WAIC World Artificial Intelligence Conference was recently held in Shanghai. At the waic·ai developer forum held on July 10th, Li Dahai, partner and cto of Zhihu, delivered a keynote speech, sharing the exploration and application practice of Zhihu as an intelligent community in the field of multimodality under the trend of video.

Zhihu CTO Li Dahai: Multimodal Exploration of Intelligent Communities under the Trend of Video

As a Q&A community, Zhihu has gone through ten years of development, and its business growth has gone through four stages of development, from the initial closed operation to openness, and constantly expanding user scenarios and user scale. Li Dahai said that AI technology has been widely used in every core link of Zhihu to build a smart community and improve community efficiency. As more and more users share their knowledge, experience and insights through video on Zhihu, Zhihu also realizes that videos and graphics have their own advantages and disadvantages and applicable scenarios, and the community needs to upgrade through media to make videos become as important as graphics and texts. Therefore, Zhihu has determined the video intelligent technology strategy with multimodality as the core.

According to Li Dahai, Zhihu has built a graphic multi-modal pre-training model using a two-stream comparative learning framework. The model has been widely used in many scenarios such as zhihu video production, search distribution, topic matching and sorting.

In October last year, Zhihu released a one-click video creation tool for graphics and text, known internally as the "ppt video creation tool", which can be used by graphic creators to quickly generate their text answers or articles into a video. The main idea of the conversion process is to find the corresponding picture, GIF or short video through the model of each paragraph or sentence in the article, and then calculate the correlation between each paragraph of text and the pictures in the material library through the pre-training model. In addition, there are other applications where creators can actively enter keywords, find the images with the highest matching keywords in the library, and let it actively build its own video stream.

Li Dahai said that the integration of video in the community can make Zhihu better realize its mission: "Let people better share their knowledge, experience, insights, and find their own answers." In the future, based on the accumulated massive graphic and video data, Zhihu will strive to build a large-scale pre-training model that integrates graphics, video, audio and other media, and fully open up the results for more developers in the academic community and industry.

Zhihu CTO Li Dahai: Multimodal Exploration of Intelligent Communities under the Trend of Video

Read on

The fastest way to ruin a child is to belittle him, to belittle him, to belittle him desperately...

Li Chengru's son Dahai: He had a grudge against his father, and when he grew up, he understood good intentions and good intentions, and his father's love was like a mountain

Li Dahai, son of Li Chengru: He resented his father, and when he grew up, he understood the good intentions and intentions, and his father's love was like a mountain

The deepest education that hurts children is not a stick, not a scolding, not a neglect, but...

In 1977, Li Xuemei, a young woman who knew how to take the college entrance examination, was secretly fed sleeping pills by her husband. When Li Xuemei learned this, she actually swallowed the whole bottle of sleeping pills, and pointed at her husband and shouted: "Your selfishness."

Dialogue with Li Dahai, CEO of Face Wall Intelligence: Don't talk about competition for domestic big models, catch up with GPT first

Li Chengru's son Li Dahai: He has lived in an 18-square-meter house for 28 years and longs to be recognized by his father

The most important thing in the application of large models is the ability to reason logical@MEET2024 ly

Li Chengru's son Li Dahai: I have been suppressed and belittled for more than 30 years, and now he has come to me to feel his family

Li Chengru's son Li Dahai: My vicissitudes of life, my ruthless father, my tearful family affection

Li Dahai, the son of Li Chengru: I once resented my father, but now I understand my good intentions, and my father's love is like a mountain

Li Chengru's son Li Dahai: grew up in the shadow of his parents' divorce, and is still unmarried at the age of 37

Li Chengru's son Li Dahai: a vicissitudes of life, a ruthless father, and now he is 37 years old and dare not get married

Li Dahai: My father Li Chengru lives in hundreds of millions of courtyards, and my mother and I live for 18 square meters

Li Chengru and Shi Yihong's wedding was made a big fuss by Li Dahai, and they left after 6 years, and Li and Shi's circumstances have been different since then

Li Dahai: Small screens play with large models, and improving knowledge density is the direction of efforts