PIKA Officially Launches Lip Sync - Injecting New "Imagination" into AI Video

2024-03-17 11:08:57

PIKA Officially Launches Lip Sync - Injecting New "Imagination" into AI Video

I was going to sleep again... But it's so good that I can't die, before going to bed, I took a look at the update dynamics of several AI products...

And then...

PIKA Officially Launches Lip Sync - Injecting New "Imagination" into AI Video

After 3 months of silence, PIKA has finally been renewed...

Instead of de-rolling the model, without de-rolling control, they continued to work towards the goals they set at the beginning of 1.0:

Dialogue.

One of the things that sets PIKA's model apart from all other AI video products is that.

When you generate a character, there is a high probability that it will speak, that is, the lip shape will move, simulating the feeling that the character is having a conversation.

PIKA believes that this kind of effect can be more in line with the real "short film" effect, after all, a short film, if you want to look good, the dialogue between people is essential.

Only in dialogue can there be drama, and only in conflict can there be tension.

In the case of a traditional runway, etc., the shots are all empty shots, and the dialogue can only be done in the form of narration in many cases. In that way, when the audience watches the film, the sense of immersion will be greatly reduced.

But when PIKA1.0 was launched last year, it didn't actually get through to voice, which means that you could only run out of a mouth that was moving blindly, but there was no sound.

You need to go to 11Labs or the Magic Sound Workshop by yourself. Then use Jianying or something, and put together the voice and AI clips of your run.

But there is a problem: the shape of the lips and the pronunciation do not match.

So it's absolutely reasonable for PIKA to do lip sync, they will do it, and they will do it.

No, here it is. Let's take a look at PIKA's new trailer first.

However, PIKA's trailer ... You know.

You'll have to try it yourself.

It just so happens that this lip sync is also open to super collaborators, so I'll test it.

After entering the PIKA homepage, upload a video or a picture, you can see such a function:

This is the lip drive.

When you click on it, you will see the speech production area.

PIKA has also cooperated with 11Labs, and has taken 11Labs' TTS, and the upper area can select a specific voice talent, and then enter the text and turn it into speech.

Of course, you can also upload your own audio.

I'm still used to using 11Labs or Magic Sound Workshop to run the audio myself, after all, those two things are a little more usable.

Once it's uploaded, you can start crafting.

I have probably run dozens of cases, from 1/2 front, to 1/5 front, from front face to side face, from realistic to 2D, from picture to video... I also counted and tested it all over again

Let's take a look at a few examples that I think work well:

But there are many more, and it is a bad case.

On the whole, after my experiment, I recommend that I use the text/image to generate the video first, and then use the video to perform lip synchronization, rather than directly using the image to lip sync.

Because the effect of the two is very similar, and if you use video to drive, you can also achieve some large camera movements or changing backgrounds.

PIKA's upper lip is synchronized, and the imagination must be very strong, because whether it is Heygen or Wonder Yuan, all they can do is a static photo to speak, and in terms of effect, there are certain limitations, because the background will not move.

For example, I used Marvel Meta to get this Trump a long time ago.

PIKA's own AI video coupled with lip sync will make some effects that were very complex but dramatic to achieve before, which can bring some new imagination to AI videos.

But in terms of lip synchronisation, frankly speaking, there is still a certain distance to go compared to Heygen and Marvel Yuan.

For example, only the front face is supported, and the side face will collapse.

For example, when there are some occlusions on the face, the recognition is wrong.

For example, when there are some background faces, they will be recognized together.

For example, the lips are often shaking, and they are not stable enough.

For example, sometimes the lips are mushy.

Wait a minute.

But after all, PIKA's update this time is still the Test version, and it has not yet been released to the public.

There's still a lot of room for optimization.

想想MJ的V1时刻，对吧。

I'm looking forward to PIKA's follow-up optimization of lip sync to inject some new vitality into AI videos.

How do I have a feeling, though.

11Labs, the one who does AI dubbing, is the biggest winner...

PIKA Officially Launches Lip Sync - Injecting New "Imagination" into AI Video

PIKA Officially Launches Lip Sync - Injecting New "Imagination" into AI Video

Read on

Some people, I advise you to be kind Just swiped a drowning video on the short video platform, a guy struggled in the water after falling into the water, and went more and more to the middle of the water, and the police came after passers-by called the police, visual inspection

Hot debut, come and watch, the guy uses Al to generate robots, this video of kneading noodles and stir-frying

It was revealed that Chen Jianzhou was forced to pay debts by the water army, and a large number of overseas IP comment areas asked for money, and the passionate video with Big S was exposed

Spicy eyes, celebrity large-scale video exposure? How chaotic is the entertainment industry?

Gu Junye staggered after the concert, walked with knee pads, and the video flowed out!

Overturned? The inside story of Baidu's public relations staff doing short videos was exposed, and the employees bluntly said: Very unhappy, but helpless [with a forecast for the prospect of the public relations service industry]

Tan Zhu's apology video was reversed, and it was a female Internet celebrity who pretended to be an eyeball, which has been completely banned by the platform

Video News丨Grandma "turned into a butterfly" to witness the man's wedding, if science can't explain it, then give it to love

Video | The "voice" of excellent family style and tutoring is in the hearts of the people This family recitation contest in Liangjiang New Area is superb

The official end of "Family with Children"! posted a video with the connotation of Gao Yalin's derailment, and Yang Zi and Zhang Yishan were affected

Jia Yueting posted a 6-minute video, making a decision to save FF, creating a personal IP, and dying of laughter in the comment area

Video: Passengers evacuated from the nose of a Delta airliner on fire!

A must-have new infrastructure for enterprises! Zhongguancun Kejin introduced a large model to create a new generation of intelligent audio and video platform

Gao Yalin transferred tens of thousands of dollars to Xiao Sanhua every time, and he was not even willing to charge 29 yuan for video members

The front camera of the vivo X100 Ultra is equipped with a JN1 sensor for 4K60 video shooting

Champions League dragons without a leader, Messi benefits! Meeting three points will result in 9 winning the Ballon d'Or, and 4 people may become the biggest opponents

Photo News | The fragrance of books moistens the heart, and happiness grows with it

The police video patrol arrested the theft suspect

Ronaldo Hotel recruits: annual salary of 30,000 euros, no "996", indefinite contract, and English

Visit the scene of the arrest of the murder suspect in Zhenxiong Hospital: About 6 kilometers away from the hospital where the incident occurred, a villager was pushing his child for a walk

Gobert was drafted with the 27th pick in '13, who were the top 10 picks at that time? There is only one all-star

Riley made a sudden pass to shoot the all-round forward, where did Odom, who Bryant had angry with the management, go?

If Lautaro is sold, Inter's budget will come to 165 million, and they will seek to buy Barcelona Real Madrid, targeting 5 people

Baidu has a domineering female president, does Robin Li know?

The new iPad Pro is unveiled: the new M4 chip, and a new screen

Saudi Arabia Grand Slam doubles: Da Fat Yuan was swept out 0-3, and the Mengyu combination narrowly won 3-2 to advance

41-year-old Wang Ou's current situation: After giving birth, his figure recovered, wearing a dress and a straw hat like a 30-year-old girl

The casting of "Dragon Babu" was exposed: Seeing Lin Zhiying's picture, I understood the madness of 200 million girls

The United States revoked Qualcomm and Intel's export licenses to Huawei, fearing that the two companies would have too close ties with China

"Bring" 40 billion! FAW or save Gaohe, after the resumption of production, it will finally reduce the price

News "Late" to know | Yu Xiaohui, China Academy of Information and Communications Technology: Expand the opening up of value-added telecommunications services

New revelations! The new vivo X100 series is equipped with Dimensity 9300+, and the image is strong enough to shoot concerts