GoMAvatar: High-fidelity rendering and deformation based on an efficient human body digitization method based on single-view video

author：3D Vision Workshop 2024-04-12 18:26:00

Editor: Computer Vision Workshop

Add assistant: dddvision, note: 3D Gauss, pull you into the group. At the end of the article, industry subdivisions are attached

GoMAvatar: High-fidelity rendering and deformation based on an efficient human body digitization method based on single-view video

标题：GoMAvatar: Efficient Animatable Human Modeling from Monocular Video Using Gaussians-on-Mesh

Authors: Jing Wen et al

Homepage: https://wenj.github.io/GoMAvatar/

Thesis: https://arxiv.org/pdf/2404.07991.pdf

1. Introduction

This article introduces a new method called GoMAvatar that uses monocular video to quickly and efficiently reconstruct high-quality movable mannequins. At the heart of the approach is the Gaussians-on-Mesh (GoM) representation, which combines the high quality and speed of Gaussian rendering with the geometric modeling and compatibility of deformable meshes. Specifically, GoM uses Gaussian rendering, which provides rich flexibility for modeling appearances and enables real-time performance. At the same time, GoM leverages bone-driven deformable meshes to create compact, topologically complete digital avatars and simplify mesh articulation with forward kinematics. Crucially, in order to integrate the two representations together, we attached the Gaussian body to each mesh face, which better normalized the deformation of the Gaussian body in the new pose. In addition, to deal with view dependencies, we decompose the final color into a pseudo-albedo map rendered in Gaussian and a pseudo-shadow map derived from the normal map. This representation can be inferred from a single input video only. Experimental results show that GoMAvatar matches or outperforms the current state-of-the-art monocular human modeling algorithms in terms of rendering quality, while significantly outperforming them in terms of computational efficiency (43FPS) and has high memory efficiency (3.63MB per subject).

2. Method (GoM)

The core idea of the GoMAvatar method is to combine the Gaussian distribution with the representation of a deformable mesh. Specifically, it includes the following:

Gaussian distribution representation: GoM uses Gaussian distribution for rendering, which makes the rendering speed faster. Each polygon is associated with a Gaussian distribution, where the mean and covariance matrices are related to the vertex coordinates of the polygon.
Deformable mesh representation: GoM utilizes deformable mesh for deformation, providing clear geometric information and adapting to different human postures. Each vertex is associated with a linear blend skin weight, which is used for mesh deformation.
Forward kinematic deformation: GoM performs mesh deformation through forward kinematics, which avoids the uncertainty of reverse mapping. This helps to achieve a more accurate deformation.
Rendering and Morphing Compatible: GoM's rendering and morphing are fully compatible, enabling efficient production of high-quality rendered images.
Efficient rendering: GoM uses Gaussian distribution rendering and mesh rendering to achieve efficient rendering.
Explicit geometric information: GoM provides explicit geometric information through the mesh, which avoids the overfitting problem of Gaussian distributions.
Balancing speed and quality: GoM balances rendering speed and quality, enabling high-quality rendered images to be produced quickly.

3. Experiments

, the authors conducted an extensive experimental evaluation of the GoMAvatar method and compared it with other methods for generating human digitization from single-view videos. Specific content includes:

Dataset: The authors conducted experimental verification on the ZJU-MoCap dataset, the PeopleSnapshot dataset, and YouTube videos.
Benchmark methods: The authors selected the latest human digitization methods such as NeuralBody, HumanNeRF, NeuMan, MonoHuman, Anim-NeRF, and InstantAvatar for comparison.
Evaluation indicators: The main evaluation indicators include PSNR, SSIM, LPIPS, CD, NC, inference speed and memory usage.
Experimental results: The authors performed a quantitative evaluation on the ZJU-MoCap dataset, and the results showed that the GoMAvatar method achieved a good balance in rendering quality, inference speed and memory usage.
Qualitative comparison: The authors made qualitative comparisons with NeuralBody, HumanNeRF, and MonoHuman, demonstrating the advantages of GoMAvatar in terms of detail representation, surface geometry, and self-intersection.
Case Study of Failure: The authors demonstrate the limitations of GoMAvatar in terms of unobserved regions and topological changes, while also demonstrating its flexibility in fitting garments with different topologies.
Sensitivity analysis: The authors analyzed the sensitivity of GoMAvatar to the attitude estimation accuracy, and showed that the proposed method is robust to the attitude estimation error.

The efficiency and high-quality performance of the GoMAvatar method in human digitization tasks are fully verified, which provides an important reference for further promoting the development of this field.

4. Summary

This paper proposes GoMAvatar, an efficient and high-quality approach to digitizing the human body. The core idea is to combine Gaussian distributed rendering and deformable mesh deformation, so as to achieve a double improvement in rendering speed and quality. Specifically, GoMAvatar uses a Gaussian distribution for rendering, avoiding dense sampling in volumetric rendering, while attaching a Gaussian distribution to a deformable mesh to accommodate different human poses. In addition, it takes advantage of the forward kinematic deformation of the mesh, which avoids the uncertainty of the reverse mapping. Experimental results show that GoMAvatar achieves a good balance in rendering quality and inference speed, which is better than other latest methods. Finally, the authors also performed qualitative and quantitative analyses and demonstrated the advantages of GoMAvatar in terms of detail performance and self-inbreeding, as well as its limitations in unobserved regions and topological changes. Overall, GoMAvatar offers an efficient, high-quality new option for human digitization tasks.

This article is only for academic sharing, if there is any infringement, please contact to delete the article.

Here I recommend the new course "New SLAM Algorithm Based on NeRF/Gaussian 3D Reconstruction" launched by the 3D Vision Workshop and Gigi

About the Speaker

Course outline

Course Highlights:

This course starts from both theory and code implementation, and takes you from scratch to learn the principles of NeRF/Gaussian Based SLAM, read papers, and sort out code.
At the theoretical level, starting from linear algebra to traditional computer graphics, we can understand the theoretical basis and source of modern 3D reconstruction.
At the code level, through a number of exercises, you will be taught to reproduce computer graphics and NeRF related work.

Harvest after school

Getting started in the field of SLAM based on NeRF/Gaussian
Learn how to quickly capture the key points and innovative points of a paper
How to quickly run through the code of a paper and grasp the idea of the paper in combination with the code
Parse the NeRF code line by line, grasp every implementation detail, and manually reproduce and improve it

Curriculum

System requirements: Linux
Programming language: Python
Basic requirements: Python and PyTorch foundation

Suitable for people

A novice who has no idea how to start with the open source code for a new paper
SLAM定位建图、NeRF三维重建小白
Those who are engaged in 3D reconstruction work can refer to it
Initial readers of NeRF papers
Students who are interested in SLAM and NeRF

Start time

On Saturday, February 24, 2024 at 8 p.m., there will be one chapter updated weekly.

Course Q&A

The Q&A of this course is mainly answered in the corresponding goose circle of this course, and students can ask questions in the goose circle at any time if they have any questions during the learning process.

▲Add a small assistant: cv3d007, consult more

Note: Some of the above pictures and videos are from the Internet, if your rights and interests are violated, please contact to delete!

GoMAvatar: High-fidelity rendering and deformation based on an efficient human body digitization method based on single-view video

Read on

The Changan Qiyuan E07, which can be soundproofed, does not carry odors, and can be deformed, redefines the spatial experience of pickup trucks

The former host of CCTV jumped to his death in Thailand, and the cause of death was exposed, his head was broken and his leg was deformed, and there were more hidden secrets

The former host of CCTV jumped to his death in Thailand, and his head was exposed to death, his leg was broken, and he was deformed, and there are more hidden feelings

Former CCTV host Thailand jumped to his death! The cause of death was revealed, the head was broken and the leg was deformed

There was a rear-end collision on the highway, and the owner was shocked: the front of the car was deformed, and the airbags did not open

Luo Yonghao, who was about to pay off his debts, started to do things again, and this time he actually knocked down the price of men's ice silk underwear for a fish! The key is to try one!! 7A grade bacteriostatic,

How arrogant can Luo Yonghao, who is about to pay off his debts, be! This time, I actually knocked down the price of a fish's men's ice silk underwear! The key is to try one! It is still 7A grade antibacterial

Luo Yonghao, who is about to pay off his debts, has started to do things again, and this time he actually knocked down the price of men's ice silk underwear for a fish! The key is to try one!! 7A grade bacteriostatic,

When many people used to wear high heels, I followed the trend for a while. I started buying the kind of high heels that cost a few dozen dollars. It hurts to wear my feet, and I'm a person who likes to walk a lot, so I don't wear well

#二次元#动漫#变形金刚#插画#色彩

"Metamorphosis" Gao Zhanxi is engaged, and the seeds sown 18 years ago have grown into towering trees

#以为王安宇去变形记了#本来以为田曦薇的状态已经很搞笑了看到老王哥我忍不住笑出了声录制#忙忙碌碌寻宝藏#上下班反差感拉满

Today is the annual Dragon Boat Festival, I am on the way to work in a restaurant in Hong Kong, life in Hong Kong is too difficult, not only the housing is small and pitiful, even the holidays have no rest, even the New Year

Calling all Autobots! The LEGO Group has released a teaser of another Transformers set

What is the difference between 3000 yuan and 8000 yuan glass bath? I've used both of them, and these feelings don't feel unpleasant. When it comes to renovating your bathroom, a glass bathroom is an important choice. There are many on the market

Outburst! The left leg in the stuck position is deformed and the ligament is torn! Bad news for the Green Army! What about the overall winner?

#抖音中视频计划#如何加入中视频计划#今日讨论话题#干货分享#中视频开通条件

Why can't a dog hold a ball on a person's head? Watch the video of its head ball for 20 seconds, the ball does not fall and does not say that it is still rushing forward, if there is a 1.9-meter player on the court who tops the ball for 5 seconds,

How to make a small payment to a customer in a video store? The role of small payments

The four beauties are obviously Yang Mi, Di Lieba, Zhao Liying, Yang Ying, Yang Yingguang Weibo is 105 million, so I don't know how the video is counted, if Zhao Liying opens four platforms

Cheng Yi drama fans, fruits, grab it [Aimu] Good luck 6.14 @ fashion COSMO Weibo, video number, Xiaohongshu 14:00 Cheng Yi official cover, magazine on sale 🌟

#比亚迪领先智驾看腾势N7#在2023年, an automobile platform has been tested specifically for the intelligent assisted driving of new energy vehicles, and at that time, BYD's Teng, which had not been listed for a long time

In fact, the most important function of the boss IP is to understand what kind of person and company the boss of the company is through the video, eliminate the initial trust foundation, and be able to get tired through online trust

A large Israeli unmanned reconnaissance aircraft was shot down by a Lebanese anti-aircraft missile. Lebanese Allah forces released a video of a large "Hermes-900" reconnaissance drone from Israel

A female tourist in Thailand was sexually assaulted by a driver of a fake motorcycle for 2 hours and videotaped, and was looking for the next target when she was caught! On June 12, a Chinese female tourist was sexually assaulted by a driver who pretended to be a motorcycle in Thailand, police

The jujube flowers bloomed again, they also bloomed last year, and not a single jujube bore fruit, and this year's jujube trees are blooming again in season

The youngest is only 12 years old! Thai university students trick 20 women into having sex and shoot a video for sale

Shaking people is too stalked, swiping up a video of Zhen Huan's biography, and being emotionally moved by the master and servant in it, as soon as I open the comment, it is like a success, and it can be regarded as being out of the circle in another way, black and red are also red.

It's terrifying, is this Liu Yifei's passer-by?! The first time I encountered a TV series that had not finished broadcasting, and the lines were already out of the circle, I have swiped a lot of videos using Huang Yimei's lines as copywriting in the past two days

Wang Sicong's ex-girlfriend broke the defense! posted a video emo late at night, and previously went to the obstetrics and gynecology department with Wang Sicong

#热门内容分享#纳达尔网球学院又到毕业季了, Nadal attended the graduation ceremony as usual, with a smile on his face. Nadal Tennis Academy was in October 2016 in the West