Only 3GB!2ms! two images can reconstruct the entire 3D Gaussian scene!

2024-02-02 18:22:00

Source: 3D Vision Workshop

Add v: dddvision, note: 3D GS, and pull you into the group. At the end of the article, industry subdivisions are attached

0. Write on the front

Today, the author recommends a new work in the direction of 3D GS, pixelSplat, which can reconstruct the 3D radiation field parameterized by the 3D Gaussian primitives with two images and complete the synthesis of new perspectives.

Let's read about this work together~

1. Thesis information

标题：pixelSplat: 3D Gaussian Splats from Image Pairs for Scalable Generalizable 3D Reconstruction

作者:David Charatan, Sizhe Li, Andrea Tagliasacchi, Vincent Sitzmann

Institutions: Massachusetts Institute of Technology, Simon Fraser University, University of Toronto

Original link: https://arxiv.org/abs/2312.12337

Code link: https://github.com/dcharatan/pixelsplat

Official website: https://dcharatan.github.io/pixelsplat

2. Summary

We introduce pixelSplat, a feedforward model that learns to reconstruct a 3D radiated field parameterized by 3D Gaussian primitives from paired images. Our models feature real-time and memory-efficient rendering for scalable training and fast 3D reconstruction for inference. To overcome the sparse and local support representations inherent to local minima, we predict a dense probability distribution on 3D and sample Gaussian means from this probability distribution. We make the sampling operation differentiable by reparameterizing the technique, which allows us to represent the backpropagation gradient by a Gaussian distribution. Based on our benchmark of our wide-baseline new view compositing method on real-world RealEstate10k and ACID datasets, we outperformed the most advanced light field converters and improved rendering speed by 2.5 orders of magnitude while reconstructing interpretable and editable 3D radiant fields.

3. Effect display

Given a pair of input images, pixelSplat reconstructs the 3D radiation field parameterized by 3D Gaussian primitives. This results in explicit 3D representations that render in real time, remain editable, and are inexpensive to train.

Only 3GB!2ms! two images can reconstruct the entire 3D Gaussian scene!

Predicted 3D Gaussian plot (top) and corresponding depth map (bottom).

4. Major Contributions

The authors mainly compared with the following baselines:

Method of Du et al. (https://yilundu.github.io/wide_baseline/): A light field renderer designed for the synthesis of new views with a wide baseline.

GPNR: A light field converter that can only handle two input views.

pixelNeRF: A well-known NeRF-based approach that struggles to handle scene-scale datasets because it can't handle scale ambiguity.

5. How does it work?

Probabilistic prediction of pixel-aligned Gaussian distributions. For each pixel feature F[u] in the input feature map, the neural network F predicts the Gaussian element parameter σ and s. The Gaussian position μ and opacity α are not directly predicted, which will result in a local minimum. Conversely, f predicts the discrete probability distribution per pixel at depth pφ(z), which is sampled by φ. parameterization and then produces the position of Gaussian primitives. The opacity of each Gaussian is set to the probability of sampling the depth bucket. The final set of Gaussian primitives can then be rendered from the new view using the splatting algorithm proposed by Kerbl et al.

6. Comparison with other SOTA methods

Quantitative comparisons. pixelSplat outperforms all benchmark methods in terms of PSNR, LPIPS, and SSIM when compositing new views on real-world RealEstate10k and ACID datasets. In addition, pixelSplat requires less memory during inference and training, and renders images about 650 times faster than the second-fastest baseline. In the Memory column, Memory usage for a single scene and 256 × 256 rays is reported.

Qualitative comparison of the new views on the RealEstate10k (top) and ACID (bottom) test sets. Compared to baseline, pixelSplat not only produces more accurate and attractive images, but also better generalizes non-distributed examples.

7. Summary

This work introduces pixelSplat, an element-based parameterization method for reconstructing the 3D radiation field of a scene from only two images. When inferring, pixelSlat produces explicit 3D representations of the scene significantly faster than previous work on generalizable novel view composition. In order to solve the local minimum problem in element-based function regression, a new method of parameterizing the position of the element through dense probability distribution is introduced, and a new reparameterization technique of backpropagating the gradient into the distribution parameters is introduced.

Readers who are interested in more experimental results and details of the article can read the original paper~

Here is an introduction to the latest course of the 3D Vision Workshop, "New SLAM Algorithm Based on NeRF/Gaussian":

This course starts from both theory and code implementation, and takes you from scratch to learn the principles of NeRF/Gaussian Based SLAM, read papers, and sort out code.
At the theoretical level, starting from linear algebra to traditional computer graphics, we can understand the theoretical basis and source of modern 3D reconstruction.
At the code level, through a number of exercises, you will be taught to reproduce computer graphics and NeRF related work.

Only 3GB!2ms! two images can reconstruct the entire 3D Gaussian scene!

Read on

Top 10 mathematicians in the world 1. Newton, England 2. Gauss, Germany 3. Euler, France

Xie Saining's team broke through the Gaussian splash memory bottleneck and realized multi-graphics card training in a parallel scheme

When the three Yau are combined, can they be on an equal footing with Newton, Gauss, Euler, and Riemann?

Wang Yang: After breaking up with Jiang Xin, he turned his head to marry Gauss, who was 4 years younger, and now he has finally succeeded after all his hard work

Negative Han Wang Yang: I have been in love with Jiang Xin for many years, and I turned my head and chose Gauss of Wangfu

Wang Yang and Jiang Xin have been in love for many years without success, and then married a four-year-old wife Gauss, and now their careers are booming!

#分享我的话题荣誉#一条购物评语胜过头条半月的稿酬我的快件收到了, there was a reward note attached to it, and I did a good review as required and uploaded the picture

To measure whether the workpiece contains magnetic objects, the desktop Gaussmeter TD8650 can be used for automatic measurement

The seminar on the research and application of 3D Gaussian and light field technology was successfully held

"420,200 Gauss! It's a record-breaking! ”

420,200 gauss! It's a record-breaking!

CNCC | At the end of the 3D reconstruction is Gauss? Advances in the construction and mapping of three-dimensional Gaussian expressions

The magnetic flux density of a permanent magnet can be measured with a benchtop Gaussmeter TD8650

He and Jiang Xin have been in love for many years, but they married Gauss, who is 4 years younger, and now his wife is very popular with him

Wang Yang: Although I have a relationship with Jiang Xin, I will not live up to Gauss, who has been waiting for me for 6 years, for the rest of my life

The well-known referee made a fatal mistake that sent Wawrinka out of the tournament with a grievance, and Kyrgios: He should have been sacked a long time ago

双Orin-X+双激光雷达！小鹏G9智驾算力拉满，XNGP领跑全场景

Leading the trend of cultural tourism, Quwo's 72-hour graffiti action creates a new scene of cultural tourism experience

Wang Chuqin left the airport, and this battle was probably more lively than the opening ceremony of the Olympic Games. These 800 feints are not to be handsome, but to avoid the dense cameras. You say this guy

Award-winning "5G + Cultural Tourism" Application Demonstration Scenario! The visual effects of the AR experience of Datang Never Sleeps City are directly full~

The University of Science and Technology of China has developed BSF soft fingers that can be used for early cancer screening, pulse measurement and other scenarios

Penghua Pension Investment and Education New Scene|Jointly launched the "Blue Vest Caravan" to help the elderly and prevent fraud

National Health Insurance Administration: The next step will be to focus on key business scenarios and develop new models of medical insurance digital services

Fantasy Journey to the West: 20 kindergarten script numbers in one scene, online 24 hours a day, and thousands of profits in 1 day

Make a small wooden scene, looking back at the water supply station and the trembling water delivery people

Huawei's all-scenario new product electronics expo shines brightly, and once again leads the new trend of science and technology

2024 Electronics Expo|Huawei's new products in all scenarios shine and lead the new trend of science and technology again

Jinxiang Mountain Forest Park has a new scene and new gameplay! Go for the weekend!

2024 Electronics Expo|Huawei's new products in all scenarios shine and lead the new trend of science and technology again

2024 Music Variety Show Market Trend Observation: New Programs Coming Everywhere Focus on Vertical Tracks and Composite Scenes

SAIC Maxus: Scene-based car manufacturing, ingenuity to create Chinese symbols丨People's City · fifth anniversary

Vita Lemon Tea launched an autumn marketing offensive, focusing on these two scenes