GaussCtrl: Multi-View Consistent Text-Driven 3D Gaussian Splatting Editing

  • 2024-04-25 18:50:07
  • Jing Wu, Jia-Wang Bian, Xinghui Li, Guangrun Wang, Ian Reid, Philip Torr, Victor Adrian Prisacariu
  • 0

Abstract

We propose GaussCtrl, a text-driven method to edit a 3D scene reconstructedby the 3D Gaussian Splatting (3DGS). Our method first renders a collection of images by using the 3DGS and editsthem by using a pre-trained 2D diffusion model (ControlNet) based on the inputprompt, which is then used to optimise the 3D model. Our key contribution is multi-view consistent editing, which enables editingall images together instead of iteratively editing one image while updating the3D model as in previous works. It leads to faster editing as well as higher visual quality. This is achieved by the two terms: (a) depth-conditioned editing that enforces geometric consistency acrossmulti-view images by leveraging naturally consistent depth maps. (b) attention-based latent code alignment that unifies the appearance ofedited images by conditioning their editing to several reference views throughself and cross-view attention between images' latent representations. Experiments demonstrate that our method achieves faster editing and bettervisual results than previous state-of-the-art methods.

 

Quick Read (beta)

loading the full paper ...