[R] VOID: Video Object and Interaction Deletion (physically-consistent video inpainting)
![[R] VOID: Video Object and Interaction Deletion (physically-consistent video inpainting)](/_next/image?url=https%3A%2F%2Fpreview.redd.it%2F00ca5c008ysg1.png%3Fwidth%3D140%26height%3D78%26auto%3Dwebp%26s%3Dc0e174d6741698b12a9a171d245b4cf7d456846d&w=3840&q=75)
| We present VOID, a model for video object removal that aims to handle *physical interactions*, not just appearance. Most existing video inpainting / object removal methods can fill in pixels behind an object (e.g., removing shadows or reflections), but they often fail when the removed object affects the dynamics of the scene. For example: Current models typically remove the object but leave its effects unchanged, resulting in physically implausible outputs. VOID addresses this by modeling counterfactual scene evolution: Key ideas: In a human preference study on real-world videos, VOID was selected 64.8% of the time over baselines such as Runway (Aleph), Generative Omnimatte, and ProPainter. Project page: https://void-model.github.io/ Happy to answer questions! [link] [comments] |
Want to read more?
Check out the full article on the original site