arXiv:2511.14848v1 Announce Type: new 
Abstract: We present Gaussian See, Gaussian Do, a novel approach for semantic 3D motion transfer from multiview video. Our method enables rig-free, cross-category motion transfer between objects with semantically meaningful correspondence. Building on implicit motion transfer techniques, we extract motion embeddings from source videos via condition inversion, apply them to rendered frames of static target shapes, and use the resulting videos to supervise dynamic 3D Gaussian Splatting reconstruction. Our approach introduces an anchor-based view-aware motion embedding mechanism, ensuring cross-view consistency and accelerating convergence, along with a robust 4D reconstruction pipeline that consolidates noisy supervision videos. We establish the first benchmark for semantic 3D motion transfer and demonstrate superior motion fidelity and structural consistency compared to adapted baselines. Code and data for this paper available at https://gsgd-motiontransfer.github.io/

Gaussian See, Gaussian Do هي طريقة جديدة لنقل الحركة ثلاثية الأبعاد الدلالية من الفيديو متعدد الزوايا. تتيح هذه الطريقة نقل الحركة بدون الحاجة إلى هياكل ثابتة، بين كائنات لها تطابق دلالي ذي معنى. من خلال استخدام تقنيات نقل الحركة الضمنية، تستخرج الطريقة تجسيدات الحركة من مقاطع الفيديو المصدر وتطبقها على الأشكال الثابتة المستهدفة، مما يحسن من دقة الحركة والتناسق الهيكلي في إعادة البناء باستخدام Splatting غاوسي ثلاثي الأبعاد.

Gaussian See, Gaussian Do es un nuevo método para la transferencia de movimiento 3D semántico a partir de video multivista. Este enfoque permite la transferencia de movimiento sin necesidad de rig y entre objetos que tienen una correspondencia semántica significativa. Al utilizar técnicas de transferencia de movimiento implícitas, el método extrae incrustaciones de movimiento de videos fuente y las aplica a formas estáticas objetivo, mejorando así la fidelidad del movimiento y la consistencia estructural en la reconstrucción mediante Splatting Gaussiano 3D.

Gaussian See, Gaussian Do est une nouvelle méthode de transfert de mouvement 3D sémantique à partir de vidéos multivues. Cette approche permet un transfert de mouvement sans rig et entre des objets ayant une correspondance sémantique significative. En utilisant des techniques de transfert de mouvement implicite, la méthode extrait des embeddings de mouvement à partir de vidéos sources et les applique à des formes cibles statiques, améliorant ainsi la fidélité du mouvement et la cohérence structurelle dans la reconstruction par Splatting Gaussien 3D.

Gaussian See, Gaussian Do is a new method for semantic 3D motion transfer from multiview video. This approach allows for rig-free, cross-category motion transfer between objects that have semantically meaningful correspondence. By utilizing implicit motion transfer techniques, the method extracts motion embeddings from source videos and applies them to static target shapes, resulting in improved motion fidelity and structural consistency in 3D Gaussian Splatting reconstruction.

Gaussian See, Gaussian Do: Semantic 3D Motion Transfer from Multiview Video

Was this article worth reading? Share it

LucidQuery AI

SuperMotion

Postugc

Video Face Swap AI

SVGenius

Deptho.ai

Ready to build your own newsroom?