Add model card and pipeline tag

#2
by nielsr HF Staff - opened
Files changed (1) hide show
  1. README.md +34 -3
README.md CHANGED
@@ -1,3 +1,34 @@
1
- ---
2
- license: apache-2.0
3
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ pipeline_tag: image-to-3d
4
+ ---
5
+
6
+ # Uni3R: Unified 3D Reconstruction and Semantic Understanding via Generalizable Gaussian Splatting from Unposed Multi-View Images
7
+
8
+ Uni3R is a novel feed-forward framework that jointly reconstructs a unified 3D scene representation enriched with open-vocabulary semantics, directly from unposed multi-view images. It leverages a Cross-View Transformer to integrate information across arbitrary multi-view inputs and regresses a set of 3D Gaussian primitives endowed with semantic feature fields.
9
+
10
+ [**Project Page**](https://horizonrobotics.github.io/robot_lab/uni3R/) | [**Paper (arXiv:2508.03643)**](https://arxiv.org/abs/2508.03643) | [**GitHub Code**](https://github.com/HorizonRobotics/Uni3R)
11
+
12
+ ## Key Features
13
+ - **Feed-forward Reconstruction**: Jointly handles 3D reconstruction and semantic interpretation without requiring costly per-scene optimization.
14
+ - **Unified Representation**: Facilitates high-fidelity novel view synthesis, open-vocabulary 3D semantic segmentation, and depth prediction in a single pass.
15
+ - **Unposed Inputs**: Robustly integrates information across arbitrary multi-view inputs without pre-defined camera poses.
16
+ - **Generalizable**: Establishes state-of-the-art performance on benchmarks like RE10K and ScanNet.
17
+
18
+ ## Usage
19
+ For detailed installation and usage instructions, please refer to the official [GitHub repository](https://github.com/HorizonRobotics/Uni3R). The repository provides scripts for training and evaluation on 2, 8, and 16 views.
20
+
21
+ ## Citation
22
+ If you find this work useful in your research, please consider citing:
23
+
24
+ ```bibtex
25
+ @misc{sun2025uni3runified3dreconstruction,
26
+ title={Uni3R: Unified 3D Reconstruction and Semantic Understanding via Generalizable Gaussian Splatting from Unposed Multi-View Images},
27
+ author={Xiangyu Sun and Haoyi Jiang and Liu Liu and Seungtae Nam and Gyeongjin Kang and Xinjie Wang and Wei Sui and Zhizhong Su and Wenyu Liu and Xinggang Wang and Eunbyung Park},
28
+ year={2025},
29
+ eprint={2508.03643},
30
+ archivePrefix={arXiv},
31
+ primaryClass={cs.CV},
32
+ url={https://arxiv.org/abs/2508.03643},
33
+ }
34
+ ```