|
7 | 7 |
|
8 | 8 | ## 🧠 Overview |
9 | 9 |
|
10 | | -**Lift to Match (L2M)** is a novel two-stage framework for **dense feature matching** that lifts 2D images into 3D space to enhance feature generalization and robustness. Unlike traditional methods that depend on multi-view image pairs, L2M is trained on large-scale, diverse single-view image collections. |
| 10 | +**Lift to Match (L2M)** is a two-stage framework for **dense feature matching** that lifts 2D images into 3D space to enhance feature generalization and robustness. Unlike traditional methods that depend on multi-view image pairs, L2M is trained on large-scale, diverse single-view image collections. |
11 | 11 |
|
12 | 12 | - **Stage 1:** Learn a **3D-aware ViT-based encoder** using multi-view image synthesis and 3D Gaussian feature representation. |
13 | 13 | - **Stage 2:** Learn a **feature decoder** through novel-view rendering and synthetic data, enabling robust matching across diverse scenarios. |
@@ -38,51 +38,21 @@ Below are feature comparison results on the Sacré-Cœur dataset: |
38 | 38 |
|
39 | 39 | --- |
40 | 40 |
|
41 | | -## 🏗️ Data Generation (WIP) |
42 | | - |
43 | | -We synthesize multi-view images and 3D-aware Gaussian features from single-view inputs. |
44 | | -Scripts for data generation will be released soon. |
45 | | - |
46 | | ---- |
47 | | - |
48 | | -## 🏋️♀️ Model Training (Stage 1) |
49 | | - |
50 | | -We provide pretrained weights for the 3D-aware ViT encoder. |
51 | | - |
52 | | -> 🔗 **[Download pretrained encoder weights](#)** (Coming soon) |
53 | | -
|
54 | | -You can visualize features using: |
55 | | - |
56 | | -```bash |
57 | | -python vis_feats.py --input ./assets/sacre_coeur_A.jpg --model vit_encoder.pth |
| 41 | +To get the results, make sure your checkpoints and image files are in the correct paths, then run: |
58 | 42 | ``` |
59 | | - |
60 | | ---- |
61 | | - |
62 | | -## 🚀 Inference & Stage 2 (Coming Soon) |
63 | | - |
64 | | -The second stage—feature decoding with novel-view rendering—is **under development**. Stay tuned! |
65 | | - |
66 | | ---- |
67 | | - |
68 | | -## 📌 Citation |
69 | | - |
70 | | -```bibtex |
71 | | -@article{liang2025lift2match, |
72 | | - title={Learning Dense Feature Matching via Lifting Single 2D Image to 3D Space}, |
73 | | - author={Liang, Yingping and Hu, Yutao and Shao, Wenqi and Fu, Ying}, |
74 | | - journal={ICCV}, |
75 | | - year={2025} |
76 | | -} |
| 43 | +python vis_feats.py \ |
| 44 | + --img_paths assets/sacre_coeur_A.jpg assets/sacre_coeur_B.jpg \ |
| 45 | + --ckpt_dino ckpts/dinov2.pth \ |
| 46 | + --ckpt_fit3d ckpts/fit3d.pth \ |
| 47 | + --ckpt_L2M ckpts/output_20250629/l2m_vit_base.pth \ |
| 48 | + --save_dir outputs_vis_feat |
77 | 49 | ``` |
78 | 50 |
|
79 | | ---- |
80 | | - |
81 | | -## 📋 License |
| 51 | +## 🏗️ Data Generation |
82 | 52 |
|
83 | | -This project is licensed under **CC BY 4.0**. |
| 53 | +We synthesize novel-view images with dense matching labels from single-view inputs for training. |
| 54 | +Scripts for data generation will be released soon. |
84 | 55 |
|
85 | | ---- |
86 | 56 |
|
87 | 57 | ## 🙋♂️ Acknowledgements |
88 | 58 |
|
|
0 commit comments