Skip to content

Commit 140e2a0

Browse files
committed
release lidar maskgit
1 parent 2420aa0 commit 140e2a0

19 files changed

Lines changed: 5551 additions & 46 deletions

README.md

Lines changed: 24 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -49,13 +49,15 @@ python -m pip install torch==2.5.1 torchvision==0.20.1
4949
Clone the repository, then install the dependencies.
5050

5151
```
52-
cd DWM
52+
cd OpenDWM
5353
git submodule update --init --recursive
54-
python -m pip install requirements.txt -r
54+
python -m pip install -r requirements.txt
5555
```
5656

5757
## Models
5858

59+
### Video Models
60+
5961
Our cross-view temporal SD (CTSD) pipeline support loading the pretrained SD 2.1, 3.0, 3.5, or the checkpoints we trained on the autonomous driving datasets.
6062

6163
| Base model | Text conditioned <br/> driving generation | Text and layout (box, map) <br/> conditioned driving generation |
@@ -64,6 +66,15 @@ Our cross-view temporal SD (CTSD) pipeline support loading the pretrained SD 2.1
6466
| [SD 3.0](https://huggingface.co/stabilityai/stable-diffusion-3-medium-diffusers) | | [UniMLVG Config](configs/ctsd/unimlvg/ctsd_unimlvg_stage3_tirda_bm_nwa.json), [Download](http://103.237.29.236:10030/ctsd_unimlvg_tirda_bm_nwa_60k.pth) |
6567
| [SD 3.5](https://huggingface.co/stabilityai/stable-diffusion-3.5-medium) | [Config](configs/ctsd/multi_datasets/ctsd_35_tirda_nwao.json), [Download](http://103.237.29.236:10030/ctsd_35_tirda_nwao_20k.pth) | [Config](configs/ctsd/multi_datasets/ctsd_35_tirda_bm_nwao.json), [Download](http://103.237.29.236:10030/ctsd_35_tirda_bm_nwao_40k.pth) |
6668

69+
### LiDAR Models
70+
71+
You can download our pre-trained tokenzier and generation model in the following link.
72+
73+
| Model Architecture | Configs | Checkpoint Download |
74+
| :-: | :-: | :-: |
75+
| VQVAE | [Config](configs/lidar/lidar_vqvae_nwa.json) | [checkpoint](http://103.237.29.236:10030/lidar_vqvae_nwa_60k.pth), [blank code ](http://103.237.29.236:10030/lidar_vqvae_nwa_60k_blank_code.pkl) |
76+
| MaskGIT | [Config](configs/lidar/lidar_maskgit_layout_ns.json)| [checkpoint](http://103.237.29.236:10030/lidar_maskgit_nusc_150k.pth) |
77+
| Temporal MaskGIT | | |
6778
## Examples
6879

6980
### T2I, T2V generation with CTSD pipeline
@@ -84,6 +95,17 @@ PYTHONPATH=src python examples/ctsd_generation_example.py -c examples/ctsd_35_6v
8495
PYTHONPATH=src python src/dwm/preview.py -c examples/ctsd_35_6views_video_generation_with_layout.json -o output/ctsd_35_6views_video_generation_with_layout
8596
```
8697

98+
### Layout conditioned LiDAR generation with MaskGIT pipeline
99+
100+
1. Download LiDAR VQVAE and LiDAR MaskGIT generation model checkpoint.
101+
2. Prepare the dataset ( [nuscenes_scene-0627_lidar_package.zip](http://103.237.29.236:10030/nuscenes_scene-0627_lidar_package.zip) ).
102+
3. Modify the values of `json_file`, `vq_point_cloud_ckpt_path`, `vq_blank_code_path` and `model_ckpt_path` to the paths of your dataset and checkpoints in the json file `examples/lidar_maskgit_preview.json` .
103+
4. Run the following command to visualize the LiDAR of the validation set and save the generated point cloud as `.bin` file.
104+
105+
```bash
106+
PYTHONPATH=src python src/dwm/preview.py -c examples/lidar_maskgit_preview.json -o output/test
107+
```
108+
87109
## Train
88110

89111
Preparation:

0 commit comments

Comments
 (0)