Skip to content

Commit cd4e3c7

Browse files
author
wuzehuan
committed
Merge branch 'wuzehuan/MinorFixOnDocAndConfig' into 'main'
Update README. See merge request vc-research/driving-world-models!40
2 parents 2b06475 + f8b37d9 commit cd4e3c7

2 files changed

Lines changed: 9 additions & 8 deletions

File tree

README.md

Lines changed: 5 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -2,6 +2,10 @@
22

33
[[中文简介](README_intro_zh.md)]
44

5+
https://github.com/user-attachments/assets/e73d22b1-e856-4bee-8bc3-a1e9755e8c25
6+
7+
[Video link](https://youtu.be/j9RRj-xzOA4)
8+
59
Welcome to the OpenDWM project! This is an open-source initiative, focusing on autonomous driving video generation. Our mission is to provide a high-quality, controllable tool for generating autonomous driving videos using the latest technology. We aim to build a codebase that is both user-friendly and highly reusable, and hope to continuously improve the project through the collective wisdom of the community.
610

711
The driving world models generate multi-view images or videos of autonomous driving scenes based on text and road environment layout conditions. Whether it's the environment, weather conditions, vehicle type, or driving path, you can adjust them according to your needs.
@@ -16,10 +20,6 @@ The highlights are as follows:
1620

1721
Furthermore, our code modules are designed with high reusability in mind, for easy application in other projects.
1822

19-
https://github.com/user-attachments/assets/e73d22b1-e856-4bee-8bc3-a1e9755e8c25
20-
21-
[Video link](https://youtu.be/j9RRj-xzOA4)
22-
2323
Currently, the project has implemented the following papers:
2424

2525
> [UniMLVG: Unified Framework for Multi-view Long Video Generation with Comprehensive Control Capabilities for Autonomous Driving](https://sensetime-fvg.github.io/UniMLVG)<br>
@@ -62,6 +62,7 @@ Our cross-view temporal SD (CTSD) pipeline support loading the pretrained SD 2.1
6262
| Base model | Text conditioned <br/> driving generation | Text and layout (box, map) <br/> conditioned driving generation |
6363
| :-: | :-: | :-: |
6464
| [SD 2.1](https://huggingface.co/stabilityai/stable-diffusion-2-1) | [Config](configs/ctsd/multi_datasets/ctsd_21_tirda_nwao.json), [Download](http://103.237.29.236:10030/ctsd_21_tirda_nwao_30k.pth) | [Config](configs/ctsd/multi_datasets/ctsd_21_tirda_bm_nwa.json), [Download](http://103.237.29.236:10030/ctsd_21_tirda_bm_nwa_30k.pth) |
65+
| [SD 3.0](https://huggingface.co/stabilityai/stable-diffusion-3-medium-diffusers) | | [UniMLVG Config](configs/ctsd/unimlvg/unimlvg_stage3_tirda_nwa.json), Released by 2025-2-1 |
6566
| [SD 3.5](https://huggingface.co/stabilityai/stable-diffusion-3.5-medium) | [Config](configs/ctsd/multi_datasets/ctsd_35_tirda_nwao.json), [Download](http://103.237.29.236:10030/ctsd_35_tirda_nwao_20k.pth) | [Config](configs/ctsd/multi_datasets/ctsd_35_tirda_bm_nwa.json), Released by 2025-2-1 |
6667

6768
## Examples

README_intro_zh.md

Lines changed: 4 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -2,6 +2,10 @@
22

33
[[English README](README.md)]
44

5+
https://github.com/user-attachments/assets/e73d22b1-e856-4bee-8bc3-a1e9755e8c25
6+
7+
[视频链接](https://youtu.be/j9RRj-xzOA4)
8+
59
欢迎来到 OpenDWM 项目!这是一个专注于自动驾驶视频生成的开源项目。我们的使命是提供一个高质量、可控的、使用最新技术的自动驾驶视频生成工具。我们的目标是构建一个既用户友好,又高度可复用的代码库,并希望通过聚集社区智慧,不断改进。
610

711
驾驶世界模型根据文本和道路环境布局条件,生成自动驾驶场景的多视角图像或视频。无论是环境、天气条件、车辆类型,还是驾驶路径,你都可以根据需求来调整。
@@ -16,10 +20,6 @@
1620

1721
此外,我们设计的代码模块考虑到了相当程度的可复用性,以便于在其他项目中应用。
1822

19-
https://github.com/user-attachments/assets/e73d22b1-e856-4bee-8bc3-a1e9755e8c25
20-
21-
[视频链接](https://youtu.be/j9RRj-xzOA4)
22-
2323
截止现在,本项目实现了以下论文中的技巧:
2424

2525
> [UniMLVG: Unified Framework for Multi-view Long Video Generation with Comprehensive Control Capabilities for Autonomous Driving](https://sensetime-fvg.github.io/UniMLVG)<br>

0 commit comments

Comments
 (0)