Skip to content

Commit a5db5fb

Browse files
committed
fix cross-references, clarify model file, add demo video
1 parent 7d7ede8 commit a5db5fb

1 file changed

Lines changed: 19 additions & 4 deletions

File tree

tutorials/docs/depth_ai_integration.rst

Lines changed: 19 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -23,7 +23,7 @@ Traditional 3D navigation typically requires expensive hardware like LiDAR or St
2323
What is Depth Anything 3 AI model?
2424
----------------------------------
2525

26-
Depth Anything 3 (DA3) is an AI model that predicts spatially consistent geometry from an arbitrary number of visual inputs, with or without known camera poses. `For more details <https://arxiv.org/abs/2511.10647>`_. In this tutorial, we are using ROS 2 implementation of Depth Anything 3 `[2] <_2>`_, which provides the ROS 2 composable node to run the inference of the DA3 model. The attached image shows Rviz2 with two image views, one is showig the Color Image topic and other is showing Depth Image topic, published by DA3 ROS2 Node.
26+
Depth Anything 3 (DA3) is an AI model that predicts spatially consistent geometry from an arbitrary number of visual inputs, with or without known camera poses. `For more details <https://arxiv.org/abs/2511.10647>`_. In this tutorial, we are using ROS 2 implementation of Depth Anything 3 :ref:`[2] <repo_link>`, which provides the ROS 2 composable node to run the inference of the DA3 model. The attached image shows Rviz2 with two image views, one is showig the Color Image topic and other is showing Depth Image topic, published by DA3 ROS2 Node.
2727

2828
.. image:: images/depth_ai_integration/depth_ai_nav2_costmap.png
2929
:width: 80%
@@ -83,7 +83,7 @@ Run the following commands on your robot's main computer to install the foundati
8383
sudo apt install ros-$ROS_DISTRO-image-proc ros-$ROS_DISTRO-depth-image-proc ros-$ROS_DISTRO-usb-cam
8484
8585
86-
2. Build DAV3 package
86+
2. Build Depth Anything V3 package
8787
---------------------
8888

8989
Navigate to your workspace, clone the TensorRT Depth Anything stack, and build it:
@@ -111,6 +111,8 @@ You need the compiled ONNX model weights for the pipeline to compute depth maps.
111111
A. Download the ONNX file from `Huggingface <https://huggingface.co/TillBeemelmanns/Depth-Anything-V3-ONNX>`_
112112
B. Generate ONNX following the instruction `here <https://github.com/ika-rwth-aachen/ros2-depth-anything-v3-trt/blob/main/onnx/README.md>`_
113113

114+
You can place this file anywhere as you fit, e.g. `depth_anything_v3/models/`, but remember to modify this path in the ``nav2_depth_ai_params.yaml`` file as shown :ref:`below <param_yaml>`.
115+
114116
4. Configure Params
115117
-------------------
116118

@@ -160,12 +162,14 @@ Here we are using Depth Anything V3 model, and it is exported to ONNX with these
160162
width: 504
161163
height: 280
162164
163-
Params for DAV3
165+
Params for Depth Anything V3
164166
~~~~~~~~~~~~~~~
165167

166168
.. note::
167169
These params need to be configured for the Depth Anything V3 AI Model
168170

171+
.. _param_yaml:
172+
169173
.. code-block:: yaml
170174
171175
depth_anything_v3:
@@ -268,12 +272,23 @@ Finally, launch the entire pipeline using the provided launch file:
268272
.. note::
269273
This will start the camera node (if not using already), process the image topic using pipeline through the DA3 model, and publish the resulting PointCloud2 topic. This topic is subscribed by Nav2 to add voxel costmap layer for path planning and obstacle avoidance.
270274

275+
276+
If everything is configured and running correctly, your results in RViz2 should match the demonstration in the video below.
277+
278+
.. raw:: html
279+
280+
<h1 align="center">
281+
<div style="position: relative; padding-bottom: 0%; overflow: hidden; max-width: 100%; height: auto;">
282+
<iframe width="708" height="400" src="https://www.youtube.com/embed/QDN1uA71su4?autoplay=1&mute=1" frameborder="1" allowfullscreen></iframe>
283+
</div>
284+
</h1>
285+
271286
Acknowledgements
272287
================
273288

274289
This tutorial was developed in collaboration with the ROS 2 community. Special thanks to the contributors who provided insights and feedback during the development process.
275290

276-
.. _2:
291+
.. _repo_link:
277292

278293
1. `Depth Anything 3: Recovering the Visual Space from Any Views (arXiv:2511.10647) <https://arxiv.org/abs/2511.10647>`_
279294
2. `ika-rwth-aachen Depth Anything V3 TensorRT repository <https://github.com/ika-rwth-aachen/ros2-depth-anything-v3-trt>`_

0 commit comments

Comments
 (0)