You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: tutorials/docs/depth_ai_integration.rst
+19-4Lines changed: 19 additions & 4 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -23,7 +23,7 @@ Traditional 3D navigation typically requires expensive hardware like LiDAR or St
23
23
What is Depth Anything 3 AI model?
24
24
----------------------------------
25
25
26
-
Depth Anything 3 (DA3) is an AI model that predicts spatially consistent geometry from an arbitrary number of visual inputs, with or without known camera poses. `For more details <https://arxiv.org/abs/2511.10647>`_. In this tutorial, we are using ROS 2 implementation of Depth Anything 3 `[2] <_2>`_, which provides the ROS 2 composable node to run the inference of the DA3 model. The attached image shows Rviz2 with two image views, one is showig the Color Image topic and other is showing Depth Image topic, published by DA3 ROS2 Node.
26
+
Depth Anything 3 (DA3) is an AI model that predicts spatially consistent geometry from an arbitrary number of visual inputs, with or without known camera poses. `For more details <https://arxiv.org/abs/2511.10647>`_. In this tutorial, we are using ROS 2 implementation of Depth Anything 3 :ref:`[2] <repo_link>`, which provides the ROS 2 composable node to run the inference of the DA3 model. The attached image shows Rviz2 with two image views, one is showig the Color Image topic and other is showing Depth Image topic, published by DA3 ROS2 Node.
Navigate to your workspace, clone the TensorRT Depth Anything stack, and build it:
@@ -111,6 +111,8 @@ You need the compiled ONNX model weights for the pipeline to compute depth maps.
111
111
A. Download the ONNX file from `Huggingface <https://huggingface.co/TillBeemelmanns/Depth-Anything-V3-ONNX>`_
112
112
B. Generate ONNX following the instruction `here <https://github.com/ika-rwth-aachen/ros2-depth-anything-v3-trt/blob/main/onnx/README.md>`_
113
113
114
+
You can place this file anywhere as you fit, e.g. `depth_anything_v3/models/`, but remember to modify this path in the ``nav2_depth_ai_params.yaml`` file as shown :ref:`below <param_yaml>`.
115
+
114
116
4. Configure Params
115
117
-------------------
116
118
@@ -160,12 +162,14 @@ Here we are using Depth Anything V3 model, and it is exported to ONNX with these
160
162
width: 504
161
163
height: 280
162
164
163
-
Params for DAV3
165
+
Params for Depth Anything V3
164
166
~~~~~~~~~~~~~~~
165
167
166
168
.. note::
167
169
These params need to be configured for the Depth Anything V3 AI Model
168
170
171
+
.. _param_yaml:
172
+
169
173
.. code-block:: yaml
170
174
171
175
depth_anything_v3:
@@ -268,12 +272,23 @@ Finally, launch the entire pipeline using the provided launch file:
268
272
.. note::
269
273
This will start the camera node (if not using already), process the image topic using pipeline through the DA3 model, and publish the resulting PointCloud2 topic. This topic is subscribed by Nav2 to add voxel costmap layer for path planning and obstacle avoidance.
270
274
275
+
276
+
If everything is configured and running correctly, your results in RViz2 should match the demonstration in the video below.
This tutorial was developed in collaboration with the ROS 2 community. Special thanks to the contributors who provided insights and feedback during the development process.
275
290
276
-
.. _2:
291
+
.. _repo_link:
277
292
278
293
1. `Depth Anything 3: Recovering the Visual Space from Any Views (arXiv:2511.10647) <https://arxiv.org/abs/2511.10647>`_
0 commit comments