- Paper: real, arXiv
2412.00626v3, dated2025-05-10 - Venue claim: matches
IROS 2025 - Upstream repo: exists and matches the paper naming/configuration
- Dataset status: partial;
visdroneanduavdark135are on the shared volume,uavdtis not present yet - Reproducibility status:
VERIFIED WITH CAVEATS
- The upstream repo contains a hardcoded pretrained Mamba path.
- Nighttime datasets and released checkpoints are distributed via Baidu links, which is fragile for reproducibility.
- The README has setup inconsistencies, including
requirement.txtvsrequirements.txt. - No independent reproduction report was confirmed during the initial verification pass.
- Core method retained: Vision Mamba backbone plus center head with Adaptive Curriculum Learning.
- Adaptation path added:
YOLO26detector bootstrap before Mamba-based nighttime tracking. - Deployment target retained: dual compute execution on
MLX,CUDA, andCPUfallback.
- One-stream tracker structure.
mambar_small_patch16_224backbone profile.- ACL sampling scheduler and ADW loss formulation.
- Paper training recipe and benchmark targets.
- Direct dependence on the upstream absolute file layout.
- Baidu-only download automation.
- Full training or evaluation claims before the missing datasets and weights are staged.
- Detector-first pipeline with
YOLO26ROI bootstrap. - Internal 1.8M UAV dataset and shared-wave datasets.
- Clean local configuration, asset checks, and task decomposition for ANIMA.
- ASSETS.md
- prds/README.md
- tasks/INDEX.md
- initial code scaffold under src/anima_coonhound
- Land clean foundation/config/runtime checks.
- Port the ACL math and dataset mixture logic exactly.
- Implement the Vision Mamba tracker module behind a backend-agnostic adapter.
- Attach
YOLO26proposals to template/search crop generation. - Stage missing datasets and pretrained weights.
- Run reproduction on reference benchmarks before Shenzhen-specific fine-tuning.