Accessibility navigation


NimbleD: enhancing self-supervised monocular depth estimation with pseudo-labels and large-scale video pre-training

Luginov, A. and Shahzad, M. ORCID: https://orcid.org/0009-0002-9394-343X (2025) NimbleD: enhancing self-supervised monocular depth estimation with pseudo-labels and large-scale video pre-training. In: Del Bue, A., Canton, C., Pont-Tuset, J. and Tommasi, T. (eds.) Computer Vision – ECCV 2024 Workshops Proceedings, Part II. Lecture Notes in Computer Science (15624). Springer, Cham, pp. 235-251. ISBN 9783031923869

[thumbnail of NimbleD.pdf] Text - Accepted Version
· Restricted to Repository staff only until 12 May 2026.

5MB

It is advisable to refer to the publisher's version if you intend to cite from this work. See Guidance on citing.

To link to this item DOI: 10.1007/978-3-031-92387-6_18

Abstract/Summary

We introduce NimbleD, an efficient self-supervised monocular depth estimation learning framework that incorporates supervision from pseudo-labels generated by a large vision model. This framework does not require camera intrinsics, enabling large-scale pre-training on publicly available videos. Our straightforward yet effective learning strategy significantly enhances the performance of fast and lightweight models without introducing any overhead, allowing them to achieve performance comparable to state-of-the-art self-supervised monocular depth estimation models. This advancement is particularly beneficial for virtual and augmented reality applications requiring low latency inference. The source code, model weights, and acknowledgments are available at https://github.com/xapaxca/nimbled.

Item Type:Book or Report Section
Refereed:Yes
Divisions:Science > School of Mathematical, Physical and Computational Sciences > Department of Computer Science
ID Code:118622
Publisher:Springer

University Staff: Request a correction | Centaur Editors: Update this record

Page navigation