Number of items: 3.
Li, X. ORCID: https://orcid.org/0000-0002-9946-7000, Ding, J., Chen, Z. and Elhoseiny, M.
(2024)
Uni3DL: A unified model for 3D vision-language understanding.
In: ECCV 2024, 29 Sep — 4 Oct 2024, Milan, Italy, pp. 74-92.
doi: https://doi.org/10.1007/978-3-031-73337-6_5
Li, X. ORCID: https://orcid.org/0000-0002-9946-7000, Wen, C., Hu, Y., Yuan, Z. and Zhu, X. X.
(2024)
Vision-language models in remote sensing: current progress and future trends.
IEEE Geoscience and Remote Sensing Magazine, 12 (2).
pp. 32-66.
ISSN 2168-6831
doi: https://doi.org/10.1109/MGRS.2024.3383473
Li, X. ORCID: https://orcid.org/0000-0002-9946-7000, Wen, C., Hu, Y. and Zhou, N.
(2023)
RS-CLIP: zero shot remote sensing scene classification via contrastive vision-language supervision.
International Journal of Applied Earth Observation and Geoinformation, 124.
103497.
ISSN 1872-826X
doi: https://doi.org/10.1016/j.jag.2023.103497
This list was generated on Tue Jan 21 17:59:43 2025 UTC.