CLIP for Unsupervised and Fully Supervised Visual Grounding. This repository is the official Pytorch implementation for the paper CLIP-VG: Self-paced Curriculum Adapting of CLIP for Visual Grounding.
Abstract: Accurately determining the distance between a camera and an object is of paramount importance in a multitude of computer vision applications, such as those found in robotics and industrial ...
This repository is the Pytorch implementation of our work - Multi-Modal Fusion of Event and RGB for Monocular Depth Estimation Using Transformer. Testing is done in two steps. First, is to run test.py ...
Abstract: For a swarm of Unmanned Aerial Vehicle (UAV), long-distance visual mapping is advantageous for pre-planning navigation paths in unknown urban building environments. This work leverages two ...