Spatial reasoning is the ability to perceive, interpret, and act across spatial scales, from millimeter-sized components to distant aerial scenes. All-scale spatial reasoning is fundamental to ...
Abstract: We propose a novel pan-tilt-zoom (PTZ) camera configuration planning method to improve data collection for structure visual inspections. This method plans a set of configurations for PTZ ...
Abstract: Visual grounding for remote sensing images (RSVG) is a fundamental vision-language task, which aims to locate the objects referred to by the natural language expression from the RS images.