DepthVLM serves as a unified foundation model for both low-level dense geometry prediction and high-level multimodal understanding, while achieving substantially faster inference compared with ...
Abstract: Accurate and robust human motion estimation is essential for enabling effective electromyography (EMG) signal-driven neural-machine interfaces in daily activities. Variation in loading ...
Abstract: Monocular vision-based target motion estimation is a fundamental challenge in numerous applications. This work introduces a novel bearing-box approach that fully leverages modern 3-D ...