Abstract: Recently, many multimodal trackers have prioritized RGB as the dominant modality, treating other modalities as auxiliary, and fine-tuning separately various multimodal tasks. This imbalance ...
Abstract: Event camera-based visual tracking has drawn more and more attention in recent years due to the unique imaging principle and advantages of low energy consumption, high dynamic range, and ...
🎉 Welcome to visit our Project Page | 💻 Visit our Demo Website to try our model! Capybara is a unified visual creation model, i.e., a powerful visual generation and editing framework designed for ...