Visual Basic Component Object Model Object

RingMamba: Remote Sensing Multisensor Pretraining With Visual State Space Model

Abstract: Previous studies on remote sensing foundation models have demonstrated the representational ability of convolutional neural networks (CNNs) and vision transformers (ViTs). However, these ...

GitHub

T-Rex2: Towards Generic Object Detection via Text-Visual Prompt Synergy

Note: This model has been trained for approximately 2.7M steps (batch size = 1) and is still in the training process. I have attached a .ipynb file in the repository. You can refer to it to know how ...

IEEE

MambaEVT: Event Stream based Visual Object Tracking using State Space Model

Abstract: Event camera-based visual tracking has drawn more and more attention in recent years due to the unique imaging principle and advantages of low energy consumption, high dynamic range, and ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results

RingMamba: Remote Sensing Multisensor Pretraining With Visual State Space Model

T-Rex2: Towards Generic Object Detection via Text-Visual Prompt Synergy

MambaEVT: Event Stream based Visual Object Tracking using State Space Model

Trending now