FNI V2.0 for Maritime Visual Tracking Dataset Mvtd: Authority (A:62), Popularity (P:55), Recency (R:88), Quality (Q:50). Semantic (S) is a query-time baseline scored live at search.
MVTD (Maritime Visual Tracking Dataset) is a large-scale benchmark dataset designed specifically for single-object visual tracking (VOT) in maritime environments. It addresses challenges unique to maritime scenes: such as water reflections, low-contrast objects, dynamic backgrounds, scale variation, and severe illumination changesβwhich are not adequately covered by generic tracking datasets.
The dataset contains 182 annotated video sequences with approximately 150,000 frames, spanning four maritime object categories:
Boat
Ship
Sailboat
Unmanned Surface Vehicle (USV)
MVTD is suitable for training, fine-tuning, and benchmarking visual object tracking algorithms under realistic maritime conditions.
Dataset Statistics
Total sequences: 182
Total annotated frames: 150,058
Frame rate: 30 FPS and 60 FPS
Resolution range:
Min: 1024 Γ 1024
Max: 1920 Γ 1440
Average sequence length: ~824 frames
Sequence length range: 82 β 4747 frames
Object categories: 4
Dataset Structure
The dataset follows the GOT-10k single-object tracking format, enabling easy integration with existing tracking pipelines.