UniFilter A Unified Multimodal Data Quality Classifier for generating quality scores for both image-text caption data and interleaved document data weizhiwang/UniFilter-Qwen3-0.6B Image-Text-to-Text • 1B • Updated Oct 21 • 45 weizhiwang/OBELICS_HQ_5M_UniFilter Viewer • Updated Oct 21 • 5.06M • 534 weizhiwang/UniFilter-Qwen2.5-1.5B Image-Text-to-Text • 2B • Updated Oct 21 • 129 weizhiwang/unifilter_train_data Updated Oct 21 • 59
MLM-Filter Model and Data The collections of proposed MLM-Filter models based on different LLM backbones. weizhiwang/mlm-filter-qwen2.5-1.5b-gpt4o Text Generation • 2B • Updated Apr 14 • 19 • 3 weizhiwang/mlm-filter-llava-13b-gpt4v Text Generation • Updated Apr 9, 2024 • 16 • 6 weizhiwang/mlm_filter_instructions Updated Jul 2 • 34 • 5
Open-Qwen2VL weizhiwang/Open-Qwen2VL Image-Text-to-Text • Updated Apr 15 • 103 • 21 weizhiwang/Open-Qwen2VL-Data Viewer • Updated Apr 16 • 13M • 8.45k • 22 weizhiwang/Open-Qwen2VL-base Image-Text-to-Text • Updated Apr 3 • 32 weizhiwang/Open-Qwen2VL-Data-Interleaved Viewer • Updated Mar 5 • 23.3M • 103 • 2
Large Language and Video Assistant Video LLM based on Llama-3 and Llama-3.1 weizhiwang/LLaVA-Video-Llama-3.1-8B 8B • Updated Oct 28, 2024 • 108 • 5
UniFilter A Unified Multimodal Data Quality Classifier for generating quality scores for both image-text caption data and interleaved document data weizhiwang/UniFilter-Qwen3-0.6B Image-Text-to-Text • 1B • Updated Oct 21 • 45 weizhiwang/OBELICS_HQ_5M_UniFilter Viewer • Updated Oct 21 • 5.06M • 534 weizhiwang/UniFilter-Qwen2.5-1.5B Image-Text-to-Text • 2B • Updated Oct 21 • 129 weizhiwang/unifilter_train_data Updated Oct 21 • 59
Open-Qwen2VL weizhiwang/Open-Qwen2VL Image-Text-to-Text • Updated Apr 15 • 103 • 21 weizhiwang/Open-Qwen2VL-Data Viewer • Updated Apr 16 • 13M • 8.45k • 22 weizhiwang/Open-Qwen2VL-base Image-Text-to-Text • Updated Apr 3 • 32 weizhiwang/Open-Qwen2VL-Data-Interleaved Viewer • Updated Mar 5 • 23.3M • 103 • 2
MLM-Filter Model and Data The collections of proposed MLM-Filter models based on different LLM backbones. weizhiwang/mlm-filter-qwen2.5-1.5b-gpt4o Text Generation • 2B • Updated Apr 14 • 19 • 3 weizhiwang/mlm-filter-llava-13b-gpt4v Text Generation • Updated Apr 9, 2024 • 16 • 6 weizhiwang/mlm_filter_instructions Updated Jul 2 • 34 • 5
Large Language and Video Assistant Video LLM based on Llama-3 and Llama-3.1 weizhiwang/LLaVA-Video-Llama-3.1-8B 8B • Updated Oct 28, 2024 • 108 • 5