Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Jiannan Wu's picture
2 5

Jiannan Wu PRO

wjn922
ye6291's profile picture 0xSojalSec's profile picture 21world's profile picture
·

AI & ML interests

None yet

Recent Activity

updated a dataset 11 days ago
wjn922/ocr-vqa-200k_images
published a dataset 11 days ago
wjn922/ocr-vqa-200k_images
published a dataset 21 days ago
wjn922/Recap-Datacomp-1B_tars_part14
View all activity

Organizations

OpenGVLab's profile picture

authored 4 papers 4 months ago

VisionLLM v2: An End-to-End Generalist Multimodal Large Language Model for Hundreds of Vision-Language Tasks

Paper • 2406.08394 • Published Jun 12, 2024

Language as Queries for Referring Video Object Segmentation

Paper • 2201.00487 • Published Jan 3, 2022

InternVL: Scaling up Vision Foundation Models and Aligning for Generic Visual-Linguistic Tasks

Paper • 2312.14238 • Published Dec 21, 2023 • 20

VisionLLM: Large Language Model is also an Open-Ended Decoder for Vision-Centric Tasks

Paper • 2305.11175 • Published May 18, 2023 • 3
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs