1 9 15

YANGYIFEI

yangyfaker

plyfager

AI & ML interests

Generative Models

Recent Activity

upvoted a paper 4 days ago

DeepResearchEval: An Automated Framework for Deep Research Task Construction and Agentic Evaluation

upvoted a paper 5 months ago

POINTS-Reader: Distillation-Free Adaptation of Vision-Language Models for Document Conversion

upvoted a paper about 1 year ago

DiffSensei: Bridging Multi-Modal LLMs and Diffusion Models for Customized Manga Generation

View all activity

Organizations

upvoted a paper 4 days ago

DeepResearchEval: An Automated Framework for Deep Research Task Construction and Agentic Evaluation

Paper • 2601.09688 • Published 6 days ago • 116

upvoted a paper 5 months ago

POINTS-Reader: Distillation-Free Adaptation of Vision-Language Models for Document Conversion

Paper • 2509.01215 • Published Sep 1, 2025 • 50

upvoted 2 papers about 1 year ago

DiffSensei: Bridging Multi-Modal LLMs and Diffusion Models for Customized Manga Generation

Paper • 2412.07589 • Published Dec 10, 2024 • 48

POINTS1.5: Building a Vision-Language Model towards Real World Applications

Paper • 2412.08443 • Published Dec 11, 2024 • 38

updated a model about 2 years ago

yangyfaker/textual_inversion_cat

Updated Jan 8, 2024

liked a model about 2 years ago

stabilityai/stable-zero123

Text-to-3D • Updated Jul 10, 2024 • 753

upvoted 2 papers about 2 years ago

DiffMorpher: Unleashing the Capability of Diffusion Models for Image Morphing

Paper • 2312.07409 • Published Dec 12, 2023 • 23

FreeControl: Training-Free Spatial Control of Any Text-to-Image Diffusion Model with Any Condition

Paper • 2312.07536 • Published Dec 12, 2023 • 18

liked 4 Spaces over 2 years ago

text-to-3D & image-to-3D

Animagine XL 3.0

🌍

485

updated a model over 2 years ago

yangyfaker/params_ti_debug

Updated Aug 17, 2023

liked 2 Spaces over 2 years ago

Open Object Detection Leaderboard

🏆

176

Request evaluation for a new model

FarmingGame

🚀

Play a farming game in your browser

upvoted 2 papers over 2 years ago

BuboGPT: Enabling Visual Grounding in Multi-Modal LLMs

Paper • 2307.08581 • Published Jul 17, 2023 • 28

JourneyDB: A Benchmark for Generative Image Understanding

Paper • 2307.00716 • Published Jul 3, 2023 • 19

liked a dataset over 2 years ago

fusing/fill50k

Viewer • Updated Mar 10, 2023 • 50k • 291 • 38

liked 2 Spaces over 2 years ago

Zeroscope Text-To-Video

🐠

757

watermark-free Modelscope-based video generation

ReVersion

🐠

YANGYIFEI

AI & ML interests

Recent Activity

Organizations

yangyfaker's activity

Nougat

EditAnything

Shap-E

Animagine XL 3.0

Open Object Detection Leaderboard

FarmingGame

Zeroscope Text-To-Video

ReVersion