Sihan XU's picture

5 9 20

Sihan XU

sihanxu

·

https://sihanxu.github.io/

AI & ML interests

None yet

Recent Activity

upvoted a paper 5 days ago

Scaling Text-to-Image Diffusion Transformers with Representation Autoencoders

liked a model 29 days ago

SixAILab/nepa-base-patch14-224-sft

updated a model about 1 month ago

SixAILab/nepa-large-patch14-224-sft

View all activity

Organizations

authored 3 papers about 1 month ago

Vision-Language Models Are Not Pragmatically Competent in Referring Expression Generation

Paper • 2504.16060 • Published Apr 22, 2025

4D-LRM: Large Space-Time Reconstruction Model From and To Any View at Any Time

Paper • 2506.18890 • Published Jun 23, 2025 • 6

Next-Embedding Prediction Makes Strong Vision Learners

Paper • 2512.16922 • Published Dec 18, 2025 • 84

authored a paper over 1 year ago

Multi-Object Hallucination in Vision-Language Models

Paper • 2407.06192 • Published Jul 8, 2024 • 12

authored 2 papers about 2 years ago

CycleNet: Rethinking Cycle Consistency in Text-Guided Diffusion for Image Manipulation

Paper • 2310.13165 • Published Oct 19, 2023

Inversion-Free Image Editing with Natural Language

Paper • 2312.04965 • Published Dec 7, 2023 • 2