Submitted by akhaliq 9 Tree-Ring Watermarks: Fingerprints for Diffusion Images that are Invisible and Robust · 4 authors 344 2
Submitted by akhaliq 7 HiFA: High-fidelity Text-to-3D with Advanced Diffusion Guidance · 2 authors 209 1
Submitted by akhaliq 5 StyleAvatar3D: Leveraging Image-Text Diffusion Models for High-Fidelity 3D Avatar Generation · 10 authors 507 2
Submitted by akhaliq 5 Real-World Image Variation by Aligning Diffusion Inversion Chain · 4 authors 153 1
Submitted by akhaliq 4 Grammar Prompting for Domain-Specific Language Generation with Large Language Models · 6 authors 75 4
Submitted by akhaliq 4 PaLI-X: On Scaling up a Multilingual Vision and Language Model · 43 authors 93
Submitted by akhaliq 4 Make-An-Audio 2: Temporal-Enhanced Text-to-Audio Generation · 10 authors 188 1
Submitted by akhaliq 2 AlteredAvatar: Stylizing Dynamic 3D Avatars with Fast Style Adaptation · 5 authors
Submitted by akhaliq 2 LANCE: Stress-testing Visual Models by Generating Language-guided Counterfactual Images · 4 authors 32
Submitted by akhaliq 1 KAFA: Rethinking Image Ad Understanding with Knowledge-Augmented Feature Adaptation of Vision-Language Models · 7 authors