new

Get trending papers in your email inbox once a day!

Get trending papers in your email inbox!

Daily Papers

byAK and the research community

May 31

Submitted by

akhaliq

Tree-Ring Watermarks: Fingerprints for Diffusion Images that are Invisible and Robust

·
4 authors

Submitted by

akhaliq

Faith and Fate: Limits of Transformers on Compositionality

·
16 authors

Submitted by

akhaliq

HiFA: High-fidelity Text-to-3D with Advanced Diffusion Guidance

·
2 authors

Submitted by

akhaliq

StyleAvatar3D: Leveraging Image-Text Diffusion Models for High-Fidelity 3D Avatar Generation

·
10 authors

Submitted by

akhaliq

LibriTTS-R: A Restored Multi-Speaker Text-to-Speech Corpus

·
10 authors

Submitted by

akhaliq

Real-World Image Variation by Aligning Diffusion Inversion Chain

·
4 authors

Submitted by

akhaliq

Grammar Prompting for Domain-Specific Language Generation with Large Language Models

·
6 authors

Submitted by

akhaliq

PaLI-X: On Scaling up a Multilingual Vision and Language Model

·
43 authors

Submitted by

akhaliq

Make-An-Audio 2: Temporal-Enhanced Text-to-Audio Generation

·
10 authors

Submitted by

akhaliq

AlteredAvatar: Stylizing Dynamic 3D Avatars with Fast Style Adaptation

·
5 authors

Submitted by

akhaliq

LANCE: Stress-testing Visual Models by Generating Language-guided Counterfactual Images

·
4 authors

Submitted by

akhaliq

Geometric Algebra Transformers

·
4 authors

Submitted by

akhaliq

Nested Diffusion Processes for Anytime Image Generation

·
4 authors

Submitted by

akhaliq

KAFA: Rethinking Image Ad Understanding with Knowledge-Augmented Feature Adaptation of Vision-Language Models

·
7 authors