Deepti Ghadiyaram's picture

Deepti Ghadiyaram PRO

dghadiya

·

AI & ML interests

None yet

Recent Activity

authored a paper about 6 hours ago

$\textit{Revelio}$: Interpreting and leveraging semantic information in diffusion models

authored a paper about 6 hours ago

Mitigating stereotypical biases in text to image generative systems

authored a paper about 6 hours ago

ClusterFit: Improving Generalization of Visual Representations

View all activity

Organizations

authored 12 papers about 6 hours ago

$\textit{Revelio}$: Interpreting and leveraging semantic information in diffusion models

Paper • 2411.16725 • Published Nov 23, 2024 • 1

Mitigating stereotypical biases in text to image generative systems

Paper • 2310.06904 • Published Oct 10, 2023

ClusterFit: Improving Generalization of Visual Representations

Paper • 1912.03330 • Published Dec 6, 2019

What's in a Latent? Leveraging Diffusion Latent Space for Domain Generalization

Paper • 2503.06698 • Published Mar 9, 2025 • 4

Improving Physical Object State Representation in Text-to-Image Generative Systems

Paper • 2505.02236 • Published May 4, 2025

Right Side Up? Disentangling Orientation Understanding in MLLMs with Fine-grained Multi-axis Perception Tasks

Paper • 2505.21649 • Published May 27, 2025 • 3

Progressive Prompt Detailing for Improved Alignment in Text-to-Image Generative Models

Paper • 2503.17794 • Published Mar 22, 2025

Some Modalities are More Equal Than Others: Decoding and Architecting Multimodal Integration in MLLMs

Paper • 2511.22826 • Published Nov 28, 2025 • 8

Generative Action Tell-Tales: Assessing Human Motion in Synthesized Videos

Paper • 2512.01803 • Published Dec 1, 2025 • 5

DDiT: Dynamic Patch Scheduling for Efficient Diffusion Transformers

Paper • 2602.16968 • Published Feb 19 • 12

Semantic Richness or Geometric Reasoning? The Fragility of VLM's Visual Invariance

Paper • 2604.01848 • Published 11 days ago

A Systematic Study of Cross-Modal Typographic Attacks on Audio-Visual Reasoning

Paper • 2604.03995 • Published 9 days ago • 4

submitted a paper to Daily Papers 4 days ago

A Systematic Study of Cross-Modal Typographic Attacks on Audio-Visual Reasoning

Paper • 2604.03995 • Published 9 days ago • 4