Auden

community

AI & ML interests

Auden is an open research initiative for audio and multimodal understanding. We publish reproducible code, curated datasets, model checkpoints, and interactive demos to enable transparent evaluation and strong, reusable baselines.

Recent Activity

Cyril666 authored a paper 2 days ago

VideoLLaMA 3: Frontier Multimodal Foundation Models for Image and Video Understanding

Cyril666 authored a paper 2 days ago

What Is a Good Caption? A Comprehensive Visual Caption Benchmark for Evaluating Both Correctness and Thoroughness

Cyril666 authored a paper 2 days ago

MMR1: Enhancing Multimodal Reasoning with Variance-Aware Sampling and Open Resources

View all activity

Cyril666

authored 7 papers 2 days ago

VideoLLaMA 3: Frontier Multimodal Foundation Models for Image and Video Understanding

Paper • 2501.13106 • Published Jan 22, 2025 • 90

What Is a Good Caption? A Comprehensive Visual Caption Benchmark for Evaluating Both Correctness and Thoroughness

Paper • 2502.14914 • Published Feb 19, 2025

MMR1: Enhancing Multimodal Reasoning with Variance-Aware Sampling and Open Resources

Paper • 2509.21268 • Published Sep 25, 2025 • 104

N3D-VLM: Native 3D Grounding Enables Accurate Spatial Reasoning in Vision-Language Models

Paper • 2512.16561 • Published Dec 18, 2025 • 20

Chain of Ideas: Revolutionizing Research in Novel Idea Development with LLM Agents

Paper • 2410.13185 • Published Oct 17, 2024 • 5

Focus on the Whole Character: Discriminative Character Modeling for Scene Text Recognition

Paper • 2407.05562 • Published Jul 8, 2024

VideoRefer Suite: Advancing Spatial-Temporal Object Understanding with Video LLM

Paper • 2501.00599 • Published Dec 31, 2024 • 46

yshao18

updated 5 models about 1 month ago

published 2 models about 1 month ago

AudenAI/TagSpeech-Alimeeting

Automatic Speech Recognition • Updated Jan 22 • 2

AudenAI/TagSpeech-AMI

Automatic Speech Recognition • Updated Jan 22 • 39 • 4

mingyue66

updated a model about 1 month ago

AudenAI/TagSpeech-Alimeeting

Automatic Speech Recognition • Updated Jan 22 • 2

yshao18

updated 2 models about 1 month ago

AudenAI/TagSpeech-AMI

Automatic Speech Recognition • Updated Jan 22 • 39 • 4

AudenAI/TagSpeech-Alimeeting

Automatic Speech Recognition • Updated Jan 22 • 2

mingyue66

updated a model about 1 month ago

AudenAI/TagSpeech-AMI

Automatic Speech Recognition • Updated Jan 22 • 39 • 4

mingyue66

updated a collection about 2 months ago

Auden-LLM

Collection

3 items • Updated Jan 19

AI & ML interests

Recent Activity

Team members 13

AudenAI's activity