Hiroaki OGASAWARA

xhiroga

AI & ML interests

None yet

Recent Activity

liked a model 4 days ago

deepseek-ai/DeepSeek-OCR-2

liked a Space 8 days ago

Qwen/Qwen3-TTS

updated a dataset about 1 month ago

xhiroga/data

View all activity

Organizations

liked a model 4 days ago

deepseek-ai/DeepSeek-OCR-2

Image-Text-to-Text • 3B • Updated 3 days ago • 103k • 616

liked a Space 8 days ago

Qwen3-TTS Demo

🎙

1.12k

Generate realistic speech from text with custom voices or voice cloning

updated a dataset about 1 month ago

xhiroga/data

Viewer • Updated about 1 month ago • 1 • 81 • 1

liked a dataset 2 months ago

Seed3D/Articulation-XL2.0

Updated Sep 19, 2025 • 170 • 29

liked a model 2 months ago

VAST-AI/UniRig

Updated Aug 1, 2025 • 71

liked a model 3 months ago

microsoft/Phi-4-multimodal-instruct

Automatic Speech Recognition • 6B • Updated Dec 10, 2025 • 232k • 1.57k

liked a Space 3 months ago

Open ASR Leaderboard

🏆

1.21k

View and request speech models benchmark data

liked a model 3 months ago

nguyenvulebinh/AV-HuBERT-MuAViC-multilingual

Text Generation • 0.4B • Updated Mar 6, 2025 • 5 • 2

liked a model 4 months ago

meta-llama/Llama-3.2-3B

Text Generation • 3B • Updated Oct 24, 2024 • 797k • 690

upvoted a paper 4 months ago

Zero-AVSR: Zero-Shot Audio-Visual Speech Recognition with LLMs by Learning Language-Agnostic Speech Representations

Paper • 2503.06273 • Published Mar 8, 2025 • 6

liked a model 4 months ago

fierce-cats/beatrice-trainer

Audio-to-Audio • Updated Aug 30, 2025 • 37

updated a dataset 5 months ago

xhiroga/hiroga-speech

Updated Sep 14, 2025 • 31

published a dataset 5 months ago

xhiroga/hiroga-speech

Updated Sep 14, 2025 • 31

liked 3 models 6 months ago

liked 2 Spaces 6 months ago

LLM Embeddings Explained: A Visual and Intuitive Guide

🚀

320

How Language Models Turn Text into Meaning, From Traditional

Mitsua Likes Demo

🚀

Text-to-Image Diffusion Model trained on licensed/pd data