Caio Cesar Iglesias

caioiglesias

AI & ML interests

Reinforcement Learning

Recent Activity

liked a model 27 days ago

tarn59/book_flatten_and_crop_qwen_image_edit_2509

liked a Space 3 months ago

briaai/BRIA-RMBG-2.0

liked a Space 3 months ago

Stable-X/ReconViaGen

View all activity

Organizations

liked a model 27 days ago

tarn59/book_flatten_and_crop_qwen_image_edit_2509

Image-to-Image • Updated 28 days ago • 239 • • 37

liked 2 Spaces 3 months ago

BRIA RMBG 2.0

🐢

848

remove background from any image

ReconViaGen

🖥

155

High-fidelity 3D Geometry Generation from multi-view images

upvoted a collection 3 months ago

VibeVoice

Collection

Frontier Text-to-Speech Models https://microsoft.github.io/VibeVoice/ • 8 items • Updated 12 days ago • 171

liked a Space 6 months ago

Sparc3D

🏃

1.57k

Next-Gen High-Resolution 3D Model Generation

upvoted 2 papers 7 months ago

Spatial-MLLM: Boosting MLLM Capabilities in Visual-based Spatial Intelligence

Paper • 2505.23747 • Published May 29 • 68

Distilling LLM Agent into Small Models with Retrieval and Code Tools

Paper • 2505.17612 • Published May 23 • 81

liked a Space 7 months ago

TRELLIS - Multiple Imagen a 3D

🚀

Scalable and Versatile 3D Generation from images

liked a model 7 months ago

Comfy-Org/HiDream-I1_ComfyUI

Updated Aug 5 • 334k • 202

liked a Space 7 months ago

Qwen3 WebGPU

🚀

A hybrid reasoning model that runs locally in your browser.

liked 2 models 9 months ago

sesame/csm-1b

Text-to-Speech • Updated 15 days ago • 27.3k • 2.29k

docling-project/SmolDocling-256M-preview

Image-Text-to-Text • 0.3B • Updated Sep 17 • 102k • 1.6k

liked a Space 9 months ago

Gemini Image Edit

📚

273

Generate edited images with text prompts

upvoted an article 10 months ago

Article

Llama can now see and run on your device - welcome Llama 3.2

Sep 25, 2024

•

191

upvoted an article 11 months ago

Article

SmolVLM Grows Smaller – Introducing the 256M & 500M Models!

Jan 23

•

186

liked a Space 11 months ago

Stable Point-Aware 3D

⚡

466

Generate 3D models from images

liked a model about 1 year ago

meta-llama/Llama-3.2-11B-Vision-Instruct

Image-Text-to-Text • 11B • Updated Dec 4, 2024 • 134k • • 1.55k

upvoted a collection about 1 year ago

Llama 3.2

Collection

This collection hosts the transformers and original repos of the Llama 3.2 and Llama Guard 3 • 15 items • Updated Dec 6, 2024 • 647

liked 2 Spaces over 1 year ago

Stable Fast 3D

🎮

1.13k

Generate a 3D mesh model from an image

OpenVoice

🤗

1.12k

Generate customized speech from text using a reference audio