Guilherme Bueno

gbaf

AI & ML interests

None yet

Recent Activity

liked a dataset 4 months ago

HuggingFaceFW/finepdfs

liked a model 8 months ago

deepseek-ai/DeepSeek-R1-0528

liked a model 10 months ago

meta-llama/Llama-4-Maverick-17B-128E-Instruct

View all activity

Organizations

None yet

liked a dataset 4 months ago

HuggingFaceFW/finepdfs

Viewer • Updated 23 days ago • 476M • 34.4k • 811

liked a model 8 months ago

deepseek-ai/DeepSeek-R1-0528

Text Generation • Updated May 29, 2025 • 475k • • 2.4k

liked 2 models 10 months ago

meta-llama/Llama-4-Maverick-17B-128E-Instruct

Image-to-Text • 402B • Updated May 22, 2025 • 28.9k • 455

meta-llama/Llama-4-Scout-17B-16E-Instruct

Image-to-Text • 109B • Updated May 22, 2025 • 201k • 1.21k

upvoted 16 papers 11 months ago

Block Diffusion: Interpolating Between Autoregressive and Diffusion Language Models

Paper • 2503.09573 • Published Mar 12, 2025 • 75

New Trends for Modern Machine Translation with Large Reasoning Models

Paper • 2503.10351 • Published Mar 13, 2025 • 25

Gemini Embedding: Generalizable Embeddings from Gemini

Paper • 2503.07891 • Published Mar 10, 2025 • 45

Search-R1: Training LLMs to Reason and Leverage Search Engines with Reinforcement Learning

Paper • 2503.09516 • Published Mar 12, 2025 • 38

Optimizing Test-Time Compute via Meta Reinforcement Fine-Tuning

Paper • 2503.07572 • Published Mar 10, 2025 • 47

World Modeling Makes a Better Planner: Dual Preference Optimization for Embodied Task Planning

Paper • 2503.10480 • Published Mar 13, 2025 • 55

SegAgent: Exploring Pixel Understanding Capabilities in MLLMs by Imitating Human Annotator Trajectories

Paper • 2503.08625 • Published Mar 11, 2025 • 27

Taking Notes Brings Focus? Towards Multi-Turn Multimodal Dialogue Learning

Paper • 2503.07002 • Published Mar 10, 2025 • 39

VisualPRM: An Effective Process Reward Model for Multimodal Reasoning

Paper • 2503.10291 • Published Mar 13, 2025 • 36

Unified Reward Model for Multimodal Understanding and Generation

Paper • 2503.05236 • Published Mar 7, 2025 • 123

Implicit Reasoning in Transformers is Reasoning through Shortcuts

Paper • 2503.07604 • Published Mar 10, 2025 • 23

GTR: Guided Thought Reinforcement Prevents Thought Collapse in RL-based VLM Agent Training

Paper • 2503.08525 • Published Mar 11, 2025 • 17

LMM-R1: Empowering 3B LMMs with Strong Reasoning Abilities Through Two-Stage Rule-Based RL

Paper • 2503.07536 • Published Mar 10, 2025 • 88

Guilherme Bueno

AI & ML interests

Recent Activity

Organizations

gbaf's activity