H company

Team

company

Verified

https://www.hcompany.ai/

hcompany_ai

hcompai

Activity Feed

AI & ML interests

None defined yet.

Recent Activity

hamza-hcompany new activity 10 days ago

Hcompany/Holo3-35B-A3B:Is there any special design on agentical framwork? memory/planning? I only got 23% score on OSWorld

hamza-hcompany updated a collection 18 days ago

Holotron

hamza-hcompany published a model 18 days ago

Hcompany/Holotron-3-Nano

View all activity

Papers

Surfer 2: The Next Generation of Cross-Platform Computer Use Agents

View all Papers

Articles

posted an update 8 days ago

Post

1699

OpenEnv is growing fast in tutorials. If you're looking to get started with RL environments, check them out

> evaluate your agents using OpenEnv
> learn how rewards work via rubrics
> connect agents via MCP
> many moreeeee!

anything you think it's missing?

https://meta-pytorch.org/OpenEnv/tutorials/index.html

sergiopaniego

posted an update 9 days ago

Post

773

OpenEnv already ships 🚢 with a ready-to-deploy RLM environment on free HF Spaces

Drop "Attention Is All You Need", write code that spawns parallel LLM calls → ✅ correct answer, reward 1.0, in 4.2s

Run GRPO (TRL) → model learns to write that search strategy itself

test it yourself → sergiopaniego/repl-env
check out OpenEnv → https://github.com/meta-pytorch/OpenEnv

hamza-hcompany

in Hcompany/Holo3-35B-A3B 10 days ago

Is there any special design on agentical framwork? memory/planning? I only got 23% score on OSWorld

#5 opened 16 days ago by

Wenjin0421

hamza-hcompany

updated a collection 18 days ago

Holotron

Collection

2 items • Updated 18 days ago • 1

hamza-hcompany

published a model 18 days ago

Hcompany/Holotron-3-Nano

Image-Text-to-Text • 33B • Updated 18 days ago • 317 • 16

hamza-hcompany

updated a model 18 days ago

Hcompany/Holotron-3-Nano

Image-Text-to-Text • 33B • Updated 18 days ago • 317 • 16

h-aurelien-lac

updated a model 18 days ago

Hcompany/Holotron-3-Nano

Image-Text-to-Text • 33B • Updated 18 days ago • 317 • 16

sergiopaniego

posted an update about 1 month ago

Post

1363

Earlier this month, Apple introduced Simple Self-Distillation: a fine-tuning method that improves models on coding tasks just by sampling from the model and training on its own outputs with plain cross-entropy

And… it's already supported in TRL, built by Kashif Rasul. you can really feel the pace of development in the team 🐎

Paper by Ruixiang ZHANG, He Bai, Huangjie Zheng, Navdeep Jaitly, Ronan Collobert, Yizhe Zhang at Apple 🍎

How it works: the model generates completions at a training-time temperature (T_train) with top_k/top_p truncation, then fine-tunes on them with plain cross-entropy. no labels or verifier needed

You can try it right away with this ready-to-run example (Qwen3-4B on rStar-Coder):
https://github.com/huggingface/trl/blob/main/trl/experimental/ssd/ssd.py
or benchmark a checkpoint with the eval script:
https://github.com/huggingface/trl/blob/main/trl/experimental/ssd/ssd_eval.py

One neat insight from the paper: T_train and T_eval compose into an effective T_eff = T_train × T_eval, so a broad band of configs works well. even very noisy samples still help

Want to dig deeper?

Paper: Embarrassingly Simple Self-Distillation Improves Code Generation (2604.01193)
Trainer docs: https://huggingface.co/docs/trl/main/en/ssd_trainer

plcedoz38

published an article about 1 month ago

Article

Meet HoloTab by HCompany. Your AI browser companion.

Hcompany

•

about 1 month ago

• 24

sergiopaniego

posted an update about 1 month ago

Post

458

Great experience yesterday at PyTorch Conf Europe in Paris 🇫🇷

We (w/ @kashif ) talked about training LLMs through interaction, using trajectories across games, browsers, or simulators

Room was packed, a clear sign of interest in where RL post-training is heading.

sharing the slides! 🤓
https://drive.google.com/file/d/16k7YRnf5EJEo0XjXGlRJ_hVeLoFWKyNP/view?usp=sharing

hamza-hcompany

in Hcompany/Holo3-35B-A3B about 1 month ago

Rename README.md to README.mds

#3 opened about 1 month ago by

faizikhan1

sergiopaniego

posted an update about 1 month ago

Post

2863

Gemma 4 💎 is here and it’s strong!

to celebrate, we’re rolling out in TRL:

> support for multimodal tool responses for environments (OpenEnv)
> an example to train it in CARLA for autonomous driving with image-based tool calls

go check it out 🏎️🏎️

blog: https://huggingface.co/blog/gemma4
script: https://github.com/huggingface/trl/blob/main/examples/scripts/openenv/carla_vlm_gemma.py

h-aleixcambray

in Hcompany/Holo3-35B-A3B about 1 month ago

Are there any plans regarding Holo3-9B?

#1 opened about 2 months ago by

Pevernow

hamza-hcompany

in Hcompany/Holo3-35B-A3B about 1 month ago

Quickstart guide link on model card leads to 404

#2 opened about 1 month ago by

TomSchelsen

plcedoz38

in Hcompany/Holo3-35B-A3B about 1 month ago

Are there any plans regarding Holo3-9B?

#1 opened about 2 months ago by

Pevernow

Quickstart guide link on model card leads to 404

#2 opened about 1 month ago by

TomSchelsen

plcedoz38

updated a model about 1 month ago

Hcompany/Holo3-35B-A3B

Image-Text-to-Text • 35B • Updated Apr 2 • 71.6k • 331

ramzidecoster

published an article about 1 month ago

Article

Holo3: Breaking the Computer Use Frontier

Hcompany

•

Apr 1

• 46

emricksini-h

updated a model about 1 month ago

Hcompany/Holo3-35B-A3B

Image-Text-to-Text • 35B • Updated Apr 2 • 71.6k • 331

sergiopaniego

posted an update about 2 months ago

Post

2081

TRL is officially an adult 🥳

excited to announce TRL v1.0❗️

head to the blog to see how we got here and what’s next for this post-training library, designed to keep pace with the field

https://huggingface.co/blog/trl-v1

2 replies

AI & ML interests

Recent Activity

Papers

Articles

Meet HoloTab by HCompany. Your AI browser companion.

Holo3: Breaking the Computer Use Frontier

Holotron-12B - High Throughput Computer Use Agent

H Company's new Holo2 model takes the lead in UI Localization

Holo1: New family of GUI automation VLMs powering GUI agent Surfer-H

Team members 37

Hcompany's activity

Is there any special design on agentical framwork? memory/planning? I only got 23% score on OSWorld

Meet HoloTab by HCompany. Your AI browser companion.

Rename README.md to README.mds

Are there any plans regarding Holo3-9B?

Quickstart guide link on model card leads to 404

Are there any plans regarding Holo3-9B?

Quickstart guide link on model card leads to 404

Holo3: Breaking the Computer Use Frontier