Riya
tai-tai-sama
ยท
AI & ML interests
Large Language Models, Applied ML, AI Agents, ReAct, Reflexion, Function Calling Models, Model Fine-Tuning, LoRA, QLoRA, RAG Systems, Semantic Search, Code Understanding Models, AST-Based Chunking, Model Evaluation, Generative AI, Production ML, ML Infrastructure, Cost Optimization, Token Efficiency, MLOps, Transformers, PyTorch, Llama Models
Recent Activity
upvoted an article 4 days ago
๐ช Introduction to Matryoshka Embedding Models liked
a model 4 days ago
jinaai/jina-embeddings-v5-text-small reacted
to
sergiopaniego's
post with ๐ฅ 3 months ago
TRL now includes agent training support for GRPOโผ๏ธ
Train ๐ต๏ธ agents with ๐ง tools, enabling interaction with external functions and APIs.
And of course, a new notebook and scripts to get you up to speed
๐ notebook tutorial: https://github.com/huggingface/trl/blob/main/examples/notebooks/grpo_agent.ipynb
๐ script examples: https://github.com/huggingface/trl/blob/main/examples/scripts/grpo_agent.py
๐ฆ TRL v0.26.0 release: https://github.com/huggingface/trl/releases/tag/v0.26.0