Gibran Iqbal

Jibbscript

1 550 434

AI & ML interests

None yet

Recent Activity

upvoted an article about 8 hours ago

Keep the Tokens Flowing: Lessons from 16 Open-Source RL Libraries

liked a model about 8 hours ago

lightonai/LateOn

upvoted an article about 8 hours ago

After the party comes the free lunch: regularizing ColBERT models to enhance pooling capabilities and reduce index footprint

View all activity

Organizations

upvoted 2 articles about 8 hours ago

Article

Keep the Tokens Flowing: Lessons from 16 Open-Source RL Libraries

aminediroHF, qgallouedec, kashif, lewtun, edbeeching, albertvillanova, nouamanetazi, lvwerra, sergiopaniego

•

Mar 10

• 170

Article

After the party comes the free lunch: regularizing ColBERT models to enhance pooling capabilities and reduce index footprint

lightonai

•

7 days ago

• 13

upvoted a paper 3 days ago

RuleChef: Grounding LLM Task Knowledge in Human-Editable Rules

Paper • 2607.01293 • Published 12 days ago • 3

upvoted an article 4 days ago

Article

Native-speed vLLM transformers modeling backend

hmellor, lysandre

•

5 days ago

• 29

upvoted a collection 7 days ago

AI2 Safety Toolkit

Collection

Safety data, moderation tools and safe LLMs. • 6 items • Updated Dec 23, 2025 • 11

upvoted a paper 7 days ago

WildGuard: Open One-Stop Moderation Tools for Safety Risks, Jailbreaks, and Refusals of LLMs

Paper • 2406.18495 • Published Jun 26, 2024 • 15

upvoted 9 papers 9 days ago

Logit-Contribution Scoring Identifies Non-Literal Retrieval Heads

Paper • 2607.01002 • Published 12 days ago • 18

Breaking Failure Cascades: Step-Aware Reinforcement Learning for Medical Multimodal Reasoning

Paper • 2606.31825 • Published 13 days ago • 28

SkillCoach: Self-Evolving Rubrics for Evaluating and Enhancing Agentic Skill-Use

Paper • 2607.01874 • Published 11 days ago • 18

WorldDirector: Building Controllable World Simulators with Persistent Dynamic Memory

Paper • 2607.02517 • Published 11 days ago • 32

AgenticDataBench: A Comprehensive Benchmark for Data Agents

Paper • 2607.01647 • Published 11 days ago • 35

Morphing into Hybrid Attention Models

Paper • 2606.30562 • Published 14 days ago • 48

EvoPolicyGym: Evaluating Autonomous Policy Evolution in Interactive Environments

Paper • 2607.02440 • Published 11 days ago • 50

AgenticSTS: A Bounded-Memory Testbed for Long-Horizon LLM Agents

Paper • 2607.02255 • Published 11 days ago • 61

Program-as-Weights: A Programming Paradigm for Fuzzy Functions

Paper • 2607.02512 • Published 11 days ago • 118

upvoted 4 papers 10 days ago

upvoted a paper 11 days ago

Dockerless: Environment-Free Program Verifier for Coding Agents

Paper • 2606.28436 • Published 17 days ago • 109

Gibran Iqbal

AI & ML interests

Recent Activity

Organizations

Jibbscript's activity

Keep the Tokens Flowing: Lessons from 16 Open-Source RL Libraries

After the party comes the free lunch: regularizing ColBERT models to enhance pooling capabilities and reduce index footprint

Native-speed vLLM transformers modeling backend