1 18 6

Abdulhakeem Adefioye

kokolamba

AI & ML interests

None yet

Recent Activity

upvoted a paper 26 days ago

Target Policy Optimization

updated a dataset 4 months ago

kokolamba/keen_popqa_gpt2xl_generations

upvoted a paper 4 months ago

Estimating Knowledge in Large Language Models Without Generating a Single Token

View all activity

Organizations

upvoted a paper 26 days ago

Target Policy Optimization

Paper • 2604.06159 • Published Apr 7 • 23

upvoted a paper 4 months ago

Estimating Knowledge in Large Language Models Without Generating a Single Token

Paper • 2406.12673 • Published Jun 18, 2024 • 9

upvoted a collection 6 months ago

LMEnt

Collection

14 items • Updated Sep 14, 2025 • 6

upvoted 2 papers 7 months ago

ReplaceMe: Network Simplification via Depth Pruning and Transformer Block Linearization

Paper • 2505.02819 • Published Feb 19 • 26

Share Your Attention: Transformer Weight Sharing via Matrix-based Dictionary Learning

Paper • 2508.04581 • Published Aug 6, 2025 • 6

upvoted an article 7 months ago

Article

Sparse Mixture of Experts Language Model from Scratch: Extending makeMoE with Expert Capacity

AviSoori1x

•

Mar 18, 2024

• 14

upvoted a paper 10 months ago

DyVo: Dynamic Vocabularies for Learned Sparse Retrieval with Entities

Paper • 2410.07722 • Published Oct 10, 2024 • 15

upvoted an article 10 months ago

Article

SmolLM3: smol, multilingual, long-context reasoner

eliebak, cmpatino, anton-l, edbeeching, m-ric, nouamanetazi, akseljoonas, guipenedo, hynky, clefourrier, SaylorTwift, kashif, qgallouedec, hlarcher, glutamatt, Xenova, reach-vb, ngxson, craffel, lewtun, loubnabnl, lvwerra, thomwolf

•

Jul 8, 2025

• 773

upvoted 2 articles 11 months ago

Article

Training and Finetuning Sparse Embedding Models with Sentence Transformers

tomaarsen, arthurbresnu

•

Jul 1, 2025

• 138

Article

Train 400x faster Static Embedding Models with Sentence Transformers

tomaarsen

•

Jan 15, 2025

• 229

upvoted a paper 11 months ago

REASONING GYM: Reasoning Environments for Reinforcement Learning with Verifiable Rewards

Paper • 2505.24760 • Published May 30, 2025 • 74

upvoted an article 12 months ago

Article

Training and Finetuning Reranker Models with Sentence Transformers

tomaarsen

•

Mar 26, 2025

• 193

upvoted 3 papers 12 months ago

Rank-K: Test-Time Reasoning for Listwise Reranking

Paper • 2505.14432 • Published May 20, 2025 • 5

Fixing Data That Hurts Performance: Cascading LLMs to Relabel Hard Negatives for Robust Information Retrieval

Paper • 2505.16967 • Published May 22, 2025 • 24

BERT has a Mouth, and It Must Speak: BERT as a Markov Random Field Language Model

Paper • 1902.04094 • Published Feb 11, 2019 • 1

upvoted an article about 1 year ago

Article

Unlocking Longer Generation with Key-Value Cache Quantization

RaushanTurganbay

•

May 16, 2024

• 56

upvoted an article over 1 year ago

Article

Deriving DPO's Loss

hba123

•

Dec 24, 2024

• 30

upvoted a paper over 1 year ago

Offline Reinforcement Learning for LLM Multi-Step Reasoning

Paper • 2412.16145 • Published Dec 20, 2024 • 38

Abdulhakeem Adefioye

AI & ML interests

Recent Activity

Organizations

kokolamba's activity

Sparse Mixture of Experts Language Model from Scratch: Extending makeMoE with Expert Capacity

SmolLM3: smol, multilingual, long-context reasoner

Training and Finetuning Sparse Embedding Models with Sentence Transformers

Train 400x faster Static Embedding Models with Sentence Transformers

Training and Finetuning Reranker Models with Sentence Transformers

Unlocking Longer Generation with Key-Value Cache Quantization

Deriving DPO's Loss