Shoaib
shoaibmohd
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
5 days ago
MinerU: An Open-Source Solution for Precise Document Content Extraction
updated
a collection
11 days ago
Datasets
updated
a collection
11 days ago
OCR
Organizations
NBA/Recommenders
-
FinTRec: Transformer Based Unified Contextual Ads Targeting and Personalization for Financial Applications
Paper • 2511.14865 • Published • 3 -
TradingAgents: Multi-Agents LLM Financial Trading Framework
Paper • 2412.20138 • Published • 14 -
MemOS: A Memory OS for AI System
Paper • 2507.03724 • Published • 157
Tab models
Learning from examples - training/inference
-
ExGRPO: Learning to Reason from Experience
Paper • 2510.02245 • Published • 80 -
A Practitioner's Guide to Multi-turn Agentic Reinforcement Learning
Paper • 2510.01132 • Published • 5 -
Agentic Context Engineering: Evolving Contexts for Self-Improving Language Models
Paper • 2510.04618 • Published • 124 -
MixReasoning: Switching Modes to Think
Paper • 2510.06052 • Published • 21
Data Analysis Papers
-
Scaling Generalist Data-Analytic Agents
Paper • 2509.25084 • Published • 18 -
PARROT: A Benchmark for Evaluating LLMs in Cross-System SQL Translation
Paper • 2509.23338 • Published • 4 -
LLMSQL: Upgrading WikiSQL for the LLM Era of Text-to-SQL
Paper • 2510.02350 • Published • 3 -
RAG-Anything: All-in-One RAG Framework
Paper • 2510.12323 • Published • 49
Memory
-
MemOS: A Memory OS for AI System
Paper • 2507.03724 • Published • 157 -
Zep: A Temporal Knowledge Graph Architecture for Agent Memory
Paper • 2501.13956 • Published • 8 -
MIRIX: Multi-Agent Memory System for LLM-Based Agents
Paper • 2507.07957 • Published • 79 -
Mem0: Building Production-Ready AI Agents with Scalable Long-Term Memory
Paper • 2504.19413 • Published • 34
Voice models
-
Step-Audio-R1 Technical Report
Paper • 2511.15848 • Published • 51 -
OmniFlatten: An End-to-end GPT Model for Seamless Voice Conversation
Paper • 2410.17799 • Published • 6 -
Easy Dataset: A Unified and Extensible Framework for Synthesizing LLM Fine-Tuning Data from Unstructured Documents
Paper • 2507.04009 • Published • 51
Computer Use Agent
OCR
-
MinerU2.5: A Decoupled Vision-Language Model for Efficient High-Resolution Document Parsing
Paper • 2509.22186 • Published • 136 -
CommonForms: A Large, Diverse Dataset for Form Field Detection
Paper • 2509.16506 • Published • 19 -
Automated Structured Radiology Report Generation with Rich Clinical Context
Paper • 2510.00428 • Published • 7 -
Extract-0: A Specialized Language Model for Document Information Extraction
Paper • 2509.22906 • Published
Datasets
Memory
-
MemOS: A Memory OS for AI System
Paper • 2507.03724 • Published • 157 -
Zep: A Temporal Knowledge Graph Architecture for Agent Memory
Paper • 2501.13956 • Published • 8 -
MIRIX: Multi-Agent Memory System for LLM-Based Agents
Paper • 2507.07957 • Published • 79 -
Mem0: Building Production-Ready AI Agents with Scalable Long-Term Memory
Paper • 2504.19413 • Published • 34
NBA/Recommenders
-
FinTRec: Transformer Based Unified Contextual Ads Targeting and Personalization for Financial Applications
Paper • 2511.14865 • Published • 3 -
TradingAgents: Multi-Agents LLM Financial Trading Framework
Paper • 2412.20138 • Published • 14 -
MemOS: A Memory OS for AI System
Paper • 2507.03724 • Published • 157
Voice models
-
Step-Audio-R1 Technical Report
Paper • 2511.15848 • Published • 51 -
OmniFlatten: An End-to-end GPT Model for Seamless Voice Conversation
Paper • 2410.17799 • Published • 6 -
Easy Dataset: A Unified and Extensible Framework for Synthesizing LLM Fine-Tuning Data from Unstructured Documents
Paper • 2507.04009 • Published • 51
Tab models
Computer Use Agent
Learning from examples - training/inference
-
ExGRPO: Learning to Reason from Experience
Paper • 2510.02245 • Published • 80 -
A Practitioner's Guide to Multi-turn Agentic Reinforcement Learning
Paper • 2510.01132 • Published • 5 -
Agentic Context Engineering: Evolving Contexts for Self-Improving Language Models
Paper • 2510.04618 • Published • 124 -
MixReasoning: Switching Modes to Think
Paper • 2510.06052 • Published • 21
OCR
-
MinerU2.5: A Decoupled Vision-Language Model for Efficient High-Resolution Document Parsing
Paper • 2509.22186 • Published • 136 -
CommonForms: A Large, Diverse Dataset for Form Field Detection
Paper • 2509.16506 • Published • 19 -
Automated Structured Radiology Report Generation with Rich Clinical Context
Paper • 2510.00428 • Published • 7 -
Extract-0: A Specialized Language Model for Document Information Extraction
Paper • 2509.22906 • Published
Data Analysis Papers
-
Scaling Generalist Data-Analytic Agents
Paper • 2509.25084 • Published • 18 -
PARROT: A Benchmark for Evaluating LLMs in Cross-System SQL Translation
Paper • 2509.23338 • Published • 4 -
LLMSQL: Upgrading WikiSQL for the LLM Era of Text-to-SQL
Paper • 2510.02350 • Published • 3 -
RAG-Anything: All-in-One RAG Framework
Paper • 2510.12323 • Published • 49