OnePiece123 's Collections
Unlocking Continual Learning Abilities in Language Models
Paper
• 2406.17245
• Published
• 30
A Closer Look into Mixture-of-Experts in Large Language Models
Paper
• 2406.18219
• Published
• 17
Symbolic Learning Enables Self-Evolving Agents
Paper
• 2406.18532
• Published
• 12
Step-DPO: Step-wise Preference Optimization for Long-chain Reasoning of
LLMs
Paper
• 2406.18629
• Published
• 42
AutoRAG-HP: Automatic Online Hyper-Parameter Tuning for
Retrieval-Augmented Generation
Paper
• 2406.19251
• Published
• 10
LiteSearch: Efficacious Tree Search for LLM
Paper
• 2407.00320
• Published
• 40
Chain-of-Knowledge: Integrating Knowledge Reasoning into Large Language
Models by Learning from Knowledge Graphs
Paper
• 2407.00653
• Published
• 13
We-Math: Does Your Large Multimodal Model Achieve Human-like
Mathematical Reasoning?
Paper
• 2407.01284
• Published
• 81
Agentless: Demystifying LLM-based Software Engineering Agents
Paper
• 2407.01489
• Published
• 65
Planetarium: A Rigorous Benchmark for Translating Text to Structured
Planning Languages
Paper
• 2407.03321
• Published
• 20
AriGraph: Learning Knowledge Graph World Models with Episodic Memory for
LLM Agents
Paper
• 2407.04363
• Published
• 34
DotaMath: Decomposition of Thought with Code Assistance and
Self-correction for Mathematical Reasoning
Paper
• 2407.04078
• Published
• 21
AgentInstruct: Toward Generative Teaching with Agentic Flows
Paper
• 2407.03502
• Published
• 51
Skywork-Math: Data Scaling Laws for Mathematical Reasoning in Large
Language Models -- The Story Goes On
Paper
• 2407.08348
• Published
• 52
Towards Building Specialized Generalist AI with System 1 and System 2
Fusion
Paper
• 2407.08642
• Published
• 11
MUSCLE: A Model Update Strategy for Compatible LLM Evolution
Paper
• 2407.09435
• Published
• 23
Paper
• 2407.10671
• Published
• 168
Sibyl: Simple yet Effective Agent Framework for Complex Real-world
Reasoning
Paper
• 2407.10718
• Published
• 19