-
Inference-Time Computations for LLM Reasoning and Planning: A Benchmark and Insights
Paper • 2502.12521 • Published -
Sketch-of-Thought: Efficient LLM Reasoning with Adaptive Cognitive-Inspired Sketching
Paper • 2503.05179 • Published • 46 -
Chain of Draft: Thinking Faster by Writing Less
Paper • 2502.18600 • Published • 50 -
SoftCoT: Soft Chain-of-Thought for Efficient Reasoning with LLMs
Paper • 2502.12134 • Published • 3
ztgx
ztgx
·
AI & ML interests
None yet
Recent Activity
liked a dataset 9 days ago
llamaindex/ParseBench upvoted an article 9 months ago
LLM Inference at scale with TGI upvoted an article 9 months ago
Prefill and Decode for Concurrent Requests - Optimizing LLM PerformanceOrganizations
None yet