Training Domain Draft Models for Speculative Decoding: Best Practices and Insights Paper • 2503.07807 • Published Mar 10
On the Tool Manipulation Capability of Open-source Large Language Models Paper • 2305.16504 • Published May 25, 2023 • 2
Agentic Context Engineering: Evolving Contexts for Self-Improving Language Models Paper • 2510.04618 • Published Oct 6 • 123
DataComp-LM: In search of the next generation of training sets for language models Paper • 2406.11794 • Published Jun 17, 2024 • 54
SambaNova SN40L: Scaling the AI Memory Wall with Dataflow and Composition of Experts Paper • 2405.07518 • Published May 13, 2024 • 27
SambaNova SN40L: Scaling the AI Memory Wall with Dataflow and Composition of Experts Paper • 2405.07518 • Published May 13, 2024 • 27
CAB: Comprehensive Attention Benchmarking on Long Sequence Modeling Paper • 2210.07661 • Published Oct 14, 2022
Unrestricted Adversarial Examples via Semantic Manipulation Paper • 1904.06347 • Published Apr 12, 2019
NL-Augmenter: A Framework for Task-Sensitive Natural Language Augmentation Paper • 2112.02721 • Published Dec 6, 2021