CoRNStack: High-Quality Contrastive Data for Better Code Ranking Paper • 2412.01007 • Published Dec 1, 2024 • 1
Training Sparse Mixture Of Experts Text Embedding Models Paper • 2502.07972 • Published Feb 11, 2025 • 9
Nomic Embed: Training a Reproducible Long Context Text Embedder Paper • 2402.01613 • Published Feb 2, 2024 • 15
GPT4All: An Ecosystem of Open Source Compressed Language Models Paper • 2311.04931 • Published Nov 6, 2023 • 22