view article Article How to Use Multiple GPUs in Hugging Face Transformers: Device Map vs Tensor Parallelism Feb 12 • 19
sentence-transformers/all-MiniLM-L6-v2 Sentence Similarity • 22.7M • Updated Mar 6, 2025 • 204M • • 4.71k
Learning Rate Matters: Vanilla LoRA May Suffice for LLM Fine-tuning Paper • 2602.04998 • Published Feb 4 • 6
Learning Rate Matters: Vanilla LoRA May Suffice for LLM Fine-tuning Paper • 2602.04998 • Published Feb 4 • 6
Learning Rate Matters: Vanilla LoRA May Suffice for LLM Fine-tuning Paper • 2602.04998 • Published Feb 4 • 6
sentence-transformers/all-mpnet-base-v2 Sentence Similarity • 0.1B • Updated Aug 19, 2025 • 33.4M • • 1.28k
Compound AI Systems Optimization: A Survey of Methods, Challenges, and Future Directions Paper • 2506.08234 • Published Jun 9, 2025 • 9
Compound AI Systems Optimization: A Survey of Methods, Challenges, and Future Directions Paper • 2506.08234 • Published Jun 9, 2025 • 9
Compound AI Systems Optimization: A Survey of Methods, Challenges, and Future Directions Paper • 2506.08234 • Published Jun 9, 2025 • 9 • 3