Sara Han Díaz
sdiazlor
AI & ML interests
Data curation and generation, RLHF, RAG, Prompt Engineering
Recent Activity
posted an update 11 days ago
More OSS than ever with the latest pruna 0.3.2 release. It extends existing algorithm families, such as compilers, kernels, and pruners, and adds new ones, including decoders, distillers, enhancers, and recoverers. But it's not only a collection of algorithms; instead, you can easily combine them to get the biggest efficiency win.
Read the full blog here: https://huggingface.co/blog/PrunaAI/pruna-0-3-2-open-source-optimization-algorithms upvoted an article 11 days ago
KV Caching Explained: Optimizing Transformer Inference Efficiency published an article 11 days ago
Pruna 0.3.2: More OSS Algos, More Ways to Optimize