Quantized AI Models
These models are specifically quantized for CPU optimization you can use this on a docker space up to the speed of 9 token per second
This collection has no items.
These models are specifically quantized for CPU optimization you can use this on a docker space up to the speed of 9 token per second