The Kaitchup

company

https://kaitchup.substack.com/

bnjmn_marie

Activity Feed Request to join this org

AI & ML interests

None defined yet.

Recent Activity

bnjmnmarie updated a model 1 day ago

kaitchup/K2-V2-Instruct-nvfp4

bnjmnmarie updated a model 1 day ago

kaitchup/NVIDIA-Nemotron-3-Nano-30B-A3B-nvfp4

bnjmnmarie published a model 2 days ago

kaitchup/NVIDIA-Nemotron-3-Nano-30B-A3B-nvfp4

View all activity

kaitchup 's collections 32

Quantized K2-V2

Verified models.

kaitchup/K2-V2-Instruct-autoround-w4a16

4B • Updated 19 days ago • 22
kaitchup/K2-V2-Instruct-awq-w4a16-asym

13B • Updated 19 days ago • 37
kaitchup/K2-V2-Instruct-fp8-dynamic

73B • Updated 19 days ago • 38 • 1
kaitchup/K2-V2-Instruct-nvfp4

43B • Updated 1 day ago • 114

Quantized VibeThinker-1.5B

Verified models. Tested with vLLM.

kaitchup/VibeThinker-1.5B-w8a8-smoothquant

2B • Updated 22 days ago • 13
kaitchup/VibeThinker-1.5B-awq-w4a16-asym

0.6B • Updated 22 days ago • 28
kaitchup/VibeThinker-1.5B-fp8-dynamic

2B • Updated 22 days ago • 24
kaitchup/VibeThinker-1.5B-gptq-w4a16-g128

0.7B • Updated 22 days ago • 73

Compressed Tensors Demo

kaitchup/Qwen3-4B-Instruct-2507-gptq-w4a16-g128

1B • Updated Nov 13 • 27
kaitchup/Qwen3-4B-Instruct-2507-NVFP4

3B • Updated 3 days ago • 77
kaitchup/Qwen3-4B-Instruct-2507-autoround-w4a16-g128

1B • Updated Nov 14 • 8
kaitchup/Qwen3-4B-Instruct-2507-fp8-dynamic

4B • Updated Nov 13 • 10

NVFP4

LLMs quantized with LLM Compressor to NVFP4. Check the JSON files in the model directories for more information.

kaitchup/Qwen3-0.6B-NVFP4

0.6B • Updated Sep 8 • 29
kaitchup/Qwen3-4B-NVFP4

3B • Updated Sep 8 • 43
kaitchup/Qwen3-1.7B-NVFP4

1B • Updated Sep 8 • 10
kaitchup/Qwen3-8B-NVFP4

5B • Updated Sep 8 • 32 • 1

Quantized Qwen3

kaitchup/Qwen3-32B-autoround-2bit-gptq

33B • Updated May 2 • 8 • 1
kaitchup/Qwen3-32B-autoround-4bit-gptq

33B • Updated May 2 • 30 • 4
kaitchup/Qwen3-1.7B-autoround-4bit-gptq

2B • Updated May 2 • 12
kaitchup/Qwen3-4B-autoround-4bit-gptq

4B • Updated May 2 • 13 • 1

Quantized QwQ 32B

kaitchup/QwQ-32B-bnb-4bit

Text Generation • 34B • Updated Mar 7 • 13 • 1
kaitchup/QwQ-32B-AutoRoundGPTQ-8bit

Text Generation • 33B • Updated Mar 7 • 64 • 2
kaitchup/QwQ-32B-AutoRoundGPTQ-4bit

Text Generation • 33B • Updated Mar 7 • 11 • 1
kaitchup/QwQ-32B-AutoRoundGPTQ-3bit

Text Generation • 31B • Updated Mar 7 • 12

Quantized Mistral Small 3

kaitchup/Mistral-Small-24B-Instruct-2501-AutoRound-GPTQ-4bit

24B • Updated Feb 4 • 90 • 1
kaitchup/Mistral-Small-24B-Base-2501-AutoRound-GPTQ-4bit

24B • Updated Feb 4 • 3

Quantized QwQ

kaitchup/QwQ-32B-Preview-AutoRound-GPTQ-2bit

Text Generation • 33B • Updated Dec 31, 2024 • 9 • 2
kaitchup/QwQ-32B-Preview-AutoRound-GPTQ-4bit

Text Generation • 33B • Updated Dec 29, 2024 • 6

Quantized Llama 3.3

kaitchup/Llama-3.3-70B-Instruct-AutoRound-GPTQ-4bit

Text Generation • 71B • Updated Dec 9, 2024 • 287 • 6

Quantized Tulu 3 and OLMo 2

kaitchup/OLMo-2-1124-7B-Instruct-AutoRound-GPTQ-4bit

Text Generation • 7B • Updated Nov 28, 2024 • 12 • 1
kaitchup/Llama-3.1-Tulu-3-70B-AutoRound-GPTQ-4bit

Text Generation • 71B • Updated Nov 28, 2024 • 9
kaitchup/Llama-3.1-Tulu-3-8B-AutoRound-GPTQ-4bit

Text Generation • 8B • Updated Nov 28, 2024 • 4
kaitchup/OLMo-2-1124-13B-Instruct-AutoRound-GPTQ-4bit

Text Generation • 14B • Updated Nov 28, 2024 • 10

Opus Translation Datasets

Some language pairs of OPUS formatted to have source and target sentences as single sequences. Intended to facilitate fine-tuning of causal LLMs.

kaitchup/opus-Vietnamese-to-English

Viewer • Updated Nov 1, 2023 • 994k • 47 • 1
kaitchup/opus-Indonesian-to-English

Viewer • Updated Nov 1, 2023 • 992k • 94 • 4
kaitchup/opus-Norwegian-to-English

Viewer • Updated Nov 1, 2023 • 1M • 38 • 1
kaitchup/opus-Dutch-to-English

Viewer • Updated Nov 1, 2023 • 972k • 72 • 1

GPTQ

Llama 2 7B, 13B, Llama 3 8B, and Mistral 7B quantized with GPTQ in 2-bit, 3-bit, 4-bit and 8-bit with GPTQ.

kaitchup/Llama-2-7b-hf-gptq-8bit

Text Generation • 7B • Updated Jan 19, 2024 • 5
kaitchup/Llama-2-7b-hf-gptq-3bit

Text Generation • 6B • Updated Jan 20, 2024 • 879 • 1
kaitchup/Llama-2-7b-hf-gptq-2bit

Text Generation • 7B • Updated Jan 20, 2024 • 868
kaitchup/Llama-2-7b-hf-gptq-4bit

Text Generation • 7B • Updated Jan 19, 2024 • 7

Contaminated Mistral 7B and TinyLlama

Contaminated Mistral 7B and TinyLlama adapters, and the datasets used for contamination.

kaitchup/Mistral-7B-v0.1-contaminated-e1

Updated Mar 21, 2024
kaitchup/Mistral-7B-v0.1-contaminated-e5

Updated Mar 21, 2024
kaitchup/TinyLlama-1.1B-intermediate-step-1431k-3T-contaminated-e1

Updated Mar 21, 2024
kaitchup/TinyLlama-1.1B-intermediate-step-1431k-3T-contaminated-e5

Updated Mar 21, 2024

Llama 3 70B ExLlamaV2 (bpw: 4.0, 3.5, 3.0, 2.5, and 2.18)

kaitchup/Llama-3-70B-3.5bpw-exl2

Text Generation • Updated May 5, 2024 • 6
kaitchup/Llama-3-70B-3.0bpw-exl2

Text Generation • Updated May 5, 2024 • 6
kaitchup/Llama-3-70B-2.5bpw-exl2

Text Generation • Updated May 5, 2024 • 6
kaitchup/Llama-3-70B-2.18bpw-exl2

Text Generation • Updated May 5, 2024 • 3

[Basics] AutoRound: Llama 3

kaitchup/Llama-3-8B-4bit-AutoRound-GPTQ

Text Generation • 8B • Updated Jun 28, 2024
kaitchup/Llama-3-8B-4bit-Symm-AutoRound-GPTQ

Text Generation • 8B • Updated Jun 28, 2024

The Minitron Models and Their Teachers, Quantized

kaitchup/Mistral-NeMo-Minitron-8B-Base-AutoRound-GPTQ-sym-4bit

Text Generation • 8B • Updated Aug 27, 2024 • 5
kaitchup/Mistral-NeMo-Minitron-8B-Base-AutoRound-GPTQ-asym-4bit

Text Generation • 8B • Updated Aug 26, 2024 • 5
kaitchup/Mistral-Nemo-Base-2407-AutoRound-GPTQ-asym-4bit

Text Generation • 12B • Updated Aug 26, 2024 • 11
kaitchup/Llama-3.1-Minitron-4B-Width-Base-AutoRound-GPTQ-asym-4bit

Text Generation • 5B • Updated Aug 26, 2024 • 6

Quantized rnj-1

Verified models.

kaitchup/rnj-1-instruct-gptq-w4a16-g128

2B • Updated 20 days ago • 12
kaitchup/rnj-1-instruct-nvfp4

5B • Updated 19 days ago • 55

Quantized Olmo 3

Verified models. All compatible with vLLM for very fast inference. Use the 3.1 models as they are more recent.

kaitchup/Olmo-3-7B-Instruct-fp8-dynamic

7B • Updated 13 days ago • 12
kaitchup/Olmo-3-7B-Instruct-gptq-w4a16-g128

2B • Updated 13 days ago • 7
kaitchup/Olmo-3-7B-Instruct-NVFP4

4B • Updated 13 days ago • 22
kaitchup/Olmo-3-7B-Instruct-w8a8-smoothquant

7B • Updated 13 days ago • 22

Quantized Qwen3-VL

kaitchup/Qwen3-VL-8B-Instruct-NVFP4

2B • Updated Nov 5 • 49
kaitchup/Qwen3-VL-2B-Instruct-W4A16

0.9B • Updated Nov 5 • 15 • 2
kaitchup/Qwen3-VL-8B-Instruct-W4A16

2B • Updated Nov 5 • 65

EoRA Adapters for Qwen2.5 and Qwen3

kaitchup/Qwen2.5-72B-Instruct-autoround-2bit-32g-4096-gptq-EoRA-r32

Updated May 12
kaitchup/Qwen2.5-72B-Instruct-autoround-2bit-32g-4096-gptq-EoRA-r64

Updated May 12
kaitchup/Qwen2.5-72B-Instruct-autoround-2bit-32g-4096-gptq-EoRA-r256

Updated May 12
kaitchup/Qwen3-32B-autoround-2bit-128g-gptq-EoRA-r64

Updated May 13

2-bit Qwen2.5 72B Instruct

kaitchup/Qwen2.5-72B-Instruct-autoround-2bit-32g-4096-gptq

73B • Updated May 7 • 8
kaitchup/Qwen2.5-72B-Instruct-autoround-2bit-128g-4096-gptq

73B • Updated May 7 • 7
kaitchup/Qwen2.5-72B-Instruct-autoround-2bit-128g-2048-gptq

73B • Updated May 7 • 6
kaitchup/Qwen2.5-72B-Instruct-autoround-2bit-64g-4096-gptq

73B • Updated May 7 • 6

Quantized Phi-4 Mini

kaitchup/Phi-4-mini-instruct-AutoRoundGPTQ-8bit

Text Generation • 4B • Updated Feb 28 • 7
kaitchup/Phi-4-mini-instruct-AutoRoundGPTQ-3bit

Text Generation • 4B • Updated Feb 28 • 9
kaitchup/Phi-4-mini-instruct-AutoRoundGPTQ-4bit

Text Generation • 4B • Updated Feb 28 • 10
kaitchup/Phi-4-mini-instruct-AutoRoundGPTQ-2bit

Text Generation • 4B • Updated Feb 28 • 8

Quantized DeepSeek R1 Distill

kaitchup/DeepSeek-R1-Distill-Llama-8B-AutoRound-GPTQ-4bit

Text Generation • 8B • Updated Jan 27 • 382 • 1
kaitchup/DeepSeek-R1-Distill-Qwen-14B-AutoRound-GPTQ-4bit

Text Generation • 15B • Updated Jan 27 • 28 • 7
kaitchup/DeepSeek-R1-Distill-Qwen-7B-AutoRound-GPTQ-4bit

Text Generation • 8B • Updated Jan 27 • 26 • 2

Quantized Falcon3

kaitchup/Falcon3-10B-Instruct-AutoRound-GPTQ-4bit

Text Generation • 10B • Updated Dec 31, 2024 • 8
kaitchup/Falcon3-10B-Base-AutoRound-GPTQ-4bit

Text Generation • 10B • Updated Dec 19, 2024 • 11
kaitchup/Falcon3-7B-Base-AutoRound-GPTQ-4bit

Text Generation • 7B • Updated Dec 19, 2024 • 10
kaitchup/Falcon3-7B-Instruct-AutoRound-GPTQ-4bit

Text Generation • 7B • Updated Dec 19, 2024 • 6

Quantized EuroLLM

kaitchup/EuroLLM-9B-AutoRound-GPTQ-4bit

Text Generation • 9B • Updated Dec 6, 2024 • 10
kaitchup/EuroLLM-9B-Instruct-AutoRound-GPTQ-4bit

Text Generation • 9B • Updated Dec 6, 2024 • 5 • 1

Quantized Qwen2.5

kaitchup/Qwen2.5-1.5B-AutoRound-GPTQ-asym-4bit

Text Generation • 2B • Updated Oct 18, 2024 • 7
kaitchup/Qwen2.5-7B-AutoRound-GPTQ-asym-4bit

Text Generation • 8B • Updated Oct 18, 2024 • 4
kaitchup/Qwen2.5-1.5B-Instruct-AutoRound-GPTQ-asym-4bit

Text Generation • 2B • Updated Oct 18, 2024 • 7
kaitchup/Qwen2.5-7B-Instruct-AutoRound-GPTQ-asym-4bit

Text Generation • 8B • Updated Oct 18, 2024 • 7

Llama 2 7B MT

Machine translation adapters for Llama 2 7B.

kaitchup/Llama-2-7b-mt-French-to-English

Translation • Updated Nov 2, 2023 • 135 • 3
kaitchup/Llama-2-7b-mt-Italian-to-English

Translation • Updated Nov 2, 2023 • 7
kaitchup/Llama-2-7b-mt-Indonesian-to-English

Translation • Updated Nov 2, 2023 • 12
kaitchup/Llama-2-7b-mt-Vietnamese-to-English

Translation • Updated Nov 2, 2023 • 9

Quantized and fine-tuned versions of the Yi models

kaitchup/Yi-6B-gptq-4bit

Text Generation • 6B • Updated Mar 21, 2024 • 8
kaitchup/Yi-6B-awq-4bit

Text Generation • 6B • Updated Mar 21, 2024 • 10
kaitchup/Yi-6B-bnb-4bit

Text Generation • 6B • Updated Mar 21, 2024 • 9
kaitchup/Yi-6B-openguanaco-3e-QLoRA

Updated Mar 21, 2024

The Mayonnaise

A collection of 7B models made with mergekit.

kaitchup/Mayonnaise-4in1-03

Text Generation • 7B • Updated Mar 17, 2024 • 55 • 2
kaitchup/Mayonnaise-4in1-01

Text Generation • 7B • Updated Mar 17, 2024 • 62 • 1
kaitchup/Mayonnaise-4in1-02

Text Generation • 7B • Updated Apr 10, 2024 • 79
kaitchup/TheMayonnaise

Text Generation • 7B • Updated Mar 17, 2024 • 66 • 1

[Basics] Quantized Phi-3

kaitchup/Phi-3-mini-4k-instruct-gptq-4bit

Text Generation • 4B • Updated Apr 25, 2024 • 733k • 2
kaitchup/Phi-3-medium-128k-instruct-awq-4bit

Text Generation • 14B • Updated Jun 21, 2024
kaitchup/Phi-3-mini-4k-instruct-bnb-4bit

Text Generation • 4B • Updated Apr 25, 2024 • 7
kaitchup/Phi-3-medium-4k-instruct-awq-4bit

Text Generation • 14B • Updated Jun 21, 2024

Quantized Llama 3.1

kaitchup/Meta-Llama-3.1-8B-Instruct-autoround-gptq-4bit-sym

Text Generation • 8B • Updated Aug 13, 2024 • 8 • 1
kaitchup/Meta-Llama-3.1-8B-Instruct-awq-4bit

Text Generation • 8B • Updated Aug 13, 2024 • 14 • 1
kaitchup/Meta-Llama-3.1-8B-awq-4bit

Text Generation • 8B • Updated Aug 6, 2024 • 7
kaitchup/Meta-Llama-3.1-8B-Instruct-gptq-4bit

Text Generation • 8B • Updated Aug 13, 2024 • 7

Minivoc

kaitchup/Mistral-NeMo-Minitron-8B-Base-Minivoc-32k-v0.1a

Text Generation • 8B • Updated Sep 13, 2024 • 2
kaitchup/Llama-3.1-8B-Minivoc-32k-v0.1a

Text Generation • 7B • Updated Sep 13, 2024 • 1
kaitchup/Qwen2-1.5B-Minivoc-32k-v0.1a

Text Generation • 1B • Updated Sep 12, 2024 • 12 • 2
kaitchup/Qwen2.5-1.5B-Minivoc-32k-v0.1a-AutoRound-GPTQ-asym-4bit

Text Generation • 1B • Updated Oct 18, 2024 • 6

Quantized K2-V2

Verified models.

kaitchup/K2-V2-Instruct-autoround-w4a16

4B • Updated 19 days ago • 22
kaitchup/K2-V2-Instruct-awq-w4a16-asym

13B • Updated 19 days ago • 37
kaitchup/K2-V2-Instruct-fp8-dynamic

73B • Updated 19 days ago • 38 • 1
kaitchup/K2-V2-Instruct-nvfp4

43B • Updated 1 day ago • 114

Quantized rnj-1

Verified models.

kaitchup/rnj-1-instruct-gptq-w4a16-g128

2B • Updated 20 days ago • 12
kaitchup/rnj-1-instruct-nvfp4

5B • Updated 19 days ago • 55

Quantized VibeThinker-1.5B

Verified models. Tested with vLLM.

kaitchup/VibeThinker-1.5B-w8a8-smoothquant

2B • Updated 22 days ago • 13
kaitchup/VibeThinker-1.5B-awq-w4a16-asym

0.6B • Updated 22 days ago • 28
kaitchup/VibeThinker-1.5B-fp8-dynamic

2B • Updated 22 days ago • 24
kaitchup/VibeThinker-1.5B-gptq-w4a16-g128

0.7B • Updated 22 days ago • 73

Quantized Olmo 3

Verified models. All compatible with vLLM for very fast inference. Use the 3.1 models as they are more recent.

kaitchup/Olmo-3-7B-Instruct-fp8-dynamic

7B • Updated 13 days ago • 12
kaitchup/Olmo-3-7B-Instruct-gptq-w4a16-g128

2B • Updated 13 days ago • 7
kaitchup/Olmo-3-7B-Instruct-NVFP4

4B • Updated 13 days ago • 22
kaitchup/Olmo-3-7B-Instruct-w8a8-smoothquant

7B • Updated 13 days ago • 22

Compressed Tensors Demo

kaitchup/Qwen3-4B-Instruct-2507-gptq-w4a16-g128

1B • Updated Nov 13 • 27
kaitchup/Qwen3-4B-Instruct-2507-NVFP4

3B • Updated 3 days ago • 77
kaitchup/Qwen3-4B-Instruct-2507-autoround-w4a16-g128

1B • Updated Nov 14 • 8
kaitchup/Qwen3-4B-Instruct-2507-fp8-dynamic

4B • Updated Nov 13 • 10

Quantized Qwen3-VL

kaitchup/Qwen3-VL-8B-Instruct-NVFP4

2B • Updated Nov 5 • 49
kaitchup/Qwen3-VL-2B-Instruct-W4A16

0.9B • Updated Nov 5 • 15 • 2
kaitchup/Qwen3-VL-8B-Instruct-W4A16

2B • Updated Nov 5 • 65

NVFP4

LLMs quantized with LLM Compressor to NVFP4. Check the JSON files in the model directories for more information.

kaitchup/Qwen3-0.6B-NVFP4

0.6B • Updated Sep 8 • 29
kaitchup/Qwen3-4B-NVFP4

3B • Updated Sep 8 • 43
kaitchup/Qwen3-1.7B-NVFP4

1B • Updated Sep 8 • 10
kaitchup/Qwen3-8B-NVFP4

5B • Updated Sep 8 • 32 • 1

EoRA Adapters for Qwen2.5 and Qwen3

kaitchup/Qwen2.5-72B-Instruct-autoround-2bit-32g-4096-gptq-EoRA-r32

Updated May 12
kaitchup/Qwen2.5-72B-Instruct-autoround-2bit-32g-4096-gptq-EoRA-r64

Updated May 12
kaitchup/Qwen2.5-72B-Instruct-autoround-2bit-32g-4096-gptq-EoRA-r256

Updated May 12
kaitchup/Qwen3-32B-autoround-2bit-128g-gptq-EoRA-r64

Updated May 13

Quantized Qwen3

kaitchup/Qwen3-32B-autoround-2bit-gptq

33B • Updated May 2 • 8 • 1
kaitchup/Qwen3-32B-autoround-4bit-gptq

33B • Updated May 2 • 30 • 4
kaitchup/Qwen3-1.7B-autoround-4bit-gptq

2B • Updated May 2 • 12
kaitchup/Qwen3-4B-autoround-4bit-gptq

4B • Updated May 2 • 13 • 1

2-bit Qwen2.5 72B Instruct

kaitchup/Qwen2.5-72B-Instruct-autoround-2bit-32g-4096-gptq

73B • Updated May 7 • 8
kaitchup/Qwen2.5-72B-Instruct-autoround-2bit-128g-4096-gptq

73B • Updated May 7 • 7
kaitchup/Qwen2.5-72B-Instruct-autoround-2bit-128g-2048-gptq

73B • Updated May 7 • 6
kaitchup/Qwen2.5-72B-Instruct-autoround-2bit-64g-4096-gptq

73B • Updated May 7 • 6

Quantized QwQ 32B

kaitchup/QwQ-32B-bnb-4bit

Text Generation • 34B • Updated Mar 7 • 13 • 1
kaitchup/QwQ-32B-AutoRoundGPTQ-8bit

Text Generation • 33B • Updated Mar 7 • 64 • 2
kaitchup/QwQ-32B-AutoRoundGPTQ-4bit

Text Generation • 33B • Updated Mar 7 • 11 • 1
kaitchup/QwQ-32B-AutoRoundGPTQ-3bit

Text Generation • 31B • Updated Mar 7 • 12

Quantized Phi-4 Mini

kaitchup/Phi-4-mini-instruct-AutoRoundGPTQ-8bit

Text Generation • 4B • Updated Feb 28 • 7
kaitchup/Phi-4-mini-instruct-AutoRoundGPTQ-3bit

Text Generation • 4B • Updated Feb 28 • 9
kaitchup/Phi-4-mini-instruct-AutoRoundGPTQ-4bit

Text Generation • 4B • Updated Feb 28 • 10
kaitchup/Phi-4-mini-instruct-AutoRoundGPTQ-2bit

Text Generation • 4B • Updated Feb 28 • 8

Quantized Mistral Small 3

kaitchup/Mistral-Small-24B-Instruct-2501-AutoRound-GPTQ-4bit

24B • Updated Feb 4 • 90 • 1
kaitchup/Mistral-Small-24B-Base-2501-AutoRound-GPTQ-4bit

24B • Updated Feb 4 • 3

Quantized DeepSeek R1 Distill

kaitchup/DeepSeek-R1-Distill-Llama-8B-AutoRound-GPTQ-4bit

Text Generation • 8B • Updated Jan 27 • 382 • 1
kaitchup/DeepSeek-R1-Distill-Qwen-14B-AutoRound-GPTQ-4bit

Text Generation • 15B • Updated Jan 27 • 28 • 7
kaitchup/DeepSeek-R1-Distill-Qwen-7B-AutoRound-GPTQ-4bit

Text Generation • 8B • Updated Jan 27 • 26 • 2

Quantized QwQ

kaitchup/QwQ-32B-Preview-AutoRound-GPTQ-2bit

Text Generation • 33B • Updated Dec 31, 2024 • 9 • 2
kaitchup/QwQ-32B-Preview-AutoRound-GPTQ-4bit

Text Generation • 33B • Updated Dec 29, 2024 • 6

Quantized Falcon3

kaitchup/Falcon3-10B-Instruct-AutoRound-GPTQ-4bit

Text Generation • 10B • Updated Dec 31, 2024 • 8
kaitchup/Falcon3-10B-Base-AutoRound-GPTQ-4bit

Text Generation • 10B • Updated Dec 19, 2024 • 11
kaitchup/Falcon3-7B-Base-AutoRound-GPTQ-4bit

Text Generation • 7B • Updated Dec 19, 2024 • 10
kaitchup/Falcon3-7B-Instruct-AutoRound-GPTQ-4bit

Text Generation • 7B • Updated Dec 19, 2024 • 6

Quantized Llama 3.3

kaitchup/Llama-3.3-70B-Instruct-AutoRound-GPTQ-4bit

Text Generation • 71B • Updated Dec 9, 2024 • 287 • 6

Quantized EuroLLM

kaitchup/EuroLLM-9B-AutoRound-GPTQ-4bit

Text Generation • 9B • Updated Dec 6, 2024 • 10
kaitchup/EuroLLM-9B-Instruct-AutoRound-GPTQ-4bit

Text Generation • 9B • Updated Dec 6, 2024 • 5 • 1

Quantized Tulu 3 and OLMo 2

kaitchup/OLMo-2-1124-7B-Instruct-AutoRound-GPTQ-4bit

Text Generation • 7B • Updated Nov 28, 2024 • 12 • 1
kaitchup/Llama-3.1-Tulu-3-70B-AutoRound-GPTQ-4bit

Text Generation • 71B • Updated Nov 28, 2024 • 9
kaitchup/Llama-3.1-Tulu-3-8B-AutoRound-GPTQ-4bit

Text Generation • 8B • Updated Nov 28, 2024 • 4
kaitchup/OLMo-2-1124-13B-Instruct-AutoRound-GPTQ-4bit

Text Generation • 14B • Updated Nov 28, 2024 • 10

Quantized Qwen2.5

kaitchup/Qwen2.5-1.5B-AutoRound-GPTQ-asym-4bit

Text Generation • 2B • Updated Oct 18, 2024 • 7
kaitchup/Qwen2.5-7B-AutoRound-GPTQ-asym-4bit

Text Generation • 8B • Updated Oct 18, 2024 • 4
kaitchup/Qwen2.5-1.5B-Instruct-AutoRound-GPTQ-asym-4bit

Text Generation • 2B • Updated Oct 18, 2024 • 7
kaitchup/Qwen2.5-7B-Instruct-AutoRound-GPTQ-asym-4bit

Text Generation • 8B • Updated Oct 18, 2024 • 7

Opus Translation Datasets

Some language pairs of OPUS formatted to have source and target sentences as single sequences. Intended to facilitate fine-tuning of causal LLMs.

kaitchup/opus-Vietnamese-to-English

Viewer • Updated Nov 1, 2023 • 994k • 47 • 1
kaitchup/opus-Indonesian-to-English

Viewer • Updated Nov 1, 2023 • 992k • 94 • 4
kaitchup/opus-Norwegian-to-English

Viewer • Updated Nov 1, 2023 • 1M • 38 • 1
kaitchup/opus-Dutch-to-English

Viewer • Updated Nov 1, 2023 • 972k • 72 • 1

Llama 2 7B MT

Machine translation adapters for Llama 2 7B.

kaitchup/Llama-2-7b-mt-French-to-English

Translation • Updated Nov 2, 2023 • 135 • 3
kaitchup/Llama-2-7b-mt-Italian-to-English

Translation • Updated Nov 2, 2023 • 7
kaitchup/Llama-2-7b-mt-Indonesian-to-English

Translation • Updated Nov 2, 2023 • 12
kaitchup/Llama-2-7b-mt-Vietnamese-to-English

Translation • Updated Nov 2, 2023 • 9

GPTQ

Llama 2 7B, 13B, Llama 3 8B, and Mistral 7B quantized with GPTQ in 2-bit, 3-bit, 4-bit and 8-bit with GPTQ.

kaitchup/Llama-2-7b-hf-gptq-8bit

Text Generation • 7B • Updated Jan 19, 2024 • 5
kaitchup/Llama-2-7b-hf-gptq-3bit

Text Generation • 6B • Updated Jan 20, 2024 • 879 • 1
kaitchup/Llama-2-7b-hf-gptq-2bit

Text Generation • 7B • Updated Jan 20, 2024 • 868
kaitchup/Llama-2-7b-hf-gptq-4bit

Text Generation • 7B • Updated Jan 19, 2024 • 7

Quantized and fine-tuned versions of the Yi models

kaitchup/Yi-6B-gptq-4bit

Text Generation • 6B • Updated Mar 21, 2024 • 8
kaitchup/Yi-6B-awq-4bit

Text Generation • 6B • Updated Mar 21, 2024 • 10
kaitchup/Yi-6B-bnb-4bit

Text Generation • 6B • Updated Mar 21, 2024 • 9
kaitchup/Yi-6B-openguanaco-3e-QLoRA

Updated Mar 21, 2024

Contaminated Mistral 7B and TinyLlama

Contaminated Mistral 7B and TinyLlama adapters, and the datasets used for contamination.

kaitchup/Mistral-7B-v0.1-contaminated-e1

Updated Mar 21, 2024
kaitchup/Mistral-7B-v0.1-contaminated-e5

Updated Mar 21, 2024
kaitchup/TinyLlama-1.1B-intermediate-step-1431k-3T-contaminated-e1

Updated Mar 21, 2024
kaitchup/TinyLlama-1.1B-intermediate-step-1431k-3T-contaminated-e5

Updated Mar 21, 2024

The Mayonnaise

A collection of 7B models made with mergekit.

kaitchup/Mayonnaise-4in1-03

Text Generation • 7B • Updated Mar 17, 2024 • 55 • 2
kaitchup/Mayonnaise-4in1-01

Text Generation • 7B • Updated Mar 17, 2024 • 62 • 1
kaitchup/Mayonnaise-4in1-02

Text Generation • 7B • Updated Apr 10, 2024 • 79
kaitchup/TheMayonnaise

Text Generation • 7B • Updated Mar 17, 2024 • 66 • 1

Llama 3 70B ExLlamaV2 (bpw: 4.0, 3.5, 3.0, 2.5, and 2.18)

kaitchup/Llama-3-70B-3.5bpw-exl2

Text Generation • Updated May 5, 2024 • 6
kaitchup/Llama-3-70B-3.0bpw-exl2

Text Generation • Updated May 5, 2024 • 6
kaitchup/Llama-3-70B-2.5bpw-exl2

Text Generation • Updated May 5, 2024 • 6
kaitchup/Llama-3-70B-2.18bpw-exl2

Text Generation • Updated May 5, 2024 • 3

[Basics] Quantized Phi-3

kaitchup/Phi-3-mini-4k-instruct-gptq-4bit

Text Generation • 4B • Updated Apr 25, 2024 • 733k • 2
kaitchup/Phi-3-medium-128k-instruct-awq-4bit

Text Generation • 14B • Updated Jun 21, 2024
kaitchup/Phi-3-mini-4k-instruct-bnb-4bit

Text Generation • 4B • Updated Apr 25, 2024 • 7
kaitchup/Phi-3-medium-4k-instruct-awq-4bit

Text Generation • 14B • Updated Jun 21, 2024

[Basics] AutoRound: Llama 3

kaitchup/Llama-3-8B-4bit-AutoRound-GPTQ

Text Generation • 8B • Updated Jun 28, 2024
kaitchup/Llama-3-8B-4bit-Symm-AutoRound-GPTQ

Text Generation • 8B • Updated Jun 28, 2024

Quantized Llama 3.1

kaitchup/Meta-Llama-3.1-8B-Instruct-autoround-gptq-4bit-sym

Text Generation • 8B • Updated Aug 13, 2024 • 8 • 1
kaitchup/Meta-Llama-3.1-8B-Instruct-awq-4bit

Text Generation • 8B • Updated Aug 13, 2024 • 14 • 1
kaitchup/Meta-Llama-3.1-8B-awq-4bit

Text Generation • 8B • Updated Aug 6, 2024 • 7
kaitchup/Meta-Llama-3.1-8B-Instruct-gptq-4bit

Text Generation • 8B • Updated Aug 13, 2024 • 7

The Minitron Models and Their Teachers, Quantized

kaitchup/Mistral-NeMo-Minitron-8B-Base-AutoRound-GPTQ-sym-4bit

Text Generation • 8B • Updated Aug 27, 2024 • 5
kaitchup/Mistral-NeMo-Minitron-8B-Base-AutoRound-GPTQ-asym-4bit

Text Generation • 8B • Updated Aug 26, 2024 • 5
kaitchup/Mistral-Nemo-Base-2407-AutoRound-GPTQ-asym-4bit

Text Generation • 12B • Updated Aug 26, 2024 • 11
kaitchup/Llama-3.1-Minitron-4B-Width-Base-AutoRound-GPTQ-asym-4bit

Text Generation • 5B • Updated Aug 26, 2024 • 6

Minivoc

kaitchup/Mistral-NeMo-Minitron-8B-Base-Minivoc-32k-v0.1a

Text Generation • 8B • Updated Sep 13, 2024 • 2
kaitchup/Llama-3.1-8B-Minivoc-32k-v0.1a

Text Generation • 7B • Updated Sep 13, 2024 • 1
kaitchup/Qwen2-1.5B-Minivoc-32k-v0.1a

Text Generation • 1B • Updated Sep 12, 2024 • 12 • 2
kaitchup/Qwen2.5-1.5B-Minivoc-32k-v0.1a-AutoRound-GPTQ-asym-4bit

Text Generation • 1B • Updated Oct 18, 2024 • 6

AI & ML interests

Recent Activity

Team members 1

kaitchup 's collections 32