Document Question Answering
Transformers
Safetensors
chatglm
feature-extraction
text-generation-inference
custom_code
4-bit precision
bitsandbytes
Instructions to use nikravan/glm-4vq with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use nikravan/glm-4vq with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("document-question-answering", model="nikravan/glm-4vq", trust_remote_code=True)# Load model directly from transformers import AutoModel model = AutoModel.from_pretrained("nikravan/glm-4vq", trust_remote_code=True, dtype="auto") - Notebooks
- Google Colab
- Kaggle
ValueError: too many values to unpack (expected 2)
#1
by vivasvan100 - opened
I am stuck on this error. Any idea what can i do to fix this?
Traceback (most recent call last):
File "<stdin>", line 2, in <module>
File "/home/ubuntu/.local/lib/python3.10/site-packages/torch/utils/_contextlib.py", line 115, in decorate_context
return func(*args, **kwargs)
File "/home/ubuntu/.local/lib/python3.10/site-packages/transformers/generation/utils.py", line 1914, in generate
result = self._sample(
File "/home/ubuntu/.local/lib/python3.10/site-packages/transformers/generation/utils.py", line 2651, in _sample
outputs = self(
File "/home/ubuntu/.local/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1532, in _wrapped_call_impl
return self._call_impl(*args, **kwargs)
File "/home/ubuntu/.local/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1541, in _call_impl
return forward_call(*args, **kwargs)
File "/home/ubuntu/.local/lib/python3.10/site-packages/accelerate/hooks.py", line 169, in new_forward
output = module._old_forward(*args, **kwargs)
File "/home/ubuntu/.cache/huggingface/modules/transformers_modules/nikravan/glm-4vq/e441477369dc88ad0ab225d9cd69db0291e2dc7b/modeling_chatglm.py", line 1017, in forward
transformer_outputs = self.transformer(
File "/home/ubuntu/.local/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1532, in _wrapped_call_impl
return self._call_impl(*args, **kwargs)
File "/home/ubuntu/.local/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1541, in _call_impl
return forward_call(*args, **kwargs)
File "/home/ubuntu/.local/lib/python3.10/site-packages/accelerate/hooks.py", line 169, in new_forward
output = module._old_forward(*args, **kwargs)
File "/home/ubuntu/.cache/huggingface/modules/transformers_modules/nikravan/glm-4vq/e441477369dc88ad0ab225d9cd69db0291e2dc7b/modeling_chatglm.py", line 906, in forward
hidden_states, presents, all_hidden_states, all_self_attentions = self.encoder(
File "/home/ubuntu/.local/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1532, in _wrapped_call_impl
return self._call_impl(*args, **kwargs)
File "/home/ubuntu/.local/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1541, in _call_impl
return forward_call(*args, **kwargs)
File "/home/ubuntu/.local/lib/python3.10/site-packages/accelerate/hooks.py", line 169, in new_forward
output = module._old_forward(*args, **kwargs)
File "/home/ubuntu/.cache/huggingface/modules/transformers_modules/nikravan/glm-4vq/e441477369dc88ad0ab225d9cd69db0291e2dc7b/modeling_chatglm.py", line 664, in forward
layer_ret = layer(
File "/home/ubuntu/.local/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1532, in _wrapped_call_impl
return self._call_impl(*args, **kwargs)
File "/home/ubuntu/.local/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1541, in _call_impl
return forward_call(*args, **kwargs)
File "/home/ubuntu/.local/lib/python3.10/site-packages/accelerate/hooks.py", line 169, in new_forward
output = module._old_forward(*args, **kwargs)
File "/home/ubuntu/.cache/huggingface/modules/transformers_modules/nikravan/glm-4vq/e441477369dc88ad0ab225d9cd69db0291e2dc7b/modeling_chatglm.py", line 567, in forward
attention_output, kv_cache = self.self_attention(
File "/home/ubuntu/.local/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1532, in _wrapped_call_impl
return self._call_impl(*args, **kwargs)
File "/home/ubuntu/.local/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1541, in _call_impl
return forward_call(*args, **kwargs)
File "/home/ubuntu/.local/lib/python3.10/site-packages/accelerate/hooks.py", line 169, in new_forward
output = module._old_forward(*args, **kwargs)
File "/home/ubuntu/.cache/huggingface/modules/transformers_modules/nikravan/glm-4vq/e441477369dc88ad0ab225d9cd69db0291e2dc7b/modeling_chatglm.py", line 436, in forward
cache_k, cache_v = kv_cache
ValueError: too many values to unpack (expected 2)
It might be due to your transformers or accelerate version library
pip install transformers==4.41.2
pip install git+https://github.com/huggingface/accelerate.git