Instructions to use google/flan-t5-xxl with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use google/flan-t5-xxl with Transformers:
# Load model directly from transformers import AutoTokenizer, AutoModelForSeq2SeqLM tokenizer = AutoTokenizer.from_pretrained("google/flan-t5-xxl") model = AutoModelForSeq2SeqLM.from_pretrained("google/flan-t5-xxl") - Notebooks
- Google Colab
- Kaggle
Config does not align with original paper
#57
by cbock90 - opened
Hey there,
I see that the config significantly differs from the one on the paper:
- dmodel: Config: 4096, Paper: 1024
- #heads: Config: 64, Paper: 128
- d_ff: Config: 10240, Paper: 65536
- dkv: Config: 64, Paper: 128
Are there any insights why there is this difference in the checkpoint?