This model how load colab with chat ui ?

by Xhub1880 - opened 2 days ago

Discussion

Xhub1880

2 days ago

Give me the code

juni3227

about 10 hours ago

I think you should run it with huggingface transformer for a while. This is because this model's architecture is not a usual multiheaded attention. nor well adopted MAMBA or other quasi-linear scaling models. And because there seems to be an unique quark with flash attention dependency, doubt any OSS llm serving project like vllm would support this novel model in short time.

If you need chat ui, just use gradio. I am pretty sure any commercial ai chatbot would make one in no time.
And also be noted that, this is not an instruct (chat bot) weight.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment