Audio-Text-to-Text
Transformers
Safetensors
English
Chinese
qwen2
text-generation
speech-language-model
streaming
audio
multimodal
qwen2.5-omni
text-generation-inference
Instructions to use zhifeixie/AudioInteraction with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use zhifeixie/AudioInteraction with Transformers:
# Load model directly from transformers import AutoTokenizer, AutoModelForMultimodalLM tokenizer = AutoTokenizer.from_pretrained("zhifeixie/AudioInteraction") model = AutoModelForMultimodalLM.from_pretrained("zhifeixie/AudioInteraction") - Notebooks
- Google Colab
- Kaggle
Update model card with official links, citation, and paper information
#1
by nielsr HF Staff - opened
This PR improves the model card by:
- Replacing
TODOplaceholders with verified links to the GitHub repository and StreamAudio-2M dataset. - Adding a link to the official Project Page.
- Linking the model to its original paper.
- Updating the citation with the correct BibTeX entry from the paper.
- Refining the "Quick Start" section with accurate installation and usage instructions found in the codebase.