Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
Dalision
/
Omni2Sound
like
5
Text-to-Audio
English
custom
audio-generation
video-to-audio
diffusion-transformer
multimodal
arxiv:
2601.02731
License:
cc-by-nc-4.0
Model card
Files
Files and versions
xet
Community
Copy to bucket
new
main
Omni2Sound
Ctrl+K
Ctrl+K
1 contributor
History:
7 commits
Dalision
Update README.md
766f5bf
verified
about 2 months ago
vt2a-24-v55vt35-oa15-mq-td15
Add files using upload-large-folder tool
about 2 months ago
.gitattributes
Safe
1.52 kB
initial commit
about 2 months ago
README.md
5.05 kB
Update README.md
about 2 months ago
config.json
Safe
142 Bytes
Create config.json
about 2 months ago
oob_vae_16k_224410.ckpt
pickle
Detected Pickle imports (3)
"torch.FloatStorage"
,
"torch._utils._rebuild_tensor_v2"
,
"collections.OrderedDict"
What is a pickle import?
657 MB
xet
Add files using upload-large-folder tool
about 2 months ago
synchformer_state_dict.pth
Safe
pickle
Detected Pickle imports (3)
"collections.OrderedDict"
,
"torch._utils._rebuild_tensor_v2"
,
"torch.FloatStorage"
What is a pickle import?
950 MB
xet
Add files using upload-large-folder tool
about 2 months ago