Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
4
alanayu lee
alanayu
Follow
AI & ML interests
None yet
Organizations
None yet
alanayu
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
New activity in
Qwen/Qwen3-Next-80B-A3B-Instruct
2 months ago
请问一下,使用megatron微调Qwen3-Next时,设置--target_modules为"all-linear"能否训练到Qwen3NextGatedDeltaNet部分?
👀
2
#41 opened 2 months ago by
alanayu
New activity in
meituan-longcat/LongCat-Flash-Chat
5 months ago
这个模型是不是还不能用VLLM推理?
🚀
1
#9 opened 5 months ago by
alanayu
New activity in
Qwen/Qwen3-30B-A3B
8 months ago
How to train the Qwen3-30B-A3B using Reinforcement Learning?
#34 opened 8 months ago by
alanayu
New activity in
unsloth/Qwen3-30B-A3B-GGUF
9 months ago
Not compatible with transformers library
4
#8 opened 10 months ago by
Xeenxavier007