CaoZS's picture

1 4 1

CaoZS

Elpmis

AI & ML interests

None yet

Recent Activity

upvoted a paper 4 days ago

DLLM-Searcher: Adapting Diffusion Large Language Model for Search Agents

upvoted a paper 27 days ago

When Personalization Misleads: Understanding and Mitigating Hallucinations in Personalized LLMs

upvoted a paper 30 days ago

MatchTIR: Fine-Grained Supervision for Tool-Integrated Reasoning via Bipartite Matching

View all activity

Organizations

None yet

upvoted a paper 4 days ago

DLLM-Searcher: Adapting Diffusion Large Language Model for Search Agents

Paper • 2602.07035 • Published 12 days ago • 30

upvoted a paper 27 days ago

When Personalization Misleads: Understanding and Mitigating Hallucinations in Personalized LLMs

Paper • 2601.11000 • Published 30 days ago • 26

upvoted a paper 30 days ago

MatchTIR: Fine-Grained Supervision for Tool-Integrated Reasoning via Bipartite Matching

Paper • 2601.10712 • Published about 1 month ago • 24

updated a model 3 months ago

Elpmis/Qwen3-0.6B-soft-thinking-last-token-naive-top-all

Feature Extraction • 0.6B • Updated Nov 11, 2025

published a model 3 months ago

Elpmis/Qwen3-0.6B-soft-thinking-last-token-naive-top-all

Feature Extraction • 0.6B • Updated Nov 11, 2025

updated a model 3 months ago

Elpmis/Qwen3-0.6B-soft-thinking-last-token-naive-topk30

Feature Extraction • 0.6B • Updated Nov 11, 2025

published a model 3 months ago

Elpmis/Qwen3-0.6B-soft-thinking-last-token-naive-topk30

Feature Extraction • 0.6B • Updated Nov 11, 2025

updated a model 3 months ago

Elpmis/Qwen3-0.6B-direct-embedding-last-token-naive

Feature Extraction • 0.6B • Updated Nov 11, 2025

published a model 3 months ago

Elpmis/Qwen3-0.6B-direct-embedding-last-token-naive

Feature Extraction • 0.6B • Updated Nov 11, 2025

updated a model 3 months ago

Elpmis/Qwen3-0.6B-soft-thinking-last-token-PRL

Feature Extraction • 0.6B • Updated Nov 11, 2025

published a model 3 months ago

Elpmis/Qwen3-0.6B-soft-thinking-last-token-PRL

Feature Extraction • 0.6B • Updated Nov 11, 2025

updated a model 3 months ago

Elpmis/Qwen3-0.6B-soft-thinking-concept-mean-naive

Feature Extraction • 0.6B • Updated Nov 11, 2025

published a model 3 months ago

Elpmis/Qwen3-0.6B-soft-thinking-concept-mean-naive

Feature Extraction • 0.6B • Updated Nov 11, 2025

updated a model 3 months ago

Elpmis/Qwen3-0.6B-soft-thinking-concept-mean-ICR

Feature Extraction • 0.6B • Updated Nov 11, 2025

published a model 3 months ago

Elpmis/Qwen3-0.6B-soft-thinking-concept-mean-ICR

Feature Extraction • 0.6B • Updated Nov 11, 2025

updated a model 3 months ago

Elpmis/Qwen3-0.6B-soft-thinking-concept-mean-ERL

Feature Extraction • 0.6B • Updated Nov 11, 2025

published a model 3 months ago

Elpmis/Qwen3-0.6B-soft-thinking-concept-mean-ERL

Feature Extraction • 0.6B • Updated Nov 11, 2025

updated a model 3 months ago

Elpmis/Qwen3-0.6B-vanilla

Feature Extraction • 0.6B • Updated Nov 11, 2025

published a model 3 months ago

Elpmis/Qwen3-0.6B-vanilla

Feature Extraction • 0.6B • Updated Nov 11, 2025

updated a model 3 months ago

Elpmis/Qwen3-0.6B-soft-thinking-last-token-naive

0.6B • Updated Nov 11, 2025