Jeongjae Park

jjp97

AI & ML interests

I’m interested in the latest NLP and AI technologies, such as uncertainty, retrieval, agentic approaches, and long-context models!

Recent Activity

upvoted a paper about 19 hours ago

Why Language Models Hallucinate

upvoted a paper about 19 hours ago

Native Parallel Reasoner: Reasoning in Parallelism via Self-Distilled Reinforcement Learning

upvoted a paper about 19 hours ago

Stabilizing Reinforcement Learning with LLMs: Formulation and Practices

View all activity

Organizations

None yet

upvoted 3 papers about 19 hours ago

Why Language Models Hallucinate

Paper • 2509.04664 • Published Sep 4, 2025 • 196

Native Parallel Reasoner: Reasoning in Parallelism via Self-Distilled Reinforcement Learning

Paper • 2512.07461 • Published Dec 8, 2025 • 78

Stabilizing Reinforcement Learning with LLMs: Formulation and Practices

Paper • 2512.01374 • Published Dec 1, 2025 • 104

upvoted a paper 3 days ago

THINKSAFE: Self-Generated Safety Alignment for Reasoning Models

Paper • 2601.23143 • Published 5 days ago • 38

upvoted a paper 4 days ago

Self-Improving Pretraining: using post-trained models to pretrain better models

Paper • 2601.21343 • Published 7 days ago • 14

upvoted a paper 6 days ago

Agentic Reasoning for Large Language Models

Paper • 2601.12538 • Published 17 days ago • 190

upvoted a paper 7 days ago

Entropy-Adaptive Fine-Tuning: Resolving Confident Conflicts to Mitigate Forgetting

Paper • 2601.02151 • Published about 1 month ago • 106

upvoted 3 papers 8 days ago

upvoted a paper 13 days ago

Chaining the Evidence: Robust Reinforcement Learning for Deep Search Agents with Citation-Aware Rubric Rewards

Paper • 2601.06021 • Published 26 days ago • 43

upvoted 3 papers 14 days ago

Your Group-Relative Advantage Is Biased

Paper • 2601.08521 • Published 22 days ago • 149

Rewarding the Rare: Uniqueness-Aware RL for Creative Problem Solving in LLMs

Paper • 2601.08763 • Published 22 days ago • 146

mHC: Manifold-Constrained Hyper-Connections

Paper • 2512.24880 • Published Dec 31, 2025 • 296

upvoted a paper 16 days ago

Multiplex Thinking: Reasoning via Token-wise Branch-and-Merge

Paper • 2601.08808 • Published 22 days ago • 38

upvoted a paper 18 days ago

Can LLMs Predict Their Own Failures? Self-Awareness via Internal Circuits

Paper • 2512.20578 • Published Dec 23, 2025 • 85

upvoted a paper 20 days ago

Conditional Memory via Scalable Lookup: A New Axis of Sparsity for Large Language Models

Paper • 2601.07372 • Published 24 days ago • 40

upvoted 3 papers 21 days ago

Qwen3-VL-Embedding and Qwen3-VL-Reranker: A Unified Framework for State-of-the-Art Multimodal Retrieval and Ranking

Paper • 2601.04720 • Published 28 days ago • 52

EpiCaR: Knowing What You Don't Know Matters for Better Reasoning in LLMs

Paper • 2601.06786 • Published 25 days ago • 6

The Confidence Dichotomy: Analyzing and Mitigating Miscalibration in Tool-Use Agents

Paper • 2601.07264 • Published 24 days ago • 24

Jeongjae Park

AI & ML interests

Recent Activity

Organizations

jjp97's activity