Jeongjae Park

jjp97

AI & ML interests

I’m interested in the latest NLP and AI technologies, such as uncertainty, retrieval, agentic approaches, and long-context models!

Recent Activity

upvoted a paper 1 day ago

THINKSAFE: Self-Generated Safety Alignment for Reasoning Models

upvoted a paper 3 days ago

Self-Improving Pretraining: using post-trained models to pretrain better models

upvoted a paper 4 days ago

Agentic Reasoning for Large Language Models

View all activity

Organizations

None yet

upvoted a paper 1 day ago

THINKSAFE: Self-Generated Safety Alignment for Reasoning Models

Paper • 2601.23143 • Published 4 days ago • 38

upvoted a paper 3 days ago

Self-Improving Pretraining: using post-trained models to pretrain better models

Paper • 2601.21343 • Published 6 days ago • 14

upvoted a paper 4 days ago

Agentic Reasoning for Large Language Models

Paper • 2601.12538 • Published 16 days ago • 187

upvoted a paper 6 days ago

Entropy-Adaptive Fine-Tuning: Resolving Confident Conflicts to Mitigate Forgetting

Paper • 2601.02151 • Published 29 days ago • 105

upvoted 3 papers 7 days ago

upvoted a paper 12 days ago

Chaining the Evidence: Robust Reinforcement Learning for Deep Search Agents with Citation-Aware Rubric Rewards

Paper • 2601.06021 • Published 25 days ago • 43

upvoted 3 papers 13 days ago

Your Group-Relative Advantage Is Biased

Paper • 2601.08521 • Published 21 days ago • 148

Rewarding the Rare: Uniqueness-Aware RL for Creative Problem Solving in LLMs

Paper • 2601.08763 • Published 21 days ago • 145

mHC: Manifold-Constrained Hyper-Connections

Paper • 2512.24880 • Published Dec 31, 2025 • 295

upvoted a paper 15 days ago

Multiplex Thinking: Reasoning via Token-wise Branch-and-Merge

Paper • 2601.08808 • Published 21 days ago • 38

upvoted a paper 16 days ago

Can LLMs Predict Their Own Failures? Self-Awareness via Internal Circuits

Paper • 2512.20578 • Published Dec 23, 2025 • 85

upvoted a paper 18 days ago

Conditional Memory via Scalable Lookup: A New Axis of Sparsity for Large Language Models

Paper • 2601.07372 • Published 22 days ago • 40

upvoted 4 papers 20 days ago

Qwen3-VL-Embedding and Qwen3-VL-Reranker: A Unified Framework for State-of-the-Art Multimodal Retrieval and Ranking

Paper • 2601.04720 • Published 27 days ago • 52

EpiCaR: Knowing What You Don't Know Matters for Better Reasoning in LLMs

Paper • 2601.06786 • Published 24 days ago • 6

The Confidence Dichotomy: Analyzing and Mitigating Miscalibration in Tool-Use Agents

Paper • 2601.07264 • Published 23 days ago • 24

Youtu-Agent: Scaling Agent Productivity with Automated Generation and Hybrid Policy Optimization

Paper • 2512.24615 • Published Dec 31, 2025 • 119

upvoted 2 papers 21 days ago

PaCoRe: Learning to Scale Test-Time Compute with Parallel Coordinated Reasoning

Paper • 2601.05593 • Published 26 days ago • 82

Solar Open Technical Report

Paper • 2601.07022 • Published 23 days ago • 65

Jeongjae Park

AI & ML interests

Recent Activity

Organizations

jjp97's activity