Jinki Jeong PRO

Anserwise

1 14 155

AI & ML interests

None yet

Recent Activity

liked a model about 8 hours ago

FINAL-Bench/metacog-adapter-JGOS-31B-Citizen

liked a Space 1 day ago

VIDraft/vkae

upvoted an article 1 day ago

Does Your LLM Know *When It's About to Be Wrong*?

View all activity

Organizations

None yet

upvoted an article 1 day ago

Article

Does Your LLM Know When It's About to Be Wrong?

ginigen-ai

•

1 day ago

• 16

upvoted a collection 2 days ago

Metacognition Adapters

Collection

Per-model metacognition adapters from VIDRAFT Darwin/Chimera platform + AETHER metacognition-emergence technology. • 11 items • Updated 2 days ago • 17

upvoted an article 3 days ago

Article

Chitos: From Detection to Proof — An Autonomous Security AI That Actually Exploits

FINAL-Bench

•

3 days ago

• 19

upvoted an article 18 days ago

Article

FINAL-Bench Quantum: An Open, Neutral Benchmark for Quantum-Computing Methods

FINAL-Bench

•

18 days ago

• 17

upvoted an article about 2 months ago

Article

Training-Free Reasoning at 88.89% on GPQA Diamond: How Darwin Family Hit Frontier Scores Without a Single Gradient Step

FINAL-Bench

•

May 15

• 18

upvoted a paper about 2 months ago

Darwin Family: MRI-Trust-Weighted Evolutionary Merging for Training-Free Scaling of Language-Model Reasoning

Paper • 2605.14386 • Published May 14 • 63

upvoted 3 articles 3 months ago

Article

Darwin-TTS: We Gave a TTS Model 3% of an LLM's Brain — It Started Showing Emotion

FINAL-Bench

•

Apr 15

• 13

Article

"The Child That Surpassed Both Parents Through MRI-Guided Evolutionary Merge"

FINAL-Bench

•

Mar 31

• 14

Article

Introducing WM Bench: A Benchmark for Cognitive Intelligence in World Models

FINAL-Bench

•

Mar 29

• 13

upvoted 5 articles 4 months ago

Article

🏟️ Smol AI WorldCup: A 5-Axis Benchmark That Reveals What Small Language Models Can Really Do

FINAL-Bench

•

Mar 10

• 38

Article

MARL: Runtime Middleware That Reduces LLM Hallucination Without Fine-Tuning

FINAL-Bench

•

Mar 9

• 16

Article

Structural Problems in AI Benchmarking and the Case for a Unified Evaluation Framework

FINAL-Bench

•

Mar 8

• 12

Article

Do Bubbles Form When Tens of Thousands of AIs Simulate Capitalism?

FINAL-Bench

•

Feb 24

• 17

Article

FINAL Bench: The Real Bottleneck to AGI Is Self-Correction

FINAL-Bench

•

Feb 21

• 20

Jinki Jeong PRO

AI & ML interests

Recent Activity

Organizations

Anserwise's activity

Does Your LLM Know *When It's About to Be Wrong*?

Chitos: From Detection to Proof — An Autonomous Security AI That Actually Exploits

FINAL-Bench Quantum: An Open, Neutral Benchmark for Quantum-Computing Methods

Training-Free Reasoning at 88.89% on GPQA Diamond: How Darwin Family Hit Frontier Scores Without a Single Gradient Step

Darwin-TTS: We Gave a TTS Model 3% of an LLM's Brain — It Started Showing Emotion

"The Child That Surpassed Both Parents Through MRI-Guided Evolutionary Merge"

Introducing WM Bench: A Benchmark for Cognitive Intelligence in World Models

🏟️ Smol AI WorldCup: A 5-Axis Benchmark That Reveals What Small Language Models Can Really Do

MARL: Runtime Middleware That Reduces LLM Hallucination Without Fine-Tuning

Structural Problems in AI Benchmarking and the Case for a Unified Evaluation Framework

Do Bubbles Form When Tens of Thousands of AIs Simulate Capitalism?

FINAL Bench: The Real Bottleneck to AGI Is Self-Correction

Does Your LLM Know When It's About to Be Wrong?