2 4

Jongwon Lim

Jongwondd

AI & ML interests

None yet

Recent Activity

upvoted a paper 12 days ago

Where Should Diffusion Enter a Language Model? Geometry-Guided Hidden-State Replacement

submitted a paper 16 days ago

KL for a KL: On-Policy Distillation with Control Variate Baseline

commentedon a paper 16 days ago

KL for a KL: On-Policy Distillation with Control Variate Baseline

View all activity

Organizations

upvoted a paper 12 days ago

Where Should Diffusion Enter a Language Model? Geometry-Guided Hidden-State Replacement

Paper • 2605.14368 • Published 17 days ago • 14

submitted a paper to Daily Papers 16 days ago

KL for a KL: On-Policy Distillation with Control Variate Baseline

Paper • 2605.07865 • Published 23 days ago • 20

commented a paper 16 days ago

KL for a KL: On-Policy Distillation with Control Variate Baseline

Paper • 2605.07865 • Published 23 days ago • 20 •

submitted a paper to Daily Papers 17 days ago

Your Language Model is Its Own Critic: Reinforcement Learning with Value Estimation from Actor's Internal States

Paper • 2605.07579 • Published 23 days ago • 16

authored 3 papers 17 days ago

DAHL: Domain-specific Automated Hallucination Evaluation of Long-Form Text through a Benchmark Dataset in Biomedicine

Paper • 2411.09255 • Published Nov 14, 2024

Learning to Retrieve User History and Generate User Profiles for Personalized Persuasiveness Prediction

Paper • 2601.05654 • Published Apr 19

Your Language Model is Its Own Critic: Reinforcement Learning with Value Estimation from Actor's Internal States

Paper • 2605.07579 • Published 23 days ago • 16

upvoted 2 papers 19 days ago

KL for a KL: On-Policy Distillation with Control Variate Baseline

Paper • 2605.07865 • Published 23 days ago • 20

Your Language Model is Its Own Critic: Reinforcement Learning with Value Estimation from Actor's Internal States

Paper • 2605.07579 • Published 23 days ago • 16

updated a model 29 days ago

Jongwondd/GRESO_step_90

4B • Updated 29 days ago • 14

published 2 models 29 days ago

Jongwondd/GRESO_step_90

4B • Updated 29 days ago • 14

Jongwondd/Qwen3-4B_GRESO_batch_256

Updated 29 days ago

updated a model about 1 month ago

Jongwondd/convai_hw1

Updated Apr 25

published a model about 1 month ago

Jongwondd/convai_hw1

Updated Apr 25

updated a dataset about 1 month ago

Jongwondd/convai_hw1

Updated Apr 25 • 16

published a dataset about 1 month ago

Jongwondd/convai_hw1

Updated Apr 25 • 16

upvoted a paper about 1 month ago

ThinkBrake: Efficient Reasoning via Log-Probability Margin Guided Decoding

Paper • 2510.00546 • Published Apr 20 • 14

Jongwon Lim

AI & ML interests

Recent Activity

Organizations

Jongwondd's activity