OSU NLP Group

university

https://twitter.com/osunlp

osunlp

OSU-NLP-Group

Activity Feed

AI & ML interests

Natural language processing, language models, language agents

Recent Activity

nnnyt new activity about 5 hours ago

osunlp/SkillHarm:Add task_categories to metadata

nnnyt submitted a paper 1 day ago

SkillHarm: Lifecycle-Aware Skill-Based Attacks via Automated Construction

Lzy01241010 updated a Space 1 day ago

osunlp/QUEST

View all activity

Papers

AgentCL: Toward Rigorous Evaluation of Continual Learning in Language Agents

SkillHarm: Lifecycle-Aware Skill-Based Attacks via Automated Construction

View all Papers

osunlp 's collections 12

QUEST

Running

Agents

8

QUEST

🔎

8

Generate comprehensive answers via multi‑source web research
QUEST: Training Frontier Deep Research Agents with Fully Synthetic Tasks

Paper • 2605.24218 • Published 21 days ago • 42
osunlp/QUEST-35B-RL

Text Generation • 35B • Updated 16 days ago • 395 • 4
osunlp/QUEST-RL-Data

Viewer • Updated 16 days ago • 1.13k • 1.2k

ACuRL

Autonomous Continual Learning of Computer-Use Agents for Environment Adaptation

osunlp/ACuRL_UI-TARS-1.5-7B_libreoffice_impress

8B • Updated Feb 10 • 4
osunlp/ACuRL_UI-TARS-1.5-7B_libreoffice_calc

8B • Updated Feb 10 • 5
osunlp/ACuRL_UI-TARS-1.5-7B_libreoffice_writer

8B • Updated Feb 10 • 2
osunlp/ACuRL_UI-TARS-1.5-7B_thunderbird

8B • Updated Feb 10 • 1

GUI-Drag

Beyond Clicking: A step towards generalist grounding via text dragging

osunlp/GUI-Drag-3B

4B • Updated Oct 16, 2025 • 5 • 1
osunlp/GUI-Drag-7B

Image-Text-to-Text • 8B • Updated Jan 19 • 9 • 2
osunlp/GUI-Drag-dataset

Preview • Updated Mar 6 • 81 • 4
Beyond Clicking:A Step Towards Generalist GUI Grounding via Text Dragging

Paper • 2601.06031 • Published Nov 7, 2025

WebDreamer

Is Your LLM Secretly a World Model of the Internet? Model-Based Planning for Web Agents

osunlp/Dreamer-V1-Data

Viewer • Updated Apr 9, 2025 • 3.12M • 1.08k • 4
osunlp/Dreamer-7B

Image-Text-to-Text • 8B • Updated Apr 9, 2025 • 230 • 5
osunlp/Dreamer-72B

Image-Text-to-Text • 73B • Updated Apr 9, 2025 • 5 • 2
osunlp/Dreamer-7B-Shopping

Image-Text-to-Text • 8B • Updated Apr 9, 2025 • 6 • 1

SAEV

SAEs for vision models like CLIP or DINOv2

osunlp/SAE_CLIP_24K_ViT-B-16_IN1K

Updated Feb 11, 2025 • 8 • 2
osunlp/SAE_DINOv2_24K_ViT-B-14_IN1K

Updated Feb 11, 2025 • 5 • 2
osunlp/SAE_BioCLIP_24K_ViT-B-16_iNat21

Updated Apr 23, 2025 • 5 • 1
osunlp/SAE_DINOv3_ViT-S-16_IN1K

Updated Feb 26

AmpleGCG

Generative models to produce GCG-like adversarial suffixes

osunlp/AmpleGCG-llama2-sourced-llama2-7b-chat

Text Generation • 7B • Updated Nov 3, 2024 • 81 • 4
osunlp/AmpleGCG-llama2-sourced-vicuna-7b

Text Generation • 7B • Updated Nov 3, 2024 • 1
osunlp/AmpleGCG-llama2-sourced-vicuna-7b13b-guanaco-7b13b

Text Generation • 7B • Updated Nov 3, 2024 • 5 • 1
osunlp/AmpleGCG-plus-llama2-sourced-llama2-7b-chat

Text Generation • 7B • Updated Nov 3, 2024 • 1 • 2

D3-Gym

Constructing Verifiable Environments for Data-Driven Discovery

osunlp/D3-Gym

Viewer • Updated May 5 • 565 • 56
osunlp/D3-Gym-Trajectories

Viewer • Updated May 5 • 6.37k • 179
osunlp/D3-Gym-8B-rft-self

8B • Updated Apr 29 • 2
osunlp/D3-Gym-4B-rft-self

4B • Updated Apr 30 • 4

AutoElicit

When Benign Inputs Lead to Severe Harms: Eliciting Unsafe Unintended Behaviors of Computer-Use Agents

osunlp/AutoElicit-Seed

Viewer • Updated Feb 10 • 361 • 35 • 1
osunlp/AutoElicit-Bench

Viewer • Updated Feb 10 • 117 • 151 • 1
osunlp/AutoElicit-Exec

Viewer • Updated Feb 10 • 132 • 72 • 1
When Benign Inputs Lead to Severe Harms: Eliciting Unsafe Unintended Behaviors of Computer-Use Agents

Paper • 2602.08235 • Published Feb 9 • 1

Mind2Web 2

Evaluating Agentic Search with Agent-as-a-Judge

Mind2Web 2: Evaluating Agentic Search with Agent-as-a-Judge

Paper • 2506.21506 • Published Jun 26, 2025 • 52
osunlp/Mind2Web-2

Viewer • Updated Dec 14, 2025 • 130 • 127 • 16

Mind2Web

Towards Generalist Agents for the Web (NeurIPS'23 Spotlight)

osunlp/Mind2Web

Viewer • Updated Oct 19, 2025 • 253 • 7.94k • 126
osunlp/Multimodal-Mind2Web

Viewer • Updated Jun 5, 2024 • 14.2k • 5.8k • 95
osunlp/Online-Mind2Web

Viewer • Updated 19 days ago • 300 • 1.81k • 25
Mind2Web: Towards a Generalist Agent for the Web

Paper • 2306.06070 • Published Jun 9, 2023 • 21

UGround

Navigating GUIs as Humans Do: Universal Visual Grounding for GUI Agents (ICLR'25 Oral)

osunlp/UGround-V1-Data

Viewer • Updated May 2, 2025 • 1.23M • 690 • 24
osunlp/UGround-V1-Data-Box

Viewer • Updated May 2, 2025 • 488k • 39 • 10
osunlp/UGround-V1-2B

Image-Text-to-Text • 2B • Updated Feb 16, 2025 • 459 • 10
osunlp/UGround-V1-7B

Image-Text-to-Text • 8B • Updated Apr 16, 2025 • 649 • 20

LlaSMol

LLMs tuned on the SMolInstruct dataset for chemistry tasks.

osunlp/LlaSMol-Llama2-7B

Updated May 6, 2024 • 1
osunlp/LlaSMol-Galactica-6.7B

Updated May 6, 2024 • 1
osunlp/LlaSMol-CodeLlama-7B

Updated May 6, 2024 • 1
osunlp/LlaSMol-Mistral-7B

Updated May 6, 2024 • 20

QUEST

Running

Agents

8

QUEST

🔎

8

Generate comprehensive answers via multi‑source web research
QUEST: Training Frontier Deep Research Agents with Fully Synthetic Tasks

Paper • 2605.24218 • Published 21 days ago • 42
osunlp/QUEST-35B-RL

Text Generation • 35B • Updated 16 days ago • 395 • 4
osunlp/QUEST-RL-Data

Viewer • Updated 16 days ago • 1.13k • 1.2k

D3-Gym

Constructing Verifiable Environments for Data-Driven Discovery

osunlp/D3-Gym

Viewer • Updated May 5 • 565 • 56
osunlp/D3-Gym-Trajectories

Viewer • Updated May 5 • 6.37k • 179
osunlp/D3-Gym-8B-rft-self

8B • Updated Apr 29 • 2
osunlp/D3-Gym-4B-rft-self

4B • Updated Apr 30 • 4

ACuRL

Autonomous Continual Learning of Computer-Use Agents for Environment Adaptation

osunlp/ACuRL_UI-TARS-1.5-7B_libreoffice_impress

8B • Updated Feb 10 • 4
osunlp/ACuRL_UI-TARS-1.5-7B_libreoffice_calc

8B • Updated Feb 10 • 5
osunlp/ACuRL_UI-TARS-1.5-7B_libreoffice_writer

8B • Updated Feb 10 • 2
osunlp/ACuRL_UI-TARS-1.5-7B_thunderbird

8B • Updated Feb 10 • 1

AutoElicit

When Benign Inputs Lead to Severe Harms: Eliciting Unsafe Unintended Behaviors of Computer-Use Agents

osunlp/AutoElicit-Seed

Viewer • Updated Feb 10 • 361 • 35 • 1
osunlp/AutoElicit-Bench

Viewer • Updated Feb 10 • 117 • 151 • 1
osunlp/AutoElicit-Exec

Viewer • Updated Feb 10 • 132 • 72 • 1
When Benign Inputs Lead to Severe Harms: Eliciting Unsafe Unintended Behaviors of Computer-Use Agents

Paper • 2602.08235 • Published Feb 9 • 1

GUI-Drag

Beyond Clicking: A step towards generalist grounding via text dragging

osunlp/GUI-Drag-3B

4B • Updated Oct 16, 2025 • 5 • 1
osunlp/GUI-Drag-7B

Image-Text-to-Text • 8B • Updated Jan 19 • 9 • 2
osunlp/GUI-Drag-dataset

Preview • Updated Mar 6 • 81 • 4
Beyond Clicking:A Step Towards Generalist GUI Grounding via Text Dragging

Paper • 2601.06031 • Published Nov 7, 2025

Mind2Web 2

Evaluating Agentic Search with Agent-as-a-Judge

Mind2Web 2: Evaluating Agentic Search with Agent-as-a-Judge

Paper • 2506.21506 • Published Jun 26, 2025 • 52
osunlp/Mind2Web-2

Viewer • Updated Dec 14, 2025 • 130 • 127 • 16

WebDreamer

Is Your LLM Secretly a World Model of the Internet? Model-Based Planning for Web Agents

osunlp/Dreamer-V1-Data

Viewer • Updated Apr 9, 2025 • 3.12M • 1.08k • 4
osunlp/Dreamer-7B

Image-Text-to-Text • 8B • Updated Apr 9, 2025 • 230 • 5
osunlp/Dreamer-72B

Image-Text-to-Text • 73B • Updated Apr 9, 2025 • 5 • 2
osunlp/Dreamer-7B-Shopping

Image-Text-to-Text • 8B • Updated Apr 9, 2025 • 6 • 1

Mind2Web

Towards Generalist Agents for the Web (NeurIPS'23 Spotlight)

osunlp/Mind2Web

Viewer • Updated Oct 19, 2025 • 253 • 7.94k • 126
osunlp/Multimodal-Mind2Web

Viewer • Updated Jun 5, 2024 • 14.2k • 5.8k • 95
osunlp/Online-Mind2Web

Viewer • Updated 19 days ago • 300 • 1.81k • 25
Mind2Web: Towards a Generalist Agent for the Web

Paper • 2306.06070 • Published Jun 9, 2023 • 21

SAEV

SAEs for vision models like CLIP or DINOv2

osunlp/SAE_CLIP_24K_ViT-B-16_IN1K

Updated Feb 11, 2025 • 8 • 2
osunlp/SAE_DINOv2_24K_ViT-B-14_IN1K

Updated Feb 11, 2025 • 5 • 2
osunlp/SAE_BioCLIP_24K_ViT-B-16_iNat21

Updated Apr 23, 2025 • 5 • 1
osunlp/SAE_DINOv3_ViT-S-16_IN1K

Updated Feb 26

UGround

Navigating GUIs as Humans Do: Universal Visual Grounding for GUI Agents (ICLR'25 Oral)

osunlp/UGround-V1-Data

Viewer • Updated May 2, 2025 • 1.23M • 690 • 24
osunlp/UGround-V1-Data-Box

Viewer • Updated May 2, 2025 • 488k • 39 • 10
osunlp/UGround-V1-2B

Image-Text-to-Text • 2B • Updated Feb 16, 2025 • 459 • 10
osunlp/UGround-V1-7B

Image-Text-to-Text • 8B • Updated Apr 16, 2025 • 649 • 20

AmpleGCG

Generative models to produce GCG-like adversarial suffixes

osunlp/AmpleGCG-llama2-sourced-llama2-7b-chat

Text Generation • 7B • Updated Nov 3, 2024 • 81 • 4
osunlp/AmpleGCG-llama2-sourced-vicuna-7b

Text Generation • 7B • Updated Nov 3, 2024 • 1
osunlp/AmpleGCG-llama2-sourced-vicuna-7b13b-guanaco-7b13b

Text Generation • 7B • Updated Nov 3, 2024 • 5 • 1
osunlp/AmpleGCG-plus-llama2-sourced-llama2-7b-chat

Text Generation • 7B • Updated Nov 3, 2024 • 1 • 2

LlaSMol

LLMs tuned on the SMolInstruct dataset for chemistry tasks.

osunlp/LlaSMol-Llama2-7B

Updated May 6, 2024 • 1
osunlp/LlaSMol-Galactica-6.7B

Updated May 6, 2024 • 1
osunlp/LlaSMol-CodeLlama-7B

Updated May 6, 2024 • 1
osunlp/LlaSMol-Mistral-7B

Updated May 6, 2024 • 20

AI & ML interests

Recent Activity

Papers

Team members 29

osunlp 's collections 12

QUEST

QUEST