MMComposition

university

https://hanghuacs.github.io/MMComposition/

AI & ML interests

None defined yet.

Recent Activity

zengziyun updated a dataset 4 days ago

MMComposition/MMComposition

zengziyun published a dataset 4 days ago

MMComposition/MMComposition

hhua2 authored a paper 4 days ago

MemoBench: Benchmarking World Modeling in Dynamically Changing Environments

View all activity

updated a dataset 4 days ago

MMComposition/MMComposition

Viewer • Updated 4 days ago • 4.12k • 737

published a dataset 4 days ago

MMComposition/MMComposition

Viewer • Updated 4 days ago • 4.12k • 737

authored a paper 4 days ago

MemoBench: Benchmarking World Modeling in Dynamically Changing Environments

Paper • 2606.27537 • Published 11 days ago • 6

submitted a paper to Daily Papers 19 days ago

Aligning Quantum Operators with Large Language Models

Paper • 2606.13811 • Published 25 days ago • 4

authored 2 papers about 1 month ago

AutoLab: Can Frontier Models Solve Long-Horizon Auto Research and Engineering Tasks?

Paper • 2606.05080 • Published Jun 3 • 30

Agent Skills Should Go Beyond Text: The Case for Visual Skills

Paper • 2606.01414 • Published May 31 • 10

submitted a paper to Daily Papers about 1 month ago

Agent Skills Should Go Beyond Text: The Case for Visual Skills

Paper • 2606.01414 • Published May 31 • 10

authored 2 papers about 2 months ago

Aurora: Unified Video Editing with a Tool-Using Agent

Paper • 2605.18748 • Published May 18 • 30

MementoGUI: Learning Agentic Multimodal Memory Control for Long-Horizon GUI Agents

Paper • 2605.18652 • Published May 18 • 8

submitted a paper to Daily Papers about 2 months ago

MementoGUI: Learning Agentic Multimodal Memory Control for Long-Horizon GUI Agents

Paper • 2605.18652 • Published May 18 • 8

authored a paper about 2 months ago

Visual Aesthetic Benchmark: Can Frontier Models Judge Beauty?

Paper • 2605.12684 • Published May 12 • 11

authored a paper 3 months ago

ChartNet: A Million-Scale, High-Quality Multimodal Dataset for Robust Chart Understanding

Paper • 2603.27064 • Published Mar 28 • 29

submitted a paper to Daily Papers 3 months ago

ChartNet: A Million-Scale, High-Quality Multimodal Dataset for Robust Chart Understanding

Paper • 2603.27064 • Published Mar 28 • 29

authored a paper 5 months ago

SPARC: Separating Perception And Reasoning Circuits for Test-time Scaling of VLMs

Paper • 2602.06566 • Published Feb 6 • 3

submitted a paper to Daily Papers 5 months ago

SPARC: Separating Perception And Reasoning Circuits for Test-time Scaling of VLMs

Paper • 2602.06566 • Published Feb 6 • 3

authored a paper 7 months ago

MIRA: Multimodal Iterative Reasoning Agent for Image Editing

Paper • 2511.21087 • Published Nov 26, 2025 • 10

authored a paper 7 months ago

Video-R4: Reinforcing Text-Rich Video Reasoning with Visual Rumination

Paper • 2511.17490 • Published Nov 21, 2025 • 22

authored a paper 8 months ago

Latent Chain-of-Thought for Visual Reasoning

Paper • 2510.23925 • Published Oct 27, 2025 • 10

authored a paper 9 months ago

Building a Foundational Guardrail for General Agentic Systems via Synthetic Data

Paper • 2510.09781 • Published Oct 10, 2025 • 27

authored a paper 9 months ago

VideoCutLER: Surprisingly Simple Unsupervised Video Instance Segmentation

Paper • 2308.14710 • Published Aug 28, 2023