GlobalSplat: Efficient Feed-Forward 3D Gaussian Splatting via Global Scene Tokens Paper • 2604.15284 • Published 9 days ago • 24
ScheMatiQ: From Research Question to Structured Data through Interactive Schema Discovery Paper • 2604.09237 • Published 15 days ago • 9
Advancing Speech Understanding in Speech-Aware Language Models with GRPO Paper • 2509.16990 • Published Sep 21, 2025 • 22
Alterbute: Editing Intrinsic Attributes of Objects in Images Paper • 2601.10714 • Published Jan 15 • 31
Beyond Transcription: Mechanistic Interpretability in ASR Paper • 2508.15882 • Published Aug 21, 2025 • 89
Auto-Regressive vs Flow-Matching: a Comparative Study of Modeling Paradigms for Text-to-Music Generation Paper • 2506.08570 • Published Jun 10, 2025 • 33
StressTest: Can YOUR Speech LM Handle the Stress? Paper • 2505.22765 • Published May 28, 2025 • 17
CHIMERA: A Knowledge Base of Idea Recombination in Scientific Literature Paper • 2505.20779 • Published May 27, 2025 • 15
Don't Overthink it. Preferring Shorter Thinking Chains for Improved LLM Reasoning Paper • 2505.17813 • Published May 23, 2025 • 58
WHISTRESS: Enriching Transcriptions with Sentence Stress Detection Paper • 2505.19103 • Published May 25, 2025 • 13
Slamming: Training a Speech Language Model on One GPU in a Day Paper • 2502.15814 • Published Feb 19, 2025 • 69