Agent-World: Scaling Real-World Environment Synthesis for Evolving General Agent Intelligence Paper • 2604.18292 • Published 18 days ago • 83
view article Article How to Ground a Korean AI Agent in Real Demographics with Synthetic Personas 17 days ago • 25
The PokeAgent Challenge: Competitive and Long-Context Learning at Scale Paper • 2603.15563 • Published Mar 16 • 10
WorldCam: Interactive Autoregressive 3D Gaming Worlds with Camera Pose as a Unifying Geometric Representation Paper • 2603.16871 • Published Mar 17 • 60
Penguin-VL: Exploring the Efficiency Limits of VLM with LLM-based Vision Encoders Paper • 2603.06569 • Published Mar 6 • 119
DreamDojo: A Generalist Robot World Model from Large-Scale Human Videos Paper • 2602.06949 • Published Feb 6 • 37
Judging What We Cannot Solve: A Consequence-Based Approach for Oracle-Free Evaluation of Research-Level Math Paper • 2602.06291 • Published Feb 6 • 24
Physical AI Collection Collection of open, commercial-grade datasets for physical AI developers • 35 items • Updated about 4 hours ago • 148
What Users Leave Unsaid: Under-Specified Queries Limit Vision-Language Models Paper • 2601.06165 • Published Jan 7 • 16
KORMo: Korean Open Reasoning Model for Everyone Paper • 2510.09426 • Published Oct 10, 2025 • 87
Exploring Fine-Tuning of Large Audio Language Models for Spoken Language Understanding under Limited Speech data Paper • 2509.15389 • Published Sep 18, 2025 • 3
D2E: Scaling Vision-Action Pretraining on Desktop Data for Transfer to Embodied AI Paper • 2510.05684 • Published Oct 7, 2025 • 146
Less is More: Recursive Reasoning with Tiny Networks Paper • 2510.04871 • Published Oct 6, 2025 • 514
Multi-Granular Spatio-Temporal Token Merging for Training-Free Acceleration of Video LLMs Paper • 2507.07990 • Published Jul 10, 2025 • 45
Don't Look Only Once: Towards Multimodal Interactive Reasoning with Selective Visual Revisitation Paper • 2505.18842 • Published May 24, 2025 • 36