MME-RealWorld: Could Your Multimodal LLM Challenge High-Resolution Real-World Scenarios that are Difficult for Humans?

Jan 1, 2025·
Yi-Fan Zhang
,
Huanyu Zhang
,
Haochen Tian
,
Chaoyou Fu
,
Shuangqing Zhang
,
Junfei Wu
,
Feng Li
,
Kun Wang
,
Qingsong Wen
,
Zhang Zhang
,
Others
· 0 min read
Type
Publication
The Thirteenth International Conference on Learning Representations