MMToM-QA
API
a multimodal question-answering benchmark designed to evaluate AI models' cognitive ability to understand human beliefs and goals.
0
Very Poor0 reviews
Robot Dance Score™
0.0
Performance
25%
0.0
Reliability
20%
0.0
Ease of Use
15%
0.0
Value
15%
0.0
Trust
15%
0.0
Delight
10%
Reviews (0)
Write a Review →No reviews yet
Be the first to review →