OpenAI's new IndQA benchmark, which evaluates AI reasoning on everyday life and culture, is a massive dataset comprising ...
OpenAI’s IndQA comprises 2,278 culturally grounded, reasoning-heavy questions across 12 Indian languages and 10 cultural domains, developed in partnership with 261 domain experts.
Some results have been hidden because they may be inaccessible to you
Show inaccessible results