Chinese social networking company Weibo's AI division recently released its open source VibeThinker-1.5B —a 1.5 billion ...
Tech Xplore on MSN
AI evaluates texts without bias—until the source is revealed
Large language models (LLMs) are increasingly used not only to generate content but also to evaluate it. They are asked to ...
Last week, an unlisted private American company that recorded a $12bn (£9.6bn) loss in its last quarter asked the US ...
The benchmark currently comprises 2,278 questions spanning 11 Indian languages (Hindi, Hinglish, Gujarati, Punjabi, Kannada, ...
OpenAI has introduced IndQA, a new benchmark to evaluate how well AI models understand and reason about Indian languages and ...
Fireship on MSN
Is Elon’s Grok 3 the New King of AI?
In this video, we take a first look at Elon Musk's Grok 3, the latest deep-thinking AI model from xAI. How does it stack up ...
OpenAI, a San Francisco-based AI research and deployment firm that created ChatGPT, has introduced IndQA, a new benchmark for ...
OpenAI has launched IndQA, a new benchmark aimed at evaluating how effectively AI systems comprehend questions based on India ...
OpenAI’s IndQA comprises 2,278 culturally grounded, reasoning-heavy questions across 12 Indian languages and 10 cultural domains, developed in partnership with 261 domain experts.
OpenAI launches a new benchmark developed with 261 Indian experts to evaluate how effectively AI systems understand and ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results