Chinese social networking company Weibo's AI division recently released its open source VibeThinker-1.5B —a 1.5 billion ...
Large language models (LLMs) are increasingly used not only to generate content but also to evaluate it. They are asked to ...
Last week, an unlisted private American company that recorded a $12bn (£9.6bn) loss in its last quarter asked the US ...
The benchmark currently comprises 2,278 questions spanning 11 Indian languages (Hindi, Hinglish, Gujarati, Punjabi, Kannada, ...
OpenAI has introduced IndQA, a new benchmark to evaluate how well AI models understand and reason about Indian languages and ...
Some leading AIs sabotage shutdown instructions in controlled tests, echoing concerns from experts on future safety risks, ...
In this video, we take a first look at Elon Musk's Grok 3, the latest deep-thinking AI model from xAI. How does it stack up ...
OpenAI, a San Francisco-based AI research and deployment firm that created ChatGPT, has introduced IndQA, a new benchmark for ...
OpenAI has launched IndQA, a new benchmark aimed at evaluating how effectively AI systems comprehend questions based on India ...
OpenAI launches a new benchmark developed with 261 Indian experts to evaluate how effectively AI systems understand and ...