AI Tools AI Agents Blog Pricing

Flawed AI benchmarks put enterprise budgets at risk

November 4, 2025

Ryan Daws

AI News

38 views

A new academic review suggests AI benchmarks are flawed, potentially leading an enterprise to make high-stakes decisions on “misleading” data. Enterprise leaders are committing budgets of eight or nine figures to generative AI programmes. These procurement and development decisions often rely on public leaderboards and benchmarks to compare model capabilities. A large-scale study, ‘Measuring what […] The post Flawed AI benchmarks put enterprise budgets at risk appeared first on AI News.

<p>A new academic review suggests AI benchmarks are flawed, potentially leading an enterprise to make high-stakes decisions on “misleading” data. Enterprise leaders are committing budgets of eight or nine figures to generative AI programmes. These procurement and development decisions often rely on public leaderboards and benchmarks to compare model capabilities. A large-scale study, ‘Measuring what […]</p>

<p>The post <a href="https://www.artificialintelligence-news.com/news/flawed-ai-benchmarks-enterprise-budgets-at-risk/">Flawed AI benchmarks put enterprise budgets at risk</a> appeared first on <a href="https://www.artificialintelligence-news.com">AI News</a>.</p>

Read Full Article

Similar Articles

ChatGPT users are about to get hit with targeted ads

**Exploring the Shift: Targeted Ads Coming to ChatGPT** As ChatGPT evolves, users are soon going to notice a significant change. OpenAI is introducing targeted ads to the platform. This new move aims to personalize the user experience, but what does that really mean for you? **Understanding Targeted Ads** Targeted ads are not just random advertisements. They are curated based on your interactions and preferences. OpenAI assures users that they will have some control over what kind of ads they see. This is an important aspect, as it means you can customize your experience. It’s all about making the ads more relevant and useful for you. **Why This Matters** For many users, the integration of ads can change how they interact with the platform. It’s essential to consider how these ads might enhance or disrupt your experience. With targeted ads, there's a chance you may discover new products or services that genuinely interest you. However, it can also lead to concerns about privacy and the data being collected. Understanding these aspects is vital as you navigate this new landscape. **What You Can

TechCrunchJan 16

The AI healthcare gold rush is here

**Exploring the Surge of AI in Healthcare** The intersection of artificial intelligence and healthcare is becoming a hotbed of innovation and investment. As companies race to develop tools that enhance patient care, understanding this trend is vital for anyone interested in the future of health. **Opportunities and Innovations** In recent weeks, we have seen major shifts in the healthcare landscape. Notable players like OpenAI and Anthropic are diving deep into this sector. OpenAI has acquired a health startup called Torch, while Anthropic is rolling out its new AI, Claude, specifically for healthcare applications. These moves highlight a growing confidence in AI's potential to transform how we approach medical challenges. **Why This Matters** This surge in activity means that healthcare is not just a field for traditional medicine anymore. It is becoming a playground for technology and innovation. With substantial investments flowing in, new products and services are emerging that aim to improve diagnostics, patient management, and treatment effectiveness. For instance, AI tools can analyze vast amounts of medical data quickly, helping doctors make more informed decisions. **What You Can Learn** For

TechCrunchJan 16