Flawed AI benchmarks put enterprise budgets at risk
A new academic review suggests AI benchmarks are flawed, potentially leading an enterprise to make high-stakes decisions on “misleading” data. Enterprise leaders are committing budgets of eight or nine figures to generative AI programmes. These procurement and development decisions often rely on public leaderboards and benchmarks to compare model capabilities. A large-scale study, ‘Measuring what […] The post Flawed AI benchmarks put enterprise budgets at risk appeared first on AI News.
<p>A new academic review suggests AI benchmarks are flawed, potentially leading an enterprise to make high-stakes decisions on “misleading” data. Enterprise leaders are committing budgets of eight or nine figures to generative AI programmes. These procurement and development decisions often rely on public leaderboards and benchmarks to compare model capabilities. A large-scale study, ‘Measuring what […]</p>
<p>The post <a href="https://www.artificialintelligence-news.com/news/flawed-ai-benchmarks-enterprise-budgets-at-risk/">Flawed AI benchmarks put enterprise budgets at risk</a> appeared first on <a href="https://www.artificialintelligence-news.com">AI News</a>.</p>