Business Insights
  • Home
  • Crypto
  • Finance Expert
  • Business
  • Invest News
  • Investing
  • Trading
  • Forex
  • Videos
  • Economy
  • Tech
  • Contact

Archives

  • March 2026
  • February 2026
  • January 2026
  • December 2025
  • November 2025
  • October 2025
  • September 2025
  • August 2025
  • July 2025
  • June 2025
  • May 2025
  • April 2025
  • March 2025
  • February 2025
  • January 2025
  • December 2024
  • November 2024
  • October 2024
  • September 2024
  • August 2024
  • July 2024
  • June 2024
  • May 2024
  • April 2024
  • March 2024
  • February 2024
  • August 2023
  • January 2023
  • December 2021
  • July 2021
  • November 2019
  • October 2019
  • September 2019
  • August 2019
  • July 2019
  • June 2019
  • May 2019
  • April 2019
  • March 2019
  • February 2019
  • January 2019

Categories

  • Business
  • Crypto
  • Economy
  • Finance Expert
  • Forex
  • Invest News
  • Investing
  • Tech
  • Trading
  • Uncategorized
  • Videos
Apply Loan
Money Visa
Advertise Us
Money Visa
  • Home
  • Crypto
  • Finance Expert
  • Business
  • Invest News
  • Investing
  • Trading
  • Forex
  • Videos
  • Economy
  • Tech
  • Contact
AI keeps getting more powerful, making it harder to judge how smart models actually are
  • Finance Expert

AI keeps getting more powerful, making it harder to judge how smart models actually are

  • August 1, 2025
  • Roubens Andy King
Total
0
Shares
0
0
0
Total
0
Shares
Share 0
Tweet 0
Pin it 0

How do you judge an AI model when it’s already starting to perform better than human beings? That’s the challenge faced by researchers like Russell Wald, executive director of the Stanford Institute for Human-Centered Artificial Intelligence (HAI). 

“As of 2024, there are very few task categories where human ability surpasses AI, and even in these areas, the performance gap between AI and humans is shrinking rapidly,” Wald said last week in a presentation hosted at the Fortune Brainstorm AI Singapore conference. “AI is exceeding human capabilities and it’s becoming increasingly harder for us to benchmark.”

The HAI releases the AI Index each year, which aims to provide a comprehensive, data-driven snapshot of where AI is today. At Fortune Brainstorm AI Singapore, Wald shared a few highlights from the 2025 edition of the AI index, such as the increasing power of today’s models, the growing dominance of industry on the AI frontier, and how China is poised to overtake the U.S.


The following transcript has been lightly edited for conciseness and clarity.

I’m Russell Wald, the executive director of the Stanford Institute for Human-Centered Artificial Intelligence, or what we call “HAI”. 

We are Stanford University’s globally recognized interdisciplinary research institute at the forefront of shaping AI development for the public good. HAI was established in 2019 with the goal of advancing AI research, education, policy and practice. And, through our convening role and rigorous study of AI, we have become the trusted partner on AI governance for decision makers in industry, government and civil society. 

I’m going to talk about what we produce at HAI, which is the AI index, an annual data driven analysis of trends in AI that tracks research, development, deployment and the socio-economic impact of AI across academia, government and industry.

We see AI performance consistently improve year over year. We use Midjourney, a text-to-image generator, asking for a hyper-realistic image of Harry Potter. And from February 2022 to July 2024, we see rapidly increasing quality in these generated images. 

In 2022, the model produced cartoonish, inaccurate renderings of Harry Potter, but by 2024, it could create startlingly realistic depictions. We have gone from what mirrors a Picasso painting to an uncanny rendering of Daniel Radcliffe, the actor who played Harry Potter in the movies. 

Because of this consistent performance growth, we are increasingly challenged when it comes to benchmarking these models. As of 2024, there are very few task categories where human ability surpasses AI, and even in these areas, the performance gap between AI and humans is shrinking rapidly. From image recognition to competition-level mathematics to PhD-level science questions, AI is exceeding human capabilities and it’s becoming increasingly harder for us to benchmark.

From healthcare to transportation, AI is rapidly moving from the lab to our daily life. In 2023, the U.S. Food and Drug Administration approved 223 AI-enabled medical devices, up from just six in 2015. 

On the roads, self-driving cars are no longer experimental. For example, Waymo, which I regularly take while living in San Francisco, is one of the largest U.S. operators and provides over 150,000 autonomous rides each week, while Baidu’s affordable Apollo Go robotaxi has a fleet now that serves numerous cities across China. 

Business use of AI increased significantly after stagnating from 2017 to 2023. The latest McKinsey report reveals that 78% of surveyed respondents say their organizations have begun to use AI in at least one business function, marking a significant increase from 55% in 2023. 

Driven by increasingly capable small models, the inference cost for a system performing at the level of [GPT 3.5] dropped over 280-fold between November 2022 and October 2024. Hardware costs have declined 30% annually, while energy efficiency has improved by 40% each year. 

Open-weight models are also closing the gap with closed models, reducing the performance [gap] from 8% to just 1.7% on some benchmarks in a single year. Together, these trends are rapidly lowering the barriers to advanced AI. 

However, even with inference and hardware costs going down, training costs remain out of reach for academia and most small players. Nearly 90% of notable AI models in 2024 came from industry, which is up from 60% in 2023. And while academia remains a top source of highly cited research, it does struggle at this point to stay as advanced at the frontier level. 

Model scale continues to grow rapidly. Training compute doubles every five months, datasets every eight, and power use annually. Yet performance gaps are shrinking. The score difference between the top and 10th ranked models fell from 11.9% to 5.4% in a year, and the top two models are now separated by just 0.7%. The frontier is increasingly competitive and increasingly crowded. 

In recent years, AI model performance at the frontier has converged, with multiple providers now offering highly capable models. This marks a shift from late 2022, when ChatGPT’s launch, widely seen as AI’s breakthrough into the public consciousness, coincided with the landscape dominated by just two players: OpenAI and Google. 

One of the most important things to note is that the transformer model cost $930 for Google to train in 2017—and that is the T in GPT, the baseline level of architecture—and now today we’re at $200 million to train Gemini Ultra. 

Last year’s AI index was among the first publications to highlight the lack of standard benchmarks for AI safety and responsibility evaluations. The index has also been analyzing global public opinion. If you are from a non-Western industrialized nation, you are more likely to view AI positively than not. China has an 83% positive view, Indonesia 80%, and Thailand 77%. Whereas Canada is at 40%, the U.S. 39%, and the Netherlands 36%. 

I’ll close with the geopolitical situation. The U.S. still maintains a lead in AI, followed closely by China. However, this gap is tightening. My intention is not to exacerbate the idea of an AI arms race between China and the U.S., but instead to highlight the different approaches between the most advanced frontier AI model developers. 

Over the last several years, the U.S. has relied on a few proprietary model providers. Meanwhile, China has deeply invested in its talent base, and more importantly, an open-source environment. If this trend continues, and I appear next year, at this rate, China would surpass the U.S. in terms of model performance. 

Total
0
Shares
Share 0
Tweet 0
Pin it 0
Roubens Andy King

Previous Article
Southwest Airlines credit cards have new annual fees — are these cards still worth it?
  • Business

Southwest Airlines credit cards have new annual fees — are these cards still worth it?

  • August 1, 2025
  • Roubens Andy King
Read More
Next Article
Ether Machine Expands Ethereum Exposure, Adds 15,000 ETH In Latest Buy
  • Forex

Ether Machine Expands Ethereum Exposure, Adds 15,000 ETH In Latest Buy

  • August 1, 2025
  • Roubens Andy King
Read More
You May Also Like
CoreWeave’s stock slides as insider selling sparks investor concerns
Read More
  • Finance Expert

CoreWeave’s stock slides as insider selling sparks investor concerns

  • Roubens Andy King
  • September 2, 2025
Is CAT Outperforming the Industrial Sector?
Read More
  • Finance Expert

Is CAT Outperforming the Industrial Sector?

  • Roubens Andy King
  • September 2, 2025
Crude oil climbs on Russian supply risks; Russia and China agree on huge new gas pipeline
Read More
  • Finance Expert

Crude oil climbs on Russian supply risks; Russia and China agree on huge new gas pipeline

  • Roubens Andy King
  • September 2, 2025
Nestlé fired its scandal-clad CEO without a payout—a ‘really unusual’ move, expert says
Read More
  • Finance Expert

Nestlé fired its scandal-clad CEO without a payout—a ‘really unusual’ move, expert says

  • Roubens Andy King
  • September 2, 2025
‘Her kids will have no inheritance’: Will my friend lose her house to Medicaid if she goes into a nursing home?
Read More
  • Finance Expert

‘Her kids will have no inheritance’: Will my friend lose her house to Medicaid if she goes into a nursing home?

  • Roubens Andy King
  • September 2, 2025
Analyst Report: Caterpillar Inc.
Read More
  • Finance Expert

Analyst Report: Caterpillar Inc.

  • Roubens Andy King
  • September 2, 2025
AbbVie’s Elahere gains approval in Canada for ovarian cancer
Read More
  • Finance Expert

AbbVie’s Elahere gains approval in Canada for ovarian cancer

  • Roubens Andy King
  • September 2, 2025
Microsoft CEO Satya Nadella reveals 5 AI prompts that can ‘supercharge your everyday workflow’
Read More
  • Finance Expert

Microsoft CEO Satya Nadella reveals 5 AI prompts that can ‘supercharge your everyday workflow’

  • Roubens Andy King
  • September 2, 2025

Recent Posts

  • a constraint on AI development in emerging countries
  • Studying vs Business: Real Income Truth!💥#shorts #finance #business
  • Mexico: Waiting for the USMCA
  • $400,000+ Of LGBT Debt | Financial Audit
  • Federal Reserve Board – Agencies clarify the capital treatment of tokenized securities
Featured Posts
  • a constraint on AI development in emerging countries 1
    a constraint on AI development in emerging countries
    • March 6, 2026
  • Studying vs Business: Real Income Truth!💥#shorts #finance #business 2
    Studying vs Business: Real Income Truth!💥#shorts #finance #business
    • March 6, 2026
  • Mexico: Waiting for the USMCA 3
    Mexico: Waiting for the USMCA
    • March 6, 2026
  • 0,000+ Of LGBT Debt | Financial Audit 4
    $400,000+ Of LGBT Debt | Financial Audit
    • March 5, 2026
  • Federal Reserve Board – Agencies clarify the capital treatment of tokenized securities 5
    Federal Reserve Board – Agencies clarify the capital treatment of tokenized securities
    • March 5, 2026
Recent Posts
  • Federal Reserve Board – Federal Reserve Board announces termination of enforcement action with Wells Fargo
    Federal Reserve Board – Federal Reserve Board announces termination of enforcement action with Wells Fargo
    • March 5, 2026
  • Best SIP Plans for 2026 | Best SBI Mutual Funds to invest in 2025 | SBI Mutual Funds for beginners
    Best SIP Plans for 2026 | Best SBI Mutual Funds to invest in 2025 | SBI Mutual Funds for beginners
    • March 4, 2026
  • 7 Unforgettable Celebrity Confessions That Backfired
    7 Unforgettable Celebrity Confessions That Backfired
    • March 4, 2026
Categories
  • Business (2,057)
  • Crypto (2,023)
  • Economy (225)
  • Finance Expert (1,687)
  • Forex (2,016)
  • Invest News (2,442)
  • Investing (2,040)
  • Tech (2,056)
  • Trading (2,024)
  • Uncategorized (2)
  • Videos (992)

Subscribe

Subscribe now to our newsletter

Money Visa
  • Privacy Policy
  • DMCA
  • Terms of Use
Money & Invest Advices

Input your search keywords and press Enter.