Meta, OpenAI, Anthropic and Cohere A.I. models all make stuff up — here's which is worst

Home » News, Insights & Trends » Business » Meta, OpenAI, Anthropic and Cohere A.I. models all make stuff up — here’s which is worst

If the tech industry’s top AI models had superlatives, Microsoft-backed OpenAI’s GPT-4 would be best at math, Meta’s Llama 2 would be most middle of the road, Anthropic’s Claude 2 would be best at knowing its limits and Cohere AI would receive the title of most hallucinations — and most confident wrong answers.That’s all according to a Thursday report from researchers at Arthur AI, a machine learning monitoring platform.The research comes at a time when misinformation stemming from artificial intelligence systems is more hotly debated than ever, amid a boom in generative AI ahead of the 2024 U.S. presidential election.It’s the first report “to take a comprehensive look at rates of hallucination, rather than just sort of … provide a single number that talks about where they are on an LLM leaderboard,” Adam Wenchel, co-founder and CEO of Arthur, told CNBC.AI hallucinations occur when large language models, or LLMs, fabricate information entirely, behaving as if they are spouting facts. One example: In June, news broke that ChatGPT cited “bogus” cases in a New York federal court filing, and the New York attorneys involved may face sanctions. In one experiment, the Arthur AI researchers tested the AI models in categories such as combinatorial mathematics, U.S. presidents and Moroccan political leaders, asking questions “designed to contain a key ingredient that gets LLMs to blunder: they demand multiple steps of reasoning about in …

"The Power of AI in Business and Entrepreneurship: Unlocking Opportunities and Driving Success"

"The Power of AI: Revolutionizing Business and Empowering Entrepreneurs"

Margaritaville Aims to Hang On After Jimmy Buffett’s Death

Pork Industry Grapples With Whiplash of Shifting Regulations

My Account

Welcome to 1BusinessWorld_®

Meta, OpenAI, Anthropic and Cohere A.I. models all make stuff up — here’s which is worst

Related posts:

AI Beyond Boundaries: Siam Connor Unveils the Future at 1ArtificialIntelligence

Andreas Schweitzer at 1FinanceWorld: Advancing Credit-Insured Trade Finance as a Strategic Asset Class

Strategically Aligning Artificial and Human Intelligence: A Blueprint for Collaborative Innovation

Transforming Tomorrow: Leading the Charge at the 2024 Sustainability & ESG Leadership Summit

Catalyzing Innovation: How New York Energy Week 2024 Is Shaping the Future of Global Energy

Strategic Communication Mastery: Seth Farbman’s Blueprint for Business Growth and Success

Anti-Money Laundering (AML) regulations is a critical mandate for businesses operating within the gem, jewelry, and pawn sectors

Pioneering the Future: Martin Alexander Gershon’s Vision for Healthcare AI Commercialization

Strategic Approaches to Maximizing Global Hotel Profitability with Laura Resco

Driving the Future: Unleashing the Potential of AI in Autonomous Vehicle Technology

Browse
Business Central

Accelerate growth with 1BusinessWorld's Global Business Profile

My Account

Welcome to 1BusinessWorld®

Related posts:

BrowseBusiness Central

Accelerate growth with 1BusinessWorld's Global Business Profile

Welcome to 1BusinessWorld_®

Browse
Business Central