DeepSeek's Chatbot Falls Short: A 17% Accuracy Rate Sparks Concerns in AI Race

Chinese AI startup DeepSeek's chatbot scored a mere 17% accuracy in NewsGuard's audit, ranking it 10th out of 11 in performance against Western competitors. Highlighting technology gaps, the chatbot repeated false claims 30% of the time, raising concerns over its claimed cost-effectiveness versus mainstream models like OpenAI.

Devdiscourse News Desk | Updated: 29-01-2025 20:11 IST | Created: 29-01-2025 20:11 IST

DeepSeek's Chatbot Falls Short: A 17% Accuracy Rate Sparks Concerns in AI Race

Chinese AI startup DeepSeek's chatbot has achieved disappointing results in a NewsGuard audit, managing only 17% accuracy in news delivery. The report, which placed it tenth among eleven major AI models, revealed that the chatbot repeated false claims 30% of the time and provided vague answers in 53% of responses.

Comparatively, Western counterparts had an average fail rate of 62%, posing questions about DeepSeek's claim of parity with OpenAI's models at significantly reduced costs. Within days of its launch, the chatbot became the most downloaded app on Apple's App Store, sparking discussions about the United States' competitive edge in AI development.

DeepSeek, which did not respond to queries, faced scrutiny in NewsGuard's evaluation using 300 prompts, including 30 on false claims. Interestingly, in three instances, the chatbot echoed China's stance on issues, even when unrelated to the prompts, drawing further attention to its operational efficacy.

(With inputs from agencies.)

DeepSeek's Chatbot Falls Short: A 17% Accuracy Rate Sparks Concerns in AI Race

ALSO READ

Tech Titans Unite: Microsoft, OpenAI, and Oracle's Groundbreaking AI Collaboration

Microsoft in the Race to Save TikTok: Trump's Unfolding Drama

OpenAI Faces Legal Battle in India Over Copyright Claims

Trump Backs Microsoft in TikTok Acquisition Talks

Indian Media Giants Challenge OpenAI Over Copyright Infringement

TRENDING

Trump Administration Offers Buyouts to Shrink Federal Workforce

Congo Conflict: UN Calls for Action Amid Rising Tensions

Judicial Intervention Temporarily Halts Trump's Funding Freeze

Rubio Grants 90-Day Humanitarian Aid Waiver Amidst Aid Review

OPINION / BLOG / INTERVIEW

AI drives breakthroughs in early detection of cervical cancer

AI’s legal limit: Why machines can’t deliver justice

AI won’t take over, but it will take us somewhere unexpected

Do language barriers undermine AI’s role in global health communication?

DevShots

Latest News

Tragic Toll as Ebola Strikes Kampala: A Nurse's Death Marks New Outbreak

Global Kabaddi Takes Center Stage: GI-PKL Unveils Franchise Captains

Catalysts Unite: Bridging the Aftercare Gap for India's Youth

Supreme Court Eases Ad Hoc Judge Appointments to Tackle Case Backlog

Connect us on

SECTORS

EDITIONS

OTHER LINKS

OTHER PRODUCTS

CONNECT