{"id":930,"date":"2025-03-14T02:18:37","date_gmt":"2025-03-14T02:18:37","guid":{"rendered":"https:\/\/www.batteryone.co\/blog\/?p=930"},"modified":"2025-03-14T02:18:37","modified_gmt":"2025-03-14T02:18:37","slug":"ai-search-engines-provide-incorrect-answers-over-60-now","status":"publish","type":"post","link":"https:\/\/www.batteryone.co\/blog\/archives\/930","title":{"rendered":"AI search engines provide incorrect answers over 60% now"},"content":{"rendered":"\n<p class=\"wp-block-paragraph\">A recent study by Columbia Journalism Review\u2019s Tow Center for Digital Journalism reveals serious accuracy issues with generative AI models used for news searches. The research tested eight AI-powered search tools with live search capabilities and found that over 60% of responses about news content were incorrect.<\/p>\n\n\n\n<figure class=\"wp-block-image\"><img decoding=\"async\" src=\"https:\/\/cdn.builtin.com\/cdn-cgi\/image\/f=auto,fit=cover,w=1200,h=635,q=80\/https:\/\/builtin.com\/sites\/www.builtin.com\/files\/2024-06\/AI%20search%20engine.jpg\" alt=\"\"\/><\/figure>\n\n\n\n<p class=\"wp-block-paragraph\">The report, written by researchers Klaudia Ja\u017awi\u0144ska and Aisvarya Chandrasekar, highlights that about one in four Americans now use AI models instead of traditional search engines. Given the high error rate uncovered, this raises serious concerns about the reliability of AI-driven searches.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">The error rates varied widely among the platforms. Perplexity returned incorrect information 37% of the time, while ChatGPT Search was wrong 67% of the time (134 out of 200 queries). Grok 3 had the highest error rate at 94%.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">&gt;&gt;&gt;<a href=\"https:\/\/www.batteryone.co\/detail\/1747227\/B-W5\">B-W5&nbsp;for Vivo X Fold+ Plus V2229A<\/a><\/p>\n\n\n\n<p class=\"wp-block-paragraph\">For the study, the researchers provided AI models with direct excerpts from actual news articles and asked them to identify the headline, publisher, publication date, and URL. They tested 1,600 queries across the eight AI search tools.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">A major issue identified was that the AI models often did not refuse to answer when they lacked reliable information. Instead, they frequently offered plausible but incorrect or speculative answers, a behavior consistent across all models tested.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Interestingly, premium versions of some AI tools performed worse in certain aspects. Perplexity Pro ($20\/month) and Grok 3\u2019s premium service ($40\/month) were more likely to provide incorrect answers than their free versions. Although the premium models answered more prompts correctly, their tendency to provide uncertain responses increased overall error rates.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">The study also uncovered problems related to citations and publisher control. Some AI tools appeared to ignore the Robot Exclusion Protocol, which publishers use to prevent unauthorized access. For instance, Perplexity\u2019s free version correctly identified all 10 excerpts from paywalled National Geographic content, even though the publisher had blocked Perplexity&#8217;s crawlers.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Moreover, when these AI tools cited sources, they often directed users to syndicated versions of articles on platforms like Yahoo News, rather than linking back to the original publisher&#8217;s site\u2014even when there were formal licensing agreements in place.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Another significant problem was URL fabrication. More than half of citations from Google\u2019s Gemini and Grok 3 led to fabricated or broken URLs. Of 200 citations from Grok 3, 154 resulted in error pages.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">&gt;&gt;&gt;<a href=\"https:\/\/www.batteryone.co\/detail\/1747229\/CL2203-7S1P-01A\">CL2203-7S1P-01A&nbsp;for Tineco CL2203-7S1P-01A<\/a><\/p>\n\n\n\n<p class=\"wp-block-paragraph\">These issues create major challenges for publishers, who face difficult decisions: blocking AI crawlers could result in no attribution, while allowing access could lead to widespread reuse without driving traffic to their own websites.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Mark Howard, COO of&nbsp;<em>Time<\/em>&nbsp;magazine, expressed concern about ensuring transparency and control over how Time\u2019s content appears in AI-generated searches. Despite the challenges, he sees potential for improvement, stating, \u201cToday is the worst that the product will ever be,\u201d pointing to ongoing investments and engineering aimed at refining these tools.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">However, Howard also criticized users for not being skeptical of free AI tools, saying, &#8220;If anybody as a consumer is right now believing that any of these free products are going to be 100 percent accurate, then shame on them.&#8221;<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Both OpenAI and Microsoft acknowledged the study\u2019s findings but did not specifically address the issues raised. OpenAI emphasized its commitment to supporting publishers by driving traffic through summaries, quotes, and clear attribution. Microsoft stated it follows Robot Exclusion Protocols and publisher directives.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">This report builds on findings from a previous Tow Center study in November 2024, which identified similar accuracy issues with ChatGPT\u2019s handling of news content. For more details, check out Columbia Journalism Review&#8217;s website.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>A recent study by Columbia Journalism Review\u2019s Tow Center for Digital Journalism reveals serious accuracy issues with generative AI models used for news searches. The research tested eight AI-powered search tools with live search capabilities and found that over 60% of responses about news content were incorrect. The report, written by researchers Klaudia Ja\u017awi\u0144ska and [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[1],"tags":[68],"class_list":["post-930","post","type-post","status-publish","format-standard","hentry","category-news","tag-ai"],"_links":{"self":[{"href":"https:\/\/www.batteryone.co\/blog\/wp-json\/wp\/v2\/posts\/930","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.batteryone.co\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.batteryone.co\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.batteryone.co\/blog\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/www.batteryone.co\/blog\/wp-json\/wp\/v2\/comments?post=930"}],"version-history":[{"count":1,"href":"https:\/\/www.batteryone.co\/blog\/wp-json\/wp\/v2\/posts\/930\/revisions"}],"predecessor-version":[{"id":931,"href":"https:\/\/www.batteryone.co\/blog\/wp-json\/wp\/v2\/posts\/930\/revisions\/931"}],"wp:attachment":[{"href":"https:\/\/www.batteryone.co\/blog\/wp-json\/wp\/v2\/media?parent=930"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.batteryone.co\/blog\/wp-json\/wp\/v2\/categories?post=930"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.batteryone.co\/blog\/wp-json\/wp\/v2\/tags?post=930"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}