AI's blind spot: tools fail to detect their own fakes

The China Mail - AI's blind spot: tools fail to detect their own fakes

Beijing -13°C

USD -

AED 3.672504

AFN 65.503991

ALL 81.893517

AMD 377.703986

ANG 1.79008

AOA 917.000367

ARS 1431.463704

AUD 1.424075

AWG 1.8

AZN 1.70397

BAM 1.658906

BBD 2.014216

BDT 122.30167

BGN 1.67937

BHD 0.377004

BIF 2963.603824

BMD 1

BND 1.273484

BOB 6.910269

BRL 5.23885

BSD 1.000025

BTN 90.583306

BWP 13.239523

BYN 2.873016

BYR 19600

BZD 2.011247

CAD 1.36432

CDF 2230.000362

CHF 0.775404

CLF 0.021785

CLP 860.180396

CNY 6.93805

CNH 6.93014

COP 3691.11

CRC 495.76963

CUC 1

CUP 26.5

CVE 93.526553

CZK 20.49104

DJF 177.720393

DKK 6.318604

DOP 63.114413

DZD 129.915817

EGP 46.860804

ERN 15

ETB 155.46494

EUR 0.84612

FJD 2.209504

FKP 0.738005

GBP 0.734505

GEL 2.69504

GGP 0.738005

GHS 10.990102

GIP 0.738005

GMD 73.000355

GNF 8778.001137

GTQ 7.670255

GYD 209.225001

HKD 7.81355

HNL 26.416279

HRK 6.375104

HTG 131.004182

HUF 319.673504

IDR 16847.65

ILS 3.110675

IMP 0.738005

INR 90.60355

IQD 1310.041816

IRR 42125.000158

ISK 122.690386

JEP 0.738005

JMD 156.517978

JOD 0.70904

JPY 157.06304

KES 129.004623

KGS 87.450384

KHR 4035.7261

KMF 419.00035

KPW 900.002243

KRW 1462.730383

KWD 0.30717

KYD 0.833355

KZT 494.785725

LAK 21489.944613

LBP 89557.410282

LKR 309.387392

LRD 188.003087

LSL 16.133574

LTL 2.95274

LVL 0.60489

LYD 6.332646

MAD 9.180641

MDL 17.050476

MGA 4439.468349

MKD 52.169828

MMK 2100.00747

MNT 3580.70414

MOP 8.047618

MRU 39.542143

MUR 46.060378

MVR 15.450378

MWK 1734.055998

MXN 17.260975

MYR 3.947504

MZN 63.750377

NAD 16.133574

NGN 1367.390377

NIO 36.803155

NOK 9.658735

NPR 144.932675

NZD 1.659792

OMR 0.384466

PAB 1.000025

PEN 3.364787

PGK 4.288489

PHP 58.458038

PKR 279.633919

PLN 3.568365

PYG 6607.462446

QAR 3.645108

RON 4.308404

RSD 99.305038

RUB 77.002259

RWF 1459.579124

SAR 3.750159

SBD 8.058149

SCR 13.731545

SDG 601.503676

SEK 9.004245

SGD 1.271104

SHP 0.750259

SLE 24.450371

SLL 20969.499267

SOS 570.497977

SRD 37.818038

STD 20697.981008

STN 20.780851

SVC 8.750011

SYP 11059.574895

SZL 16.130113

THB 31.539504

TJS 9.370298

TMT 3.505

TND 2.900328

TOP 2.40776

TRY 43.592904

TTD 6.771984

TWD 31.613038

TZS 2575.000335

UAH 42.955257

UGX 3558.190624

UYU 38.652875

UZS 12280.366935

VES 377.985125

VND 25950

VUV 119.988021

WST 2.726314

XAF 556.381418

XAG 0.012857

XAU 0.000201

XCD 2.70255

XCG 1.802328

XDR 0.692248

XOF 556.381418

XPF 101.156094

YER 238.403589

ZAR 16.024104

ZMK 9001.203584

ZMW 18.62558

ZWL 321.999592

SCS

0.0200

16.14

+0.12%
CMSC

0.0050

23.555

+0.02%
CMSD

0.0580

23.948

+0.24%
RYCEF

0.0500

16.67

+0.3%
NGG

1.0500

87.94

+1.19%
RBGPF

0.1000

82.5

+0.12%
AZN

5.8400

193

+3.03%
BP

0.8200

38.99

+2.1%
RIO

2.2670

93.387

+2.43%
GSK

1.0500

60.22

+1.74%
BTI

0.8150

62.775

+1.3%
BCE

-0.2850

25.285

-1.13%
BCC

2.4400

91.6

+2.66%
JRI

0.0900

12.97

+0.69%
VOD

0.4450

15.065

+2.95%
RELX

-0.8000

29.29

-2.73%

AI's blind spot: tools fail to detect their own fakes

ECONOMY 20.11.2025

When outraged Filipinos turned to an AI-powered chatbot to verify a viral photograph of a lawmaker embroiled in a corruption scandal, the tool failed to detect it was fabricated -- even though it had generated the image itself.

Text size:

Internet users are increasingly turning to chatbots to verify images in real time, but the tools often fail, raising questions about their visual debunking capabilities at a time when major tech platforms are scaling back human fact-checking.

In many cases, the tools wrongly identify images as real even when they are generated using the same generative models, further muddying an online information landscape awash with AI-generated fakes.

Among them is a fabricated image circulating on social media of Elizaldy Co, a former Philippine lawmaker charged by prosecutors in a multibillion-dollar flood-control corruption scam that sparked massive protests in the disaster-prone country.

The image of Co, whose whereabouts has been unknown since the official probe began, appeared to show him in Portugal.

When online sleuths tracking him asked Google's new AI mode whether the image was real, it incorrectly said it was authentic.

AFP's fact-checkers tracked down its creator and determined that the image was generated using Google AI.

"These models are trained primarily on language patterns and lack the specialized visual understanding needed to accurately identify AI-generated or manipulated imagery," Alon Yamin, chief executive of AI content detection platform Copyleaks, told AFP.

"With AI chatbots, even when an image originates from a similar generative model, the chatbot often provides inconsistent or overly generalized assessments, making them unreliable for tasks like fact-checking or verifying authenticity."

Google did not respond to AFP’s request for comment.

- 'Distinguishable from reality' -

AFP found similar examples of AI tools failing to verify their own creations.

During last month's deadly protests over lucrative benefits for senior officials in Pakistan-administered Kashmir, social media users shared a fabricated image purportedly showing men marching with flags and torches.

An AFP analysis found it was created using Google's Gemini AI model.

But Gemini and Microsoft's Copilot falsely identified it as a genuine image of the protest.

"This inability to correctly identify AI images stems from the fact that they (AI models) are programmed only to mimic well," Rossine Fallorina, from the nonprofit Sigla Research Center, told AFP.

"In a sense, they can only generate things to resemble. They cannot ascertain whether the resemblance is actually distinguishable from reality."

Earlier this year, Columbia University's Tow Center for Digital Journalism tested the ability of seven AI chatbots -- including ChatGPT, Perplexity, Grok, and Gemini -- to verify 10 images from photojournalists of news events.

All seven models failed to correctly identify the provenance of the photos, the study said.

- 'Shocked' -

AFP tracked down the source of Co's photo that garnered over a million views across social media -- a middle-aged web developer in the Philippines, who said he created it "for fun" using Nano Banana, Gemini's AI image generator.

"Sadly, a lot of people believed it," he told AFP, requesting anonymity to avoid a backlash.

"I edited my post -- and added 'AI generated' to stop the spread -- because I was shocked at how many shares it got."

Such cases show how AI-generated photos flooding social platforms can look virtually identical to real imagery.

The trend has fueled concerns as surveys show online users are increasingly shifting from traditional search engines to AI tools for information gathering and verifying information.

The shift comes as Meta announced earlier this year it was ending its third-party fact-checking program in the United States, turning over the task of debunking falsehoods to ordinary users under a model known as "Community Notes."

Human fact-checking has long been a flashpoint in hyperpolarized societies, where conservative advocates accuse professional fact-checkers of liberal bias, a charge they reject.

AFP currently works in 26 languages with Meta's fact-checking program, including in Asia, Latin America, and the European Union.

Researchers say AI models can be useful to professional fact-checkers, helping to quickly geolocate images and spot visual clues to establish authenticity. But they caution that they cannot replace the work of trained human fact-checkers.

"We can't rely on AI tools to combat AI in the long run," Fallorina said.

burs-ac/sla/sms

E.Lau--ThChM

The China Mail - AI's blind spot: tools fail to detect their own fakes

AI's blind spot: tools fail to detect their own fakes

Featured

Danone expands recall of infant formula batches in Europe

EU nations back chemical recycling for plastic bottles

Why bitcoin is losing its luster after stratospheric rise

Stellantis takes massive hit on 'overestimation' of EV demand