The China Mail - Grok shows 'flaws' in fact-checking Israel-Iran war: study

USD -
AED 3.673101
AFN 63.505345
ALL 81.708441
AMD 368.210155
ANG 1.790403
AOA 917.517817
ARS 1436.776103
AUD 1.413887
AWG 1.8
AZN 1.698937
BAM 1.685177
BBD 2.015096
BDT 122.817901
BGN 1.69088
BHD 0.377095
BIF 2991
BMD 1
BND 1.281762
BOB 6.938712
BRL 5.099903
BSD 1.000526
BTN 94.560525
BWP 13.406112
BYN 2.76997
BYR 19600
BZD 2.012252
CAD 1.39941
CDF 2320.999973
CHF 0.793035
CLF 0.022503
CLP 885.670416
CNY 6.75745
CNH 6.75723
COP 3450.08
CRC 455.716489
CUC 1
CUP 26.5
CVE 95.00853
CZK 20.80395
DJF 177.720348
DKK 6.437795
DOP 58.694285
DZD 133.002981
EGP 50.126095
ERN 15
ETB 161.303992
EUR 0.861198
FJD 2.21195
FKP 0.744874
GBP 0.744645
GEL 2.645001
GGP 0.744874
GHS 11.255482
GIP 0.744874
GMD 72.503383
GNF 8763.721587
GTQ 7.626359
GYD 209.290102
HKD 7.833302
HNL 26.754265
HRK 6.488706
HTG 130.666299
HUF 300.775499
IDR 17741.6
ILS 2.915702
IMP 0.744874
INR 94.489649
IQD 1310.701361
IRR 1375752.50281
ISK 124.360019
JEP 0.744874
JMD 158.238482
JOD 0.70903
JPY 160.439499
KES 129.420123
KGS 87.450262
KHR 4017.784058
KMF 425.000171
KPW 900.00035
KRW 1509.215034
KWD 0.30814
KYD 0.8338
KZT 487.920041
LAK 22016.388216
LBP 89596.067517
LKR 335.185855
LRD 182.097037
LSL 16.148994
LTL 2.95274
LVL 0.60489
LYD 6.374399
MAD 9.250461
MDL 17.459223
MGA 4157.368235
MKD 53.069114
MMK 2099.401411
MNT 3576.563972
MOP 8.072446
MRU 39.93262
MUR 47.240348
MVR 15.450203
MWK 1734.893459
MXN 17.21198
MYR 4.068602
MZN 63.90009
NAD 16.148855
NGN 1357.570315
NIO 36.629735
NOK 9.479955
NPR 151.295881
NZD 1.71305
OMR 0.384508
PAB 1.000526
PEN 3.408382
PGK 4.383153
PHP 60.268495
PKR 278.370642
PLN 3.64972
PYG 6105.515298
QAR 3.657654
RON 4.502801
RSD 101.093034
RUB 72.50098
RWF 1483.728104
SAR 3.752094
SBD 8.065041
SCR 14.70031
SDG 600.500752
SEK 9.36225
SGD 1.282045
SHP 0.746601
SLE 24.749767
SLL 20969.503664
SOS 571.773221
SRD 37.332017
STD 20697.981008
STN 21.109953
SVC 8.754244
SYP 110.532098
SZL 16.145959
THB 32.486006
TJS 9.274765
TMT 3.5
TND 2.928683
TOP 2.40776
TRY 46.292899
TTD 6.796543
TWD 31.512496
TZS 2620.003039
UAH 44.808889
UGX 3701.565583
UYU 40.393596
UZS 12016.40559
VES 591.77565
VND 26300
VUV 118.866954
WST 2.741216
XAF 565.192704
XAG 0.014237
XAU 0.00023
XCD 2.70255
XCG 1.803205
XDR 0.703697
XOF 565.197574
XPF 102.758965
YER 238.596617
ZAR 16.18575
ZMK 9001.199446
ZMW 17.684109
ZWL 321.999592
  • RBGPF

    2.1500

    62.87

    +3.42%

  • CMSC

    0.0250

    22.365

    +0.11%

  • GSK

    -0.0100

    52.22

    -0.02%

  • BTI

    0.3200

    61.38

    +0.52%

  • RELX

    -0.0400

    32.8

    -0.12%

  • RIO

    -0.1500

    105.74

    -0.14%

  • RYCEF

    0.4300

    18.63

    +2.31%

  • NGG

    0.7100

    82.28

    +0.86%

  • AZN

    1.4400

    178.71

    +0.81%

  • CMSD

    -0.0600

    22.26

    -0.27%

  • VOD

    -0.1100

    14.89

    -0.74%

  • BP

    -0.4400

    41.15

    -1.07%

  • JRI

    0.0300

    12.81

    +0.23%

  • BCC

    -0.0300

    71.56

    -0.04%

  • BCE

    -0.2200

    23.82

    -0.92%

Grok shows 'flaws' in fact-checking Israel-Iran war: study
Grok shows 'flaws' in fact-checking Israel-Iran war: study / Photo: © AFP

Grok shows 'flaws' in fact-checking Israel-Iran war: study

Elon Musk's AI chatbot Grok produced inaccurate and contradictory responses when users sought to fact-check the Israel-Iran conflict, a study said Tuesday, raising fresh doubts about its reliability as a debunking tool.

Text size:

With tech platforms reducing their reliance on human fact-checkers, users are increasingly utilizing AI-powered chatbots -- including xAI's Grok -- in search of reliable information, but their responses are often themselves prone to misinformation.

"The investigation into Grok's performance during the first days of the Israel-Iran conflict exposes significant flaws and limitations in the AI chatbot's ability to provide accurate, reliable, and consistent information during times of crisis," said the study from the Digital Forensic Research Lab (DFRLab) of the Atlantic Council, an American think tank.

"Grok demonstrated that it struggles with verifying already-confirmed facts, analyzing fake visuals, and avoiding unsubstantiated claims."

The DFRLab analyzed around 130,000 posts in various languages on the platform X, where the AI assistant is built in, to find that Grok was "struggling to authenticate AI-generated media."

Following Iran's retaliatory strikes on Israel, Grok offered vastly different responses to similar prompts about an AI-generated video of a destroyed airport that amassed millions of views on X, the study found.

It oscillated -- sometimes within the same minute -- between denying the airport's destruction and confirming it had been damaged by strikes, the study said.

In some responses, Grok cited the a missile launched by Yemeni rebels as the source of the damage. In others, it wrongly identified the AI-generated airport as one in Beirut, Gaza, or Tehran.

When users shared another AI-generated video depicting buildings collapsing after an alleged Iranian strike on Tel Aviv, Grok responded that it appeared to be real, the study said.

The Israel-Iran conflict, which led to US air strikes against Tehran's nuclear program over the weekend, has churned out an avalanche of online misinformation including AI-generated videos and war visuals recycled from other conflicts.

AI chatbots also amplified falsehoods.

As the Israel-Iran war intensified, false claims spread across social media that China had dispatched military cargo planes to Tehran to offer its support.

When users asked the AI-operated X accounts of AI companies Perplexity and Grok about its validity, both wrongly responded that the claims were true, according to disinformation watchdog NewsGuard.

Researchers say Grok has previously made errors verifying information related to crises such as the recent India-Pakistan conflict and anti-immigration protests in Los Angeles.

Last month, Grok was under renewed scrutiny for inserting "white genocide" in South Africa, a far-right conspiracy theory, into unrelated queries.

Musk's startup xAI blamed an "unauthorized modification" for the unsolicited response.

Musk, a South African-born billionaire, has previously peddled the unfounded claim that South Africa's leaders were "openly pushing for genocide" of white people.

Musk himself blasted Grok after it cited Media Matters -- a liberal media watchdog he has targeted in multiple lawsuits -- as a source in some of its responses about misinformation.

"Shame on you, Grok," Musk wrote on X. "Your sourcing is terrible."

N.Wan--ThChM