The China Mail - Inner workings of AI an enigma - even to its creators

USD -
AED 3.672497
AFN 66.000178
ALL 83.301632
AMD 382.090126
ANG 1.79009
AOA 917.000038
ARS 1408.504788
AUD 1.523926
AWG 1.8025
AZN 1.701647
BAM 1.68937
BBD 2.014244
BDT 122.111228
BGN 1.687895
BHD 0.377039
BIF 2950
BMD 1
BND 1.30343
BOB 6.910223
BRL 5.293013
BSD 1.000082
BTN 88.671219
BWP 14.25758
BYN 3.410338
BYR 19600
BZD 2.011289
CAD 1.400385
CDF 2137.498421
CHF 0.798505
CLF 0.023707
CLP 930.020286
CNY 7.11275
CNH 7.10856
COP 3706.75
CRC 502.36889
CUC 1
CUP 26.5
CVE 95.374985
CZK 20.911798
DJF 177.720262
DKK 6.442445
DOP 64.393572
DZD 130.368338
EGP 47.218042
ERN 15
ETB 153.616746
EUR 0.86271
FJD 2.27485
FKP 0.75922
GBP 0.761755
GEL 2.678687
GGP 0.75922
GHS 10.965008
GIP 0.75922
GMD 73.497564
GNF 8685.000016
GTQ 7.664334
GYD 209.232018
HKD 7.769905
HNL 26.309778
HRK 6.501703
HTG 130.904411
HUF 331.872007
IDR 16727
ILS 3.20022
IMP 0.75922
INR 88.60295
IQD 1310
IRR 42112.502706
ISK 126.819918
JEP 0.75922
JMD 160.817476
JOD 0.708972
JPY 154.654502
KES 129.250306
KGS 87.449519
KHR 4019.999827
KMF 427.501669
KPW 899.988373
KRW 1466.969881
KWD 0.30712
KYD 0.833377
KZT 524.809647
LAK 21695.000308
LBP 89572.717427
LKR 304.582734
LRD 183.250413
LSL 17.244982
LTL 2.95274
LVL 0.60489
LYD 5.46899
MAD 9.272502
MDL 16.941349
MGA 4499.999414
MKD 53.084556
MMK 2099.257186
MNT 3579.013865
MOP 8.005511
MRU 39.849815
MUR 45.869883
MVR 15.404958
MWK 1736.000383
MXN 18.299302
MYR 4.130221
MZN 63.960066
NAD 17.244979
NGN 1440.999746
NIO 36.770281
NOK 10.078775
NPR 141.874295
NZD 1.76719
OMR 0.3845
PAB 1.000073
PEN 3.37875
PGK 4.120431
PHP 59.11052
PKR 280.850005
PLN 3.65245
PYG 7057.035009
QAR 3.640901
RON 4.386099
RSD 101.086041
RUB 81.271535
RWF 1450
SAR 3.750458
SBD 8.237372
SCR 15.081574
SDG 600.497601
SEK 9.44998
SGD 1.301009
SHP 0.750259
SLE 23.207442
SLL 20969.500812
SOS 571.500118
SRD 38.556498
STD 20697.981008
STN 21.38
SVC 8.750858
SYP 11056.952587
SZL 17.244982
THB 32.360332
TJS 9.260569
TMT 3.5
TND 2.950498
TOP 2.40776
TRY 42.251198
TTD 6.781462
TWD 31.089299
TZS 2439.999528
UAH 42.073999
UGX 3625.244555
UYU 39.767991
UZS 12005.000197
VES 233.26555
VND 26330
VUV 122.202554
WST 2.815308
XAF 566.596269
XAG 0.01866
XAU 0.000238
XCD 2.70255
XCG 1.802343
XDR 0.704774
XOF 565.00039
XPF 103.296152
YER 238.502561
ZAR 17.084015
ZMK 9001.18816
ZMW 22.426266
ZWL 321.999592
  • SCS

    0.0000

    15.75

    0%

  • VOD

    -0.3000

    12.37

    -2.43%

  • RELX

    -1.1200

    41.36

    -2.71%

  • RBGPF

    -0.0500

    78.47

    -0.06%

  • NGG

    0.7200

    78.03

    +0.92%

  • AZN

    -1.4100

    87.68

    -1.61%

  • CMSC

    0.1100

    24.08

    +0.46%

  • GSK

    -0.3400

    48.07

    -0.71%

  • RYCEF

    -0.0700

    14.96

    -0.47%

  • RIO

    0.7900

    71.11

    +1.11%

  • CMSD

    0.2300

    24.55

    +0.94%

  • JRI

    0.0500

    13.87

    +0.36%

  • BCC

    0.6500

    70.28

    +0.92%

  • BCE

    -0.6400

    22.77

    -2.81%

  • BTI

    0.0600

    55.82

    +0.11%

  • BP

    -0.4900

    36.86

    -1.33%

Inner workings of AI an enigma - even to its creators
Inner workings of AI an enigma - even to its creators / Photo: © AFP

Inner workings of AI an enigma - even to its creators

Even the greatest human minds building generative artificial intelligence that is poised to change the world admit they do not comprehend how digital minds think.

Text size:

"People outside the field are often surprised and alarmed to learn that we do not understand how our own AI creations work," Anthropic co-founder Dario Amodei wrote in an essay posted online in April.

"This lack of understanding is essentially unprecedented in the history of technology."

Unlike traditional software programs that follow pre-ordained paths of logic dictated by programmers, generative AI (gen AI) models are trained to find their own way to success once prompted.

In a recent podcast Chris Olah, who was part of ChatGPT-maker OpenAI before joining Anthropic, described gen AI as "scaffolding" on which circuits grow.

Olah is considered an authority in so-called mechanistic interpretability, a method of reverse engineering AI models to figure out how they work.

This science, born about a decade ago, seeks to determine exactly how AI gets from a query to an answer.

"Grasping the entirety of a large language model is an incredibly ambitious task," said Neel Nanda, a senior research scientist at the Google DeepMind AI lab.

It was "somewhat analogous to trying to fully understand the human brain," Nanda added to AFP, noting neuroscientists have yet to succeed on that front.

Delving into digital minds to understand their inner workings has gone from a little-known field just a few years ago to being a hot area of academic study.

"Students are very much attracted to it because they perceive the impact that it can have," said Boston University computer science professor Mark Crovella.

The area of study is also gaining traction due to its potential to make gen AI even more powerful, and because peering into digital brains can be intellectually exciting, the professor added.

- Keeping AI honest -

Mechanistic interpretability involves studying not just results served up by gen AI but scrutinizing calculations performed while the technology mulls queries, according to Crovella.

"You could look into the model...observe the computations that are being performed and try to understand those," the professor explained.

Startup Goodfire uses AI software capable of representing data in the form of reasoning steps to better understand gen AI processing and correct errors.

The tool is also intended to prevent gen AI models from being used maliciously or from deciding on their own to deceive humans about what they are up to.

"It does feel like a race against time to get there before we implement extremely intelligent AI models into the world with no understanding of how they work," said Goodfire chief executive Eric Ho.

In his essay, Amodei said recent progress has made him optimistic that the key to fully deciphering AI will be found within two years.

"I agree that by 2027, we could have interpretability that reliably detects model biases and harmful intentions," said Auburn University associate professor Anh Nguyen.

According to Boston University's Crovella, researchers can already access representations of every digital neuron in AI brains.

"Unlike the human brain, we actually have the equivalent of every neuron instrumented inside these models", the academic said. "Everything that happens inside the model is fully known to us. It's a question of discovering the right way to interrogate that."

Harnessing the inner workings of gen AI minds could clear the way for its adoption in areas where tiny errors can have dramatic consequences, like national security, Amodei said.

For Nanda, better understanding what gen AI is doing could also catapult human discoveries, much like DeepMind's chess-playing AI, AlphaZero, revealed entirely new chess moves that none of the grand masters had ever thought about.

Properly understood, a gen AI model with a stamp of reliability would grab competitive advantage in the market.

Such a breakthrough by a US company would also be a win for the nation in its technology rivalry with China.

"Powerful AI will shape humanity's destiny," Amodei wrote.

"We deserve to understand our own creations before they radically transform our economy, our lives, and our future."

K.Lam--ThChM