The China Mail - Inner workings of AI an enigma - even to its creators

USD -
AED 3.672498
AFN 68.693178
ALL 83.231009
AMD 382.053333
ANG 1.789783
AOA 916.999904
ARS 1292.733497
AUD 1.540165
AWG 1.8005
AZN 1.698941
BAM 1.673519
BBD 2.019466
BDT 121.522237
BGN 1.674005
BHD 0.376974
BIF 2983.036345
BMD 1
BND 1.283248
BOB 6.936001
BRL 5.445401
BSD 1.000193
BTN 87.076873
BWP 13.953289
BYN 3.352172
BYR 19600
BZD 2.00901
CAD 1.38117
CDF 2894.999736
CHF 0.805375
CLF 0.024569
CLP 963.830461
CNY 7.184895
CNH 7.18346
COP 4014.85
CRC 505.439875
CUC 1
CUP 26.5
CVE 94.35044
CZK 20.936102
DJF 178.106162
DKK 6.389199
DOP 61.608232
DZD 129.729018
EGP 48.444802
ERN 15
ETB 141.169263
EUR 0.85595
FJD 2.257396
FKP 0.739708
GBP 0.739555
GEL 2.689906
GGP 0.739708
GHS 10.901997
GIP 0.739708
GMD 72.000004
GNF 8672.21426
GTQ 7.665946
GYD 209.252279
HKD 7.80134
HNL 26.194622
HRK 6.447991
HTG 130.951719
HUF 337.090034
IDR 16255.3
ILS 3.380575
IMP 0.739708
INR 87.077969
IQD 1310.201724
IRR 42112.485453
ISK 122.72992
JEP 0.739708
JMD 160.138619
JOD 0.708995
JPY 147.781499
KES 129.219861
KGS 87.449708
KHR 4008.796249
KMF 420.49797
KPW 899.979822
KRW 1388.675005
KWD 0.30562
KYD 0.833501
KZT 538.378933
LAK 21659.386404
LBP 89777.570517
LKR 301.751984
LRD 200.533078
LSL 17.598391
LTL 2.95274
LVL 0.60489
LYD 5.414679
MAD 9.013028
MDL 16.712801
MGA 4393.82725
MKD 52.657925
MMK 2098.533403
MNT 3597.063411
MOP 8.037957
MRU 39.886196
MUR 45.680341
MVR 15.409699
MWK 1734.256878
MXN 18.799405
MYR 4.223498
MZN 63.897491
NAD 17.598391
NGN 1534.340077
NIO 36.802362
NOK 10.209302
NPR 139.323593
NZD 1.68726
OMR 0.384497
PAB 1.000184
PEN 3.543158
PGK 4.225298
PHP 57.147028
PKR 283.798575
PLN 3.63359
PYG 7226.987828
QAR 3.635313
RON 4.329302
RSD 100.27402
RUB 80.772941
RWF 1447.695487
SAR 3.752488
SBD 8.223773
SCR 14.741788
SDG 600.49947
SEK 9.54981
SGD 1.28303
SHP 0.785843
SLE 23.258728
SLL 20969.49797
SOS 571.587482
SRD 37.719736
STD 20697.981008
STN 20.963912
SVC 8.751792
SYP 13001.624023
SZL 17.580593
THB 32.499496
TJS 9.296517
TMT 3.51
TND 2.923311
TOP 2.342098
TRY 40.8887
TTD 6.778559
TWD 30.094502
TZS 2515.00031
UAH 41.389658
UGX 3565.576401
UYU 40.071021
UZS 12499.625644
VES 135.47035
VND 26320
VUV 119.390828
WST 2.678368
XAF 561.280248
XAG 0.026269
XAU 0.0003
XCD 2.70255
XCG 1.802554
XDR 0.697125
XOF 561.268241
XPF 102.04719
YER 240.27503
ZAR 17.61465
ZMK 9001.203383
ZMW 23.279156
ZWL 321.999592
  • SCS

    0.1000

    16.25

    +0.62%

  • RYCEF

    0.0500

    14.76

    +0.34%

  • GSK

    0.2800

    39.35

    +0.71%

  • CMSC

    0.0000

    23.15

    0%

  • NGG

    0.2300

    70.93

    +0.32%

  • RIO

    -0.2400

    61

    -0.39%

  • RELX

    -0.1150

    47.705

    -0.24%

  • VOD

    0.0950

    11.795

    +0.81%

  • BCE

    0.1500

    25.72

    +0.58%

  • BTI

    -0.3200

    57.4

    -0.56%

  • BCC

    0.3100

    86.3

    +0.36%

  • AZN

    0.5950

    79.715

    +0.75%

  • CMSD

    0.0000

    23.35

    0%

  • BP

    0.1500

    34.2

    +0.44%

  • JRI

    0.0360

    13.356

    +0.27%

  • RBGPF

    0.0000

    75.92

    0%

Inner workings of AI an enigma - even to its creators
Inner workings of AI an enigma - even to its creators / Photo: © AFP

Inner workings of AI an enigma - even to its creators

Even the greatest human minds building generative artificial intelligence that is poised to change the world admit they do not comprehend how digital minds think.

Text size:

"People outside the field are often surprised and alarmed to learn that we do not understand how our own AI creations work," Anthropic co-founder Dario Amodei wrote in an essay posted online in April.

"This lack of understanding is essentially unprecedented in the history of technology."

Unlike traditional software programs that follow pre-ordained paths of logic dictated by programmers, generative AI (gen AI) models are trained to find their own way to success once prompted.

In a recent podcast Chris Olah, who was part of ChatGPT-maker OpenAI before joining Anthropic, described gen AI as "scaffolding" on which circuits grow.

Olah is considered an authority in so-called mechanistic interpretability, a method of reverse engineering AI models to figure out how they work.

This science, born about a decade ago, seeks to determine exactly how AI gets from a query to an answer.

"Grasping the entirety of a large language model is an incredibly ambitious task," said Neel Nanda, a senior research scientist at the Google DeepMind AI lab.

It was "somewhat analogous to trying to fully understand the human brain," Nanda added to AFP, noting neuroscientists have yet to succeed on that front.

Delving into digital minds to understand their inner workings has gone from a little-known field just a few years ago to being a hot area of academic study.

"Students are very much attracted to it because they perceive the impact that it can have," said Boston University computer science professor Mark Crovella.

The area of study is also gaining traction due to its potential to make gen AI even more powerful, and because peering into digital brains can be intellectually exciting, the professor added.

- Keeping AI honest -

Mechanistic interpretability involves studying not just results served up by gen AI but scrutinizing calculations performed while the technology mulls queries, according to Crovella.

"You could look into the model...observe the computations that are being performed and try to understand those," the professor explained.

Startup Goodfire uses AI software capable of representing data in the form of reasoning steps to better understand gen AI processing and correct errors.

The tool is also intended to prevent gen AI models from being used maliciously or from deciding on their own to deceive humans about what they are up to.

"It does feel like a race against time to get there before we implement extremely intelligent AI models into the world with no understanding of how they work," said Goodfire chief executive Eric Ho.

In his essay, Amodei said recent progress has made him optimistic that the key to fully deciphering AI will be found within two years.

"I agree that by 2027, we could have interpretability that reliably detects model biases and harmful intentions," said Auburn University associate professor Anh Nguyen.

According to Boston University's Crovella, researchers can already access representations of every digital neuron in AI brains.

"Unlike the human brain, we actually have the equivalent of every neuron instrumented inside these models", the academic said. "Everything that happens inside the model is fully known to us. It's a question of discovering the right way to interrogate that."

Harnessing the inner workings of gen AI minds could clear the way for its adoption in areas where tiny errors can have dramatic consequences, like national security, Amodei said.

For Nanda, better understanding what gen AI is doing could also catapult human discoveries, much like DeepMind's chess-playing AI, AlphaZero, revealed entirely new chess moves that none of the grand masters had ever thought about.

Properly understood, a gen AI model with a stamp of reliability would grab competitive advantage in the market.

Such a breakthrough by a US company would also be a win for the nation in its technology rivalry with China.

"Powerful AI will shape humanity's destiny," Amodei wrote.

"We deserve to understand our own creations before they radically transform our economy, our lives, and our future."

K.Lam--ThChM