The China Mail - Inner workings of AI an enigma - even to its creators

USD -
AED 3.672984
AFN 70.095814
ALL 88.322167
AMD 387.5784
ANG 1.790262
AOA 916.499154
ARS 1130.5001
AUD 1.566235
AWG 1.8025
AZN 1.697632
BAM 1.761205
BBD 2.014516
BDT 121.225765
BGN 1.761027
BHD 0.376922
BIF 2968.446077
BMD 1
BND 1.304481
BOB 6.91953
BRL 5.6718
BSD 0.997767
BTN 84.753058
BWP 13.621137
BYN 3.265225
BYR 19600
BZD 2.00416
CAD 1.397305
CDF 2870.000215
CHF 0.844085
CLF 0.024662
CLP 946.389917
CNY 7.203295
CNH 7.18786
COP 4224.75
CRC 506.720097
CUC 1
CUP 26.5
CVE 99.294452
CZK 22.506979
DJF 177.670917
DKK 6.720705
DOP 58.686598
DZD 133.816983
EGP 50.503203
ERN 15
ETB 135.040411
EUR 0.900875
FJD 2.278504
FKP 0.751765
GBP 0.75872
GEL 2.744984
GGP 0.751765
GHS 12.920539
GIP 0.751765
GMD 71.498872
GNF 8641.230448
GTQ 7.674124
GYD 208.747569
HKD 7.79215
HNL 25.920439
HRK 6.786197
HTG 130.502125
HUF 364.931496
IDR 16612.3
ILS 3.566625
IMP 0.751765
INR 84.80025
IQD 1306.990608
IRR 42100.000459
ISK 132.159776
JEP 0.751765
JMD 158.598084
JOD 0.709298
JPY 147.9715
KES 129.009947
KGS 87.449484
KHR 3992.867949
KMF 436.500135
KPW 899.999819
KRW 1418.960086
KWD 0.30734
KYD 0.831435
KZT 510.387307
LAK 21572.459005
LBP 89397.112986
LKR 298.19269
LRD 199.552448
LSL 18.288863
LTL 2.95274
LVL 0.60489
LYD 5.467906
MAD 9.310028
MDL 17.260849
MGA 4484.547223
MKD 55.412226
MMK 2099.691958
MNT 3573.956258
MOP 8.008447
MRU 39.541638
MUR 45.709919
MVR 15.404623
MWK 1730.152727
MXN 19.633797
MYR 4.329859
MZN 63.898555
NAD 18.288863
NGN 1601.795628
NIO 36.714019
NOK 10.44969
NPR 135.605934
NZD 1.703825
OMR 0.384993
PAB 0.997767
PEN 3.644697
PGK 4.141452
PHP 55.683499
PKR 280.865031
PLN 3.821136
PYG 7972.156435
QAR 3.640752
RON 4.598206
RSD 105.548001
RUB 81.000086
RWF 1428.301275
SAR 3.75067
SBD 8.350849
SCR 14.212403
SDG 600.49767
SEK 9.81055
SGD 1.304465
SHP 0.785843
SLE 22.750131
SLL 20969.500376
SOS 570.203876
SRD 36.199497
STD 20697.981008
SVC 8.73038
SYP 13001.862587
SZL 18.285786
THB 33.405503
TJS 10.396448
TMT 3.5
TND 3.035881
TOP 2.342101
TRY 38.804203
TTD 6.772686
TWD 30.422501
TZS 2694.99943
UAH 41.449643
UGX 3651.574094
UYU 41.702499
UZS 12851.083756
VES 92.71499
VND 25946
VUV 121.003465
WST 2.778524
XAF 590.696816
XAG 0.030464
XAU 0.000309
XCD 2.70255
XDR 0.734637
XOF 590.696816
XPF 107.394033
YER 244.449736
ZAR 18.279099
ZMK 9001.232815
ZMW 26.270385
ZWL 321.999592
  • RBGPF

    2.2700

    65.27

    +3.48%

  • NGG

    -3.1600

    67.53

    -4.68%

  • CMSC

    0.0200

    22.08

    +0.09%

  • GSK

    0.7500

    37.37

    +2.01%

  • RIO

    1.4300

    61.41

    +2.33%

  • BTI

    -0.6600

    40.98

    -1.61%

  • SCS

    0.3600

    10.82

    +3.33%

  • RELX

    -2.0200

    51.83

    -3.9%

  • RYCEF

    -0.1200

    10.38

    -1.16%

  • VOD

    -0.2300

    9.07

    -2.54%

  • CMSD

    -0.0400

    22.3

    -0.18%

  • AZN

    1.3800

    68.95

    +2%

  • BCE

    -0.1500

    22.56

    -0.66%

  • BP

    0.4200

    30.19

    +1.39%

  • BCC

    4.4800

    93.1

    +4.81%

  • JRI

    0.0300

    13.01

    +0.23%

Inner workings of AI an enigma - even to its creators
Inner workings of AI an enigma - even to its creators / Photo: © AFP

Inner workings of AI an enigma - even to its creators

Even the greatest human minds building generative artificial intelligence that is poised to change the world admit they do not comprehend how digital minds think.

Text size:

"People outside the field are often surprised and alarmed to learn that we do not understand how our own AI creations work," Anthropic co-founder Dario Amodei wrote in an essay posted online in April.

"This lack of understanding is essentially unprecedented in the history of technology."

Unlike traditional software programs that follow pre-ordained paths of logic dictated by programmers, generative AI (gen AI) models are trained to find their own way to success once prompted.

In a recent podcast Chris Olah, who was part of ChatGPT-maker OpenAI before joining Anthropic, described gen AI as "scaffolding" on which circuits grow.

Olah is considered an authority in so-called mechanistic interpretability, a method of reverse engineering AI models to figure out how they work.

This science, born about a decade ago, seeks to determine exactly how AI gets from a query to an answer.

"Grasping the entirety of a large language model is an incredibly ambitious task," said Neel Nanda, a senior research scientist at the Google DeepMind AI lab.

It was "somewhat analogous to trying to fully understand the human brain," Nanda added to AFP, noting neuroscientists have yet to succeed on that front.

Delving into digital minds to understand their inner workings has gone from a little-known field just a few years ago to being a hot area of academic study.

"Students are very much attracted to it because they perceive the impact that it can have," said Boston University computer science professor Mark Crovella.

The area of study is also gaining traction due to its potential to make gen AI even more powerful, and because peering into digital brains can be intellectually exciting, the professor added.

- Keeping AI honest -

Mechanistic interpretability involves studying not just results served up by gen AI but scrutinizing calculations performed while the technology mulls queries, according to Crovella.

"You could look into the model...observe the computations that are being performed and try to understand those," the professor explained.

Startup Goodfire uses AI software capable of representing data in the form of reasoning steps to better understand gen AI processing and correct errors.

The tool is also intended to prevent gen AI models from being used maliciously or from deciding on their own to deceive humans about what they are up to.

"It does feel like a race against time to get there before we implement extremely intelligent AI models into the world with no understanding of how they work," said Goodfire chief executive Eric Ho.

In his essay, Amodei said recent progress has made him optimistic that the key to fully deciphering AI will be found within two years.

"I agree that by 2027, we could have interpretability that reliably detects model biases and harmful intentions," said Auburn University associate professor Anh Nguyen.

According to Boston University's Crovella, researchers can already access representations of every digital neuron in AI brains.

"Unlike the human brain, we actually have the equivalent of every neuron instrumented inside these models", the academic said. "Everything that happens inside the model is fully known to us. It's a question of discovering the right way to interrogate that."

Harnessing the inner workings of gen AI minds could clear the way for its adoption in areas where tiny errors can have dramatic consequences, like national security, Amodei said.

For Nanda, better understanding what gen AI is doing could also catapult human discoveries, much like DeepMind's chess-playing AI, AlphaZero, revealed entirely new chess moves that none of the grand masters had ever thought about.

Properly understood, a gen AI model with a stamp of reliability would grab competitive advantage in the market.

Such a breakthrough by a US company would also be a win for the nation in its technology rivalry with China.

"Powerful AI will shape humanity's destiny," Amodei wrote.

"We deserve to understand our own creations before they radically transform our economy, our lives, and our future."

K.Lam--ThChM