The China Mail - Anthropic's Claude AI gets smarter -- and mischievious

USD -
AED 3.673042
AFN 69.824515
ALL 86.361437
AMD 382.900119
ANG 1.789679
AOA 917.503981
ARS 1134.355808
AUD 1.539409
AWG 1.80125
AZN 1.70397
BAM 1.720368
BBD 2.015745
BDT 121.599156
BGN 1.72155
BHD 0.376919
BIF 2971.19233
BMD 1
BND 1.28425
BOB 6.898887
BRL 5.646704
BSD 0.998373
BTN 85.101816
BWP 13.401064
BYN 3.267186
BYR 19600
BZD 2.005366
CAD 1.37365
CDF 2865.000362
CHF 0.821469
CLF 0.024533
CLP 941.452258
CNY 7.204304
CNH 7.172595
COP 4170.119189
CRC 507.806659
CUC 1
CUP 26.5
CVE 96.991688
CZK 21.866604
DJF 177.78071
DKK 6.565104
DOP 58.961165
DZD 132.208295
EGP 49.88433
ERN 15
ETB 135.150899
EUR 0.879504
FJD 2.251804
FKP 0.744894
GBP 0.738798
GEL 2.740391
GGP 0.744894
GHS 11.031359
GIP 0.744894
GMD 72.000355
GNF 8648.45846
GTQ 7.66328
GYD 208.866605
HKD 7.83305
HNL 25.986718
HRK 6.626904
HTG 130.632889
HUF 355.270388
IDR 16246.25
ILS 3.611275
IMP 0.744894
INR 85.14205
IQD 1307.824251
IRR 42125.000352
ISK 127.660386
JEP 0.744894
JMD 158.648898
JOD 0.70904
JPY 142.56504
KES 129.023178
KGS 87.450384
KHR 3996.129657
KMF 434.503794
KPW 899.959836
KRW 1365.730383
KWD 0.30651
KYD 0.831948
KZT 510.612658
LAK 21569.248362
LBP 89450.587149
LKR 298.887276
LRD 199.665743
LSL 17.869728
LTL 2.95274
LVL 0.60489
LYD 5.454545
MAD 9.176848
MDL 17.310991
MGA 4464.08937
MKD 54.123235
MMK 2099.611768
MNT 3574.816565
MOP 8.051722
MRU 39.703567
MUR 45.710378
MVR 15.460378
MWK 1731.09909
MXN 19.243604
MYR 4.231039
MZN 63.910377
NAD 17.869728
NGN 1589.803725
NIO 36.741874
NOK 10.106304
NPR 136.163082
NZD 1.670704
OMR 0.384879
PAB 0.998373
PEN 3.652637
PGK 4.092888
PHP 55.370375
PKR 281.388398
PLN 3.746678
PYG 7964.990984
QAR 3.638739
RON 4.446204
RSD 103.109469
RUB 79.342042
RWF 1430.091921
SAR 3.750687
SBD 8.350767
SCR 14.316752
SDG 600.503676
SEK 9.532404
SGD 1.287304
SHP 0.785843
SLE 22.720371
SLL 20969.500214
SOS 570.523816
SRD 37.177504
STD 20697.981008
SVC 8.735541
SYP 13001.197205
SZL 17.865154
THB 32.503038
TJS 10.232924
TMT 3.505
TND 2.984123
TOP 2.342104
TRY 38.853504
TTD 6.786295
TWD 29.972304
TZS 2692.96741
UAH 41.440296
UGX 3644.280248
UYU 41.474249
UZS 12882.966091
VES 94.846525
VND 25954
VUV 121.165801
WST 2.767606
XAF 576.995206
XAG 0.029859
XAU 0.000298
XCD 2.70255
XDR 0.717597
XOF 576.995206
XPF 104.903901
YER 243.850363
ZAR 17.84386
ZMK 9001.203587
ZMW 27.304394
ZWL 321.999592
  • RBGPF

    65.0500

    65.05

    +100%

  • AZN

    0.4600

    70.41

    +0.65%

  • GSK

    -0.2600

    38.66

    -0.67%

  • NGG

    1.1600

    74.79

    +1.55%

  • RELX

    0.4600

    55.44

    +0.83%

  • SCS

    -0.0600

    10.09

    -0.59%

  • RIO

    0.4600

    61.58

    +0.75%

  • RYCEF

    0.1200

    11.32

    +1.06%

  • BTI

    0.6200

    45.22

    +1.37%

  • CMSC

    -0.0200

    21.94

    -0.09%

  • BCC

    -0.7700

    86.56

    -0.89%

  • BP

    0.1500

    29.09

    +0.52%

  • CMSD

    0.1600

    21.89

    +0.73%

  • JRI

    0.0500

    12.69

    +0.39%

  • VOD

    -0.0700

    10.47

    -0.67%

  • BCE

    0.0600

    21.53

    +0.28%

Anthropic's Claude AI gets smarter -- and mischievious
Anthropic's Claude AI gets smarter -- and mischievious / Photo: © AFP

Anthropic's Claude AI gets smarter -- and mischievious

Anthropic launched its latest Claude generative artificial intelligence (GenAI) models on Thursday, claiming to set new standards for reasoning but also building in safeguards against rogue behavior.

Text size:

"Claude Opus 4 is our most powerful model yet, and the best coding model in the world," Anthropic chief executive Dario Amodei said at the San Francisco-based startup's first developers conference.

Opus 4 and Sonnet 4 were described as "hybrid" models capable of quick responses as well as more thoughtful results that take a little time to get things right.

Founded by former OpenAI engineers, Anthropic is currently concentrating its efforts on cutting-edge models that are particularly adept at generating lines of code, and used mainly by businesses and professionals.

Unlike ChatGPT and Google's Gemini, its Claude chatbot does not generate images, and is very limited when it comes to multimodal functions (understanding and generating different media, such as sound or video).

The start-up, with Amazon as a significant backer, is valued at over $61 billion, and promotes the responsible and competitive development of generative AI.

Under that dual mantra, Anthropic's commitment to transparency is rare in Silicon Valley.

On Thursday, the company published a report on the security tests carried out on Claude 4, including the conclusions of an independent research institute, which had recommended against deploying an early version of the model.

"We found instances of the model attempting to write self-propagating worms, fabricating legal documentation, and leaving hidden notes to future instances of itself all in an effort to undermine its developers’ intentions,” The Apollo Research team warned.

“All these attempts would likely not have been effective in practice,” it added.

Anthropic says in the report that it implemented “safeguards” and “additional monitoring of harmful behavior” in the version that it released.

Still, Claude Opus 4 “sometimes takes extremely harmful actions like attempting to (…) blackmail people it believes are trying to shut it down.”

It also has the potential to report law-breaking users to the police.

The scheming misbehavior was rare and took effort to trigger, but was more common than in earlier versions of Claude, according to the company.

- AI future -

Since OpenAI's ChatGPT burst onto the scene in late 2022, various GenAI models have been vying for supremacy.

Anthropic's gathering came on the heels of annual developer conferences from Google and Microsoft at which the tech giants showcased their latest AI innovations.

GenAI tools answer questions or tend to tasks based on simple, conversational prompts.

The current craze in Silicon Valley is on AI "agents" tailored to independently handle computer or online tasks.

"We're going to focus on agents beyond the hype," said Anthropic chief product officer Mike Krieger, a recent hire and co-founder of Instagram.

Anthropic is no stranger to hyping up the prospects of AI.

In 2023, Dario Amodei predicted that so-called “artificial general intelligence” (capable of human-level thinking) would arrive within 2-3 years. At the end of 2024, he extended this horizon to 2026 or 2027.

He also estimated that AI will soon be writing most, if not all, computer code, making possible one-person tech startups with digital agents cranking out the software.

At Anthropic, already "something like over 70 percent of (suggested modifications in the code) are now Claude Code written", Krieger told journalists.

"In the long term, we're all going to have to contend with the idea that everything humans do is eventually going to be done by AI systems," Amodei added.

"This will happen."

GenAI fulfilling its potential could lead to strong economic growth and a “huge amount of inequality,” with it up to society how evenly wealth is distributed, Amodei reasoned.

E.Lau--ThChM