The China Mail - Anthropic's Claude AI gets smarter -- and mischievious

USD -
AED 3.672499
AFN 65.49754
ALL 80.979656
AMD 377.215764
ANG 1.79008
AOA 917.000004
ARS 1404.088403
AUD 1.404485
AWG 1.8025
AZN 1.702819
BAM 1.643792
BBD 2.01512
BDT 122.389289
BGN 1.67937
BHD 0.376978
BIF 2965.35987
BMD 1
BND 1.266678
BOB 6.913941
BRL 5.196498
BSD 1.0005
BTN 90.584735
BWP 13.12568
BYN 2.874337
BYR 19600
BZD 2.012178
CAD 1.351735
CDF 2209.999919
CHF 0.764798
CLF 0.02167
CLP 855.659814
CNY 6.91085
CNH 6.90741
COP 3667.46
CRC 495.12315
CUC 1
CUP 26.5
CVE 92.677576
CZK 20.33315
DJF 178.163649
DKK 6.26502
DOP 62.707755
DZD 129.419762
EGP 46.837801
ERN 15
ETB 155.312845
EUR 0.83859
FJD 2.18585
FKP 0.731875
GBP 0.731155
GEL 2.690116
GGP 0.731875
GHS 11.010531
GIP 0.731875
GMD 73.489005
GNF 8782.951828
GTQ 7.672912
GYD 209.326172
HKD 7.81475
HNL 26.438786
HRK 6.320599
HTG 131.239993
HUF 316.717502
IDR 16771
ILS 3.07635
IMP 0.731875
INR 90.548504
IQD 1310.634936
IRR 42125.000158
ISK 121.602337
JEP 0.731875
JMD 156.538256
JOD 0.708993
JPY 152.826501
KES 129.000162
KGS 87.450287
KHR 4032.593576
KMF 414.400398
KPW 899.999067
KRW 1451.015027
KWD 0.30687
KYD 0.833761
KZT 492.246531
LAK 21486.714209
LBP 89522.281894
LKR 309.580141
LRD 186.599091
LSL 15.938326
LTL 2.95274
LVL 0.60489
LYD 6.307756
MAD 9.121259
MDL 16.933027
MGA 4429.297238
MKD 51.733832
MMK 2099.913606
MNT 3568.190929
MOP 8.056446
MRU 39.329271
MUR 45.679578
MVR 15.449664
MWK 1734.822093
MXN 17.15845
MYR 3.925501
MZN 63.902223
NAD 15.938527
NGN 1355.459875
NIO 36.82116
NOK 9.477765
NPR 144.931312
NZD 1.64852
OMR 0.384493
PAB 1.000504
PEN 3.359612
PGK 4.2923
PHP 58.307499
PKR 279.886956
PLN 3.53654
PYG 6585.112687
QAR 3.647007
RON 4.269695
RSD 98.41699
RUB 77.42437
RWF 1460.743567
SAR 3.75085
SBD 8.058149
SCR 14.106202
SDG 601.497232
SEK 8.844315
SGD 1.261905
SHP 0.750259
SLE 24.349869
SLL 20969.499267
SOS 571.774366
SRD 37.890414
STD 20697.981008
STN 20.59161
SVC 8.754376
SYP 11059.574895
SZL 15.922777
THB 31.039964
TJS 9.389882
TMT 3.51
TND 2.882406
TOP 2.40776
TRY 43.639504
TTD 6.786071
TWD 31.420303
TZS 2582.653999
UAH 43.08933
UGX 3556.990006
UYU 38.36876
UZS 12326.389618
VES 384.790411
VND 25944.5
VUV 119.366255
WST 2.707053
XAF 551.314711
XAG 0.012176
XAU 0.000198
XCD 2.70255
XCG 1.803175
XDR 0.685659
XOF 551.314711
XPF 100.234491
YER 238.325026
ZAR 15.88361
ZMK 9001.198133
ZMW 19.034211
ZWL 321.999592
  • SCS

    0.0200

    16.14

    +0.12%

  • RBGPF

    0.1000

    82.5

    +0.12%

  • CMSC

    0.1070

    23.692

    +0.45%

  • NGG

    0.3700

    88.76

    +0.42%

  • RIO

    0.3900

    97.24

    +0.4%

  • CMSD

    0.1100

    24.08

    +0.46%

  • RYCEF

    0.5300

    17.41

    +3.04%

  • BCE

    0.2100

    25.83

    +0.81%

  • RELX

    -0.1900

    29.29

    -0.65%

  • VOD

    -0.2300

    15.25

    -1.51%

  • GSK

    -0.1900

    58.82

    -0.32%

  • BTI

    -0.9600

    60.19

    -1.59%

  • BCC

    0.7100

    89.73

    +0.79%

  • JRI

    -0.0300

    12.78

    -0.23%

  • AZN

    5.3900

    193.4

    +2.79%

  • BP

    -2.2500

    36.97

    -6.09%

Anthropic's Claude AI gets smarter -- and mischievious
Anthropic's Claude AI gets smarter -- and mischievious / Photo: © AFP

Anthropic's Claude AI gets smarter -- and mischievious

Anthropic launched its latest Claude generative artificial intelligence (GenAI) models on Thursday, claiming to set new standards for reasoning but also building in safeguards against rogue behavior.

Text size:

"Claude Opus 4 is our most powerful model yet, and the best coding model in the world," Anthropic chief executive Dario Amodei said at the San Francisco-based startup's first developers conference.

Opus 4 and Sonnet 4 were described as "hybrid" models capable of quick responses as well as more thoughtful results that take a little time to get things right.

Founded by former OpenAI engineers, Anthropic is currently concentrating its efforts on cutting-edge models that are particularly adept at generating lines of code, and used mainly by businesses and professionals.

Unlike ChatGPT and Google's Gemini, its Claude chatbot does not generate images, and is very limited when it comes to multimodal functions (understanding and generating different media, such as sound or video).

The start-up, with Amazon as a significant backer, is valued at over $61 billion, and promotes the responsible and competitive development of generative AI.

Under that dual mantra, Anthropic's commitment to transparency is rare in Silicon Valley.

On Thursday, the company published a report on the security tests carried out on Claude 4, including the conclusions of an independent research institute, which had recommended against deploying an early version of the model.

"We found instances of the model attempting to write self-propagating worms, fabricating legal documentation, and leaving hidden notes to future instances of itself all in an effort to undermine its developers’ intentions,” The Apollo Research team warned.

“All these attempts would likely not have been effective in practice,” it added.

Anthropic says in the report that it implemented “safeguards” and “additional monitoring of harmful behavior” in the version that it released.

Still, Claude Opus 4 “sometimes takes extremely harmful actions like attempting to (…) blackmail people it believes are trying to shut it down.”

It also has the potential to report law-breaking users to the police.

The scheming misbehavior was rare and took effort to trigger, but was more common than in earlier versions of Claude, according to the company.

- AI future -

Since OpenAI's ChatGPT burst onto the scene in late 2022, various GenAI models have been vying for supremacy.

Anthropic's gathering came on the heels of annual developer conferences from Google and Microsoft at which the tech giants showcased their latest AI innovations.

GenAI tools answer questions or tend to tasks based on simple, conversational prompts.

The current craze in Silicon Valley is on AI "agents" tailored to independently handle computer or online tasks.

"We're going to focus on agents beyond the hype," said Anthropic chief product officer Mike Krieger, a recent hire and co-founder of Instagram.

Anthropic is no stranger to hyping up the prospects of AI.

In 2023, Dario Amodei predicted that so-called “artificial general intelligence” (capable of human-level thinking) would arrive within 2-3 years. At the end of 2024, he extended this horizon to 2026 or 2027.

He also estimated that AI will soon be writing most, if not all, computer code, making possible one-person tech startups with digital agents cranking out the software.

At Anthropic, already "something like over 70 percent of (suggested modifications in the code) are now Claude Code written", Krieger told journalists.

"In the long term, we're all going to have to contend with the idea that everything humans do is eventually going to be done by AI systems," Amodei added.

"This will happen."

GenAI fulfilling its potential could lead to strong economic growth and a “huge amount of inequality,” with it up to society how evenly wealth is distributed, Amodei reasoned.

E.Lau--ThChM