The China Mail - Anthropic's Claude AI gets smarter -- and mischievious

USD -
AED 3.672498
AFN 65.999546
ALL 83.886299
AMD 382.569343
ANG 1.789982
AOA 916.999667
ARS 1450.724895
AUD 1.535992
AWG 1.8025
AZN 1.703625
BAM 1.701894
BBD 2.013462
BDT 121.860805
BGN 1.698675
BHD 0.376969
BIF 2951
BMD 1
BND 1.306514
BOB 6.907654
BRL 5.340706
BSD 0.999682
BTN 88.718716
BWP 13.495075
BYN 3.407518
BYR 19600
BZD 2.010599
CAD 1.40972
CDF 2221.000107
CHF 0.8083
CLF 0.024025
CLP 942.260127
CNY 7.12675
CNH 7.124335
COP 3834.5
CRC 501.842642
CUC 1
CUP 26.5
CVE 96.374981
CZK 21.130974
DJF 177.719889
DKK 6.481435
DOP 64.297733
DZD 130.702957
EGP 47.350598
ERN 15
ETB 153.125026
EUR 0.868055
FJD 2.281097
FKP 0.766404
GBP 0.765345
GEL 2.714973
GGP 0.766404
GHS 10.924959
GIP 0.766404
GMD 73.496433
GNF 8691.000207
GTQ 7.661048
GYD 209.152772
HKD 7.774794
HNL 26.359887
HRK 6.537806
HTG 130.911876
HUF 335.451502
IDR 16695.1
ILS 3.253855
IMP 0.766404
INR 88.641051
IQD 1310
IRR 42112.439107
ISK 127.05977
JEP 0.766404
JMD 160.956848
JOD 0.709027
JPY 153.633017
KES 129.201234
KGS 87.449557
KHR 4027.000211
KMF 427.999878
KPW 900.033283
KRW 1447.48028
KWD 0.30713
KYD 0.83313
KZT 525.140102
LAK 21712.500514
LBP 89549.999727
LKR 304.599802
LRD 182.625016
LSL 17.379986
LTL 2.95274
LVL 0.60489
LYD 5.455014
MAD 9.301979
MDL 17.135125
MGA 4500.000656
MKD 53.533982
MMK 2099.044592
MNT 3585.031206
MOP 8.006805
MRU 38.249781
MUR 45.999702
MVR 15.404977
MWK 1736.000423
MXN 18.58737
MYR 4.18301
MZN 63.960022
NAD 17.380215
NGN 1440.729964
NIO 36.770288
NOK 10.170899
NPR 141.949154
NZD 1.7668
OMR 0.384495
PAB 0.999687
PEN 3.376505
PGK 4.216027
PHP 58.845981
PKR 280.85006
PLN 3.69242
PYG 7077.158694
QAR 3.640957
RON 4.414195
RSD 101.74198
RUB 81.125016
RWF 1450
SAR 3.750543
SBD 8.223823
SCR 13.740948
SDG 600.503506
SEK 9.536655
SGD 1.304925
SHP 0.750259
SLE 23.200677
SLL 20969.499529
SOS 571.507056
SRD 38.558019
STD 20697.981008
STN 21.45
SVC 8.747031
SYP 11056.895466
SZL 17.38022
THB 32.350333
TJS 9.257197
TMT 3.5
TND 2.960056
TOP 2.342104
TRY 42.11875
TTD 6.775354
TWD 30.898017
TZS 2459.806973
UAH 42.064759
UGX 3491.230589
UYU 39.758439
UZS 11987.497487
VES 227.27225
VND 26315
VUV 122.169446
WST 2.82328
XAF 570.814334
XAG 0.020533
XAU 0.000249
XCD 2.70255
XCG 1.801656
XDR 0.70875
XOF 570.495888
XPF 104.149691
YER 238.497406
ZAR 17.363401
ZMK 9001.204121
ZMW 22.392878
ZWL 321.999592
  • RBGPF

    0.0000

    76

    0%

  • RYCEF

    0.1500

    15.1

    +0.99%

  • CMSC

    0.2400

    23.83

    +1.01%

  • BTI

    0.9000

    53.88

    +1.67%

  • SCS

    0.0600

    15.93

    +0.38%

  • VOD

    0.0700

    11.27

    +0.62%

  • BCC

    0.9700

    71.38

    +1.36%

  • RIO

    1.1700

    69.06

    +1.69%

  • NGG

    0.2300

    75.37

    +0.31%

  • GSK

    -0.1300

    46.69

    -0.28%

  • RELX

    0.2800

    44.58

    +0.63%

  • JRI

    0.0700

    13.77

    +0.51%

  • CMSD

    0.1900

    24.01

    +0.79%

  • BCE

    0.1000

    22.39

    +0.45%

  • BP

    0.5600

    35.68

    +1.57%

  • AZN

    -0.8800

    81.15

    -1.08%

Anthropic's Claude AI gets smarter -- and mischievious
Anthropic's Claude AI gets smarter -- and mischievious / Photo: © AFP

Anthropic's Claude AI gets smarter -- and mischievious

Anthropic launched its latest Claude generative artificial intelligence (GenAI) models on Thursday, claiming to set new standards for reasoning but also building in safeguards against rogue behavior.

Text size:

"Claude Opus 4 is our most powerful model yet, and the best coding model in the world," Anthropic chief executive Dario Amodei said at the San Francisco-based startup's first developers conference.

Opus 4 and Sonnet 4 were described as "hybrid" models capable of quick responses as well as more thoughtful results that take a little time to get things right.

Founded by former OpenAI engineers, Anthropic is currently concentrating its efforts on cutting-edge models that are particularly adept at generating lines of code, and used mainly by businesses and professionals.

Unlike ChatGPT and Google's Gemini, its Claude chatbot does not generate images, and is very limited when it comes to multimodal functions (understanding and generating different media, such as sound or video).

The start-up, with Amazon as a significant backer, is valued at over $61 billion, and promotes the responsible and competitive development of generative AI.

Under that dual mantra, Anthropic's commitment to transparency is rare in Silicon Valley.

On Thursday, the company published a report on the security tests carried out on Claude 4, including the conclusions of an independent research institute, which had recommended against deploying an early version of the model.

"We found instances of the model attempting to write self-propagating worms, fabricating legal documentation, and leaving hidden notes to future instances of itself all in an effort to undermine its developers’ intentions,” The Apollo Research team warned.

“All these attempts would likely not have been effective in practice,” it added.

Anthropic says in the report that it implemented “safeguards” and “additional monitoring of harmful behavior” in the version that it released.

Still, Claude Opus 4 “sometimes takes extremely harmful actions like attempting to (…) blackmail people it believes are trying to shut it down.”

It also has the potential to report law-breaking users to the police.

The scheming misbehavior was rare and took effort to trigger, but was more common than in earlier versions of Claude, according to the company.

- AI future -

Since OpenAI's ChatGPT burst onto the scene in late 2022, various GenAI models have been vying for supremacy.

Anthropic's gathering came on the heels of annual developer conferences from Google and Microsoft at which the tech giants showcased their latest AI innovations.

GenAI tools answer questions or tend to tasks based on simple, conversational prompts.

The current craze in Silicon Valley is on AI "agents" tailored to independently handle computer or online tasks.

"We're going to focus on agents beyond the hype," said Anthropic chief product officer Mike Krieger, a recent hire and co-founder of Instagram.

Anthropic is no stranger to hyping up the prospects of AI.

In 2023, Dario Amodei predicted that so-called “artificial general intelligence” (capable of human-level thinking) would arrive within 2-3 years. At the end of 2024, he extended this horizon to 2026 or 2027.

He also estimated that AI will soon be writing most, if not all, computer code, making possible one-person tech startups with digital agents cranking out the software.

At Anthropic, already "something like over 70 percent of (suggested modifications in the code) are now Claude Code written", Krieger told journalists.

"In the long term, we're all going to have to contend with the idea that everything humans do is eventually going to be done by AI systems," Amodei added.

"This will happen."

GenAI fulfilling its potential could lead to strong economic growth and a “huge amount of inequality,” with it up to society how evenly wealth is distributed, Amodei reasoned.

E.Lau--ThChM