Anthropic's Claude AI gets smarter -- and mischievious

The China Mail - Anthropic's Claude AI gets smarter -- and mischievious

Beijing 20°C

USD -

AED 3.672504

AFN 64.000368

ALL 82.087167

AMD 368.450607

ANG 1.790403

AOA 918.000367

ARS 1428.330353

AUD 1.418842

AWG 1.801525

AZN 1.70397

BAM 1.689603

BBD 2.013822

BDT 122.983888

BGN 1.69088

BHD 0.37683

BIF 2970.152477

BMD 1

BND 1.283746

BOB 6.909421

BRL 5.061504

BSD 0.99987

BTN 95.052482

BWP 13.460326

BYN 2.766446

BYR 19600

BZD 2.010971

CAD 1.39945

CDF 2295.000362

CHF 0.796927

CLF 0.022916

CLP 904.902596

CNY 6.771504

CNH 6.76346

COP 3492.894475

CRC 454.839964

CUC 1

CUP 26.5

CVE 95.257224

CZK 20.874704

DJF 178.057103

DKK 6.461104

DOP 58.710207

DZD 133.120816

EGP 51.846573

ERN 15

ETB 157.556391

EUR 0.863904

FJD 2.215904

FKP 0.745521

GBP 0.745768

GEL 2.65504

GGP 0.745521

GHS 11.098441

GIP 0.745521

GMD 73.000355

GNF 8759.016889

GTQ 7.622133

GYD 209.191828

HKD 7.83605

HNL 26.736642

HRK 6.513804

HTG 130.733014

HUF 304.250388

IDR 17779.3

ILS 2.92082

IMP 0.745521

INR 95.110504

IQD 1309.835428

IRR 1375877.503816

ISK 124.650386

JEP 0.745521

JMD 158.489914

JOD 0.70904

JPY 160.22904

KES 129.480368

KGS 87.450384

KHR 4017.105093

KMF 426.00035

KPW 900.00035

KRW 1518.230383

KWD 0.30848

KYD 0.833312

KZT 488.937843

LAK 22017.191482

LBP 89543.518639

LKR 335.207982

LRD 181.97918

LSL 16.286467

LTL 2.95274

LVL 0.60489

LYD 6.372943

MAD 9.260766

MDL 17.462745

MGA 4172.605935

MKD 53.254719

MMK 2099.254457

MNT 3578.100965

MOP 8.070062

MRU 39.65617

MUR 47.250378

MVR 15.460378

MWK 1733.834392

MXN 17.222904

MYR 4.057604

MZN 63.903729

NAD 16.286467

NGN 1360.503725

NIO 36.793227

NOK 9.513504

NPR 152.084143

NZD 1.714972

OMR 0.384251

PAB 0.99987

PEN 3.400458

PGK 4.378213

PHP 60.771038

PKR 278.191957

PLN 3.66995

PYG 6122.413719

QAR 3.65522

RON 4.526104

RSD 101.386549

RUB 72.4589

RWF 1468.359898

SAR 3.753804

SBD 8.045573

SCR 14.065224

SDG 600.503676

SEK 9.47869

SGD 1.284504

SHP 0.746601

SLE 24.650371

SLL 20969.503664

SOS 571.465595

SRD 37.509504

STD 20697.981008

STN 21.165392

SVC 8.74865

SYP 110.532098

SZL 16.273163

THB 32.873038

TJS 9.318906

TMT 3.51

TND 2.933437

TOP 2.40776

TRY 46.232504

TTD 6.791931

TWD 31.621504

TZS 2624.681439

UAH 44.803507

UGX 3749.298086

UYU 40.387024

UZS 11975.292644

VES 581.95784

VND 26310

VUV 119.415431

WST 2.743477

XAF 566.677033

XAG 0.014699

XAU 0.000237

XCD 2.70255

XCG 1.801996

XDR 0.704764

XOF 566.677033

XPF 103.027947

YER 238.603589

ZAR 16.313845

ZMK 9001.203584

ZMW 17.467928

ZWL 321.999592

CMSC

-0.0200

22.33

-0.09%
RELX

0.6300

33.74

+1.87%
NGG

0.3200

81.84

+0.39%
RBGPF

0.0000

60.72

0%
BCE

0.0200

24.59

+0.08%
GSK

0.1800

53.04

+0.34%
BCC

0.4800

71.14

+0.67%
RIO

1.7100

105.35

+1.62%
RYCEF

0.4600

17.5

+2.63%
BTI

0.9300

62.32

+1.49%
CMSD

-0.0400

22.26

-0.18%
JRI

-0.0300

12.8

-0.23%
VOD

0.2700

15.53

+1.74%
BP

0.1000

42.78

+0.23%
AZN

-3.5300

178.75

-1.97%

Anthropic's Claude AI gets smarter -- and mischievious

TECHNOLOGY 23.05.2025

Anthropic launched its latest Claude generative artificial intelligence (GenAI) models on Thursday, claiming to set new standards for reasoning but also building in safeguards against rogue behavior.

Text size:

"Claude Opus 4 is our most powerful model yet, and the best coding model in the world," Anthropic chief executive Dario Amodei said at the San Francisco-based startup's first developers conference.

Opus 4 and Sonnet 4 were described as "hybrid" models capable of quick responses as well as more thoughtful results that take a little time to get things right.

Founded by former OpenAI engineers, Anthropic is currently concentrating its efforts on cutting-edge models that are particularly adept at generating lines of code, and used mainly by businesses and professionals.

Unlike ChatGPT and Google's Gemini, its Claude chatbot does not generate images, and is very limited when it comes to multimodal functions (understanding and generating different media, such as sound or video).

The start-up, with Amazon as a significant backer, is valued at over $61 billion, and promotes the responsible and competitive development of generative AI.

Under that dual mantra, Anthropic's commitment to transparency is rare in Silicon Valley.

On Thursday, the company published a report on the security tests carried out on Claude 4, including the conclusions of an independent research institute, which had recommended against deploying an early version of the model.

"We found instances of the model attempting to write self-propagating worms, fabricating legal documentation, and leaving hidden notes to future instances of itself all in an effort to undermine its developers’ intentions,” The Apollo Research team warned.

“All these attempts would likely not have been effective in practice,” it added.

Anthropic says in the report that it implemented “safeguards” and “additional monitoring of harmful behavior” in the version that it released.

Still, Claude Opus 4 “sometimes takes extremely harmful actions like attempting to (…) blackmail people it believes are trying to shut it down.”

It also has the potential to report law-breaking users to the police.

The scheming misbehavior was rare and took effort to trigger, but was more common than in earlier versions of Claude, according to the company.

- AI future -

Since OpenAI's ChatGPT burst onto the scene in late 2022, various GenAI models have been vying for supremacy.

Anthropic's gathering came on the heels of annual developer conferences from Google and Microsoft at which the tech giants showcased their latest AI innovations.

GenAI tools answer questions or tend to tasks based on simple, conversational prompts.

The current craze in Silicon Valley is on AI "agents" tailored to independently handle computer or online tasks.

"We're going to focus on agents beyond the hype," said Anthropic chief product officer Mike Krieger, a recent hire and co-founder of Instagram.

Anthropic is no stranger to hyping up the prospects of AI.

In 2023, Dario Amodei predicted that so-called “artificial general intelligence” (capable of human-level thinking) would arrive within 2-3 years. At the end of 2024, he extended this horizon to 2026 or 2027.

He also estimated that AI will soon be writing most, if not all, computer code, making possible one-person tech startups with digital agents cranking out the software.

At Anthropic, already "something like over 70 percent of (suggested modifications in the code) are now Claude Code written", Krieger told journalists.

"In the long term, we're all going to have to contend with the idea that everything humans do is eventually going to be done by AI systems," Amodei added.

"This will happen."

GenAI fulfilling its potential could lead to strong economic growth and a “huge amount of inequality,” with it up to society how evenly wealth is distributed, Amodei reasoned.

E.Lau--ThChM

The China Mail - Anthropic's Claude AI gets smarter -- and mischievious

Anthropic's Claude AI gets smarter -- and mischievious

Featured

SpaceX: Five key moments, from first launch to Starship megarocket

Musk becomes world's first trillionaire as SpaceX shares soar

SpaceX lifts off in record Wall Street debut

SpaceX to make historic IPO that could make Musk a trillionaire