The China Mail - China's DeepSeek releases long-awaited new AI model

USD -
AED 3.672504
AFN 63.000368
ALL 82.732897
AMD 367.370222
ANG 1.790403
AOA 917.000367
ARS 1478.086972
AUD 1.450326
AWG 1.80125
AZN 1.70397
BAM 1.716442
BBD 2.015885
BDT 123.112028
BGN 1.69088
BHD 0.377375
BIF 2972.662249
BMD 1
BND 1.295099
BOB 6.916495
BRL 5.177041
BSD 1.000921
BTN 93.946202
BWP 13.602176
BYN 2.902892
BYR 19600
BZD 2.012989
CAD 1.41895
CDF 2267.50392
CHF 0.80956
CLF 0.023471
CLP 922.497696
CNY 6.79815
CNH 6.804685
COP 3438.325508
CRC 454.429769
CUC 1
CUP 26.5
CVE 96.770372
CZK 21.30904
DJF 178.235113
DKK 6.565804
DOP 58.809075
DZD 133.424898
EGP 49.530036
ERN 15
ETB 161.36601
EUR 0.877704
FJD 2.266104
FKP 0.756395
GBP 0.757518
GEL 2.64504
GGP 0.756395
GHS 11.285269
GIP 0.756395
GMD 73.000355
GNF 8770.020624
GTQ 7.63614
GYD 209.469481
HKD 7.84255
HNL 26.780464
HRK 6.617804
HTG 130.8175
HUF 310.850388
IDR 17860.6
ILS 3.00205
IMP 0.756395
INR 94.360504
IQD 1311.158892
IRR 1375250.000352
ISK 126.490386
JEP 0.756395
JMD 157.637457
JOD 0.70904
JPY 161.75504
KES 129.518627
KGS 87.450384
KHR 4017.727851
KMF 434.00035
KPW 900.00035
KRW 1535.290383
KWD 0.30961
KYD 0.834087
KZT 485.637808
LAK 21969.371188
LBP 89630.523498
LKR 336.443021
LRD 182.31603
LSL 16.452675
LTL 2.95274
LVL 0.60489
LYD 6.42503
MAD 9.385493
MDL 17.746281
MGA 4233.621484
MKD 54.091886
MMK 2099.386013
MNT 3578.909161
MOP 8.085217
MRU 39.945588
MUR 47.250378
MVR 15.450378
MWK 1735.574181
MXN 17.504204
MYR 4.088039
MZN 63.903729
NAD 16.452675
NGN 1376.130377
NIO 36.83356
NOK 9.933039
NPR 150.313748
NZD 1.771166
OMR 0.384504
PAB 1.000921
PEN 3.41305
PGK 4.39247
PHP 61.312038
PKR 278.550353
PLN 3.76695
PYG 6109.087718
QAR 3.648427
RON 4.603104
RSD 103.014612
RUB 78.910966
RWF 1465.794901
SAR 3.758743
SBD 8.051953
SCR 14.057835
SDG 600.000339
SEK 9.73761
SGD 1.294204
SHP 0.746601
SLE 24.803667
SLL 20969.503664
SOS 572.030366
SRD 37.483038
STD 20697.981008
STN 21.501602
SVC 8.757734
SYP 110.532098
SZL 16.443021
THB 33.378038
TJS 9.263329
TMT 3.5
TND 2.966607
TOP 2.40776
TRY 46.553304
TTD 6.802405
TWD 31.859804
TZS 2632.322612
UAH 44.926675
UGX 3673.702225
UYU 40.177279
UZS 12022.46698
VES 620.752985
VND 26300
VUV 119.628449
WST 2.780038
XAF 575.678617
XAG 0.017058
XAU 0.000246
XCD 2.70255
XCG 1.803853
XDR 0.715959
XOF 575.678617
XPF 104.664531
YER 238.625037
ZAR 16.987795
ZMK 9001.203584
ZMW 18.029751
ZWL 321.999592
  • CMSC

    -0.1160

    21.93

    -0.53%

  • CMSD

    -0.1600

    21.77

    -0.73%

  • BCC

    1.2600

    81.02

    +1.56%

  • NGG

    -0.4100

    83.01

    -0.49%

  • RYCEF

    0.3900

    18.39

    +2.12%

  • VOD

    0.0300

    13.89

    +0.22%

  • BCE

    -0.2800

    22.92

    -1.22%

  • JRI

    0.2100

    12.79

    +1.64%

  • RIO

    -1.3700

    93.74

    -1.46%

  • RBGPF

    3.7000

    65

    +5.69%

  • RELX

    0.4200

    31.34

    +1.34%

  • GSK

    0.6100

    52.5

    +1.16%

  • AZN

    2.7300

    188.41

    +1.45%

  • BTI

    0.2800

    62.76

    +0.45%

  • BP

    -0.5900

    37.13

    -1.59%

China's DeepSeek releases long-awaited new AI model
China's DeepSeek releases long-awaited new AI model / Photo: © AFP/File

China's DeepSeek releases long-awaited new AI model

Chinese startup DeepSeek released a new artificial intelligence model with "drastically reduced" costs Friday, more than a year after it stunned the world with a low-cost reasoning model that matched the capabilities of US rivals.

Text size:

The AI race has intensified the rivalry between China and the United States, and the White House on Thursday accused Chinese entities of a massive effort to steal artificial intelligence technology.

Hangzhou-based DeepSeek burst onto the scene in January last year with a generative AI chatbot, powered by its R1 reasoning model, that upended assumptions of US dominance in the strategic sector.

DeepSeek-V4, "features an ultra-long context", the company said in a statement on social media platform WeChat, hailing it as "world-leading... with drastically reduced compute (and) memory costs" in a separate announcement on X.

V4 supports a context length of one million "tokens" -- small components of text including words or punctuation -- putting it on par with Google's Gemini.

Context length determines how much input a model is able to absorb to help it complete tasks.

The new V4 is released as two versions, DeepSeek-V4-Pro and DeepSeek-V4-Flash, with the latter being "a more efficient and economical choice" because it has smaller parameters.

In terms of "world knowledge", a benchmark for reasoning, V4-Pro trails only the latest Gemini model, DeepSeek said.

A "preview version" of the open source model is now available, the company said, without indicating when a final version would be released.

- 'Inflection point' -

Experts say V4's arrival marks an "inflection point" in terms of hardware and cost.

"This addresses the long-standing issues of slower performance and higher costs associated with long context lengths, marking a genuine inflection point for the industry," Zhang Yi, the founder of tech research firm iiMedia, told AFP.

"For end users, this will bring widespread, accessible benefits. For instance, if ultra-long context support becomes a standard feature, long-text processing is expected to move beyond high-end research labs and enter mainstream commercial applications," he said.

V4-Pro has 1.6 trillion parameters while the V4-Flash has 284 billion parameters, which refine models' decision-making ability.

The model has also been "optimised" for popular AI Agent products such as Claude Code, OpenClaw, OpenCode and CodeBuddy, the DeepSeek statement said.

DeepSeek's latest release is a "milestone" for Chinese firms, said veteran AI industry analyst Max Liu.

"It's a good thing for the entire domestic AI industry. It can provide better models for domestic users and we can now expect a lot more things -- more products (and a) more competitive market," he told AFP.

"This is no less shocking than when DeepSeek first came out" if its new model indeed matches the performance of leading models from Western labs, he added.

- 'Sputnik moment' -

Last year's so-called "DeepSeek shock" sparked a sell-off of AI-related shares and a reckoning on business strategy in what was also described as a "Sputnik moment" for the industry.

The chatbot performed at a similar level to ChatGPT and other top American offerings, but the company said it had taken significantly less computing power to develop.

However, its sudden popularity raised questions over data privacy and censorship, with the chatbot often refusing to answer questions on sensitive topics such as the 1989 Tiananmen crackdown.

At home, DeepSeek's AI tools have been widely adopted by Chinese municipalities and healthcare institutions as well as the financial sector and other businesses.

This has been partly driven by DeepSeek's decision to make its systems open source, with their inner workings public -- in contrast to the proprietary models sold by OpenAI and other Western rivals.

But the White House has accused Chinese firms of vying to "steal" American technology, ahead of an expected summit between Donald Trump and Xi Jinping in Beijing next month.

"The US has evidence that foreign entities, primarily in China, are running industrial-scale distillation campaigns to steal American AI," Trump's science and technology chief advisor Michael Kratsios said in a post on X.

Distillation is a common practice within AI development, often used by companies to create cheaper, smaller versions of their own models.

DeepSeek's Friday announcement also came as Meta said it planned to cut a tenth of its staff as it looks for productivity gains from the rest of the workforce while investing heavily in artificial intelligence. Reports said Microsoft was also looking to trim its ranks.

Q.Yam--ThChM