The China Mail - OpenAI releases reasoning AI with eye on safety, accuracy

USD -
AED 3.673021
AFN 64.500085
ALL 81.04013
AMD 377.570287
ANG 1.79008
AOA 916.999994
ARS 1397.0363
AUD 1.411761
AWG 1.8025
AZN 1.703608
BAM 1.646095
BBD 2.014569
BDT 122.333554
BGN 1.67937
BHD 0.37706
BIF 2955
BMD 1
BND 1.261126
BOB 6.911847
BRL 5.198602
BSD 1.000215
BTN 90.656892
BWP 13.115002
BYN 2.867495
BYR 19600
BZD 2.011792
CAD 1.36276
CDF 2239.999889
CHF 0.769655
CLF 0.021703
CLP 856.959793
CNY 6.90065
CNH 6.899875
COP 3671.49
CRC 487.566753
CUC 1
CUP 26.5
CVE 94.875032
CZK 20.441008
DJF 177.720273
DKK 6.297485
DOP 62.625011
DZD 129.608487
EGP 46.847504
ERN 15
ETB 155.049838
EUR 0.84285
FJD 2.190198
FKP 0.732521
GBP 0.734545
GEL 2.689918
GGP 0.732521
GHS 11.005031
GIP 0.732521
GMD 73.505048
GNF 8779.999507
GTQ 7.671623
GYD 209.274433
HKD 7.816415
HNL 26.505018
HRK 6.350102
HTG 130.97728
HUF 319.496499
IDR 16823
ILS 3.063925
IMP 0.732521
INR 90.598499
IQD 1310.5
IRR 42125.000158
ISK 122.380273
JEP 0.732521
JMD 156.251973
JOD 0.709007
JPY 152.839791
KES 129.000569
KGS 87.450047
KHR 4030.0002
KMF 414.99991
KPW 899.988812
KRW 1440.150231
KWD 0.30671
KYD 0.833596
KZT 494.926752
LAK 21445.000487
LBP 85549.999541
LKR 309.456576
LRD 186.550345
LSL 15.859536
LTL 2.95274
LVL 0.60489
LYD 6.302627
MAD 9.138749
MDL 16.94968
MGA 4429.99998
MKD 51.932021
MMK 2100.304757
MNT 3579.516219
MOP 8.054945
MRU 39.900536
MUR 45.90247
MVR 15.460255
MWK 1736.501794
MXN 17.240225
MYR 3.902498
MZN 63.910195
NAD 15.960282
NGN 1352.320109
NIO 36.715003
NOK 9.53845
NPR 145.04947
NZD 1.658005
OMR 0.384491
PAB 1.000332
PEN 3.35497
PGK 4.298499
PHP 58.090162
PKR 279.600947
PLN 3.553435
PYG 6585.896503
QAR 3.64125
RON 4.291018
RSD 98.911047
RUB 77.223079
RWF 1452.5
SAR 3.750374
SBD 8.048395
SCR 13.452269
SDG 601.510014
SEK 8.92871
SGD 1.26305
SHP 0.750259
SLE 24.250324
SLL 20969.499267
SOS 571.501199
SRD 37.77903
STD 20697.981008
STN 20.95
SVC 8.752299
SYP 11059.574895
SZL 15.859726
THB 31.093501
TJS 9.417602
TMT 3.5
TND 2.83525
TOP 2.40776
TRY 43.647699
TTD 6.776109
TWD 31.448974
TZS 2599.999875
UAH 43.023284
UGX 3540.813621
UYU 38.353905
UZS 12299.999861
VES 388.253525
VND 25960
VUV 119.359605
WST 2.711523
XAF 552.10356
XAG 0.013313
XAU 0.000203
XCD 2.70255
XCG 1.802726
XDR 0.686599
XOF 549.501968
XPF 100.750245
YER 238.403969
ZAR 15.973595
ZMK 9001.200595
ZMW 18.555599
ZWL 321.999592
  • RBGPF

    0.1000

    82.5

    +0.12%

  • RYCEF

    -0.0600

    16.87

    -0.36%

  • VOD

    -0.0600

    15.62

    -0.38%

  • CMSC

    0.0000

    23.7

    0%

  • RIO

    -1.6100

    97.91

    -1.64%

  • GSK

    0.0500

    58.54

    +0.09%

  • NGG

    0.5800

    91.22

    +0.64%

  • AZN

    -0.2400

    204.52

    -0.12%

  • BTI

    0.2800

    60.61

    +0.46%

  • BP

    -1.3600

    37.19

    -3.66%

  • RELX

    1.0800

    28.81

    +3.75%

  • BCC

    -1.3500

    88.06

    -1.53%

  • CMSD

    -0.1280

    23.942

    -0.53%

  • JRI

    0.0300

    13.16

    +0.23%

  • BCE

    0.1800

    25.83

    +0.7%

OpenAI releases reasoning AI with eye on safety, accuracy
OpenAI releases reasoning AI with eye on safety, accuracy / Photo: © AFP/File

OpenAI releases reasoning AI with eye on safety, accuracy

ChatGPT creator OpenAI on Thursday released a new series of artificial intelligence models designed to spend more time thinking -- in hopes that generative AI chatbots provide more accurate and beneficial responses.

Text size:

The new models, known as OpenAI o1-Preview, are designed to tackle complex tasks and solve more challenging problems in science, coding and mathematics -- something that earlier models have been criticized for failing to provide consistently.

Unlike their predecessors, these models have been trained to refine their thinking processes, try different methods and recognize mistakes before they deploy a final answer.

The new release comes as OpenAI is raising funds that could see it valued around $150 billion, which would make it one of the world's most valuable private companies, according to US media.

Investors include Microsoft and Nvidia, and could also include a $7 billion investment from MGX, a United Arab Emirates-backed investment fund, The Information reported.

OpenAI CEO Sam Altman hailed the models as "a new paradigm: AI that can do general-purpose complex reasoning."

However, he cautioned that the technology "is still flawed, still limited, and it still seems more impressive on first use than it does after you spend more time with it."

OpenAI's push to improve "thinking" in its model is a response to the persistent problem of "hallucinations" in AI chatbots.

This refers to their tendency to generate persuasive but incorrect content that has somewhat cooled the excitement over ChatGPT-style AI features among business customers

"We have noticed that this model hallucinates less," OpenAI researcher Jerry Tworek told The Verge.

But "we can't say we solved hallucinations," he added.

The Microsoft-backed company said that in tests, the models performed comparably to PhD students on difficult tasks in physics, chemistry and biology.

They also excelled in mathematics and coding, achieving an 83 percent success rate on a qualifying exam for the International Mathematics Olympiad, compared to 13 percent for GPT-4o, its most advanced general use model.

OpenAI said that the new reasoning capabilities could be used for healthcare researchers to annotate cell sequencing data, physicists to generate complex formulas, or computer developers to build and execute multistep designs.

The company also said that the models survived rigorous jailbreaking tests and could better withstand attempts to circumvent its guardrails.

OpenAI said its strengthened safety measures also included recent agreements with the US and UK AI Safety Institutes, which were granted early access to the models for evaluation and testing.

V.Fan--ThChM