The China Mail - Graid Technology Launches Agentic AI Storage Portfolio to Eliminate KV Cache Bottlenecks

USD -
AED 3.672504
AFN 63.000368
ALL 82.732897
AMD 367.370222
ANG 1.790403
AOA 917.000367
ARS 1478.086972
AUD 1.450326
AWG 1.80125
AZN 1.70397
BAM 1.716442
BBD 2.015885
BDT 123.112028
BGN 1.69088
BHD 0.377375
BIF 2972.662249
BMD 1
BND 1.295099
BOB 6.916495
BRL 5.177041
BSD 1.000921
BTN 93.946202
BWP 13.602176
BYN 2.902892
BYR 19600
BZD 2.012989
CAD 1.41895
CDF 2267.50392
CHF 0.80956
CLF 0.023471
CLP 922.497696
CNY 6.79815
CNH 6.804685
COP 3438.325508
CRC 454.429769
CUC 1
CUP 26.5
CVE 96.770372
CZK 21.30904
DJF 178.235113
DKK 6.565804
DOP 58.809075
DZD 133.424898
EGP 49.530036
ERN 15
ETB 161.36601
EUR 0.877704
FJD 2.266104
FKP 0.756395
GBP 0.757518
GEL 2.64504
GGP 0.756395
GHS 11.285269
GIP 0.756395
GMD 73.000355
GNF 8770.020624
GTQ 7.63614
GYD 209.469481
HKD 7.84255
HNL 26.780464
HRK 6.617804
HTG 130.8175
HUF 310.850388
IDR 17860.6
ILS 3.00205
IMP 0.756395
INR 94.360504
IQD 1311.158892
IRR 1375250.000352
ISK 126.490386
JEP 0.756395
JMD 157.637457
JOD 0.70904
JPY 161.75504
KES 129.518627
KGS 87.450384
KHR 4017.727851
KMF 434.00035
KPW 900.00035
KRW 1535.290383
KWD 0.30961
KYD 0.834087
KZT 485.637808
LAK 21969.371188
LBP 89630.523498
LKR 336.443021
LRD 182.31603
LSL 16.452675
LTL 2.95274
LVL 0.60489
LYD 6.42503
MAD 9.385493
MDL 17.746281
MGA 4233.621484
MKD 54.091886
MMK 2099.386013
MNT 3578.909161
MOP 8.085217
MRU 39.945588
MUR 47.250378
MVR 15.450378
MWK 1735.574181
MXN 17.504204
MYR 4.088039
MZN 63.903729
NAD 16.452675
NGN 1376.130377
NIO 36.83356
NOK 9.933039
NPR 150.313748
NZD 1.771166
OMR 0.384504
PAB 1.000921
PEN 3.41305
PGK 4.39247
PHP 61.312038
PKR 278.550353
PLN 3.76695
PYG 6109.087718
QAR 3.648427
RON 4.603104
RSD 103.014612
RUB 78.910966
RWF 1465.794901
SAR 3.758743
SBD 8.051953
SCR 14.057835
SDG 600.000339
SEK 9.73761
SGD 1.294204
SHP 0.746601
SLE 24.803667
SLL 20969.503664
SOS 572.030366
SRD 37.483038
STD 20697.981008
STN 21.501602
SVC 8.757734
SYP 110.532098
SZL 16.443021
THB 33.378038
TJS 9.263329
TMT 3.5
TND 2.966607
TOP 2.40776
TRY 46.553304
TTD 6.802405
TWD 31.859804
TZS 2632.322612
UAH 44.926675
UGX 3673.702225
UYU 40.177279
UZS 12022.46698
VES 620.752985
VND 26300
VUV 119.628449
WST 2.780038
XAF 575.678617
XAG 0.017058
XAU 0.000246
XCD 2.70255
XCG 1.803853
XDR 0.715959
XOF 575.678617
XPF 104.664531
YER 238.625037
ZAR 16.987795
ZMK 9001.203584
ZMW 18.029751
ZWL 321.999592
  • CMSC

    -0.1160

    21.93

    -0.53%

  • CMSD

    -0.1600

    21.77

    -0.73%

  • BCC

    1.2600

    81.02

    +1.56%

  • NGG

    -0.4100

    83.01

    -0.49%

  • RYCEF

    0.3900

    18.39

    +2.12%

  • VOD

    0.0300

    13.89

    +0.22%

  • BCE

    -0.2800

    22.92

    -1.22%

  • JRI

    0.2100

    12.79

    +1.64%

  • RIO

    -1.3700

    93.74

    -1.46%

  • RBGPF

    3.7000

    65

    +5.69%

  • RELX

    0.4200

    31.34

    +1.34%

  • GSK

    0.6100

    52.5

    +1.16%

  • AZN

    2.7300

    188.41

    +1.45%

  • BTI

    0.2800

    62.76

    +0.45%

  • BP

    -0.5900

    37.13

    -1.59%

Graid Technology Launches Agentic AI Storage Portfolio to Eliminate KV Cache Bottlenecks
Graid Technology Launches Agentic AI Storage Portfolio to Eliminate KV Cache Bottlenecks

Graid Technology Launches Agentic AI Storage Portfolio to Eliminate KV Cache Bottlenecks

From edge inference to NVIDIA STX, purpose-built KV cache infrastructure for consistent performance at scale.

Text size:

SUNNYVALE, CA / ACCESS Newswire / April 21, 2026 / Graid Technology, the pioneer in GPU-accelerated NVMe storage, today announced its Agentic AI Storage Portfolio: a purpose-built family of KV cache solutions designed to eliminate the storage bottleneck that stalls "always-on" production AI. The portfolio spans three deployment tiers: KV Cache Server, KV Cache Rack, and KV Cache Platform, all built on SupremeRAID technology. KV Cache Platform, the portfolio's highest tier, is purpose-aligned to NVIDIA's STX reference architecture, with native BlueField-4 DPU execution on the roadmap for H2 2026.

As agentic AI moves from experimentation to production, the infrastructure assumptions that underpinned single-shot inference have broken down. Models running continuous multi-step tasks and maintaining context across hours of operation generate KV cache demands that overwhelm GPU HBM. The result: latency spikes up to 18x, GPU utilization as low as 50%, and model-level failures, including hallucinations and reasoning degradation, that are difficult to detect and costly to recover from.

SupremeRAID addresses this directly, aggregating up to 32 NVMe drives into a single 280 GB/s virtual pool, bypassing the CPU via GPU Direct Storage, and delivering KV cache reads at 1.3ms- 77x faster than standard NVMe. The three portfolio tiers bring this capability to every deployment scale:

KV Cache Server - single-node NVMe acceleration for individual inference servers and edge AI deployments. Available now.

KV Cache Rack - rack-scale, partner-validated solutions co-engineered with leading server OEM partners for enterprise multi-GPU clusters. Available now.

KV Cache Platform - Purpose-built for NVIDIA's STX reference architecture, with native BlueField-4 DPU execution and rack-scale storage expansion on the roadmap.

"A year ago, at GTC 2025, Jensen Huang predicted that storage would become GPU-accelerated for the first time. This year, NVIDIA turned that concept into an architecture with STX and CMX," said Leander Yu, CEO of Graid Technology. "Our KV Cache Portfolio is built for precisely this moment, delivering the storage performance that agentic AI demands, at storage-tier economics."

For enterprises and infrastructure teams evaluating agentic AI deployments, the full deployment architecture, technical specifications, and NVIDIA STX compatibility details are available in the solution brief: Graid Technology Agentic AI Storage Portfolio: Purpose-built KV Cache Solutions for Inference at Scale

To learn more about Graid Technology's AI offerings, visit graidtech.com/ai

Media Inquiries:
Andrea Eaken, Sr. Director of Marketing, Americas & EMEA
[email protected]

____________________________________

About Graid Technology
Graid Technology is building the storage backbone for the future of AI, enterprise, and high-performance computing. As the creator of SupremeRAID, the world's first and only GPU-based RAID, and the global steward of Intel® Virtual RAID on CPU (Intel® VROC), Graid Technology delivers flexible RAID solutions that maximize NVMe performance while ensuring resilient, scalable data protection for modern data infrastructure. Headquartered in Silicon Valley with global operations and R&D in Taiwan, Graid Technology is advancing RAID innovation for the next generation of data-intensive workloads. To learn more, visit graidtech.com.

SOURCE: Graid Technology Inc.



View the original press release on ACCESS Newswire

A.Sun--ThChM