TheRegister
AMD puts out new slottable GPU for AI-curious enterprises
AMD hopes to win over enterprise AI customers with a more affordable datacenter GPU that can drop into conventional air-cooled servers. Announced on Thursday, the MI350P is the House of Zen’s first PCIe-based Instinct accelerator since the MI210 debuted all the way back in 2022. Until now, AMD’s best GPUs have only been available in packs of eight and used socketed OAM modules that weren’t compatible with most server platforms. By comparison, The MI350P can slot into just about any 19-inch pizza box design that offers enough power and airflow, making it a much easier sell for enterprises dipping their toes into on-prem AI for the first time. The 600-watt, dual-slot card is essentially a MI350X that’s been cut in half. That means the CNDA-based GPU is packing 4.6 petaFLOPS of FP4 compute and 144 GB of VRAM spread across four HBM3e stacks delivering a respectable 4 TB/s of memory bandwidth. AMD supports configurations ranging from one to eight MI350Ps, though a lack of high-speed interconnects on these cards means it’ll be limited to PCIe 5.0 speeds (128 GB/s) for chip-to-chip communications, potentially limiting its potential in larger models. AMD hasn’t shared pricing for the cards just yet, but at least on paper, the MI350P is well positioned to compete with either Nvidia’s H200 NVL or RTX Pro 6000 Blackwell PCIe cards. Compared to the 141 GB H200, the MI350P promises about 38 percent higher peak performance at FP8, while eking out a narrow VRAM capacity advantage. But the H200 does pull ahead when it comes to memory bandwidth. With six HBM3e stacks to the MI350P’s four, the nearly two-year-old card’s memory is still about 20 percent faster. Nvidia's H200 also supports high-speed chip-to-chip communications over NVLink, while the MI350P doesn’t use AMD’s equivalent Infinity Fabric interconnect. However, all this assumes you can still find H200 NVLs in the wild. Since last summer, Nvidia has been pushing its RTX Pro 6000 Server cards on enterprise customers. As of writing, the card is Nvidia’s most powerful Blackwell-based accelerator offered in a PCIe formfactor. Compared to the RTX Pro 6000, the MI350P’s price starts becoming a bigger factor than performance. Workstation versions of the RTX Pro, which ditch the passive cooler for an active one, routinely sell for between $8,000 to $10,000 apiece, making it one of Nvidia’s more affordable datacenter-class GPUs. Depending on how pricing shakes out, AMD may have to push hard to be competitive. Having said that, the MI350P is still the better-specced part, delivering 2.3x higher peak flops, 2.5x the memory bandwidth, and 50 percent more vRAN of the RTX Pro. Now, this all assumes peak FLOPS and memory bandwidth, which is rarely realistic. The tensors used by AI workloads are rarely the ideal shape for squeezing the maximum number of FLOPS out of a chip. This is why we run for Maximum Achievable MatMul FLOPS (MAMF) and Babel Stream memory bandwidth benchmarks as part of our AI test suite. AMD seems to understand that peak FLOPS don’t really translate cleanly into real-world performance, and in the marketing materials shared with El Reg prior to publication, compared the MI350P’s theoretical performance against its real-world delivered performance. It’d be nice to see Nvidia and others adopt similar practices regarding accelerator performance claims, though we suspect getting everyone to agree on the best way to measure this might not be easy. The MI350P’s launch comes as AMD prepares to address a very different and likely more lucrative segment with its first rack-scale compute platform, codenamed Helios. That system is due out in the second half of the year, and is aimed primarily at large hyperscale and neocloud deployments. The system packs 72 of its all-new MI455X GPUs into a single double-wide OCP rack that behaves like an enormous accelerator. The platform will be AMD’s first crack at Nvidia’s NVL72 racks, which launched alongside its Blackwell generation nearly two years ago. ®
Categories: Linux fréttir
Hungarian cops cuff suspected swatter after two-year FBI probe
20-year-old fessed up after investigators found video of crime in progress
Categories: Linux fréttir
EU hits snooze on AI Act rules after industry backlash
Brussels says it's simplification, critics may call it retreat
Categories: Linux fréttir
NHS code clampdown draws open source backlash
Plus a petition for the UK Civil Service to go FOSS by default
Categories: Linux fréttir
The network password was a key plot point in one of the most famous movies of all time
Fortunately, it was a legit contractor who guessed it
Categories: Linux fréttir
Chrome silently installs a 4 GB local LLM on your computer
You did remember to opt out of AI, didn't you?
Categories: Linux fréttir
Home Office seeks three CTOs to keep borders, passports, and core IT ticking
Roles span eGates, passports, visas, asylum applications, and enterprise services – yours for up to £105K
Categories: Linux fréttir
Minister gives Palantir's NHS platform a clean bill of health
£330M contract defended as value for money despite concerns over IP and lock-in
Categories: Linux fréttir
Neocloud IREN buys OpenStack champion Mirantis
Former bitcoin miner plans to build an easier cloudy AI on ramp while remaining a friend to FOSS
Categories: Linux fréttir
Datacenter to become Arm’s biggest business ‘soon’
Someone other than Meta is buying $1bn of its new AGI chips
Categories: Linux fréttir
Using AI to click around on a website burns 45x as many tokens as just using APIs
For AI agents, seeing is expensive
Categories: Linux fréttir
Young evil genius forces hamster to run on wheel to power his gadgets
Okay, the rodent was a willing participant - after all, who turns down treats for a spin that charges a phone?
Categories: Linux fréttir
Claude hitches ride on SpaceX's datacenter capacity
Compute from Colossus leads to relaxed limits
Categories: Linux fréttir
Musk has never built a wafer fab, but he wants to burn $119B on one anyway
Initial phases of SpaceX's Terafab project in rural Texas are expected to cost about 1.25 Twitters
Categories: Linux fréttir
Arctic Wolf kicks 250 employees out of the pack to save money for AI
Cuts appear to hit sales, product, and marketing, accounting for under 10% of staff
Categories: Linux fréttir
1 in 8 employees totally cool with selling work credentials
13% say they’ve sold logins or know someone who has, survey suggests
Categories: Linux fréttir
We've only gone and done it: Changed what you're used to
A new coat of paint
Categories: Linux fréttir
DRAM drought to dog AMD's chips this year
Commercial PC demand expected to cushion broader slowdown
Categories: Linux fréttir
Iran cybersnoops still LARPing as ransomware crooks in espionage ops
MOIS-linked cyber outfit puts on a ransomware show to disguise the wide-open backdoor behind the scenes
Categories: Linux fréttir
