TheRegister

Subscribe to TheRegister feed
Articles from www.theregister.com
Updated: 33 min 5 sec ago

Open Compute urges local government to bask in the warm glow of excess datacenter heat

18 hours 4 min ago
The Open Compute Project plans to deliver more guidance to local governments on how excess heat from datacenters can benefit their communities. The project develops open-source and energy-efficient hardware for datacenter operators. Meta, Microsoft, and Google are all top-tier Platinum members, and are also all building datacenters as fast as they can, to house AI infrastructure. Those builds have become controversial. Residents in communities flagged as sites for new bit barns have protested the quantity of water and energy they will consume, their potential to drive up prices for both, and the noise they emit. Some may be aware that datacenters create urban heat islands. Protests about new datacenters have aleady turned violent. Governments have sometimes acknowledged concerns by implementing a moratorium on big builds, but on other occasions have indicated they might fast-track developments and brush aside red tape. Into that febrile environment strode David Gardiner, Otto Van Geet, Jaime Comella, and Bharath Ramakrishnan, all of whom have participated in the OCP’s heat reuse group, with a Wednesday post extolling the virtues of datacenters when local governments are smart enough to tap excess heat that bit barns produce. “Reusing datacenter waste heat presents a significant opportunity to provide carbon-free heating across a wide array of sectors, delivering substantial environmental, economic, and social benefits,” the post states. They’re right. El Reg has reported on heat reuse helping to heat homes and grow vegetables. A swimming pool used during the Paris Olympics relied on heat from a nearby Equinix datacenter to keep its waters warm. The OCP post laments the fact that local governments lack awareness about how they can tap excess datacenter heat. The authors also point to “a lack of connections between datacenters and nearby heat users [and] supranational, national, and sub-national policy to incentivize these projects,” and suggest that smart local governments will make heat reuse a requirement before greenlighting datacenter builds. The post also, however, admits cost justification of datacenter heat reuse projects can be a challenge. The heat reuse group’s Wiki includes form letters and other material it hopes activists will send to regulators to encourage them to consider use of heat recovery systems. Providing that sort of material is a very common lobbying tactic. That OCP feels a need to highlight the availability of the resources at a time its members face opposition to their datacenter building plans is therefore more remarkable than the advice itself. ®
Categories: Linux fréttir

SpaceX pitches itself as integrated interplanetary proto-monopolist in IPO filing

20 hours 59 min ago
SpaceX has filed paperwork for its long-awaited initial public offering and made the argument that its unique combination of activities and “focus on extreme vertical integration” make it worthy of investors’ cash despite hefty losses. The company’s filing offers the usual Muskian tropes about the company’s purpose being to ensure human consciousness can reach the stars, a voyage that’s only possible with cheap reusable rockets, powerful AI operating in space where energy is abundant, and efficient manufacturing at astronomical scale. Only SpaceX, the document argues, can do it all. The company says it became the world’s premier launch outfit by operating vertically integrated design and in-house manufacturing that means it can crank out the parts needed to build and improve its rockets faster than anyone else – and without being dependent on slow-moving supply chains. That manufacturing capability means it can also build satellites faster than anyone else, as it has demonstrated when creating its Starlink space broadband network. In the datacenter, SpaceX's strategies let it create high-density rack-scale systems and associated technologies that allow it to build and deploy infrastructure faster than anyone else. The company will combine and re-use its satellite and compute infrastructure smarts to build orbiting datacenters that use Starlink to communicate with Earth. “Our high-throughput manufacturing capabilities – combined with our launch capabilities – enable us to produce and deploy thousands of satellites per year, an uneconomic proposition for those lacking an ability to deliver substantial mass into space,” the filing argues. “This capability accelerates our deployment timelines and allows us to commercialize entire constellations with capital efficiency that we believe is difficult to replicate.“ We have the benefit of being founded and led by Elon Musk, one of the great visionaries of our generation The company will keep its operations efficient using AI it also builds and runs, fuelled by data gathered from social network X. “With approximately 350 million daily posts, X enables freshness, relevance, and contextual awareness for Grok that we believe is a competitive differentiator,” the filing states. SpaceX isn’t done combining its businesses. “We intend to further extend our vertical integration to chip design and manufacturing to alleviate potential future chip shortages at SpaceX, optimize compute performance, and potentially reduce overall compute costs,” the filing states. The filing describes SpaceX as using “Business Models that Are Incredibly Difficult to Replicate.” Those business models are also struggling to produce a profit as the filing reveals SpaceX’s most recent full-year revenue (FY 2025) was $18.674 billion, which produced a $4.9 billion loss. The first quarter of FY 2026 saw the company lose $4.3 billion on $4.7 billion of revenue. As with all such documents, SpaceX’s IPO filing spells out risks the company faces – including many admissions its ideas may not work or take longer to realize than hoped. The company has proven that many times after Elon Musk promised exciting new products and technologies are imminent, then delivered them years later than his initial deadline. But the overall tone of the filing is extreme optimism as SpaceX claims it has a total addressable market (TAM) of $28.5 trillion – not far short of the USA’s gross national product. The filing breaks down that TAM into “$370 billion in Space from space-enabled solutions; $1.6 trillion in Connectivity across $870 billion in Starlink Broadband and $740 billion in Starlink Mobile as well as additional opportunities in enterprise and government; $26.5 trillion in AI across $2.4 trillion in AI infrastructure, $760 billion in consumer subscriptions, $600 billion in digital advertising, and $22.7 trillion in enterprise applications.” That last figure is vastly larger than the entire IT business today. The filing also states “We have the benefit of being founded and led by Elon Musk, one of the great visionaries of our generation,” and reveals that after the IPO he will serve as “Chief Executive Officer, Chief Technical Officer, and Chairman of our board and control the election of our directors.” The shareholding structure proposed by the IPO “concentrates voting control with Mr. Musk and other holders of our Class B common stock. This will limit or preclude your ability to influence corporate matters and the election of our directors.” That leaves would-be investors betting on Musk, and SpaceX getting an awful lot right. But the filing gives plenty of reasons to believe the company can fly, by citing its many past successes and firsts. However, it filing appeared just a few hours before X admitted it has not explained to Australian regulators how it stops publication of child sexual exploitation material on the platform, earning it a AUD$650,000 fine ($462,000). Perhaps a bit more integration from the ethics and legal teams are needed before IPO liftoff? ®
Categories: Linux fréttir

Nvidia on track to be worlds leading CPU supplier claims CFO

23 hours 20 min ago
Already the planet's largest supplier of GPUs, Nvidia now intends to conquer the CPU market. “We have visibility to nearly $20 billion in total CPU revenue this year, setting us up to become the world’s leading CPU supplier,” Nvidia CFO Colette Kress said during the company’s Q1 2027 earnings call on Wednesday. Nvidia is no stranger to CPUs having announced its first Arm datacenter chip, codenamed Grace, back in 2021. However until recently the company integrated most of these parts into GPU systems that users almost always deployed in AI datacenters and supercomputers. That changed in February when Nvidia revealed Meta was among the first hyperscalers now deploying standalone Grace CPU Superchips in its datacenters to power a variety of workloads including the Social Network’s AI agents. At its GTC conference in March, Nvidia officially expanded its CPU line up to include a standalone Vera CPU system. Each chip features 88 custom Olympus Arm cores with support for simultaneous multi-threading (SMT) — that’s Hyperthreading in Intel speak — along with confidential computing capabilities. Nvidia can equi[ each chip with up to 1.5 TB of LPDDR5x SOCAMM memory, which offers higher memory bandwidth at up to 1.2 TB/s and uses little power (which is why it's often used in laptops). “Vera will deliver up to 1.5x faster performance per core, 2x performance per watt, and 4x density per rack compared to x86-based alternatives,” Kress claimed. Nvidia’s reference designs pack up to two Vera CPUs onto a single board and via high-speed NVLink interconnects. Nvidia’s Vera is also paired in a 2:1 ratio of Rubin GPUs to CPUs in its most powerful rack-scale AI compute platforms. Since the chip was detailed this spring, Kress claims nearly every major hyperscaler and system builder plans to deploy the chips. This week, several top AI labs and hyperscalers, including Anthropic, OpenAI, Oracle, and SpaceX took delivery of Nvidia’s first Vera-based systems. “Vera CPU opens a brand new $200 billion TAM for Nvidia, a market we have never addressed before,” she said. While Nvidia is expanding its addressable market to include standalone CPUs, it should be noted that much like the company’s Ethernet networking products, they’re designed primarily with AI and HPC applications in mind. The chips can’t replace x86 processors in every application, yet. Kress’ comments come as Nvidia caps off a strong end to the first quarter of its 2027 fiscal year. The GPU giant raked in $58.3 billion in profits on $81.6 billion in revenue for the quarter, the latter of which grew 85 percent YoY and 20 percent from the prior quarter. Kress attributed the sequential jump to an “inflection in inference demand.” The quarter saw Nvidia change how it breaks out revenues. The company’s business units have now been organized into a datacenter group which includes cloud, hyperscale, neocloud and enterprise sales, plus an edge group, which serves as a catchall for gaming, robotics, automotive, and vRAN products. Datacenter revenues accounted for the vast majority of revenues, at $75.2 billion. Of that $38 billion came from hyperscaler and public cloud customers, while neocloud, industrial, and enterprise customers paid the remaining $37 billion. Edge sales accounted for a mere $6.4 billion, with the company citing demand for Blackwell-based workstation gear as a key driver. Looking ahead to Q2, Nvidia forecast revenue will hit $91 billion plus or minus two percent. That prediction assumed no datacenter sales in China. Nvidia has been trying for months to reignite its GPU business in the Middle Kingdom since Uncle Sam gave the company the green light to sell its aging H200 processors to Chinese customers for the first time ever back in December. Despite receiving approval from the Trump administration and receiving billions of dollars worth of orders, shipments remain stuck in Beijing’s red tape. ®
Categories: Linux fréttir

AMD says its $4K Ryzen AI Halo workstation practically pays for itself

23 hours 43 min ago
AMD’s answer to Nvidia’s DGX Spark AI workstations, codenamed the Ryzen AI Halo, will be available for pre-order later next month for anyone with $3,999 burning a hole in their pocket. That might sound like a lot for an AI mini PC, but don’t worry. Compared to cloud APIs, it practically pays for itself. Or, well, that’s AMD’s sales pitch. The House of Zen argues that if you spend eight hours a day vibe coding, the system could save you $750 a month. Whether this helps you justify paying for hardware that less than a year ago could be found for between $2,200 and $2,999 or not, it’s (probably) not AMD being greedy here; the RAMpocalypse has been hard on everyone. Much like the DGX Spark, which now retails for $4,699, up from $3,999 when we reviewed it last fall, AMD’s rendition aims to provide a curated developer environment for running local models and agentic AI frameworks. This is really the core value proposition behind both of these devices. They aren’t the most powerful or the fastest AI systems, but they’re able to run models that a few years ago would have cost $20K or more. A little box of TOPS The diminutive system measures in at 5.9 x 5.9 x 1.7 inches (150 x 150 x 43 mm) and is powered by a 120 watt Ryzen AI Max+ 395 APU, better known by its codename Strix Halo. The chip is backed by 128 GB of LPDDR5x 8000 MT/s memory, which feeds both its 16 Zen 5 cores and 40 RDNA 3.5 GPU compute units, providing up to 256 GB/s of bandwidth, more than a Ryzen 9000 Threadripper (non-Pro) system. For local AI enthusiasts, that’s enough to run models up to 200 billion parameters in size at 4-bit precision — just like the more expensive Spark. The bulk of the Ryzen AI Halo’s compute comes from its integrated graphics, which are capable of delivering roughly 56 teraFLOPS at 16-bit precision. While impressive for onboard graphics, that’s still between 55 and 88 percent slower than what the DGX Spark advertises. Unlike the Spark’s Blackwell-based GB10 APU, Strix Halo doesn’t support FP8 or FP4 data types in hardware. At BF16, the Spark delivers 125, at FP8 250, and FP4 500 teraFLOPS. Double those figures if you happen to find a workload that can leverage Nvidia’s 4:2 sparsity. That performance discrepancy won’t necessarily be obvious in every workload. In fact, in LLM inference, AMD claims the AI Halo generates tokens 4-14 percent faster than the Spark. The lower end of that roughly matches what we saw when we pitted the Spark against a similarly equipped HP Z2 Mini G1a back in December. The G1a packs the same silicon as AI Halo, and in Llama.cpp with the Vulkan backend, eked out a small but meaningful lead over the Spark in tokens per second generated. However, the speed any GPU can generate tokens at is largely dictated by effective memory bandwidth, not floating point performance. GPU compute has a much bigger impact on things like prompt processing time. In our testing, the Spark’s more capable tensor cores gave it a 2x to 3x lead in prompt processing. For shorter prompts, this isn’t all that noticeable, usually the difference between waiting 100 ms versus 200 ms or 300 ms, but for longer prompts, it did become more pronounced. We saw the Spark take similar leads in our image generation and fine tuning benchmarks, but it’s worth noting that AMD’s software stack has matured greatly since our initial review and the performance gap has likely closed somewhat since then. AMD’s AI Halo does have two things going for it that can’t be said of the Spark. Alongside the GPU is an XDNA 2-based neural processing unit (NPU) that AMD rates for 50 TOPS. What good that’ll do you depends heavily on the application in question. Many content creation apps have now been updated to take advantage of it, but the number of generative AI inference engines that could properly harness it was quite limited the last time we looked. The second thing AMD’s Ryzen AI Halo has going for it is that it’s a standard x86 box at its heart, and you can run Windows or your preferred flavor of Linux on it if that’s more your style. On the Spark, you’re stuck with a lightly customized version of Ubuntu 24.04. Beyond that, you’re coloring outside the lines. Particularly for developers building for Microsoft’s NPU-accelerated AI PC ecosystem, this is an obvious advantage. In terms of networking, AMD’s Spark-clone falls a bit flat. One of the hallmark features of Nvidia’s AI workstation is a 200 Gbps ConnectX-7 NIC, which allows for clustering of up to two and eventually four systems. AMD’s AI Halo has a single 10 Gbps NIC, which should help with downloading large model files in a timely manner. In theory, the system should be able to achieve high-speed networking over USB-4, but it’s not clear whether this is actually a supported use case. That said, Apple has already demonstrated just this using RDMA over Thunderbolt, so it should work so long as AMD has a playbook for configuring RDMA on its systems. AMD’s own AI lab As we mentioned earlier, much of the Ryzen AI Halo’s value proposition comes from being validated hardware with well documented playbooks for common use cases and known good software. Finding the right combination of device drivers, ROCm, HIP, SYCL, CUDA, PyTorch, TensorFlow, and JAX has long plagued the AI/ML devs, regardless of which ecosystem you opt for. Having validated environments for workloads, whether it be vLLM, Llama.cpp, Ollama, ComfyUI, or something else ensures users spend more time doing something productive than debugging mismatched dependencies. At launch, AMD says the Ryzen AI Halo will ship with five preinstalled playbooks, with another 10 available online and additional playbooks to be added monthly. Additionally, customers will gain access to AMD’s developer program, cloud credits, and exclusive playbooks. More memory on the way The 128 GB Ryzen AI Halo will be available for pre-order next month starting at $3,999, but if that isn’t enough for you, AMD is already prepping a higher capacity version of the system with 192 GB of memory on board. That system will feature a refreshed Ryzen APU in the AI Max+ 495, which just like the rest of AMD’s 400-series lineup gets a modest clock bump to the CPU, GPU, and NPU, and not a whole lot else. Still, 192 GB of unified memory opens the door to even larger, more capable models, if you can stomach the presumably higher asking price. ®
Categories: Linux fréttir

Intuit axes 3,000 – without blaming AI

Wed, 2026-05-20 23:35
Intuit has cut its full time workforce by 17 percent and is considering closing offices in some markets “to become “faster, leaner, and more focused,” company CEO Sasan Goodarzi told investors during a Wednesday earnings call. “This was not about AI,” Goodarzi said, before explaining that over the last year company management has studied the question "beyond the tools that we are putting in place across the company, what is actually the biggest blocker and what is getting in our way?" One of the answers was that Intuit had too many layers of management. Goodarzi said doing so will "reduce the complexity of information flow of ... so we can push decision making to our frontline folks that are the builders.” The CEO said Intuit also decided to cut in what he described as “coordination-heavy” roles such as project managers and business operations jobs that have become less necessary due to the speed at which the remaining teams can build products. He said Intuit also merged TurboTax and Credit Karma as a business unit so some of the cuts resulted from overlaps within that group. The move to fire over 3,000 employees comes as the company said it spent $3.4 billion in stock repurchases during the previous nine months ended April 30. Intuit’s board of directors also mandated the company lean in to share buybacks as it authorized an additional $8 billion to be spent on Intuit stock at the discretion of management and the board. The job cuts are expected to cost the company about $340 million in restructuring charges, with much of that coming in the form of severance payments, according to SEC filings Intuit published Wednesday afternoon. “A big chunk of this, you can count on it to go to margin expansion and EPS growth, and a smaller part is going to be scaling the growth engines because we feel good that the growth engines are funded quite well, just because of the productivity we see internally,” Goodarzi told investors. Several victims of the cuts posted their story to LinkedIn and many of them were, as Goodarzi said, in product, or project management positions. “I would like to thank all my teammates, both past and present, who have made the job such a fulfilling experience. I’d like to thank each and every one of the managers who have helped guide me along my path from entry-level technical support to Staff Engineer,” wrote one employee who said he had been with the company for 13 years. The cuts also included a senior sales and productivity analyst who has been with Intuit for 25 years, a software engineer with nine years at the company, and an engineering team leader with two years in that role, according to LinkedIn posts. In addition to reducing the number of managers, Goodarzi the cuts would also serve to “rightsize” staffing for Intuit’s email marketing product, Mailchimp. One customer experience leader in that organization wrote on LinkedIn that after more than eight years it was hard to believe he would no longer work for Mailchimp. “Mailchimp... it's ridiculous to get emotional over a piece of software, but if folks understood what this company meant to the city of Atlanta. It was our brand right alongside Coca-Cola, Delta, and The Home Depot ... it was OUR tech company,” he wrote. “It was also a company willing to take a risk on folks … I'm still processing that Mailchimp and Intuit will no longer be my home. Keep an eye out for me if you don't mind.” Goodarzi said the cuts were not in response to any underperformance, but were deliberate actions to scale its “growth engine and strengthen its core.” “We are at an important inflection point,” he said. “To fully capitalize on this opportunity, we must operate with greater velocity, urgency, and discipline. These deliberate actions are about scaling our growth engine and strengthening our core. We’re sharpening our cost structure to deliver durable long-term growth and margin expansion.” ®
Categories: Linux fréttir

AI code accelerates production failures and spending, study finds

Wed, 2026-05-20 22:43
The rapid adoption of AI-generated code is driving production failures and higher costs for enterprise customers. Eighty-one percent of enterprise technology leaders among more than 200 surveyed reported an increase in production issues linked to AI-generated code, according to a study published by enterprise software delivery biz CloudBees. Sunil Gottumukkala, CEO of Averlon, an agentic vulnerability remediation biz, told The Register in an email that these issues tend to refer to functionality bugs, performance issues, availability problems, and security vulnerabilities rather than CI/CD failures. "These are issues that surface after code has already been deployed to production, which means the code passed every review and deployment gate and still broke things," said Gottumukkala. "When failures happen post-deployment, it signals that the validation process itself isn’t keeping pace with what AI is producing." Yet 92 percent of respondents expressed confidence that their code was production-ready before it shipped. Jacob Krell, senior director of secure AI solutions and cybersecurity at Suzu Labs, told The Register in an email that the report does not isolate what specifically failed at these organizations. "It spans functional defects, security vulnerabilities, and compliance violations that reach production because governance and validation have not scaled with output," he said. "The same study found 69 percent citing security vulnerabilities and 63 percent citing compliance issues introduced by AI generated code specifically." Krell said what ties them together is the verification gap. "AI generates code faster than teams can validate it," he said. "Seventy percent of respondents now say test suite maintenance is a larger burden than writing code itself. These are not system crashes in the traditional sense. They are the full spectrum of what reaches production when volume outpaces the capacity to verify quality, security, and compliance before deployment." Respondents said 61 percent of their organizations' code has been generated by AI or has come into being with AI assistance. And 64 percent of the engineering organizations involved say AI is widely or fully integrated into their workflows. The result is that more than half (52 percent) of those surveyed report an uptick in software development output. And while 68 percent of organizations appear to be convinced AI is delivering business value, only 31 percent of AI-related spending can be linked to specific business results. In 36 percent of organizations, AI spending is tracked without measuring the return on investment or isn't tracked at all. With more code comes more cost from infrastructure spending, in the form of increased CI/CD, testing, and security scanning. Some 54 percent of respondents said CI/CD infrastructure spending has risen significantly in the past 12 months, and 53 percent flagged rising testing, security, and deployment costs. Only 45 percent of respondents say these costs are predictable quarter to quarter. Yet relatively few organizations have taken steps to control AI spending: 27 percent report quotas or limits on token usage, while just 18 percent have automated spending controls. And this is a problem without ownership. Just 12 percent of organizations have dedicated AI governance. For 46 percent, the buck stops with the CTO or VP of engineering when there's a production failure. For 32 percent, blame falls on the engineering lead or team associated with the tool that messed up the code. For 7 percent, the developer who shipped the pull request takes the heat. It may be tempting to take comfort in the fact that 93 percent of respondents say their organization has a formal process for reviewing and releasing AI-generated code. But keep in mind that only 56 percent of survey takers say those processes are always enforced. ®
Categories: Linux fréttir

Even Claude agrees: hole in its sandbox was real and dangerous

Wed, 2026-05-20 20:34
Two now-patched bypass bugs in Claude Code’s network sandbox put users at risk, and one of these allows baddies to send anything inside the sandbox - credentials, source code, other private data - to any server on the internet, according to a researcher who found and reported both flaws to Anthropic. Aonan Guan, who leads cloud and AI security at Wyze Labs and has hunted down bugs in pretty much every AI system out there, told The Register that this is the second time in five months Anthropic has silently fixed a sandbox bypass vulnerability in Clade Code without issuing a CVE or security advisory specific to the agentic coding tool. The latest issue was a SOCKS5 hostname null-byte injection that can be exploited to trick the sandbox allowlist filter into approving connections it should block. It’s especially dangerous when combined with prompt injection, which Guan previously detailed in his earlier comment and control research. When paired with prompt injection, the new flaw can be abused to force Claude to read hidden instructions and then run attacker-controlled code in the sandbox, allowing miscreants to exfiltrate anything the sandbox could reach. This includes cloud and GitHub credentials, the GitHub token Claude authenticated with, cloud metadata and internal APIs. “For anyone who ran Claude Code with a wildcard allowlist on a credential-bearing system, the network boundary did not exist for the 5.5 months from sandbox GA to v2.1.90,” Guan wrote in research published Wednesday. “Treat that window as a potential exfiltration event.” Anthropic says it found and fixed the latest flaw before receiving Guan’s report. The fix, according to a spokesperson, is a public commit in the sandbox-runtime repository, which shipped in Claude Code 2.1.88 on March 31. “Anyone can view” the commit, they told us. Guan filed his bug bounty report with HackerOne on April 3. “Because the report described a vulnerability Anthropic had already caught and patched, it was closed as a duplicate of an internal finding,” the spokesperson said. “We appreciate the researcher’s time on this report.” Guan says he doesn’t dispute the timeline. “That is not the core issue,” he told The Register. “The core issue is that this was a bypass of a user-configured network sandbox, and there's still no advisory CVE, and no changelog note," he said. "Shipping a sandbox with a hole is worse than not shipping one. The user with no sandbox knows they have no boundary. The user with a broken sandbox thinks they do.” Claude, for its part, seems to side with Guan. When he showed Claude its own hole, the bot responded “This is a real bypass of the network sandbox filter,” according to a screenshot published in his research. The earlier bug, which Guan reported and detailed in December 2025, was ultimately assigned a CVE tracker - CVE-2025-66479 - and patched in v0.0.16. But the CVE only applies to Anthropic's sandbox-runtime, an upstream package, and not specifically to Claude Code, which Guan says means users have no way to know if their AI coding assistant is reading “allow nothing” as “allow everything.” He requested a CVE for Claude Code, and Anthropic said no because “The root cause is in the library.” Guan told us he’s glad Anthropic ultimately addressed the security holes. But the entire disclosure process illustrates another problem that researchers and The Reg vultures have reported with how AI vendors often handle vulnerabilities in their products: no CVEs issued, and if the flaw is fixed, it usually happens silently, with no public advisories. More often than not, the burden of securing AI agents and other systems gets pushed to the end users. “Some vendors issue CVEs and some do not,” Guan said. "I think either approach can be reasonable, but the advisory is a must. The users need to know the risk is real, and in many cases, they may never know. What the public often does not see is that vendors may reward researchers and silently patch the software, while end users never learn from release notes or public advisories that the risk existed.” According to Guan, this shows why users need their own protections, either from a security company or user-controlled runtime isolation. But he said he does hope big tech “takes on the burden of clearly communicating” security issues with users. “Because of that, I think companies should treat AI agents more like employees than ordinary software tools,” he told us. “Before hiring an employee, companies do background checks. Before giving them access to systems, they define permissions. The same discipline should apply to AI agents.” ®
Categories: Linux fréttir

Intel's CEO reveals early hiring challenges as bankruptcy concerns deterred top talent

Wed, 2026-05-20 20:02
When Lip-Bu Tan took over as Intel CEO, the company's balance sheet was so dire that potential recruits turned him down flat, worried they'd be joining a chipmaker on the verge of going bust. Speaking at the JP Morgan Global Technology, Media and Communications Conference on Tuesday, he said: "I tried to recruit some talent. They said 'It's almost a bankrupt company, why should I join you?'" Fixing that became his first priority. That effort has since paid off as Lip-Bu secured equity investment from the Trump administration, which converted funds from the CHIPS program in exchange for a stake. He also drew on long-standing personal relationships, with Nvidia CEO Jenson Huang committing $5 billion and Softbank's Masayoshi Son - a former Intel board member - signing on as a backer. "So far, knock on wood, I made money for them, and they're quite happy," Lip-Bu said. The stronger balance sheet has since enabled Intel to buy back a stake it had sold to Apollo, reducing earnings-per-share dilution in the process, he added. Now, a year into the job, Lip-Bu is betting on agentic AI, inference workloads and a bold chipmaking roadmap to complete Intel's revival - looking beyond the upcoming 14A process node to future 10A and 7A chipmaking technology. When asked for a progress report on the process technology Intel uses to manufacture its products, Lip-Bu said the recently introduced 18A is seeing a 7 percent per month yield improvement, and the next-gen 14A node is "ahead of schedule" compared to the end of the year target. "And now I'm starting to look at the 10A, 7A, the roadmap," Lip-Bu said. "People don't go to you just for one node. They're looking for the roadmap for the future. So we want to build a long-term business. And then we can drive the efficiency, the defect density, and then we can go to that Rule of 45, how to drive the operating efficiency, the profitability, cash generation." Intel's 18A and 14A processes refer to 18 and 14 angstroms, and as there are 10 angstroms in 1nm, this could imply that the company is working toward a sub-nanometer process technology with 7A. The Register asked Intel if it was willing to disclose when these process nodes would likely come into use, and we will update this story if we get a response. However, Lip-Bu did mention that for 14A, "my risk production is 2028 and volume production in 2029 is about the same time as A14 for TSMC," implying that 10A and 7A are unlikely to be used to make chips before 2030. On Intel's efforts to reinvent its internal chipmaking operation as a foundry service for third-party customers, he confirmed it is engaged with multiple clients, but declined to identify any, saying disclosure was up to the customer. Manufacturing used to be Intel's strength, but it lost its way, and has never really been in the foundry service business, Lip-Bu said. For this reason, he recently poached Shawn Han, a veteran with three decades' experience at Samsung Foundry, to operate as SVP of Foundry Services. Customers are also asking about Intel's 18AP process node, an enhanced, performance-focused variant of 18A, he claimed. Previously, Intel's foundry biz planned to offer the 14A node as its first mainstream commercial offering. To show how keen potential customers are, Lip-Bu claimed they are even willing to help Intel with pre-payments on wafer substrate materials. "Some of the substrate material is very [short], they're all asking us to prepay the substrate commitment. And we ask our customers, if you are serious to use our EMIB-T [packaging technology], can you help me on the substrate prepay? They jump on it," he claimed. "So they show the commitment, they really want our technology. And this is not a few million, it's billions in the next few years." Lip-Bu expects AI to be Intel's route to recovery after a disastrous few business years, as agentic AI and inference workloads look set to favor CPUs rather than the GPUs that have been making Nvidia's fortune. "It used to be that training is 1 CPU to 8 GPUs. And now in the agentic AI with all the agents, startups all tell me, Lip-Bu, CPU actually is more useful, even single-threaded," he said. "So I can start to see not just my wishful thinking, customers have said to me, Lip-Bu, more like 1:1. And now even some of them tell me it's 4:1. So 4 CPU to 1 GPU, for the inference and agents. And so CPU [is in] high demand, and I try to make sure that we can meet the requirement from the customer." Intel still has some way to go. As The Register pointed out earlier this year, the chipmaker lost $267 million on revenues of $52.9 billion during 2025, compared to an $18.8 billion loss the year before. Maybe it will even turn a profit this year. ®
Categories: Linux fréttir

OpenAI floats buy-before-your-try AI availability guarantee

Wed, 2026-05-20 19:51
OpenAI is now offering its customers a guarantee that it will actually provide the service it's selling if customers need more AI sauce. All that's required is an annual spending commitment. AI is in short supply as demand – stoked by flat-rate subscriptions – races ahead of datacenter inference capacity. AI workloads require a lot of computing power, especially when they run for hours on end, a common scenario for AI agents. They require more computing power than OpenAI and its cloud partners can provide. Promised datacenter construction is unlikely to address this shortfall any time soon, assuming those projects are actually completed at all. The result has been thinly disguised rationing through usage limits, variable pricing, subscription restrictions, and token consumption arbitrage – swapping out expensive models for cheaper ones. OpenAI’s answer is a new offering meant to ensure it can deliver on its AI promises. "Today we announced OpenAI Guaranteed Capacity, a new offering that helps eligible customers plan for reliable access to OpenAI compute across supported cloud providers as they scale critical workflows," said Sachin Katti, who runs compute for OpenAI, in a LinkedIn post. "It gives customers a clearer framework to align forecasted demand, commercial commitments, and guaranteed shared capacity over time." According to Katti, AI adoption has made processor availability the salient constraint on modern software. He argues that companies dependent on AI workflows need to take steps to ensure infrastructure availability. Guaranteed Capacity is intended to ensure that if client demand grows, compute infrastructure will be available to support that growth. OpenAI says customers can make annual spending commitments ranging from one to three years, with discounts that scale according to duration. "Guaranteed Capacity includes certainty of access to compute based on spend levels, and customers can draw down from this commitment across the portfolio of OpenAI products," the company says. Not everyone is buying OpenAI's pitch. Santosh Ahuja, founder and CEO of enterprise infrastructure biz Pervasiviti and an experienced CTO and researcher, challenged Katti's announcement as a repackaging of basic cloud service obligations in a discussion thread reply. "Every hyperscaler solved 'reserve now, scale later' a decade ago," he wrote. "So the actual announcement is: OpenAI now offers what every cloud provider has treated as a basic procurement primitive but positions it as a strategic differentiator." Ahuja argues that OpenAI's AI evangelism can't keep up with its execution. "Enterprises don't need a 'clearer framework to align forecasted demand,'" he wrote. "They need deterministic SLAs with penalty clauses." In a message to The Register, Ahuja explained that the problem is not guaranteed capacity, it's the way OpenAI has framed cloud business assumptions as innovation. "Reserved instance models have existed at every hyperscaler for over a decade – AWS, GCP, Azure," he explained. "When you strip the marketing, OpenAI is acknowledging that demand has outrun their infrastructure planning and asking customers to pay for predictability that mature platforms deliver by default. "The real question enterprises should ask is whether these guarantees come with enforceable SLAs and financial penalties for breach, because without those, 'guaranteed' is just a word in a press release." Ahuja pointed to a response to Katti's post from Keith Strier, SVP of global AI markets at AMD ("Brilliant. We accept the challenge.") which he takes to mean, "We will try our best." "That's your supply chain partner speaking – and 'we will try our best' is not the language of guaranteed anything," Ahuja said. "It sounds like OpenAI is promising capacity they don't yet have firm commitments on from their own silicon suppliers. That's not a product launch. That's wishful thinking with a price tag." So too is OpenAI's preparation to go public, which reportedly could result in a corporate filing as soon as this week. ®
Categories: Linux fréttir

Fedora: Microsoft is all aboard, but Deepin is dumped

Wed, 2026-05-20 17:24
Microsoft has announced a new, Fedora-based Linux distro for Azure VMs, while Fedora has consigned the Deepin desktop to the bin. Fedora decided to remove a component maintained outside Red Hat. In the same week, another external company - granted, a slightly better-known one - decided to rebase one of its projects onto Fedora as its upstream distro. It’s the circle of life, or something. Fedora 💔Deepin Seven years after it added the Deepin Desktop Environment in Fedora 30, Tuesday's FESCo meeting decided to drop Deepin from the distro. The minutes say: AGREED: Retire all packages maintained by the deepinde-sig group The decision comes one year after the project called for a security review of the Deepin Desktop Environment, after openSUSE dropped the desktop following a negative security assessment. We reported on that decision at the time. SUSE asked Deepin for feedback, but didn’t get good enough answers – for which, some months later, the Chinese project issued an apology. Linux Deepin is very much still around: we most recently looked at version 25 in January, and, back in 2023, the project claimed it had passed three million installs of its paid Tongxin UOS desktop edition. It’s a very pretty Windows-like desktop environment, but it never made it to having its own Fedora spin – and it certainly won’t now. Microsoft ❤️ Fedora But as one door closes, another opens. Fedora is still winning new friends and allies, and mere days earlier, there was a surprise announcement at the Open Source Summit North America, which as we write is winding down. On Monday, Microsoft announced a new version of its in-house Linux distro, Azure Linux 4, along with a companion distro called Azure Container Linux. There have been products called Azure Linux for quite a while. It’s based on the much more minimal CBL-Mariner distro, which we tried in 2022. The Register reported on Azure Linux becoming generally available in 2023, and then on the release of version 3 in 2024. We also knew back then that the company was working on turning it into a more general-purpose server OS: we reported on it migrating LinkedIn to Azure Linux in place of CentOS Linux that same year. There isn’t very much information about Azure Linux 4 yet; the broader rollout will be at the Microsoft Build conference next month. For now, all you can do is fill in a form to register your interest. However, the announcement reveals that version 4 switches to Fedora as its upstream distro. It was already based on the RPM packaging tools, hinting at some Red Hat or SUSE heritage in there somewhere. There’s slightly more information about Azure Container Linux. This is a separate distro, an immutable host OS for running containers. The announcement says “Azure Container Linux is based on the Flatcar project.” Flatcar is the continuation of CoreOS Container Linux, which The Reg has covered since it released its first version in 2014. As Linux Weekly News reported that year, CoreOS was based on Google’s ChromeOS, but redesigned to host containers. Red Hat acquired CoreOS in 2018, and then two years later, discontinued Container Linux. It replaced the Google and Gentoo-based distro with a new one based on Red Hat’s own immutable tool chain, called Fedora CoreOS. German FOSS consultancy Kinvolk forked the CoreOS code and continued development under the name of Flatcar Container Linux. Kinvolk was acquired by Microsoft in 2021, but continued to work on Flatcar. Now it seems that, with the announcement of Azure Linux 4 as well as Azure Container Linux, Microsoft has two separate in-house distros: one based on Fedora, and one based on ChromeOS. For now, Flatcar is still trundling along the tracks just fine, but we suspect some future consolidation may be coming down the line. ®
Categories: Linux fréttir

Bye-bye, Gemini CLI; Google's gone and swapped you for a closed-source AI

Wed, 2026-05-20 16:44
Pour one out for the Gemini Command Line Interface. Come June 18, the open source development agent will stop serving most users in favor of the new Antigravity CLI, and developers aren’t happy that the replacement is far less open than the old tool. Google announced the Antigravity CLI at Google I/O this week, billing it as a way for the Chocolate Factory to unify its efforts in developing a command line interface for AI agents. One of the key arguments Google makes in a post about transitioning from Gemini CLI to Antigravity CLI is that the new one has improved support for multi-agent environments, but the company isn’t giving most of its users much of a choice on whether to switch. “On June 18, 2026, Gemini CLI and Gemini Code Assist IDE extensions will stop serving requests for Google AI Pro and Ultra, as well as those using it free of charge using Gemini Code Assist for individuals,” the Gemini CLI team wrote in their announcement of the transition. The change also affects Gemini Code Assist for GitHub, which won’t allow new installations beginning June 18, and will stop serving requests in the following weeks. Enterprises appear to be getting a pass, however, with Google noting that those using Gemini CLI or its IDE extensions through a Gemini Code Assist Standard or Enterprise license won’t see any changes in their access, nor will Gemini Code Assist for GitHub users accessing the tools through their enterprise Google Cloud accounts. “Gemini CLI will remain accessible via paid Gemini and Gemini Enterprise Agent Platform API keys,” Google explained. For everyone else, sorry: It’s Antigravity CLI or bust, but don’t expect the same experience. “There won't be 1:1 feature parity right out of the gate” between Gemini CLI and Antigravity CLI, Google added. Agent skills, hooks, subagents, and extensions are all being supported by Antigravity CLI at launch. But other stuff may take time to arrive, if it does at all. Pray we don’t alter the deal any further Take a look at the Gemini CLI GitHub page, and you'll find all the code that made it possible - it is an open-source project, after all. Surf over to the Antigravity CLI GitHub page and all you’ll see is a change log, readme, and a GIF file demonstrating the tool’s appearance. That’s right: Antigravity CLI isn’t open source - at least not from what Google has published so far - and it took developers no time at all to notice. Gemini CLI Lead Product Manager Dmitry Lyalin took to GitHub to make an announcement detailing some additional info about the forced CLI tool migration, and the comments are rife with people frustrated by the move. No small portion of the vitriol is targeted at apparent usage limits, with multiple people reporting they’d hit their weekly quota with just a couple of requests. The issues page for Antigravity CLI similarly has numerous posts asking Google to look into usage limits. Other posts accuse Google of using open-source contributions to improve a new closed-source product and generally express frustration with Google for killing yet another thing customers relied on. At the same time, Lyalin teased developers by telling them that, no, Gemini CLI isn’t truly gone if you’re willing to pay top dollar for it. “The project remains available to the community as an Apache 2.0 licensed repository with no changes,” Lyalin noted in his GitHub post. “You will continue to see us work on GitHub as we keep Gemini CLI updated with latest model releases, bugs and security fixes for our enterprise customers.” Now please open your wallets if you want access to this open-source product. Google didn’t respond to questions for this story. ®
Categories: Linux fréttir

Plex appeal fades as Lifetime Pass jumps to $750

Wed, 2026-05-20 15:51
Software platform Plex is killing its buy-once-use-forever model with an eye-watering hike of its Lifetime Plex Pass from $249.99 to $749.99. The increase will take effect on July 1, but does not apply to existing Lifetime Plex Pass holders. The existing monthly and annual subscription pricing for the service is also unchanged, suggesting Plex really would rather you subscribed. Lifetime Plex Passes used to cost $74.99, before increasing to $149.99 in 2014. It later fell back to $119.99 before jumping to $249.99 in April 2025. Just over a year later, the price is heading to $749.99. Plex is used primarily as a media server. Load it up with content and point clients at it. The project has its origins in the Xbox Media Center (XBMC) from which it was forked. The server software runs on Windows, macOS, Linux, and several network devices, and the client runs almost everywhere, including iOS and Android devices. Remote access to a Plex media server used to be free, but in 2025 Plex shifted that functionality to its paid tiers. Other paid-tier functionality includes hardware transcoding and media file downloads. "We've considered eliminating the Lifetime Plex Pass," Plex wrote. Considering the hike, it might as well have. An annual subscription costs $69.99, meaning it will take more than a decade before that $749.99 pays off. Unless, of course, Plex increases the subscription cost in the future. The increase is the latest in a long line of price hikes from media companies, though it is among the most extreme. Plex has made no secret of the fact that "recurring subscriptions help us sustain long-term development" – hence the attempt to make the one-off lifetime payment less attractive. The jump might be eye-watering, but it also reflects a growing realization across the tech and streaming industries. One-off payments don't cut it anymore and ads aren't enough, so a subscription is the way to go to sustain development. Although existing Lifetime Plex Pass customers are not affected, the hike highlights how important subscriptions have become across all sectors (we can't see Microsoft weaning itself off Microsoft 365 any time soon) and what "lifetime" actually means. ®
Categories: Linux fréttir

Those spared latest Meta job cuts forcibly reassigned to AI roles

Wed, 2026-05-20 13:27
Meta's massive role reshuffle begins today, with thousands of staff being transferred to AI-focused teams and their managers reportedly laid off. The tech giant is reassigning 7,000 workers to AI projects, eliminating around 10 percent of its current workforce, and closing 6,000 open positions, according to Reuters, which saw copies of the internal memos. The workforce changes, the latest in a series of moves that started 2022, will affect roughly 20 percent of Meta's approximately 78,000 employees. Janelle Gale, Meta's chief people officer, penned the memos to affected staff. Some have already begun their new AI-related duties, while the rest will be told of their fates today, she reportedly said. "As org leaders worked on the changes, many of them incorporated AI-native design principles ⁠into their new org structures," Gale's memo read. "We're now at the stage where many orgs can operate with a flatter structure with smaller teams of pods/cohorts that can move faster and with more ownership." This flatter structure will involve, in part, managers being either laid off or moved into roles where they are producing work instead of overseeing teams. Previous memos sent to staff in April stated that top engineers – those who represented the company's "strong software engineering talent" – were being "selected" for brand-new divisions within the business. Among these were the Applied AI Engineering and Agent Transformation Accelerator units, as well as Central Analytics. Once famed for letting its staff pick and choose their projects, Meta said those selected for this new AI mission had no say in the matter. Responding to an employee's question, Maher Saba, VP of AAI Engineering, wrote: "AAI is one of the company's highest priorities and we're resourcing it by moving our strongest talent to address it. Therefore, the transfers aren't optional." Both AI units were established for engineers to develop AI agents capable of automating and taking over duties previously undertaken by human employees. Those transferred to Central Analytics will work on ways of assessing productivity and analytics for agent development. According to Gale's memo, another new unit called Enterprise Solutions will soon be established, but Meta has not yet revealed details. The Register asked Meta for a statement, but it did not immediately respond. The Great Flattening Gale's language regarding "flatter structures" echoes chief Mark Zuckerberg's wording from Meta's January earnings report, promising to flatten teams over the coming year. "We're elevating individual contributors, and flattening teams," Zuck wrote in a post-earnings note on January 28. "We're starting to see projects that used to require big teams now be accomplished by a single very talented person. "I want to make sure as many of these very talented people as possible choose Meta as the place they can make the greatest impact – to deliver personalized products to billions of people around the world. And if we do this, then I think we'll get a lot more done and it's going to be a lot more fun." Reports surfaced around the same time about a major round of job cuts at the company, equivalent to 20 percent of its workforce, or around 15,000 roles, but it was unclear when or if this would materialize. Meta's latest round of layoffs follows a smaller-scale round in March, affecting 700 roles across Reality Labs, the social media division, and recruitment. The changes come against a backdrop of Meta investing heavily in AI, with the company saying it plans to spend between $162 billion and $167 billion this year, up from $118 billion in 2025. The company has reportedly also tried tempting top AI talent to join its ranks with nine-figure pay packets, and ex-OpenAI players with $100 million sign-on bonuses. The revolt Meta slashing roles to embrace AI replacements has led to protests across its Menlo Park HQ and internal Workspace comms platform, Reuters reports. First announced in April, Meta said it would be tracking mouse clicks and keystrokes to train AI rather than assess staff productivity. A company spokesperson told the BBC: "If we're building agents to help people complete everyday tasks using computers, our models need real examples of how people actually use them." They said the data is not used for any other purpose, and there are safeguards in place to protect sensitive content. But Meta staff have expressed their disdain for the changes in various ways, including by setting up an online petition – which now has over 1,000 signatures – and plastering flyers all over US offices referring to the company as an "Employee Data Extraction Factory." ®
Categories: Linux fréttir

Datacenter builds could be shielded from judicial review in UK planning reforms

Wed, 2026-05-20 12:49
The UK government may move to forestall objections to datacenter projects via an overhaul of planning regulations that would shield critical energy and infrastructure buildouts from legal challenges. Finance minister, Chancellor of the Exchequer Rachel Reeves, is preparing to unveil reforms allowing Parliament to designate key projects as having critical national importance to prevent them from being subject to judicial review. "For too long, vital infrastructure delivery has been delayed by judicial reviews of projects," a spokesperson for HM Treasury said in a ⁠statement. The Chancellor "is clear that Parliament must take back control to get Britain building the ⁠power plants, wind farms and grid connections that will bring ⁠bills down, strengthen our energy security, and ⁠deliver growth in every part of our country." This does not explicitly mention datacenters, but as The Register has reported previously, massive server farms have already been lumped in with energy generation as projects to be urgently fast-tracked by government. Back in 2024, server farms were given Critical National Infrastructure (CNI) designation - a move that a government a civil servant warned would allow the authorities to override opposition to datacenter sites by local residents. Last year, a datacenter project management biz urged the government go a step further and categorize large datacenter developments as Nationally Significant Infrastructure Projects (NSIPs). This would shift decision-making from local authorities to the national level, potentially expediting approvals for large projects, while removing any say that local residents may have in developments that affect them. Parliament’s Public Accounts Committee also published a report last year on Treasury plans to implement stronger governance to speed large infrastructure schemes, criticizing it for not including “technically complex” projects such as those involving “digital transformation and artificial intelligence (AI),” despite the government’s focus on these to deliver economic growth. Some are already welcoming the planning reforms before they have even been officially announced. “Today’s announcement is the right thing to do if we are serious about growth, energy security and getting Britain building again,” said Ben Brittain, director of public affairs at the Association for Consultancy and Engineering (ACE). “For too long, critical infrastructure projects have been delayed by blockers, layers of uncertainty and lengthy legal challenge, driving up costs and holding back investment." But the government should be wary, as public sentiment is turning against the growing number of datacenters being built and the AI technology that is driving their construction. Putting them beyond the reach of any legal challenge could easily exacerbate this. In America, for example, a recent survey found that people would sooner see a nuclear power plant sited in their locality than a datacenter. Gunshots were also fired at the home of an Indianapolis councilor who backed plans for a server farm in his area. In the UK, there have been protests against various datacenter building projects. Activist group Global Action Plan organized several at the end of February at sites including Iver in Buckinghamshire and Potters Bar in Hertfordshire where recently announced campuses are being constructed. HM Treasury had not responded to our requests for further information at the time of publication. ®
Categories: Linux fréttir

Microsoft says cu l8r to text message security

Wed, 2026-05-20 12:33
Microsoft has confirmed that SMS is on the way out as a method of authentication and recovery for personal Microsoft accounts. Fraud and dubious security were cited as reasons for the move: "SMS authentication is vulnerable to phishing and SIM-swap attacks." Passwordless accounts, passkeys, and verified email are the future, according to Microsoft. The announcement was first spotted by WindowsLatest and comes as passkeys are increasingly accepted as a default authentication standard. In April 2026, the UK's National Cyber Security Centre officially endorsed the technology and urged consumers to adopt it. For its part, Microsoft has promoted the use of passkeys for more than a year, declaring in 2025 that all new Microsoft accounts would be passwordless by default. As such, the days of SMS as a method of authentication and account recovery have been numbered for some time, and Microsoft's announcement confirms that users will be directed elsewhere. However, it did not state when it will pull the plug on the technology once and for all. Dropping SMS is laudable, but users will still need to learn a new authentication method. Microsoft promises to guide them through it - offering options to sign in with or create a passkey at login – yet that transition may prove easier said than done. Passkeys also have challenges, most notably when used over multiple devices. In that instance, a synchronization tool or password manager can help, but users might not be familiar with these technologies. Ultimately, SMS as a method of authentication and recovery for a Microsoft account is on the way out. For many security professionals, it is not a moment too soon. ®
Categories: Linux fréttir

'Workforce rebalancing' comes for Kyndryl, and delivery teams are in the firing line

Wed, 2026-05-20 12:15
EXCLUSIVE Kyndryl – the global technology services division spun out of IBM in 2021 – is cutting jobs to reduce overheads and "streamline our operations" in several countries. The business is this week entering a 45-day consultation with employees, where local law requires, under what insiders described as "workforce rebalancing," a euphemistic term that invariably means redundancies. Sources at the IT services biz told us more than 150 managers were called into an extraordinary meeting with HR on Monday. Personnel earmarked for redundancy will have meetings on Wednesday and an announcement is expected on Friday. The cuts are expected to affect delivery teams, with hundreds of people likely to be placed at risk of redundancy in the UK alone. "We had 150-plus [managers] in the call so it's basically everyone in every department who is an architect, consultant, subject matter expert," said a source. "Kyndryl employed loads of new VPs and they are not affected, just cutting the real workers. I do not know how the company will last with a significantly reduced delivery [capability]." The 45-day consultation, a requirement under UK law, indicates at least 100 staff will be at risk of losing their role, although no figures have been confirmed by Kyndryl. According to UK accounts for fiscal 2025, Kyndryl's UK arm employed 1,303 people in the year to March 2025, up from 1,232 in the prior year. A Kyndryl spokesperson told The Register: "As we continue to transform our business in support of serving our customers and sustainable growth, we expect a limited workforce rebalancing in some countries affecting a small percentage of our workforce to address labor costs and streamline our operations." The job cuts will not surprise anyone following Kyndryl. Interim chief financial officer Harsh Chugh, who came into the role in February when the permanent CFO departed amid a board review of accounting practices, warned of changes ahead during the company's earnings conference call earlier this month. Chugh said Kyndryl was taking $200 million of "charges associated with the workforce rebalancing actions, which we expect to incur substantially in the first quarter of fiscal 2027." Kyndryl's 2027 financial year began on April 1. "These actions are expected to yield annualized savings in the range of $400 million to $500 million in fiscal 2028. With the expectation that most of the charges will take place in the first quarter, we expect our first quarter adjusted pretax income to be a low point in earnings for the year with meaningful profit improvement expected for the remaining nine months of the year." According to fiscal 2026 financial results, Kyndryl's sales stagnated: revenue of $15.092 billion was largely flat against the $15.057 billion reported in the prior 12 months. Selling, general, and admin expenses went up by $63 million to $2.654 billion. Net profit shrank 21 percent to $198 million. During the conference call, Kyndryl chairman and CEO Martin Schroeter said the company is "taking workforce rebalancing actions to yield meaningful savings." "We're confident in our strategic direction and our financial position remains strong. We believe our focus on higher-value services, coupled with the actions we're taking to streamline our own operations, positions us to navigate the evolving ways our customers consume IBM's innovation while keeping us on track toward our multiyear objectives." Not all staff share that level of self-assurance. One insider told The Register that Kyndryl is too reliant on "big whales" – meaning too few big-ticket legacy contracts. The source claimed Kyndryl tends to jump onto the latest crazes, including agentic AI, when its real strength is in classic infrastructure services. "Despite all the bluster, we are really running IBM but without the hardware," one told us. "I'm tasked with getting X number of accreditations from my teams whether they are useful or not and spend too much of my time having strategic meetings about strategy plans and plays that don't come off." Kyndryl was formerly IBM's Global Technology Services division, minus Technical Support Services. It was an area that suffered revenue shrinkage and mounting financial losses in the years before it was spun out. Some 90,000 employees moved from IBM to Kyndryl in November 2021. As of this month, it employed 73,000 globally. Schroeter and his management team have tried to convince shareholders of the way forward – a future steeped in cloud services and AI. The share price peaked at $43.41 in July last year but has since tumbled to $12.26, hit badly by an accounting issue pertaining to cash management practices. ®
Categories: Linux fréttir

AMD says its $4K Ryzen AI Halo workstation practically pays for itself

Wed, 2026-05-20 12:00
AMD’s answer to Nvidia’s DGX Spark AI workstations, codenamed the Ryzen AI Halo, will be available for pre-order later next month for anyone with $3,999 burning a hole in their pocket. That might sound like a lot for an AI mini PC, but don’t worry. Compared to cloud APIs, it practically pays for itself. Or, well, that’s AMD’s sales pitch. The House of Zen argues that if you spend eight hours a day vibe coding, the system could save you $750 a month. Whether this helps you justify paying for hardware that less than a year ago could be found for between $2,200 and $2,999 or not, it’s (probably) not AMD being greedy here; the RAMpocalypse has been hard on everyone. Much like the DGX Spark, which now retails for $4,699, up from $3,999 when we reviewed it last fall, AMD’s rendition aims to provide a curated developer environment for running local models and agentic AI frameworks. This is really the core value proposition behind either of these devices. They aren’t the most powerful or fastest AI systems, but they’re able to run models that a few years ago would have cost $20K or more. A little box of TOPS The diminutive system measures in at 5.9 x 5.9 x 1.7 inches (150 x 150 x 43 mm) and is powered by a 120 watt Ryzen AI Max+ 395 APU, better known by its codename Strix Halo. The chip is backed by 128 GB of LPDDR5x 8000 MT/s memory, which feeds both its 16 Zen 5 cores and 40 RDNA 3.5 GPU compute units, providing up to 256 GB/s of bandwidth, more than a Ryzen 9000 Threadripper (non-Pro) system. For local AI enthusiasts, that’s enough to run models up to 200 billion parameters in size at 4-bit precision — just like the more expensive Spark. The bulk of the Ryzen AI Halo’s compute comes from its integrated graphics, which are capable of delivering roughly 56 teraFLOPS at 16-bit precision. While impressive for onboard graphics, that’s still between 55 and 88 percent slower than what the DGX Spark advertises. Unlike the Spark’s Blackwell-based GB10 APU, Strix Halo doesn’t support FP8 or FP4 data types in hardware. At BF16, the Spark delivers 125, at FP8 250, and FP4 500 teraFLOPS. Double those figures if you happen to find a workload that can leverage Nvidia’s 4:2 sparsity. That performance discrepancy won’t necessarily be obvious in every workload. In fact, in LLM inference, AMD claims the AI Halo generates tokens 4-14 percent faster than the Spark. The lower end of that roughly matches what we saw when we pitted the Spark against a similarly equipped HP Z2 Mini G1a back in December. The G1a packs the same silicon as AI Halo, and in Llama.cpp with the Vulkan backend, eked out a small but meaningful lead over the Spark in tokens per second generated. However, the speed any GPU can generate tokens at is largely dictated by effective memory bandwidth, not floating point performance. GPU compute has a much bigger impact on things like prompt processing time. In our testing, the Spark’s more capable tensor cores gave it a 2-3x lead in prompt processing. For shorter prompts, this isn’t all that noticeable, usually the difference between waiting 100 ms versus 200 ms or 300 ms, but for longer prompts, it did become more pronounced. We saw the Spark take similar leads in our image generation and fine tuning benchmarks, but it’s worth noting that AMD’s software stack has matured greatly since our initial review and the performance gap has likely closed somewhat since then. AMD’s AI Halo does have two things going for it that can’t be said of the Spark. Alongside the GPU is an XDNA 2-based neural processing unit (NPU) that AMD rates for 50 TOPS. What good that’ll do you depends heavily on the application in question. Many content creation apps have now been updated to take advantage of it, but the number of generative AI inference engines that could properly harness it was quite limited the last time we looked. The second thing AMD’s Ryzen AI Halo has going for it is that it’s a standard x86 box at its heart, and you can run Windows or your preferred flavor of Linux on it if that’s more your style. On the Spark, you’re stuck with a lightly customized version of Ubuntu 24.04. Beyond that, you’re coloring outside the lines. Particularly for developers building for Microsoft’s NPU-accelerated AI PC ecosystem, this is an obvious advantage. In terms of networking, AMD’s Spark-clone falls a bit flat. One of the hallmark features of Nvidia’s AI workstation is a 200 Gbps ConnectX-7 NIC, which allows for clustering of up to two and eventually four systems. AMD’s AI Halo has a single 10 Gbps NIC, which should help with downloading large model files in a timely manner. In theory, the system should be able to achieve high-speed networking over USB-4, but it’s not clear whether this is actually a supported use case. That said, Apple has already demonstrated just this using RDMA over Thunderbolt, so it should work so long as AMD has a playbook for configuring RDMA on its systems. AMD’s own AI lab As we mentioned earlier, much of the Ryzen AI Halo’s value proposition comes from being validated hardware with well documented playbooks for common use cases and known good software. Finding the right combination of device drivers, ROCm, HIP, SYCL, CUDA, PyTorch, TensorFlow, and JAX has long plagued the AI/ML devs regardless of which ecosystem you opt for. Having validated environments for workloads, whether it be vLLM, Llama.cpp, Ollama, ComfyUI, or something else ensures users spend more time doing something productive than debugging mismatched dependencies. At launch, AMD says the Ryzen AI Halo will ship with five preinstalled playbooks, with another 10 available online and additional playbooks to be added monthly. Additionally, customers will gain access to AMD’s developer program, cloud credits, and exclusive playbooks. More memory on the way The 128 GB Ryzen AI Halo will be available for pre-order next month starting at $3,999, but if that isn’t enough for you, AMD is already prepping a higher capacity version of the system with 192 GB of memory on board. That system will feature a refreshed Ryzen APU in the AI Max+ 495, which just like the rest of AMD’s 400-series lineup gets a modest clock bump to the CPU, GPU, and NPU, and not a whole lot else. Still, 192 GB of unified memory opens the door to even larger, more capable models, if you can stomach the presumably higher asking price. ®
Categories: Linux fréttir

ESA boss tires of being dragged around by NASA mood swings

Wed, 2026-05-20 11:34
European Space Agency (ESA) Director General Josef Aschbacher has taken a swipe at NASA and US policy, while calling for autonomy in human spaceflight via an opinion post titled "Are we pilots or are we passengers?" The May 18 post is emblematic of the hand-wringing within ESA over the last few years as NASA has lurched from plan to plan amid fluctuating priorities and funding. Aschbacher, it appears, has had enough. "Europe has become too exposed to decisions beyond its control," he said. NASA administrator Jared Isaacman recently announced changes that would pause, and likely cancel, the Lunar Gateway space station project in favor of a Moon base. The decision, along with scrapping the over-budget and delayed Mars Sample Return mission, does not sit well with ESA, which had a hand in both. Aschbacher warned of the potential for dependence on third parties for programs including human spaceflight. ESA removed reliance on Russia for missions such as ExoMars, and turbulence in US space policy has given the agency pause for thought. "Europe must decide whether it prefers to be dependent on others to send its explorers into space or to assume its role as a fully capable space power. As the head of the European Space Agency (ESA), I am convinced that autonomous human spaceflight is not a luxury. It is a necessary anchor for Europe to secure its freedom to unlock the scientific, economic, strategic and geopolitical benefits of space and to inspire a new generation to shape Europe’s future." In 2025, an agency insider referred to NASA as "an abusive spouse who could lash out at any moment in unpredictable ways." In 2026, Aschbacher's patience appears to be running out. "I'm glad," a source told The Register. "The US has fucked us around for too long." Aschbacher and ESA would not put it so bluntly. However, one of ESA's strengths is also one of its weaknesses. The agency has 23 member states. Political and funding decisions are imminent: the ESA Council meets in June, the Intermediate Ministerial Council is in December, and a full Council at Ministerial level is due in 2028. "If we started today," Aschbacher wrote, "it would still take us many years to build autonomous capability – we must act quickly. The cost of inaction would far outweigh the necessary investment." Aschbacher's comments came on the eve of the successful launch of the Smile spacecraft on a Vega-C rocket from Europe's Spaceport in French Guiana on May 19. The mission, a collaboration between ESA and the Chinese Academy of Sciences (CAS), will study how Earth responds to solar wind. "The current environment demands both diversified international partnerships and strengthened autonomous capabilities," wrote Aschbacher. "We must use this challenging moment and turn it into an opportunity to redefine our position." His next challenge is to persuade ESA's member states, each with their own budgetary pressures, to see things the same way. "History will not wait for Europe to feel comfortable and ready; it will move forward with or without us. The choice before Europe is clear: do we pilot, or are we merely passengers? We have everything we need. What remains are the confidence and political will to act." ®
Categories: Linux fréttir

GitHub says internal repos exfiltrated after poisoned VS Code extension attack

Wed, 2026-05-20 10:27
GitHub, the world's biggest code repository and DevOps platform, fell victim to a malicious Visual Studio Code (VS Code) extension. The company's initial assessment is that only internal repositories were exfiltrated. The incident was reported by GitHub on X, with follow-up posts revealing a "poisoned VS Code extension" as the cause. The Microsoft-owned code shack continues to "analyze logs, validate secret rotation, and monitor for any follow-on activity." One GitHub post references "the attacker's current claims of ~3,800 repositories" as consistent with its investigation. This may refer to a post attributed to TeamPCP, the malware crew linked to the Shai-Hulud worm, the code for which has been published and caused widespread damage. In a post, the crew advertised GitHub's internal source code for sale, claiming around 4,000 repositories. They said it was not a ransom and if no buyer was found, they would leak the code for free. Claims like these should be treated with caution. A key concern for GitHub users is whether private repositories are at risk, either immediately or in the future if the attackers have gained a foothold into internal systems via stolen credentials. Risks include leakage of commercial code and credentials. Although best practice is not to check secrets into any repository, public or private, some organizations are less disciplined about this when repositories are private. Last month, Wiz Research discovered a remote code execution flaw in GitHub.com and GitHub Enterprise Server (the self-hosted version), which the researchers said was "remarkably easy to exploit." The vulnerability was discovered using AI. Developer reactions to GitHub's latest problems combine alarm and resignation – plus some humor. "How did the attackers find a large enough uptime window to get in?" quipped one. GitHub is in some difficulty. This compromise comes after a surge in npm attacks, many related to Shai-Hulud code, which the company has failed to prevent despite being aware of the issue since September 2025. Further, the platform has reliability issues caused in part by AI bots hoovering public code to feed large language models – problems that led HashiCorp co-founder Mitchell Hashimoto to declare GitHub "no longer a place for serious work." Another said that "the era where a developer machine with source code access also has access to meaningful security systems should be over. Internal repository access should mean nothing... GitHub compromise could happen at any time, even from GitHub themselves." Issues with cloud platforms also increase the appeal of self-hosted systems such as the open source
Categories: Linux fréttir

Smaller suppliers invited to pitch for £2.9B UK defense tech framework

Wed, 2026-05-20 09:54
The Government Commercial Agency (GCA) - the UK procurement body formed from Crown Commercial Service and Cabinet Office commercial teams - is seeking greater input from customers and smaller suppliers as it develops a new version of its framework for security-focused digital and IT services. The GCA's current Digital and IT Professional Services (DIPS) framework, which runs until November 2027, is open only to buyers within the Ministry of Defence (MoD) and its 17 suppliers are mostly large defense companies and consultancies. The agency wants its replacement to be open to more buyers and suppliers, according to a procurement notice published on May 18 and linked material. The notice says that DIPS 2, which the GCA plans to run for eight years from September 2027 to August 2035, will be available to a wider group of "customers with defence and security requirements" including some outside the MoD, potentially the Home Office and Foreign Office. It has a total possible value of £2.88 billion including VAT. In a supplier engagement session held in March, the GCA said it wants to increase the participation of small and medium-sized enterprises (SMEs), which the government defines as companies with fewer than 250 staff along with turnover and balance sheet limits. It is considering having fewer lots than the six in DIPS 1 and outlined a range of options for this, including a possible lot specifically for projects worth less than £5 million. "We are learning lessons from Ukraine in terms of rapid technological development," Darren Corkindale, a deputy head for professional services in the Ministry of Defence's digital commercial service, told the session. He noted that the UK's Strategic Defence Review (SDR) is expected to include a clear focus on digital and technology including artificial intelligence, cybersecurity, and the electromagnetic domain, adding: "Never thought I'd really say this, but there's an emphasis in the SDR around transition to conflict, war readiness, war preparedness. Again, DIPS 2 needs to reflect that." The GCA plans to complete engagement with suppliers by the end of this year and publish its tender for the DIPS 2 framework in 2027. ®
Categories: Linux fréttir

Pages