<?xml version="1.0" encoding="UTF-8"?><rss xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:atom="http://www.w3.org/2005/Atom" version="2.0" xmlns:itunes="http://www.itunes.com/dtds/podcast-1.0.dtd" xmlns:googleplay="http://www.google.com/schemas/play-podcasts/1.0"><channel><title><![CDATA[The Substrate]]></title><description><![CDATA[A newsletter about AI hardware, security, and geopolitics.]]></description><link>https://www.the-substrate.net</link><image><url>https://substackcdn.com/image/fetch/$s_!lNbv!,w_256,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F68cdadbd-07eb-4d91-aac0-462dd7d03695_1280x1280.png</url><title>The Substrate</title><link>https://www.the-substrate.net</link></image><generator>Substack</generator><lastBuildDate>Mon, 15 Jun 2026 08:47:55 GMT</lastBuildDate><atom:link href="https://www.the-substrate.net/feed" rel="self" type="application/rss+xml"/><copyright><![CDATA[Erich Grunewald]]></copyright><language><![CDATA[en]]></language><webMaster><![CDATA[the-substrate@substack.com]]></webMaster><itunes:owner><itunes:email><![CDATA[the-substrate@substack.com]]></itunes:email><itunes:name><![CDATA[Erich Grunewald]]></itunes:name></itunes:owner><itunes:author><![CDATA[Erich Grunewald]]></itunes:author><googleplay:owner><![CDATA[the-substrate@substack.com]]></googleplay:owner><googleplay:email><![CDATA[the-substrate@substack.com]]></googleplay:email><googleplay:author><![CDATA[Erich Grunewald]]></googleplay:author><itunes:block><![CDATA[Yes]]></itunes:block><item><title><![CDATA[Where China’s AI chip supply chain stands in 2026]]></title><description><![CDATA[China&#8217;s domestic ecosystem is improving, but it remains constrained by several hard bottlenecks, especially in photolithography equipment and memory production.]]></description><link>https://www.the-substrate.net/p/where-chinas-ai-chip-supply-chain</link><guid isPermaLink="false">https://www.the-substrate.net/p/where-chinas-ai-chip-supply-chain</guid><dc:creator><![CDATA[Veronika Blablova]]></dc:creator><pubDate>Tue, 02 Jun 2026 17:41:39 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/aeee407b-e27c-4bc0-a281-c119ecc7bbd3_2624x1856.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<h1>In a nutshell</h1><p>This post provides an overview of domestic Chinese AI chip<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-1" href="#footnote-1" target="_self">1</a> making, from design to manufacturing to packaging, explaining key terms and concepts as they come up. The Chinese government has for many years given top priority to, and invested heavily in, self-sufficiency in advanced chipmaking. Since the US bars Chinese companies from buying advanced AI chips, much of China&#8217;s compute in the coming years is likely to come from domestic chips. So China&#8217;s progress in making these chips matters to anyone trying to make sense of the US-China competition and Chinese AI progress.</p><p>In short, China remains heavily constrained by export controls across the AI chip supply chain, and is many years away from making globally competitive AI chips indigenously. Huawei is China&#8217;s leading AI chip designer, followed by Cambricon, Alibaba&#8217;s T-Head, Baidu&#8217;s Kunlunxin, and several startups that ship only modest amounts of chips. On paper, the best Chinese chips <a href="https://epoch.ai/gradient-updates/why-china-isnt-about-to-leap-ahead-of-the-west-on-compute">remain closer</a> to the hardware NVIDIA released about five years ago than to its current frontier. But beyond the hardware, another important gap is the software that runs on the chips. NVIDIA&#8217;s software ecosystem remains much more mature than Chinese alternatives, and for this and other reasons, Chinese developers continue to prefer NVIDIA for training workloads.</p><p>China is unable to fabricate both logic and memory chips at the quality and volume it needs. Because of their lack of access to the most advanced photolithography machines&#8212;both extreme ultraviolet (EUV) and, to a large extent, immersion deep ultraviolet (DUV)&#8212;Chinese companies struggle to produce competitive, economically viable chips. China&#8217;s fabrication process for logic chips is three to five years behind the world&#8217;s most advanced foundry, TSMC. Similarly, China&#8217;s leading memory company, CXMT, remains <a href="https://www.chinatalk.media/p/mapping-chinas-hbm-advancement">three to four years behind</a> the global leaders. China&#8217;s domestic photolithography efforts are reported to have partially produced EUV prototypes, but these are most likely far from reliable or fully functional tools.</p><h1>Contents</h1><ul><li><p><a href="https://www.the-substrate.net/i/200318016/in-a-nutshell">In a nutshell</a></p></li><li><p><a href="https://www.the-substrate.net/i/200318016/introduction">Introduction</a></p><ul><li><p><a href="https://www.the-substrate.net/i/200318016/what-this-post-covers">What this post covers</a></p></li><li><p><a href="https://www.the-substrate.net/i/200318016/us-export-controls-constrain-china-across-the-entire-ai-chip-supply-chain">US export controls constrain China across the entire AI chip supply chain</a></p></li></ul></li><li><p><a href="https://www.the-substrate.net/i/200318016/ai-chip-design">AI chip design</a></p><ul><li><p><a href="https://www.the-substrate.net/i/200318016/china-remains-dependent-on-western-chip-design-software">China remains dependent on Western chip design software</a></p></li><li><p><a href="https://www.the-substrate.net/i/200318016/chinas-ai-chip-design-ecosystem-is-broadening-quickly">China&#8217;s AI chip design ecosystem is broadening quickly</a></p></li><li><p><a href="https://www.the-substrate.net/i/200318016/chinese-ai-developers-still-prefer-nvidia-for-training">Chinese AI developers still prefer NVIDIA for training</a></p></li><li><p><a href="https://www.the-substrate.net/i/200318016/chinese-chips-still-trail-nvidia-on-performance-and-memory">Chinese chips still trail NVIDIA on performance and memory</a></p></li><li><p><a href="https://www.the-substrate.net/i/200318016/domestic-chip-shipments-are-growing-but-remain-highly-concentrated">Domestic chip shipments are growing but remain highly concentrated</a></p></li><li><p><a href="https://www.the-substrate.net/i/200318016/chinas-cluster-level-systems-are-limited-by-fab-capacities">China&#8217;s cluster-level systems are limited by fab capacities</a></p></li></ul></li><li><p><a href="https://www.the-substrate.net/i/200318016/logic-chips-fabrication">Logic chips fabrication</a></p><ul><li><p><a href="https://www.the-substrate.net/i/200318016/smic-leads-chinas-foundry-sector">SMIC leads China&#8217;s foundry sector</a></p></li><li><p><a href="https://www.the-substrate.net/i/200318016/smic-lags-three-to-five-years-behind-tsmc">SMIC lags three to five years behind TSMC</a></p></li><li><p><a href="https://www.the-substrate.net/i/200318016/hua-hong-might-expand-chinas-fab-capacity-over-the-long-term">Hua Hong might expand China&#8217;s fab capacity over the long term</a></p></li><li><p><a href="https://www.the-substrate.net/i/200318016/huawei-is-building-fabs-to-reduce-its-dependence-on-smic">Huawei is building fabs to reduce its dependence on SMIC</a></p></li></ul></li><li><p><a href="https://www.the-substrate.net/i/200318016/high-bandwidth-memory-fabrication">High-bandwidth memory fabrication</a></p><ul><li><p><a href="https://www.the-substrate.net/i/200318016/hbm-is-restricted-by-us-export-controls">HBM is restricted by US export controls</a></p></li><li><p><a href="https://www.the-substrate.net/i/200318016/chinas-memory-makers-are-moving-into-hbm-led-by-cxmt">China&#8217;s memory makers are moving into HBM, led by CXMT</a></p></li><li><p><a href="https://www.the-substrate.net/i/200318016/cxmt-trails-the-state-of-the-art-by-three-to-four-years">CXMT trails the state of the art by three to four years</a></p></li><li><p><a href="https://www.the-substrate.net/i/200318016/the-global-hbm-market-is-highly-concentrated-in-three-foreign-companies">The global HBM market is highly concentrated in three foreign companies</a></p></li><li><p><a href="https://www.the-substrate.net/i/200318016/chinas-hbm-capacity-is-growing-but-still-likely-insufficient">China&#8217;s HBM capacity is growing but still likely insufficient</a></p></li><li><p><a href="https://www.the-substrate.net/i/200318016/domestic-memory-limits-huaweis-next-chips">Domestic memory limits Huawei&#8217;s next chips</a></p></li><li><p><a href="https://www.the-substrate.net/i/200318016/advanced-packaging-is-another-constraint-on-chinese-ai-chips">Advanced packaging is another constraint on Chinese AI chips</a></p></li></ul></li><li><p><a href="https://www.the-substrate.net/i/200318016/semiconductor-manufacturing-equipment">Semiconductor manufacturing equipment</a></p><ul><li><p><a href="https://www.the-substrate.net/i/200318016/photolithography">Photolithography</a></p></li><li><p><a href="https://www.the-substrate.net/i/200318016/etch-clean-deposition-metrology-and-more">Etch, clean, deposition, metrology, and more</a></p></li></ul></li></ul><h1 style="text-align: justify;">Introduction</h1><p>Compute is one of the key inputs to AI progress, and US export controls prevent Chinese companies from buying advanced AI chips and the equipment needed to make them. Under these conditions, China acquires <a href="https://www.the-substrate.net/p/where-will-china-get-its-compute">compute in several ways</a>. It stockpiles and <a href="https://www.the-substrate.net/p/how-banned-ai-chips-end-up-in-china">smuggles</a> US-designed chips, rents <a href="https://www.the-substrate.net/p/how-much-us-compute-is-china-renting">cloud capacity</a> abroad, and tries to squeeze more performance out of limited hardware through software and systems engineering. But one of the most consequential channels in the long run is domestic production. If China can make more of its own AI chips, that reduces its future dependence on US and allied suppliers and makes it harder to constrain through export controls.</p><p>Compute determines how extensively AI companies can pursue frontier R&amp;D, how many training runs they can <a href="https://epoch.ai/data-insights/openai-compute-spend/">conduct</a>, how large those runs can be, and how they can deploy new models. In early 2025, the US was estimated to<a href="https://epoch.ai/data-insights/ai-supercomputers-performance-share-by-country"> account</a> for 75% of installed compute capacity, while China was a distant second at 15% and the EU trailed at 5%. China&#8217;s share is likely lower today, with <a href="https://epoch.ai/data/ai-chip-owners?view=graph&amp;tab=h100_equivalents&amp;proportion=share&amp;colorPinned=China">estimates</a> that it owns about 6% of compute capacity, as the gap has probably widened following US export controls on advanced AI chips. With export controls preventing China from legally importing US-designed AI chips, that gap largely reflects who can design, fabricate, and deploy the most advanced AI chips at scale.</p><p>A modern AI chip is a package that brings together compute dies, high-bandwidth memory, and the physical connections between them. The compute dies are the processors that perform the calculations. The memory sits next to them and feeds them data fast enough to keep them busy. A silicon interposer connects the compute and memory dies inside the package, routing dense electrical signals between them. Producing such a complex chip therefore depends on several capabilities at once, including cutting-edge chip-design software, front-end fabrication for both logic and memory, semiconductor manufacturing equipment, and back-end packaging.</p><h2 style="text-align: justify;">What this post covers</h2><p><strong>This post gives an overview of China&#8217;s AI chipmaking ecosystem.</strong> We look at four key parts of the supply chain and, for each, compare Chinese progress with the US and allied state of the art. The four are:</p><ul><li><p><strong>AI chip design.</strong> This is the layout of the compute die, containing the processor cores. The first step in AI chip production is designing the processor architecture. That architecture is then translated into a physical circuit layout using specialized software and verified to work as intended before being sent to a chip fabrication plant (&#8220;fab&#8221;) for manufacturing. These design choices are critical because they determine the performance, efficiency, and capabilities of the resulting chip.</p></li><li><p><strong>Logic fabrication.</strong> This is the manufacturing of the compute dies. Once the design is complete, the next step is fabrication. This highly specialized manufacturing process imprints a chip design onto silicon wafers through hundreds of sequential steps (depositing, patterning, and etching layers of materials) to form transistors and the metal connections between them. It is strategically important because only a small number of facilities worldwide can produce the most advanced chips.</p></li><li><p><strong>High-bandwidth memory (HBM) fabrication.</strong> This is the manufacturing of the memory stacks that sit next to the compute dies on the interposer. In addition to processors, AI systems rely on HBM to handle vast amounts of data. This part, therefore, focuses on the manufacture of vertically stacked memory chips packaged alongside the AI chip to deliver the fast data read/write speeds required by AI training and inference workloads.</p></li><li><p><strong>Semiconductor manufacturing equipment (SME).</strong> These are highly specialized machines (photolithography, deposition, etching and cleaning, process control, and others) used to carry out each step of logic and memory chip fabrication. Unlike the other parts of this analysis, SME refers to the physical tools that enable fabrication rather than to the process itself. It represents a key bottleneck in the global semiconductor supply chain because only a handful of companies can produce them.</p></li></ul><p>China is trying to indigenize every part of the supply chain at once, while being constrained by export controls at each stage. It is much further along in some parts than in others, but overall, it remains many years away from designing and producing frontier AI chips at scale using domestic equipment.</p><p>Although we&#8217;ve done our best to cover the most relevant information, this report has some gaps, and some of its findings are tentative. That is because Chinese AI chipmaking is, to some extent, shrouded in uncertainty&#8212;there is limited public information about how much progress Chinese companies are making, their plans, and, in some cases, even who the important actors are. For example, it is often hard to pin down how many units of a product have been ordered or shipped; performance figures often come from company materials or investor-facing roadmaps rather than third-party measurements; and announcements of breakthroughs in chips, fabrication, and equipment usually come from the companies themselves, local governments, or local media, all of whom have strong <a href="https://ucigcc.org/blog/the-accomplishments-and-contradictions-of-chinas-semiconductor-industrial-policy/">reasons</a> to overstate their progress, whether to win state subsidies, financing, government contracts, or present US and allied export controls as ineffective.</p><h2 style="text-align: justify;">US export controls constrain China across the entire AI chip supply chain</h2><p>Over the past several years, the US has implemented a set of restrictions aimed at both keeping the most advanced AI chips out of China and preventing Chinese AI chip makers from using key US and allied equipment and services. These restricted products and services include:</p><ul><li><p><strong>AI chips.</strong> The US bars AI chip designers like NVIDIA and AMD from selling their most powerful chips to China. As of May 2026, the most advanced chip that <a href="https://www.nbcnews.com/world/asia/us-approves-nvidia-h200-chip-exports-china-conditions-rcna253948">can be exported</a> to China is the NVIDIA H200, but even these sales are quite regulated: shipments of these products to China cannot exceed 50% of the volume sold to US customers; there is a 25% export tariff; and buyers must certify that the chips will not be used for military purposes.</p></li><li><p><strong>Foreign foundry access.</strong> US export controls prohibit Taiwan Semiconductor Manufacturing Company (TSMC), the world&#8217;s dominant chip manufacturer, from <a href="https://www.federalregister.gov/documents/2022/10/13/2022-21658/implementation-of-additional-export-controls-certain-advanced-computing-and-semiconductor">producing</a> advanced AI chips for Chinese chip designers. Foundries are <a href="https://www.trendforce.com/news/2024/11/11/news-four-key-takeaways-on-tsmcs-reported-halt-of-7nm-and-below-chip-supplies-to-china/">required</a> to review any 7nm-and-below shipment to Chinese designers and <a href="https://www.federalregister.gov/documents/2025/01/16/2025-00711/implementation-of-additional-due-diligence-measures-for-advanced-computing-integrated-circuits">presume</a> advanced logic chips are AI chips unless proven otherwise. (A &#8220;process node&#8221;, traditionally defined in nanometers, names a generation of fabrication technology. Each new generation reduces the size of the chip&#8217;s transistors, or features, fitting more into each square inch; a smaller process node is more advanced.)</p></li><li><p><strong>High-bandwidth memory.</strong> US export controls <a href="https://www.federalregister.gov/documents/2024/12/05/2024-28270/foreign-produced-direct-product-rule-additions-and-refinements-to-controls-for-advanced-computing">block</a> HBM sales to China, specifically HBM2E (2020) and later generations.</p></li><li><p><strong>Semiconductor manufacturing equipment.</strong> China is blocked from buying EUV photolithography tools, the most advanced DUV tools, and other <a href="https://www.federalregister.gov/documents/2023/10/25/2023-23049/export-controls-on-semiconductor-manufacturing-items">equipment </a>needed for advanced logic and memory production.</p></li><li><p><strong>Electronic design automation (EDA) software.</strong> US export controls <a href="https://www.csis.org/analysis/updated-october-7-semiconductor-export-controls">prohibit</a> sales of software needed to design chips using gate-all-around (GAAFET) transistors, the most advanced transistor architecture, used to design the most advanced AI chips (3nm and below).</p></li></ul><p>For a detailed list of products controlled by the US, see the <a href="https://www.ecfr.gov/current/title-15/subtitle-B/chapter-VII/subchapter-C/part-774">Commerce Control List</a>.</p><h1>AI chip design</h1><p>The first step in<a href="https://peterwildeford.substack.com/p/explainer-how-ai-chips-are-made"> the AI chipmaking process</a> is to design the chip. Simplifying a bit, designers define what the chip is supposed to do, how much compute it should deliver, how much power it can consume, and how it will interact with memory and software. Engineers then construct an abstract model of the circuit logic, which EDA <a href="https://www.synopsys.com/glossary/what-is-electronic-design-automation.html">software</a> synthesizes and then translates into a physical blueprint, using process design kits (PDKs), <a href="https://www.synopsys.com/glossary/what-is-a-process-design-kit.html">collections</a> of files and rules provided by manufacturers that define how transistors and wires can be built on a specific fabrication process. After extensive timing, power, and manufacturability checks, the design is finalized and sent for fabrication. The entire <a href="https://www.softwebsolutions.com/resources/gen-ai-in-chip-design/">process</a> is complex and expensive, as advanced AI chips often require over a year and a large engineering team to complete.</p><p>Chip fabrication companies typically operate under one of two models:</p><ul><li><p><em>Fabless</em> firms, such as NVIDIA and AMD, design chips but outsource manufacturing to specialized facilities (&#8220;foundries&#8221;), such as TSMC.</p></li><li><p><em>Integrated device manufacturers</em> (IDMs), such as Intel, both design and fabricate chips in-house.</p></li></ul><h2 style="text-align: justify;">China remains dependent on Western chip design software</h2><p>A modern AI chip contains tens to hundreds of billions of transistors arranged across more than a dozen metal layers, and without electronic design automation software, designing or verifying a chip of this scale by hand is practically <a href="https://cset.georgetown.edu/wp-content/uploads/The-Semiconductor-Supply-Chain-Issue-Brief-1.pdf">impossible</a>.</p><p><strong>Chinese chip designers remain heavily<a href="https://www.eetimes.com/u-s-restricts-eda-software-sales-to-china/"> dependent</a> on US EDA software.</strong> EDA software enables engineers to design, simulate, and verify complex integrated circuits before fabrication. Companies such as Synopsys (US), Cadence (US), and Siemens (Germany) together control about 75% of China&#8217;s overall EDA market.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-2" href="#footnote-2" target="_self">2</a> This dominance <a href="https://ucigcc.org/wp-content/uploads/2022/12/kleinhans-working-paper_-IGCC-2022.pdf">reflects</a> over two decades of high development costs, high switching costs, and close cooperation between foundries and EDA companies.</p><p>The US first restricted EDA software for GAAFET transistors, the most advanced transistor architecture, in 2022, extending those controls multilaterally in 2024. By May 2025, the Bureau of Industry and Security (BIS) had <a href="https://techcrunch.com/2025/05/30/us-imposes-new-rules-to-curb-semiconductor-design-software-sales-to-china/">escalated</a> to requiring export licenses for all EDA sales to China. But two months later, those broader requirements were partially <a href="https://www.cnbc.com/2025/07/03/us-lifts-chip-software-curbs-on-china-amid-trade-truce-synopsys-says-.html">reversed</a> as part of a deal on rare-earth magnets.</p><p>The effectiveness of EDA controls is further complicated by potential <a href="https://www.chinatalk.media/p/holes-in-the-chip-design-software">software piracy</a>, which is likely more feasible for smaller Chinese start-ups than large, multinational companies like Baidu and Alibaba. EDA software for advanced-node chips is difficult to pirate because it is tightly integrated with foundry PDKs, which are updated frequently and serve as a de facto license verification mechanism: a chip designer without a valid, current license will be detected when submitting designs for fabrication at foundries.</p><p>China&#8217;s nascent EDA industry is led by Beijing-based <a href="https://www.scmp.com/tech/tech-war/article/3182988/chinese-firm-aiming-break-us-dominance-chip-design-software-gets-ipo">Empyrean Technology</a>, with Primarius Technologies and smaller players such as Semitronix and X-Epic following. In early 2024, Empyrean <a href="https://www.thewirechina.com/2025/07/20/the-chip-catalyst/">held</a> around 6% of China&#8217;s EDA market, while Chinese EDA vendors collectively <a href="https://www.trendforce.com/news/2025/06/05/insights-chinas-eda-self-sufficiency-tops-10-in-2024-will-u-s-crackdown-boost-or-block-its-chip-push/">accounted</a> for a little over 10%. Capability gaps remain substantial, however. Although Empyrean and Primarius <a href="https://markets.financialcontent.com/stocks/article/tokenring-2025-10-24-chinas-eda-breakthroughs-a-leap-towards-semiconductor-sovereignty-amidst-global-tech-tensions">claim</a> support for 7nm and even 5nm designs, they seem to cover only <a href="https://www.institutmontaigne.org/ressources/pdfs/publications/great-power-chokepoints-chinas-semiconductor-industry-search-breakthroughs.pdf">part</a> of the full design flow, with persistent weaknesses in full-stack <a href="https://www.computerworld.com/article/3998008/us-to-block-chinas-access-to-essential-semiconductor-design-software.html">integration</a> across simulation, IP compatibility, and foundry certification. Fully closing the gap, especially for sub-7nm chip design, could take five to ten <a href="https://www.computerworld.com/article/3998008/us-to-block-chinas-access-to-essential-semiconductor-design-software.html">years</a> or more. Progress has been most visible in memory-chip design tooling&#8212;in August 2025, Empyrean <a href="https://www.trendforce.com/news/2025/08/19/news-empyrean-reportedly-unveils-chinas-first-full-process-eda-platform-for-memory-chip-production/">unveiled</a> China&#8217;s first full-process EDA platform for memory production, which is <a href="https://www.digitimes.com/news/a20250819PD217/eda-production-cxmt-localization-dram.html">reportedly being adopted</a> by China&#8217;s largest memory maker.</p><p>Although the broader May 2025 EDA restrictions were rescinded, this persistent dependency remains a latent vulnerability. The GAAFET-specific controls from 2022 remain in place, and Washington could re-impose broader restrictions at any time. If it did, Chinese chip designers would lose access to the tools required to design leading-edge chips, with limited domestic alternatives offering comparable capabilities. In practice, any future cutoff would bite <a href="https://www.chinatalk.media/p/holes-in-the-chip-design-software">gradually </a>rather than instantly, as Chinese designers likely pay for multi-year license agreements and could maintain EDA access through overseas subsidiaries and shell-company arrangements.</p><h2 style="text-align: justify;">China&#8217;s AI chip design ecosystem is broadening quickly</h2><p><strong>The number of AI chip designers in China has grown rapidly in recent years, and at least nine domestic AI chip firms, according to Chinese media, have <a href="https://www.chinadailyhk.com/hk/article/628818">shipped</a> more than 10,000 chips each.</strong> To put this in perspective, NVIDIA <a href="https://epoch.ai/data/ai-chip-sales">sold</a> more than 4 million GPUs in 2025. These per-firm volumes are small, but the breadth of the cohort signals gradual diversification of China&#8217;s domestic supply. Investment capital has surged, and in late 2025 and early 2026 alone, multiple second-tier companies, including the <a href="https://www.globaltimes.cn/page/202512/1351155.shtml">&#8220;Four Little Dragons&#8221;</a>: Biren, Enflame, MetaX, and Moore Threads (<a href="https://m.thepaper.cn/newsDetail_forward_32313806">sometimes</a> Iluvatar CoreX is included instead of Enflame), raced to public <a href="https://www.trendforce.com/news/2025/12/19/news-chinese-gpu-makers-pick-up-pace-in-going-public/">listings</a>. They were all <a href="https://restofworld.org/2025/china-chip-startups-nvidia-us-export/">founded</a> by engineering talent returning from American semiconductor giants, such as NVIDIA, AMD, and Qualcomm, to help build China&#8217;s indigenous AI chip ecosystem.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-3" href="#footnote-3" target="_self">3</a></p><p>Of designing chips, fabricating them, and building the machines that fabricate them, design is the easiest market to enter, so it is where China&#8217;s domestic ecosystem is broadening fastest. Building a leading-edge logic fab <a href="https://www.construction-physics.com/p/how-to-build-a-20-billion-semiconductor">costs</a> $10-20 billion per facility, with a single EUV photolithography machine alone costing <a href="https://www.cnbc.com/2025/05/22/exclusive-look-at-high-na-asmls-new-400-million-chipmaking-colossus.html">$220 million</a> or more, whereas designing a chip primarily requires engineers, EDA software licenses, and access to a foundry willing to manufacture the design. China, however, is less constrained by capital than by access to SME, with US, Dutch, and Japanese export controls barring Chinese firms from buying most of the equipment a leading-edge fab requires.</p><p>Within this growing pool of AI chip designers, four companies&#8212;Huawei, Cambricon, Alibaba&#8217;s T-Head, and Baidu&#8217;s Kunlunxin&#8212;account for the vast majority of domestic shipments. The rest of this section profiles each of them, then turns to the smaller, second-tier designers.</p><div id="datawrapper-iframe" class="datawrapper-wrap outer" data-attrs="{&quot;url&quot;:&quot;https://datawrapper.dwcdn.net/07qjv/13/&quot;,&quot;thumbnail_url&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/e9a45b32-e893-47dd-9a0a-771a70d28361_1220x1190.png&quot;,&quot;thumbnail_url_full&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/3eae8ce9-210f-4558-959c-13cdef910b50_1220x1466.png&quot;,&quot;height&quot;:643,&quot;title&quot;:&quot;China's AI chip designers&quot;,&quot;description&quot;:&quot;&quot;}" data-component-name="DatawrapperToDOM"><iframe id="iframe-datawrapper" class="datawrapper-iframe" src="https://datawrapper.dwcdn.net/07qjv/13/" width="730" height="643" frameborder="0" scrolling="no"></iframe><script type="text/javascript">!function(){"use strict";window.addEventListener("message",(function(e){if(void 0!==e.data["datawrapper-height"]){var t=document.querySelectorAll("iframe");for(var a in e.data["datawrapper-height"])for(var r=0;r<t.length;r++){if(t[r].contentWindow===e.source)t[r].style.height=e.data["datawrapper-height"][a]+"px"}}}))}();</script></div><p><strong>Huawei, the crown jewel of China&#8217;s semiconductor ambitions, is the country&#8217;s most advanced AI chip designer.</strong> Its chip-design arm, HiSilicon, designs smartphone chips (Kirin), server CPUs (Kunpeng), and network chips, and, since 2018, the Ascend line of AI chips. Huawei is the primary domestic option for inference <a href="https://merics.org/en/comment/despite-huaweis-progress-nvidia-continues-dominate-ai-chips-market-china">workloads</a>, but is not yet competitive for full-scale model <a href="https://epoch.ai/gradient-updates/why-china-isnt-about-to-leap-ahead-of-the-west-on-compute/">training</a>.</p><p>The Huawei Ascend is by far the most important domestic AI chip, <a href="https://winbuzzer.com/2026/04/03/chinese-chipmakers-now-hold-41-of-chinas-ai-chip-market-xcxwbn/">accounting</a> for about half of China&#8217;s domestic AI chip <a href="https://wccftech.com/mizuho-huawei-will-likely-sell-over-700000-units-of-its-ascend-910-series-chips-in-2025-despite-smics-fairly-low-yields-of-30-percent/">shipments</a> in 2025. The Ascend 910B and 910C are widely regarded as the most viable Chinese chips for serious workloads. The Ascend 910C (2025) delivers weaker, but roughly comparable, computational performance to an NVIDIA H100 (2022).<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-4" href="#footnote-4" target="_self">4</a></p><p>The Ascend chips are also the only domestically designed hardware known to have been used to train frontier-scale models.<strong> </strong>The largest publicly known example is Huawei&#8217;s <a href="https://arxiv.org/abs/2505.04519">Pangu Ultra MoE</a>, with 718 billion parameters. Other Chinese models, including Zhipu AI&#8217;s <a href="https://z.ai/blog/glm-5">GLM-5</a> and <a href="https://www.reuters.com/world/china/deepseek-v4-chinese-ai-model-adapted-huawei-chips-2026-04-24/">DeepSeek-V4</a>, have been optimized to run on Ascends, though they were likely trained with NVIDIA chips.</p><p>In 2019, the US placed Huawei on the Entity List, and <a href="https://www.federalregister.gov/documents/2020/08/20/2020-18213/addition-of-huawei-non-us-affiliates-to-the-entity-list-the-removal-of-temporary-general-license-and">a further 2020 rule</a> restricted the company&#8217;s access to TSMC. But Huawei worked around this for years. Through intermediaries, Huawei <a href="https://newsletter.semianalysis.com/p/huawei-ascend-production-ramp">routed</a> orders to TSMC and accumulated an estimated 2.9 million advanced TSMC-made dies, worth roughly $500 million, before that channel was shut down in October 2024. This stockpile has powered most of the Ascend 910C shipments through 2024 and 2025.</p><p>Semiconductor Manufacturing International Corporation (SMIC), China&#8217;s most advanced foundry, also fabricates Ascend variants, but yields on its 7nm processes remain poor (see the section on fabrication below). (&#8220;Yield&#8221; refers to the percentage of functional chips produced from a wafer out of the total number attempted. Higher yield means better manufacturing efficiency and lower cost per chip.) <strong> </strong>It is unclear how much SMIC has actually contributed to Ascend 910C production, as an October 2025 <a href="https://www.businesstimes.com.sg/companies-markets/huawei-used-tsmc-samsung-sk-hynix-components-top-ai-chips-techinsights">teardown</a> of a 910C found dies originally fabricated by TSMC in 2020, and no public teardown has yet confirmed a fully domestically fabricated Ascend 910C. As the TSMC stockpile depletes, Huawei is gradually shifting Ascend production to SMIC, a transition that requires redesigning each chip generation around SMIC&#8217;s less mature 7nm process.</p><p><strong>Cambricon Technologies, a<a href="https://www.trendforce.com/news/2025/08/27/news-is-cambricon-the-next-nvidia-or-unsustainable-growth-story-three-operational-risks-behind-the-epic-rally/"> spin-off</a> from the Chinese Academy of Sciences, is perhaps the most commercially<a href="https://www.trendforce.com/news/2025/12/15/insights-cambricon-remains-chinas-top-ai-chip-startup-rumored-2026-triple-output-faces-smic-limits/"> visible</a> of China&#8217;s dedicated AI chip startups. </strong>Cambricon initially earned almost all of its revenue from licensing AI chip designs to Huawei: in 2017 and 2018, Huawei <a href="https://cntechpost.com/2020/03/27/ipo-prospectus-unveils-cambricons-mystery/">accounted</a> for roughly 97-98% of Cambricon&#8217;s income. That model collapsed in late 2018 when Huawei moved to its own DaVinci/Ascend architecture. After losing Huawei as its anchor customer, Cambricon <a href="https://static.sse.com.cn/stock/disclosure/announcement/c/202006/000354_20200603_6Z3R.pdf">pivoted</a> in 2019 toward cloud data-center AI accelerators. The transition took years to pay off, with company financial summaries showing annual losses before Cambricon turned <a href="https://tool.stockstar.com/summary/caibao/688256">profitable</a> in 2025.</p><p>Its most advanced AI chip, the Siyuan 590, is built on a domestic 7nm SMIC process and has a <a href="https://www.machineyearning.io/p/chinas-silicon-vanguard">reported computational performance</a> on par with the A100 (2020) or about one-third of an H100 (2022). In practice, the chip is positioned for inference and small-scale fine-tuning rather than full-scale training. Analysts <a href="https://techbuzzchina.substack.com/p/cambricon-chinas-nvidiaor-nvidia">estimate</a> that it &#8220;lags behind NVIDIA&#8217;s A100 in speed and memory bandwidth&#8221; and &#8220;lacks the scalability and throughput needed for high-end AI training&#8221;.</p><p>However, the production shift has now begun to pay off. In 2025, Cambricon&#8217;s <a href="https://www.scmp.com/tech/tech-trends/article/3348446/biren-iluvatar-corex-post-triple-digit-revenue-growth-losses-persist-ai-chip-race">revenue</a> reached around $900 million (+450% year-on-year), and the company posted its first annual profit since its 2020 IPO. At its peak, the company&#8217;s share price <a href="https://techbuzzchina.substack.com/p/cambricon-chinas-nvidiaor-nvidia">surged</a> to a level that briefly made it the highest-priced stock on China&#8217;s A-share market.<strong> </strong>The next-generation Siyuan 690, expected in the second half of 2026, is &#8220;expected to rival NVIDIA&#8217;s H100 [2022] in performance&#8221;, according to Chinese <a href="https://www.trendforce.com/news/2025/08/27/news-is-cambricon-the-next-nvidia-or-unsustainable-growth-story-three-operational-risks-behind-the-epic-rally/">media</a>. On chip volumes, Cambricon shipped an <a href="https://techbuzzchina.substack.com/p/cambricon-chinas-nvidiaor-nvidia">estimated</a> 100,000-200,000 Siyuan 590s in 2025 and is <a href="https://www.bloomberg.com/news/articles/2025-12-04/cambricon-aims-to-triple-chip-output-to-replace-nvidia-in-china">targeting</a> roughly 500,000 chips in 2026, of which about 300,000 would be Siyuan 590s and 690s combined.</p><p><strong>Alibaba&#8217;s chip subsidiary T-Head has moved from a minor player to shipping several hundred thousand Zhenwu chips. </strong>In January 2026, T-Head officially<a href="https://technode.com/2026/01/30/alibabas-t-head-unveils-self-developed-ai-chip-zhenwu-810e/"> unveiled</a> the Zhenwu 810E, advertising performance comparable to that of the NVIDIA H20 (2024). (The H20 was a deliberately weakened chip designed by NVIDIA to stay under US export control thresholds.) By the time of the announcement, the chip had already been <a href="https://technode.com/2026/01/30/alibabas-t-head-unveils-self-developed-ai-chip-zhenwu-810e/">deployed</a> in clusters containing 10,000 chips on Alibaba Cloud, and <a href="https://www.chinadaily.com.cn/a/202601/30/WS697ccbf7a310d6866eb36b18.html">was reportedly</a> used by state-affiliated organizations such as the State Grid Corporation of China and the Chinese Academy of Sciences, as well as companies with close ties to Alibaba, such as XPeng Motors and Sina Weibo. In May 2026, T-Head unveiled a newer Zhenwu M890, which is <a href="https://www.wsj.com/tech/ai/alibaba-unveils-new-ai-chip-upgrades-ai-model-421b196d">advertised</a> as three times more powerful. In 2025, Alibaba <a href="https://www.reuters.com/technology/artificial-intelligence/huawei-looks-beyond-moores-law-2026-05-27/">shipped</a> about 270,000 chips.</p><p><strong>Baidu&#8217;s AI chip subsidiary Kunlunxin has the longest operating history of any Chinese AI chip program.</strong> Baidu began<a href="https://spectrum.ieee.org/china-ai-chip"> research</a> into chip design in 2011, aimed primarily at inference workloads for Baidu&#8217;s search engine. The third-generation P800 Kunlun chip, its most advanced, is the backbone of Baidu&#8217;s AI infrastructure. In April 2025, Baidu announced it had created a 30,000-chip P800 <a href="https://www.technology.org/2025/04/25/baidu-lights-up-30000-strong-kunlun-chip-cluster-aims-to-train-deepseek-like-ai-models/">cluster</a> capable of training AI models similar to DeepSeek. The P800 offers roughly a <a href="https://thediplomat.com/2026/05/chinas-plan-for-winning-the-ai-race-hinges-on-the-token-economy-not-chips/">third</a> of the computational performance of an H100 (2022), and Baidu has said it increasingly <a href="https://www.chinadailyhk.com/hk/article/623566">runs</a> its chatbot Ernie on a mix of self-developed Kunlun chips and other infrastructure. Baidu&#8217;s future<a href="https://scout.eto.tech/?id=5281"> plans</a> include:</p><ul><li><p>M100, designed for inference, to be launched in early 2026</p></li><li><p>M300, designed for both large-scale multimodal training and inference, to be launched in 2027</p></li></ul><p>By 2030, Baidu <a href="https://www.caixinglobal.com/2025-11-14/baidu-unveils-ambitious-ai-chip-roadmap-targeting-1-million-card-cluster-by-2030-102382580.html">aims</a> to operate a cluster of one million Kunlun chips, a dramatic scale-up from its 30,000-chip P800 deployment in 2025. If of the current generation, a million Kunlun chips would deliver around 330,000 H100-equivalents<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-5" href="#footnote-5" target="_self">5</a> of training compute. Given that Baidu is developing a new generation of chips (M100 and M300) that may deliver better computational performance, the resulting cluster could provide substantially more compute. OpenAI&#8217;s Stargate facility in Abilene, Texas, <a href="https://www.datacenterdynamics.com/en/news/openai-and-oracle-to-deploy-450000-gb200-gpus-at-stargate-abilene-data-center/">plans</a> to house around 450,000 NVIDIA B200s. A million Kunlun chips by 2030 would therefore amount to about 30% of Stargate Abilene&#8217;s planned compute.</p><p>Kunlunxin historically <a href="https://news.samsung.com/global/baidu-and-samsung-electronics-ready-for-production-of-leading-edge-ai-chip-for-early-next-year">fabricated</a> its chips at Samsung Foundry rather than SMIC, relying on Kunlunxin&#8217;s absence from the Entity List and on its chips sitting in a performance tier that US rules still <a href="https://spectrum.ieee.org/china-ai-chip">permitted</a> foreign foundries to produce for Chinese customers. However, Samsung reportedly <a href="https://akihabaranews.com/samsung-halts-supply-of-ai-chips-to-baidu/">halted</a> Kunlun supply in early 2025 under tightening US export controls and has since <a href="https://www.digitimes.com/news/a20251125PD215/samsung-baidu-roadmap-partnership-chips.html">disappeared</a> from Baidu&#8217;s public chip roadmap.</p><p>Baidu&#8217;s and Alibaba&#8217;s chip design activities mirror a trend also seen in the US, with Google&#8217;s TPUs and Amazon&#8217;s Trainium chips. Other tech giants, such as ByteDance and Tencent, have taken a different path, leaning more on external merchant designers than on in-house silicon. Tencent is Enflame&#8217;s <a href="https://www.scmp.com/tech/tech-war/article/3276213/chinas-nvidia-wannabe-tencent-backed-ai-chip-start-enflame-flags-ipo-intention">largest</a> shareholder with a roughly 21% stake and an <a href="https://www.scmp.com/tech/big-tech/article/3334360/moore-threads-ipo-frenzy-fires-chinas-home-grown-gpu-drive">investor</a> in Moore Threads, while ByteDance <a href="https://www.scmp.com/tech/big-tech/article/3334360/moore-threads-ipo-frenzy-fires-chinas-home-grown-gpu-drive">invested</a> in Moore Threads and <a href="https://www.chinadailyhk.com/hk/article/628818">runs</a> workloads on chips from Kunlunxin and Cambricon.<strong> </strong>ByteDance is also reportedly<a href="https://www.eefocus.com/article/1959266.html"> developing</a> an AI inference chip with the first samples planned for the end of March 2026. It reportedly plans to produce at least 100,000 units in 2026 and scale to 350,000 units in cooperation with Samsung. (ByteDance<a href="https://scout.eto.tech/?id=5502"> denied</a> this information, while Samsung declined to comment.)</p><p><strong>Enflame is an AI chip<a href="https://finance.yahoo.com/news/chinas-nvidia-wannabe-tencent-backed-093000784.html"> supplier</a> for Tencent, its largest shareholder and overwhelmingly dominant customer.</strong> As Enflame is not on the Entity List, it has retained access to fabrication at TSMC, which means it can manufacture chips as long as they stay within the performance thresholds defined by US export controls, about one-third of the NVIDIA H100.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-6" href="#footnote-6" target="_self">6</a></p><p>Enflame&#8217;s most advanced AI chip, the S60 (2024), has reportedly shipped approximately 60,000 units. Its computing performance is <a href="https://www.machineyearning.io/p/chinas-silicon-vanguard">reportedly</a> around 40% and its memory bandwidth around 20% of those of an H100 (2022). The S60 chip was <a href="https://www.techinsights.com/blog/enflame-s60-ai-accelerator-processor-floorplan-analysis">reportedly</a> manufactured by TSMC, drawing scrutiny from US authorities. If the reported specifications are accurate, the S60 would exceed the export-control threshold. TSMC has <a href="https://www.nbcnews.com/tech/tech-news/ai-chip-tsmc-enflame-techinsights-rcna259342">disputed</a> the classification, saying the chip does not meet the criteria for a controlled AI chip. One of the most important deployments is a 10,000-chip S60<a href="https://en.eeworld.com.cn/mp/XSY/a399497.jspx"> cluster</a>, one of the largest publicly reported clusters of domestic Chinese AI chips.</p><p><strong>MetaX was listed on the Shanghai Stock Exchange in December 2025. Its revenue has <a href="https://www.trendforce.com/news/2026/03/09/news-china-gpu-race-intensifies-cambricon-turns-profitable-in-2025-moore-threads-metax-narrow-losses/">grown</a> rapidly, from roughly $8 million in 2023 to $110 million in 2024 to about $230 million in 2025.</strong> Its newest C600 chip, advertising 144 GB of HBM3E memory, is marketed as competitive with NVIDIA Hopper chips (the H100 and H200) for both training and inference workloads; its memory capacity exceeds that of the H100 (80 GB) and is on par with that of the H200 (141 GB). It is <a href="https://www.financialcontent.com/article/tokenring-2025-12-17-metaxs-soaring-debut-signals-chinas-bold-bid-for-semiconductor-self-sufficiency">planned</a> to enter mass production in the first half of 2026. The HBM3E itself is almost certainly imported, as CXMT does not yet produce HBM3E domestically. It may come from the HBM stockpile created before the December 2024 US HBM export <a href="https://www.federalregister.gov/documents/2024/12/05/2024-28270/foreign-produced-direct-product-rule-additions-and-refinements-to-controls-for-advanced-computing">ban</a> came into effect. Where the C600 logic die is fabricated has not been publicly reported. MetaX historically had to <a href="https://www.trendforce.com/news/2024/06/06/news-chinese-ai-chip-companies-reportedly-seek-tsmc-production-by-downgrading-chip-designs/">downgrade</a> its chip designs to stay within US export thresholds and keep fabricating with TSMC, but the C600 is now <a href="https://www.financialcontent.com/article/tokenring-2025-12-17-metaxs-soaring-debut-signals-chinas-bold-bid-for-semiconductor-self-sufficiency">touted</a> as a &#8220;fully domestically produced&#8221; chip. SMIC is effectively the only Chinese foundry capable of producing it at the required node, so it is the most plausible answer (although this is speculation). MetaX <a href="https://www.trendforce.com/news/2026/01/12/news-chinas-iluvatar-corex-reportedly-to-unveil-2026-28-gpu-roadmap-targeting-nvidia-h200-b200/">holds</a> roughly 1% of China&#8217;s AI accelerator market.</p><p><strong>Moore Threads is often <a href="https://www.scmp.com/tech/big-tech/article/3335337/chinas-little-nvidia-aspires-become-leading-global-player-after-shares-soar-468">described</a> as the Chinese startup most directly trying to replicate NVIDIA&#8217;s approach. </strong><a href="https://www.scmp.com/tech/big-tech/article/3327614/meet-moore-threads-how-former-nvidia-vice-president-created-chinese-gpu-star">Founded</a> in 2020 by NVIDIA&#8217;s former China general manager, James Zhang Jianzhong, it is the only Chinese firm building GPUs with real graphics-rendering capability alongside AI compute, rather than the AI-focused data center GPU designs offered by peers like Biren and MetaX<strong>.</strong><a href="https://finance.yahoo.com/news/moore-threads-ipo-frenzy-energises-093000893.html"> Backed</a> by Tencent, ByteDance, and Sequoia China, it was <a href="https://www.scmp.com/tech/tech-trends/article/3337217/moore-threads-unveils-new-ai-chips-challenge-nvidia">listed</a> on Shanghai&#8217;s STAR Market in late 2025. Moore Threads has launched four GPU architectures since 2021, though the first two were aimed at desktop and professional graphics rather than AI, and only the last two, Quyuan (2023) and Pinghu (2024), <a href="https://www.trendforce.com/news/2025/10/07/news-chinas-gpu-trio-rise-as-nvidia-retreats-decoding-moore-threads-metax-and-cambricon/">targeted</a> AI compute.</p><p>The current flagship is the MTT S5000, built on the Pinghu architecture, and Moore Threads <a href="https://en.tmtpost.com/post/7612583">claims</a> it is designed to rival the NVIDIA H100 (2022). Actual shipment volumes have not been publicly disclosed, and Moore Threads <a href="https://www.trendforce.com/news/2026/01/12/news-chinas-iluvatar-corex-reportedly-to-unveil-2026-28-gpu-roadmap-targeting-nvidia-h200-b200/">holds</a> under 1% of China&#8217;s AI computing, graphics, and SoC segments combined. At its inaugural developer conference in December 2025, it unveiled the Huashan chip, which it claims will exceed an H100 in compute, memory bandwidth, and capacity, with some metrics approaching Blackwell-generation NVIDIA chips. Independent verification is pending. Huashan is <a href="https://www.trendforce.com/news/2025/12/22/news-chinas-moore-threads-unveils-huashan-ai-chip-reportedly-takes-aim-at-nvidias-hopper/">slated</a> for mass production sometime in 2026, though Moore Threads has not disclosed a specific quarter.</p><p><strong>Biren Technology was one of the earliest Chinese startups to attract serious outside attention as a possible domestic answer to NVIDIA.</strong> In 2022, its BR100 was <a href="https://www.hpcwire.com/2022/08/22/chinese-startup-biren-details-br100-gpu/">presented</a> as a flagship chip with a claimed peak computational performance roughly on par with an NVIDIA H100 (2022), though this was never independently verified. The October 2022 US export controls forced TSMC to <a href="https://www.scmp.com/tech/tech-war/article/3196938/tsmc-said-suspend-production-chinese-chip-start-biren-amid-us-curbs">suspend</a> production of Biren&#8217;s chips. A year later, in October 2023, the US <a href="https://finance.yahoo.com/news/tech-war-us-sanctions-biren-093000121.html">added</a> Biren to the Entity List, cutting the company off from TSMC entirely. Biren entered the Hong Kong Stock Exchange in January 2026, backed by a consortium including state-linked funds. Its chips have been <a href="https://www.mlex.com/mlex/articles/2402297/china-s-telecom-giants-step-up-ai-chip-push-as-beijing-seeks-tech-independence">deployed</a> at China Mobile&#8217;s intelligent computing center in Hohhot, Inner Mongolia, though the specific number of chips has not been publicly disclosed.</p><p><strong>Iluvatar CoreX is another Chinese firm in the data center GPU category (alongside Biren and MetaX). </strong>Per its recent HKEX <a href="https://www1.hkexnews.hk/listedco/listconews/sehk/2025/1230/2025123000019.pdf">filing</a>, the company shipped nearly 10,000 training cards and 14,000 inference cards in the first nine months of 2025. Its announced roadmap and performance claims are considerably more ambitious than those shipment volumes suggest: Iluvatar <a href="https://www.trendforce.com/news/2026/01/28/news-chinas-illuvatar-corex-unveils-bold-gpu-roadmap-reportedly-eyeing-nvidias-rubin-by-2027/">claims</a> its current Tianshu architecture outperforms the NVIDIA H200 (2024), and has <a href="https://www.scmp.com/tech/big-tech/article/3341368/iluvatar-corex-targets-nvidias-rubin-gpu-road-map-amid-china-chip-push?module=top_story&amp;pgtype=section">published</a> a four-generation roadmap promising to match Blackwell by 2026 and surpass Rubin by 2027. Since NVIDIA started shipping Blackwells in 2024, these targets would place Iluvatar roughly 1-2 years behind NVIDIA. (We have not been able to verify these claims.) Per TrendForce&#8217;s <a href="https://www.trendforce.com/news/2026/01/12/news-chinas-iluvatar-corex-reportedly-to-unveil-2026-28-gpu-roadmap-targeting-nvidia-h200-b200/">analysis</a>, the older Iluvatar TianGai-100 (2021) and TianGai-150 (2023) chips deliver about 0.15x and 0.2x the computational performance of the NVIDIA H100 (2022), respectively. Iluvatar <a href="https://www.trendforce.com/news/2026/01/12/news-chinas-iluvatar-corex-reportedly-to-unveil-2026-28-gpu-roadmap-targeting-nvidia-h200-b200/">holds</a> only about 0.3% of the domestic data center GPU market.</p><h2 style="text-align: justify;">Chinese AI developers still prefer NVIDIA for training</h2><p><strong>Chinese developers still overwhelmingly prefer NVIDIA hardware for training workloads, a clear signal that a substantial gap remains between Chinese AI chips and Western competitors.</strong> Chinese companies continue to try to acquire NVIDIA chips through every available channel. For instance, ByteDance, Alibaba, and Tencent collectively <a href="https://asia.nikkei.com/business/tech/semiconductors/bytedance-alibaba-and-tencent-stockpile-billions-worth-of-nvidia-chips">ordered</a> roughly one million NVIDIA H20 chips ahead of an anticipated ban in early 2025, and orders from Chinese <a href="https://www.cnbc.com/2025/12/12/nvidia-considers-increasing-h200-chip-output-due-to-robust-china-demand-reuters-sources-say.html">firms</a> reportedly <a href="https://www.reuters.com/world/china/nvidia-sounds-out-tsmc-new-h200-chip-order-china-demand-jumps-sources-say-2025-12-31/">exceeded</a> two million units when the Trump administration later allowed H200 exports in January 2026. (It is not clear whether these H200s have in fact been shipped, however. But the demand seems to be there.)</p><p>Some Chinese companies have also <a href="https://www.ft.com/content/96fe9898-a3a4-4a33-be1d-da06bdb6cb2b">reportedly</a> leased large NVIDIA GPU clusters located in Southeast Asia to train frontier AI models, and state-linked entities have <a href="https://www.reuters.com/technology/chinese-entities-turn-amazon-cloud-its-rivals-access-high-end-us-chips-ai-2024-08-23/">accessed</a> restricted chips via AWS and Azure. Epoch AI <a href="https://epoch.ai/blog/chip-smuggling">estimates</a> that between 290,000 and 1.6 million H100-equivalents (about a third of China&#8217;s total AI compute capacity) have been smuggled into the country through 2025.</p><p><strong>Chinese AI chip software&#8212;Huawei&#8217;s CANN, Cambricon&#8217;s NeuWare, and Enflame&#8217;s TopsRider&#8212;remains substantially less mature than NVIDIA&#8217;s CUDA software. </strong>CUDA lets developers program GPUs for highly parallel AI workloads. Introduced by NVIDIA in 2006, it is highly optimized and <a href="https://developer.nvidia.com/cudnn">used</a> by every major AI framework&#8212;PyTorch, TensorFlow, and JAX&#8212;as the primary backend for training on NVIDIA hardware. Developers have <a href="https://www.chinatalk.media/p/can-huawei-compete-with-cuda">described</a> Huawei Ascends as &#8220;difficult and unstable to use&#8221; and &#8220;a road full of pitfalls&#8221;. In 2025, the Financial Times <a href="https://www.ft.com/content/eb984646-6320-4bfe-a78d-a1da2274b092">reported</a>, citing anonymous sources, that Beijing had <a href="https://www.artificialintelligence-news.com/news/deepseek-reverts-nvidia-r2-model-huawei-ai-chip-fails">urged</a> DeepSeek to train its R2 model on Huawei Ascend chips, but persistent technical failures forced the company to <a href="https://siliconangle.com/2025/08/14/deepseek-r2-model-release-reportedly-held-back-faulty-huawei-chi">revert</a> to NVIDIA for training, keeping Ascend only for inference.</p><p>In addition to their own software, Chinese chip designers also publish adapters that let models written for NVIDIA GPUs run on domestic hardware with minimal code changes, marketed as &#8220;CUDA <a href="https://techwireasia.com/2025/11/baidu-m100-m300-chinese-ai-chip-development/">compatibility</a>&#8221;. For example, <a href="https://github.com/baidu/vLLM-Kunlun">Baidu</a> and <a href="https://github.com/MetaX-MACA/mcPytorch">MetaX</a> provide a PyTorch adapter, while Huawei offers its <a href="https://github.com/Ascend/pytorch">torch_npu</a> adapter for Ascend. Public adoption of these adapters appears modest, but GitHub activity likely understates domestic adoption, since Chinese developers may use other source code hosting platforms, such as <a href="https://gitee.com/">Gitee</a>.</p><p>Adapting a single large model to a domestic Chinese chip is <a href="https://36kr.com/p/3658329871426184">reported</a> to take one to two months of engineering effort, and only a few dozen models are currently compatible with Chinese chips out of the more than two million available on Hugging Face. To accelerate adoption, Huawei has embedded engineering <a href="https://www.chinatalk.media/p/can-huawei-compete-with-cuda">teams</a> directly at customer sites at Baidu, iFlytek, and Tencent, and has <a href="https://www.scmp.com/tech/tech-war/article/3320852/tech-war-huawei-open-source-ai-chip-toolkit-take-nvidias-proprietary-platform">open-sourced</a> CANN to seed a third-party developer ecosystem. Meanwhile, researchers at the Chinese Academy of Sciences have separately developed a <a href="https://arxiv.org/abs/2505.02146">QiMeng-Xpiler</a> transcompiler that uses LLMs and symbolic synthesis to automatically translate tensor programs across hardware platforms, achieving an average accuracy of around 95%, which, if it matures, could reduce months of manual porting to an automated pipeline. But closing the CUDA gap can take years, and fragmentation across multiple Chinese stacks is <a href="https://scout.eto.tech/?id=5243">recognized</a> as a major barrier to adoption, since each chip requires its own porting effort.</p><h2 style="text-align: justify;">Chinese chips still trail NVIDIA on performance and memory</h2><p><strong>The performance gap between American and Chinese AI hardware remains substantial.</strong> The NVIDIA B300 (2025) delivers about 5x the computational performance, 2.5x the memory bandwidth, and 2.3x the memory capacity of the strongest Chinese AI chip in use, the <a href="https://cset.georgetown.edu/publication/pushing-the-limits-huaweis-ai-chip-tests-u-s-export-controls/">Huawei Ascend 910C</a> (2025). Outside of Huawei, most Chinese designers cluster around or below roughly 0.3x of an H100 (2022) in peak computational performance, the level of the older A100 (2020), which is also where the US <a href="https://cset.georgetown.edu/article/bis-2023-update-explainer/">set</a> its export control thresholds.</p><div id="datawrapper-iframe" class="datawrapper-wrap outer" data-attrs="{&quot;url&quot;:&quot;https://datawrapper.dwcdn.net/BKg29/3/&quot;,&quot;thumbnail_url&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/333f2da8-e231-42e9-b064-9aaf3ed8e983_1220x806.png&quot;,&quot;thumbnail_url_full&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/07073517-da3c-4005-b4e3-dbfcf9ca3d3e_1220x926.png&quot;,&quot;height&quot;:586,&quot;title&quot;:&quot;Comparison of computing performance of Chinese chips with AMD &amp; Nvidia (H100 equivalents)&quot;,&quot;description&quot;:&quot;&quot;}" data-component-name="DatawrapperToDOM"><iframe id="iframe-datawrapper" class="datawrapper-iframe" src="https://datawrapper.dwcdn.net/BKg29/3/" width="730" height="586" frameborder="0" scrolling="no"></iframe><script type="text/javascript">!function(){"use strict";window.addEventListener("message",(function(e){if(void 0!==e.data["datawrapper-height"]){var t=document.querySelectorAll("iframe");for(var a in e.data["datawrapper-height"])for(var r=0;r<t.length;r++){if(t[r].contentWindow===e.source)t[r].style.height=e.data["datawrapper-height"][a]+"px"}}}))}();</script></div><p>According to Huawei&#8217;s public roadmap, its next-generation Ascend line features two accelerators based on the same die: the 950PR, for the prefill stage of inference, and the 950DT, for the decode stage and for training. The 950PR has <a href="https://www.trendforce.com/news/2026/03/23/news-huawei-debuts-atlas-350-on-ascend-950pr-with-in-house-hbm-touting-2-8x-h20-performance/">roughly half</a> the computational performance of the 910C and half its memory bandwidth. Prefill is compute-bound and benefits more from memory capacity than from memory bandwidth, so the 950PR comes with cheaper, lower-bandwidth HBM. The 950DT takes the same die in the other direction, pairing it with a <a href="https://www.huaweicentral.com/huawei-announces-self-made-hbm-memory-to-boost-ai-chips-potential/">faster</a> domestic HBM to deliver 25% higher memory bandwidth than the 910C. These design choices are likely driven at least partly by two constraints. First, its TSMC die stockpile is <a href="https://newsletter.semianalysis.com/p/huawei-ascend-production-ramp">running</a> out, so new chips must be fabricated domestically instead. Second, the December 2024 US <a href="https://www.federalregister.gov/documents/2024/12/05/2024-28270/foreign-produced-direct-product-rule-additions-and-refinements-to-controls-for-advanced-computing">ban</a> on HBM exports has cut off its supply of imported HBM, leaving it to fall back on a weaker domestic substitute.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-7" href="#footnote-7" target="_self">7</a></p><p>What matters in the strategic competition over AI is <a href="https://www.the-substrate.net/p/for-chip-exports-quantity-is-at-least">the total compute a country can deploy</a>, not any single chip&#8217;s performance. Compute is largely fungible&#8212;for example, inference capacity can free up training capacity and vice versa, and to some extent, larger volumes of chips with lower performance and memory bandwidth can substitute for fewer, better chips. Therefore, what matters is not only the quality of Chinese AI chips but also how many it can make.</p><h2 style="text-align: justify;">Domestic chip shipments are growing but remain highly concentrated</h2><p>The domestic chip design ecosystem is growing, as Chinese chipmakers <a href="https://www.reuters.com/world/china/chinese-chipmakers-claim-nearly-half-of-local-market-nvidias-lead-shrinks-idc-2026-04-01/">captured</a> about 40% of China&#8217;s AI chip market in 2025, collectively <a href="https://epoch.ai/data/ai-chip-sales?view=graph&amp;tab=count&amp;timePeriod=annual">selling</a> an estimated 1.7 million chips that year, or 38% as many as NVIDIA&#8217;s 4.5 million worldwide<strong>.</strong> Similarly, the number of active AI chip designers has grown from two prior to 2018 (Huawei HiSilicon and Cambricon) to at least nine in 2026. Some of these AI chip designers likely see little demand because their products are not competitive. But even designers with more competitive products, such as Huawei, still can&#8217;t ship in the volumes they&#8217;d need to match US rivals, because Chinese chip fabrication companies can&#8217;t manufacture enough chips (see the section on fabrication below). Here&#8217;s how many chips each company shipped in 2025, according to reports:</p><ul><li><p>Huawei <a href="https://www.reuters.com/world/china/chinese-chipmakers-claim-nearly-half-of-local-market-nvidias-lead-shrinks-idc-2026-04-01/">shipped</a> about 810,000 Ascends</p></li><li><p>Alibaba T-Head <a href="https://www.scmp.com/tech/article/3341860/alibaba-ai-chip-push-hits-100000-mark-beating-local-rival-cambricon-sources">shipped</a> about 270,000 Zhenwus</p></li><li><p>Cambricon <a href="https://techbuzzchina.substack.com/p/cambricon-chinas-nvidiaor-nvidia">shipped</a> about 150,000 Siyuan 590s</p></li><li><p>Baidu Kunlunxin <a href="https://kr-asia.com/baidu-integrates-ai-into-search-as-rivals-pursue-superapp-ambitions">shipped</a> about 60,000 Kunlun P800s</p></li><li><p>Enflame <a href="https://pdf.dfcfw.com/pdf/H2_AN202601221818280973_1.pdf">shipped</a> about 60,000 chips</p></li><li><p>Iluvatar CoreX <a href="https://www1.hkexnews.hk/listedco/listconews/sehk/2025/1230/2025123000019.pdf">shipped</a> about 30,000 chips</p></li><li><p>MetaX <a href="https://hellochinatech.com/p/metax-paradox-china-ai-chips">shipped</a> about 25,000 chips</p></li><li><p>Moore Threads shipped about 20,000 chips</p></li><li><p>Biren shipped about 4,500 to 10,000 BR106 chips</p></li></ul><p><strong>Based on this shipment data and performance specifications, we estimate that Chinese AI chip designers collectively shipped about 770,000 H100-equivalents in 2025.</strong><a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-8" href="#footnote-8" target="_self">8</a> That <a href="https://epoch.ai/data/ai-chip-sales">represents</a> about 7% of the compute NVIDIA is estimated to have sold in 2025, and about 4-5% of total global AI compute sold in 2025. Chinese aggregate shipments also equal only about two-thirds of the compute that OpenAI&#8217;s single Stargate <a href="https://www.datacenterdynamics.com/en/news/openai-and-oracle-to-deploy-450000-gb200-gpus-at-stargate-abilene-data-center/">facility</a> in Abilene, Texas, will deliver once its planned 450,000 NVIDIA B200 chips are installed.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-9" href="#footnote-9" target="_self">9</a></p><div id="datawrapper-iframe" class="datawrapper-wrap outer" data-attrs="{&quot;url&quot;:&quot;https://datawrapper.dwcdn.net/TPx81/4/&quot;,&quot;thumbnail_url&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/3cb8cd0d-2a58-4a7e-a5c0-7a02e0aa92f1_1220x414.png&quot;,&quot;thumbnail_url_full&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/fb253990-ca92-4184-9e53-b026e8f9fe26_1220x534.png&quot;,&quot;height&quot;:259,&quot;title&quot;:&quot;Computing power of Chinese AI chip shipments (H100-equivalents)&quot;,&quot;description&quot;:&quot;&quot;}" data-component-name="DatawrapperToDOM"><iframe id="iframe-datawrapper" class="datawrapper-iframe" src="https://datawrapper.dwcdn.net/TPx81/4/" width="730" height="259" frameborder="0" scrolling="no"></iframe><script type="text/javascript">!function(){"use strict";window.addEventListener("message",(function(e){if(void 0!==e.data["datawrapper-height"]){var t=document.querySelectorAll("iframe");for(var a in e.data["datawrapper-height"])for(var r=0;r<t.length;r++){if(t[r].contentWindow===e.source)t[r].style.height=e.data["datawrapper-height"][a]+"px"}}}))}();</script></div><h2 style="text-align: justify;">China&#8217;s cluster-level systems are limited by fab capacities</h2><p>As the semiconductor industry shifts away from single-chip performance metrics toward rack- and data-center-level <a href="https://www.fabricatedknowledge.com/p/the-data-center-is-the-new-compute">compute</a>&#8212;where what matters is the combined performance of chips, interconnects, and memory across the system&#8212;Chinese designers are doing the same, clustering more chips together to deliver high aggregate compute.</p><p>The leading <a href="https://semianalysis.com/2025/04/16/huawei-ai-cloudmatrix-384-chinas-answer-to-nvidia-gb200-nvl72/">example</a> is the Huawei CloudMatrix 384, a 384-chip Ascend cluster that Huawei claims to beat NVIDIA&#8217;s 72-chip <a href="https://www.nvidia.com/en-us/data-center/gb200-nvl72/">GB200 NVL72</a> on some training benchmarks. The comparison isn&#8217;t apples-to-apples, though. CloudMatrix packs roughly five times as many chips and is <a href="https://semianalysis.com/2025/04/16/huawei-ai-cloudmatrix-384-chinas-answer-to-nvidia-gb200-nvl72/">estimated</a> to draw roughly four times the power.</p><p>The clustering strategy itself is universal; NVIDIA, <a href="https://ir.amd.com/news-events/press-releases/detail/1261/amd-showcases-helios-rack-scale-platform-built-on-the-open-compute-project-open-rack-for-ai-introduced-by-meta">AMD</a>, and hyperscalers like <a href="https://www.trendforce.com/news/2025/11/07/news-google-unveils-7th-gen-tpu-ironwood-with-9216-chip-superpod-taking-aim-at-nvidia/">Google</a> and <a href="https://aws.amazon.com/blogs/aws/amazon-ec2-trn2-instances-and-trn2-ultraservers-for-aiml-training-and-inference-is-now-available/">AWS</a> all build dense rack-scale systems. Huawei is extending the same strategy with its next-generation Atlas 950 SuperPoD, which it <a href="https://www.huawei.com/en/news/2026/3/mwc-superpod-ai">says</a> can scale up to 8,192 Ascend NPUs.</p><p>This brute-force approach is <a href="https://newsletter.semianalysis.com/p/huawei-ai-cloudmatrix-384-chinas-answer-to-nvidia-gb200-nvl72">feasible</a> as far as the power budget and interconnect can carry it, but it compounds two structural problems. At the cluster level, scaling efficiency erodes as clusters grow. With aggregate performance rising, a larger share is lost to interconnect bandwidth, latency, and synchronization overhead, so the gain per added chip diminishes. And at the absolute level, China also cannot manufacture as many advanced chips as the US-led ecosystem can, which limits how far clustering more chips can compensate for the performance gap.</p><h1>Logic chips fabrication</h1><p>Even the best chip designs ultimately depend on advanced fabrication processes to manufacture the physical chips. This manufacturing is generally done by foundries. A foundry is a company that fabricates chips designed by other companies, such as Apple, NVIDIA, or Huawei. Foundries own and operate the extraordinarily expensive and complex fabrication facilities (&#8221;fabs&#8221;) required to physically produce chips. This model, <a href="https://spectrum.ieee.org/morris-chang-foundry-father">pioneered</a> by TSMC in 1987, allowed chip design and manufacturing to specialize independently, unlike the older integrated IDM model, where a single company like Intel or Samsung handles both design and fabrication in-house.</p><p>Foundries use advanced production processes (process &#8220;nodes&#8221;) that determine, among other things, how small and densely packed transistors and other chip features can be. More advanced nodes enable more performant and energy-efficient chips.</p><p>Until recently, the main bottleneck to scaling AI compute globally was power capacity for data centers. However, compute scaling now <a href="https://www.cnas.org/publications/reports/american-ai-companies-cant-get-enough-chips">seems to be bottlenecked</a> by AI chip production. AI chip production, in turn, has previously been bottlenecked by advanced packaging, but is now increasingly bottlenecked by the production of logic and memory chips. China has always faced a somewhat separate set of constraints. It has <a href="https://www.brookings.edu/articles/how-will-the-united-states-and-china-power-the-ai-race/">plenty of electricity</a>, so for years the main thing holding it back has been access to the most advanced chips, which are export-controlled. And as elsewhere, China&#8217;s domestic AI chip production is limited by its ability to produce competitive logic and memory chips at scale, a difficulty exacerbated by its lack of access to the most advanced chipmaking equipment.</p><p>The leading chipmaker is TSMC, which <a href="https://semiwiki.com/semiconductor-manufacturers/tsmc/361243-tsmc-2025-update-riding-the-ai-wave-amid-global-expansion/">produces</a> over 90% of the world&#8217;s most advanced chips (at 3nm and below), mostly in Taiwan. TSMC is where NVIDIA, AMD, Google, and Amazon have their AI chips <a href="https://www.reuters.com/business/retail-consumer/energy-use-forcing-rethink-ai-chip-design-tsmc-says-2026-05-28/">made</a>. But with export controls preventing Chinese chip designers from fabricating advanced AI chips via TSMC, they must rely on domestic chipmakers.</p><h2 style="text-align: justify;">SMIC leads China&#8217;s foundry sector</h2><p><strong>The leading Chinese chipmaker, SMIC, is the world&#8217;s third-largest foundry by revenue, behind TSMC and Samsung, with roughly <a href="https://www.trendforce.com/presscenter/news/20250901-12691.html">5%</a> of the global foundry market. </strong>SMIC operates ten <a href="https://brief.bismarckanalysis.com/p/chinas-struggle-to-manufacture-advanced">facilities</a> across four Chinese cities (Shanghai, Beijing, Tianjin, and Shenzhen), <a href="https://stockanalysis.com/quote/hkg/0981/employees/">employs</a> over 20,000 people, and <a href="https://www.trendforce.com/news/2026/02/11/news-smic-posts-record-9-3b-in-2025-sales-7nm-yields-reportedly-weigh-on-margins/">achieved</a> revenue of roughly $9.3 billion in 2025. (TSMC&#8217;s 2025 revenue was approximately <a href="https://www.stocktitan.net/sec-filings/TSM/6-k-taiwan-semiconductor-manufacturing-co-ltd-current-report-foreign--c7605b09d8e8.html">$120 billion</a>, about 13 times that of SMIC.)</p><p>The company is heavily state-linked. As of late 2024, identifiable state-affiliated shareholders <a href="https://www.yicaiglobal.com/star50news/2025_03_286809128691839270912">hold at least a quarter</a> of the company, and local governments co-invest in its regional fab subsidiaries.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-10" href="#footnote-10" target="_self">10</a> In 2020, the US<a href="https://2017-2021.commerce.gov/news/press-releases/2020/12/commerce-adds-chinas-smic-entity-list-restricting-access-key-enabling.html"> placed</a> SMIC on the Entity List, citing risk of diversion to the People&#8217;s Liberation Army under the Chinese <a href="https://en.wikipedia.org/wiki/Military%E2%80%93civil_fusion">military-civil fusion</a> strategy.</p><p>Whereas TSMC now<a href="https://www.tsmc.com/english/dedicatedFoundry/technology/logic/l_A16"> mass-produces</a> chips at 3nm and is ramping 2nm<a href="https://globalsemiresearch.substack.com/p/decoding-tsmcs-advanced-process-roadmap"> production</a> with advanced nanosheet transistors, <strong>SMIC&#8217;s most<a href="https://marklapedus.substack.com/p/can-china-make-5nm-chips"> advanced</a> process in volume is its 7nm node, roughly the generation that TSMC <a href="https://www.tsmc.com/english/news-events/blog-article-20200801">introduced</a> in 2018.</strong> SMIC has also progressed to a more advanced node (N+3), which has been likened to 5nm. However, TechInsights&#8217; <a href="https://www.techinsights.com/blog/smic-n3-confirmed-kirin-9030-analysis-reveals-how-close-smic-5nm">analysis</a> places the process somewhere between 7nm and 5nm, with a transistor <a href="https://semiwiki.com/semiconductor-services/techinsights/365118-forwarded-this-email-subscribe-here-for-more-kirin-9030-hints-at-smics-possible-paths-toward-300-mtr-mm2-without-euv/">density</a> of less than <a href="https://x.com/Jukanlosreve/status/1904055677603242410">125 million transistors per square millimeter</a>, comparable to TSMC&#8217;s 6nm rather than true 5nm. Initial wafer costs on this node are <a href="https://www.trendforce.com/news/2025/03/28/news-smic-reported-to-complete-5nm-chips-by-2025-but-costs-may-be-50-higher-than-tsmcs/">estimated</a> to be approximately 50% higher than TSMC&#8217;s equivalent process, with yields reaching about one-third of TSMC&#8217;s. On well-established advanced nodes, TSMC typically reaches <a href="https://patentpc.com/blog/5nm-vs-3nm-chips-performance-gains-and-market-adoption-rates-latest-data">yields</a> of 80-90%. For modern multi-die AI chips like the Ascend 910C, which combines two logic dies and eight HBM stacks in one package, that front-end wafer yield is only part of the picture; the effective chip yield is the front-end yield multiplied by the <a href="https://anysilicon.com/cowos-package/">packaging yield</a>, compounding to a lower number than the wafer-level figure suggests.</p><h2 style="text-align: justify;">SMIC lags three to five years behind TSMC</h2><p>As measured by process node alone, SMIC remains about <a href="https://asia.nikkei.com/business/tech/semiconductors/china-s-chip-capabilities-just-3-years-behind-tsmc-teardown-shows">three</a> to <a href="https://newsletter.semianalysis.com/p/fab-whack-a-mole-chinese-companies">five</a> years behind TSMC at the leading edge; factoring in yield, cost, and production volume, the effective gap is wider still. In practical terms, a smaller process node means more transistors can be packed into the same chip area, while consuming significantly less power per operation. For example, TSMC&#8217;s 3nm process <a href="https://www.phonearena.com/news/tsmc-3nm-chips-will-contain-nearly-300-million-transistors-per-square-mm_id123963">fits</a> roughly three times as many transistors per square millimeter as a 7nm process (about 300 million versus about 100 million). For AI chips, this translates directly into more compute per chip, lower power consumption, and lower cost per unit of performance. The node gap is why SMIC-fabricated AI chips are larger, need more energy, and cost more per unit of compute than their TSMC-fabricated equivalents.</p><p>The most important blocker for SMIC is a lack of access to ASML&#8217;s EUV photolithography machines, which are highly useful for manufacturing at 7nm and 5nm and are essential for more advanced nodes. SMIC ordered an EUV scanner from ASML in April 2018, but the deal was blocked by the Dutch in 2019 <a href="https://csis-website-prod.s3.amazonaws.com/s3fs-public/2023-09/230928_Allen_Post_October7.pdf">after pressure</a> from the US. (Chinese progress in domestic photolithography and other equipment is discussed in a section below.) Without EUV, SMIC must use a technique called multipatterning&#8212;layering multiple exposures to effectively shrink feature sizes&#8212;using older DUV immersion machines. Multipatterning raises costs, reduces throughput, and makes it harder to attain good yield.</p><div id="datawrapper-iframe" class="datawrapper-wrap outer" data-attrs="{&quot;url&quot;:&quot;https://datawrapper.dwcdn.net/ZglHq/4/&quot;,&quot;thumbnail_url&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/52a69425-b540-41f2-bfdc-01d7245d035f_1220x944.png&quot;,&quot;thumbnail_url_full&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/265b9ce2-357a-41cd-b466-45230824e7fa_1220x1014.png&quot;,&quot;height&quot;:482,&quot;title&quot;:&quot;Key Chinese logic fabs compared to TSMC&quot;,&quot;description&quot;:&quot;&quot;}" data-component-name="DatawrapperToDOM"><iframe id="iframe-datawrapper" class="datawrapper-iframe" src="https://datawrapper.dwcdn.net/ZglHq/4/" width="730" height="482" frameborder="0" scrolling="no"></iframe><script type="text/javascript">!function(){"use strict";window.addEventListener("message",(function(e){if(void 0!==e.data["datawrapper-height"]){var t=document.querySelectorAll("iframe");for(var a in e.data["datawrapper-height"])for(var r=0;r<t.length;r++){if(t[r].contentWindow===e.source)t[r].style.height=e.data["datawrapper-height"][a]+"px"}}}))}();</script></div><p>By the end of 2025, SMIC&#8217;s total manufacturing capacity <a href="https://kr-asia.com/smic-posts-revenue-growth-in-q4-as-expansion-weighs-on-margins">reached</a> approximately 1.1 million wafers per month (measured in 8-inch equivalents), with utilization at roughly 95%. TSMC <a href="https://www.tsmc.com/english/dedicatedFoundry/manufacturing/fab_capacity">produces</a> roughly 1.4 million 12-inch-equivalent wafers per month. Since a 12-inch wafer has about <a href="https://www.wevolver.com/article/200-mm-wafer-vs-300-mm-wafer-a-technical-comparison-for-engineers">2.3</a> times the usable area of an 8-inch wafer, converting to a common standard puts SMIC&#8217;s output at roughly a third of TSMC&#8217;s in raw wafer area, though this overstates SMIC&#8217;s relative position, since most of its output is on mature nodes (45nm and above) while a bigger proportion of TSMC&#8217;s capacity serves the world&#8217;s most advanced chips. In 2025, SMIC&#8217;s capital expenditure <a href="https://www.trendforce.com/news/2026/03/30/news-smic-2026-action-plan-points-to-above-industry-growth-capex-held-in-line-with-2025/">hit</a> a company record of $8.1 billion, with a stated goal of adding one new fab per year. (TSMC&#8217;s 2025 capex <a href="https://www.trendforce.com/news/2026/01/14/news-tsmc-earnings-preview-150b-capex-over-next-three-years-tops-five-key-focuses/">was $40-42 billion</a>, five times that of SMIC.)</p><p>If SMIC were to allocate its entire advanced-node capacity to Huawei Ascend production, it could theoretically <a href="https://newsletter.semianalysis.com/p/huawei-ascend-production-ramp">make tens of millions</a> of Ascends per year by 2026-2027. (For comparison, NVIDIA sold <a href="https://epoch.ai/data/ai-chip-sales?view=graph&amp;tab=h100_equivalents">an estimated 4.5 million</a> GPUs in 2025.) The point is not that such an allocation is plausible, but that even an extreme reallocation of SMIC&#8217;s advanced-node capacity would still leave China short once HBM supply, packaging, yields, and other customers are taken into account. China&#8217;s AI build-out, spanning frontier model training, inference infrastructure, and state-directed deployments across industry and government, implies a scale of chips that SMIC would struggle to meet, especially given the performance gap between Ascend chips and leading NVIDIA hardware.</p><p><strong>Although SMIC can manufacture 7nm chips, over 75% of its capacity (as of Q1 2025) is <a href="https://www.trendforce.com/news/2025/05/09/news-smics-u-s-revenue-share-climbs-to-13-in-q1-warns-of-4-6-q2-sales-dip/">dedicated</a> to mature nodes (45nm and above)</strong>, serving the broad ecosystem of Chinese fabless companies designing chips for smartphones, consumer electronics, industrial automation, displays, and automotive applications. A natural question is whether this mature-node capacity could be redirected to AI chip production. In theory, it could, but in practice, the most advanced available process is strongly preferred because each node generation roughly doubles transistor density, which directly translates into more compute per chip and lower cost per operation. Building additional advanced-node capacity is difficult. It requires specialized <a href="https://www.globenewswire.com/news-release/2025/02/27/3034012/28124/en/Semiconductor-Manufacturing-Equipment-Industry-Research-2025-Market-Set-to-Reach-USD-155-09-Billion-by-2029-with-Applied-Materials-ASML-Tokyo-Electron-Lam-Research-and-KLA-Dominati.html">equipment</a> (the best of which is made in the US, Japan, and the Netherlands and is subject to export controls), deep process knowledge accumulated over years of production experience, and billions in capital.</p><h2 style="text-align: justify;">Hua Hong might expand China&#8217;s fab capacity over the long term</h2><p>Beyond SMIC, another notable Chinese chipmaker, Hua Hong, has so far made only mature-node chips, not the advanced-node chips used in AI hardware. Hua Hong Group (comprising Hua Hong Semiconductor and Huali) is China&#8217;s second-largest foundry and has historically focused on specialty and analog processes for automotive and industrial markets.</p><p>In March 2026, Reuters <a href="https://www.reuters.com/world/asia-pacific/chinas-no-2-chipmaker-readies-7-nm-production-beijing-ramps-up-self-suffiency-2026-03-16/">reported</a> that Huali is preparing a 7nm process at its Shanghai plant, with support from Huawei and SiCarrier (a Huawei-linked SME company covered in the equipment section below), aiming for an <a href="https://finance.yahoo.com/news/exclusive-china-no-2-chipmaker-034753815.html">initial</a> capacity of a few thousand wafers per month by the end of 2026. If successful, Hua Hong would become the second Chinese foundry after SMIC capable of producing 7nm chips, which could represent a meaningful expansion of domestic AI chip production, since Ascends are fabricated on 7nm processes.</p><p>However, as with SMIC, Huali is constrained by US export controls on equipment. In April 2026, BIS sent <a href="https://www.reuters.com/world/china/us-orders-chip-equipment-companies-halt-some-shipments-hua-hong-chinas-second-2026-04-28/">letters</a> to major US tool suppliers, including Applied Materials, Lam Research, and KLA, directing them to halt some shipments to Hua Hong or Huali-linked facilities. That makes Huali&#8217;s 7nm ramp highly uncertain, and even at an initial planned capacity of just a few thousand wafers per month, it would remain far behind SMIC, whose 7nm-and-below output is <a href="https://newsletter.semianalysis.com/p/huawei-ascend-production-ramp">estimated</a> at around 45,000-60,000 wafers per month. Huali is unlikely to meaningfully contribute to Chinese AI chip production for at least several years.</p><h2 style="text-align: justify;">Huawei is building fabs to reduce its dependence on SMIC</h2><p>Besides the established fabs, Huawei is moving beyond a fabless model to become an IDM, a company that designs, fabricates, and packages its own chips end-to-end, as Intel and Samsung do. Huawei is now <a href="https://www.digitimes.com/news/a20250515PD215/huawei-dram-dongguan-government-shenzhen.html">reported</a> to operate at least 11 fabs across China, covering memory and logic chips, through direct ownership or control of seven affiliated chipmaking companies. These entities are structured to obscure Huawei&#8217;s involvement, making it harder to assess Huawei&#8217;s true chipmaking capabilities or to identify sanctionable entities. At least five of the 11 fabs are reportedly capable of process nodes of 7nm and below, though this claim remains unverified and difficult to reconcile with the broader constraints on China&#8217;s advanced manufacturing.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-11" href="#footnote-11" target="_self">11</a></p><p><a href="https://www.ft.com/content/afd618f8-12c9-4297-b2a9-49f7dc548da4">Huawei</a> is also building three fabs in Shenzhen&#8217;s Guanlan district, under development since 2022, each run by a different actor linked to Huawei. One is Huawei&#8217;s own facility for 7nm smartphone and Ascend chips. A second is run by SiCarrier, the state-backed equipment maker that spun out of a Huawei lab. The third is operated by SwaySure, which supplies memory chips for Huawei&#8217;s cars and consumer devices. Huawei&#8217;s own plant was not expected to reach full operation before 2026, as the company plans to rely heavily on still-untested domestic equipment. By building its own fabs, Huawei seeks to bypass the SMIC bottleneck. SMIC&#8217;s advanced-node capacity is limited, shared among multiple Chinese chip designers, and yields on the large dies typical of AI chips have been a persistent <a href="https://biz.chosun.com/it-science/ict/2024/06/27/YSH7H767CJEWNMC2OOE7G5KIWY">challenge</a>.</p><h1 style="text-align: justify;">High-bandwidth memory fabrication</h1><p><strong>Of all the bottlenecks in China&#8217;s AI chip supply chain, HBM <a href="https://www.chinatalk.media/p/will-china-hit-the-hbm-wall">may be the most important</a>.</strong> When people talk about AI chips, they usually focus on processing speed&#8212;how many calculations a chip can perform per second. But in practice, a chip often sits idle, waiting for data. Just as a factory&#8217;s output depends not only on how fast its workers assemble parts but also on how quickly those parts reach the assembly line, so a chip&#8217;s actual performance depends on memory bandwidth, or how fast data can flow from memory to the processor. In this analogy, HBM is both the stockroom and the conveyor belt: it stores data close to the chip and delivers it in many parallel streams, fast enough to keep the processor busy. As such, it is a crucial component of modern AI chips.</p><p>HBM is an assembly of multiple thinned <a href="https://www.rambus.com/blogs/hbm3-everything-you-need-to-know/">stacked</a> dynamic random-access memory (DRAM) dies&#8212;typically eight to twelve in current generations, with sixteen-high stacks newly emerging&#8212;bonded together and mounted alongside the processor on a shared silicon interposer (a thin silicon &#8220;bridge&#8221; that routes signals between the processor and memory stacks). DRAM is the standard working memory used in virtually all computing devices, temporarily storing the data that processors need to access quickly. But standard DRAM is like a narrow pipe&#8212;it can only feed data to the processor in a single thin stream at a time. HBM works more like a broad river, delivering many parallel streams of data simultaneously, which is what AI chips need to process massive models without stalling. DRAM is in short supply worldwide because each gigabyte of HBM takes roughly <a href="https://investors.micron.com/static-files/a531c7f0-fca2-48f3-8f24-79c945aaa2d2">three</a> or <a href="https://www.trendforce.com/news/2025/12/26/news-ai-reportedly-to-consume-20-of-global-dram-wafer-capacity-in-2026-hbm-gddr7-lead-demand/">four</a> times the manufacturing capacity of a gigabyte of standard memory. As key producers shift their fabs toward AI-grade HBM and data centers drive surging demand for server-grade DRAM, conventional memory for phones, PCs, and servers gets squeezed, driving a <a href="https://dropreference.com/en/blog/news/shortage-ddr5-ram-2026">sharp</a> global spike in DRAM prices from early October 2025, with consumer DDR5 roughly tripling within a few months.</p><p>Producing HBM reliably is technically difficult. Each stack requires thinning individual DRAM dies to <a href="https://www.viksnewsletter.com/p/why-is-hbm-so-hard-to-manufacture">roughly 30-50 microns</a> (a fraction of their original thickness), then <a href="https://newsroom.lamresearch.com/high-bandwidth-memory-explained-semi-101?blog=true">etching</a> thousands of microscopic holes straight through the silicon, called through-silicon vias (TSVs), and filling them with copper to create vertical electrical connections between layers. The dies are then bonded together one by one, each requiring sub-micron alignment. A failure at any layer can damage the entire stack. Only three companies&#8212;SK Hynix and Samsung (both South Korea), and Micron (US)&#8212;<a href="https://www.astutegroup.com/news/general/sk-hynix-holds-62-of-hbm-micron-overtakes-samsung-2026-battle-pivots-to-hbm4/">control</a> essentially all of the <a href="https://counterpointresearch.com/en/insights/global-dram-and-hbm-market-share">global</a> HBM supply chain.</p><h2 style="text-align: justify;">HBM is restricted by US export controls</h2><p>This concentration in US-allied hands made HBM a viable target for export controls. When the US announced countrywide export restrictions on HBM more advanced than HBM2E to China in December 2024, Chinese companies had already been<a href="https://rhg.com/research/slaying-self-reliance-us-chip-controls-in-bidens-final-stretch/"> stockpiling</a> for months. By the time controls took effect, Chinese entities had <a href="https://newsletter.semianalysis.com/p/huawei-ascend-production-ramp">reportedly</a> procured approximately 13 million HBM stacks. That would be enough for roughly 1.6 million Huawei Ascend 910Cs if allocated entirely to that chip, though in practice Huawei and other Chinese AI chip designers may compete for the same limited pool. Some HBM continued to enter China through structural loopholes. For example, the initial rules did not<a href="https://ai-frontiers.org/articles/high-bandwidth-memory-critical-gaps-us-export-controls"> restrict</a> HBM already attached to simple processors, nor did they cover the<a href="https://www.chinatalk.media/p/mapping-chinas-hbm-advancement"> equipment</a> needed to manufacture HBM domestically. However, the stockpile is finite, and SemiAnalysis<a href="https://newsletter.semianalysis.com/p/huawei-ascend-production-ramp"> estimates</a> that China most likely ran out of stockpiled HBM in late 2025. China <a href="https://www.reuters.com/world/china/china-wants-us-relax-ai-chip-export-controls-trade-deal-ft-reports-2025-08-10/">requested</a> relaxed HBM restrictions in trade talks held in August 2025&#8212;not photolithography tools or access to TSMC&#8212;which signals that Chinese officials believe this is what most constrains domestic AI chipmaking.</p><h2 style="text-align: justify;">China&#8217;s memory makers are moving into HBM, led by CXMT</h2><p><strong>ChangXin Memory Technologies (CXMT) is China&#8217;s largest and most advanced DRAM player.</strong><a href="https://www.cxmt.com/en/"> It was founded</a> in 2016 after a failed <a href="https://americanaffairsjournal.org/2022/11/the-purges-that-upended-chinas-semiconductor-industry/">state-backed bid</a> to acquire Micron.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-12" href="#footnote-12" target="_self">12</a> Blocked from buying the technology, Beijing pivoted to building it. CXMT&#8217;s rise has been quite rapid, its<a href="https://www.wsj.com/tech/the-chinese-company-taking-on-the-worlds-memory-chip-giants-78dfea55"> </a>revenue <a href="https://www.digitimes.com/news/a20260327VL210/cxmt-memory-chips-demand-revenue-ipo-2025.html">reaching</a> about $8 billion in 2025, more than doubling from 2024. Its global DRAM market share has risen to around <a href="https://www.digitimes.com/news/a20250421PD218/cxmt-dram-samsung-sk-hynix-2025.html">5-7%</a> by revenue as of 2025. This rise was helped by the acquisition of legacy<a href="https://www.eetimes.com/changxin-emerging-as-chinas-first-only-dram-maker/"> knowledge</a> from the bankrupt German memory maker Qimonda, extensive talent<a href="https://scout.eto.tech/?id=4070"> recruitment</a> from Samsung, SK Hynix, Intel, Micron, Applied Materials, and ASML, and knowledge of Samsung&#8217;s DRAM<a href="https://wccftech.com/former-samsung-executives-and-employees-charged-for-leaking-billion-dollar-10nm-dram-tech-to-chinas-cxmt/"> processes</a> acquired through former employees. (In December 2025, Korean prosecutors<a href="https://www.wsj.com/tech/the-chinese-company-taking-on-the-worlds-memory-chip-giants-78dfea55"> indicted</a> ten people, including a former Samsung executive, on charges of systematically transferring trade secrets to CXMT, allegedly causing Samsung billions of dollars in losses.)</p><div id="datawrapper-iframe" class="datawrapper-wrap outer" data-attrs="{&quot;url&quot;:&quot;https://datawrapper.dwcdn.net/krdje/5/&quot;,&quot;thumbnail_url&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/7adcbeb6-9f0a-4201-818f-813e1080a5c1_1220x746.png&quot;,&quot;thumbnail_url_full&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/82a8abf5-29e0-415b-a4fc-2e5322014759_1220x816.png&quot;,&quot;height&quot;:400,&quot;title&quot;:&quot;Global DRAM market shares&quot;,&quot;description&quot;:&quot;&quot;}" data-component-name="DatawrapperToDOM"><iframe id="iframe-datawrapper" class="datawrapper-iframe" src="https://datawrapper.dwcdn.net/krdje/5/" width="730" height="400" frameborder="0" scrolling="no"></iframe><script type="text/javascript">!function(){"use strict";window.addEventListener("message",(function(e){if(void 0!==e.data["datawrapper-height"]){var t=document.querySelectorAll("iframe");for(var a in e.data["datawrapper-height"])for(var r=0;r<t.length;r++){if(t[r].contentWindow===e.source)t[r].style.height=e.data["datawrapper-height"][a]+"px"}}}))}();</script></div><p>Beyond CXMT, several other Chinese firms play narrower roles in the memory and HBM supply chain. Fujian Jinhua (FJICC), which the United States <a href="https://www.federalregister.gov/documents/2024/12/05/2024-28267/additions-and-modifications-to-the-entity-list-removals-from-the-validated-end-user-veu-program">placed</a> on its Entity List in 2018 over alleged theft of trade secrets from Micron, <a href="https://kr-asia.com/china-makes-inroads-in-dram-chips-in-challenge-to-samsung-and-micron">produces</a> small batches of lower-grade DRAM with support from Huawei. SwaySure, a Huawei-backed company <a href="https://www.federalregister.gov/documents/2024/12/05/2024-28267/additions-and-modifications-to-the-entity-list-removals-from-the-validated-end-user-veu-program">added</a> to the same list in December 2024, is working on HBM stacking and, as of 2025, plans to ship <a href="https://winbuzzer.com/2025/05/04/satellite-images-show-huaweis-expanding-chip-production-in-china-xcxwbn/">sample</a> HBM to Huawei. Wuhan Xinxin, which shares a parent company with flash memory maker Yangtze Memory Technologies (YMTC), began <a href="https://asia.nikkei.com/business/tech/semiconductors/huawei-s-chip-and-display-suppliers-accelerate-china-s-ai-push">ramping up HBM2 production</a> in 2024, <a href="https://technode.com/2024/05/16/chinese-firms-cxmt-and-wuhan-xinxin-make-progress-in-high-bandwidth-memory-production-for-ai-chips/">aiming</a> for about 3,000 HBM wafers per month. YMTC itself makes mainly NAND flash, but is now <a href="https://www.digitimes.com/news/a20250902PD231/ymtc-dram-hbm-cxmt-memory.html">moving into DRAM production</a> in partnership with CXMT to reach the HBM market. (NAND flash is persistent storage memory, different from the faster, but ephemeral DRAM working memory described earlier.) Tongfu Microelectronics, the world&#8217;s <a href="https://www.trendforce.com/news/2026/01/19/news-chinas-osat-giants-step-up-tongfu-microelectronics-to-raise-rmb-4-4b-jcet-backs-chip-fund/">fourth-largest</a> chip packaging and testing firm, handles the assembly of HBM rather than die manufacturing, and says it has <a href="https://www.digitimes.com/news/a20250203VL214/tfme-nikkei-production-packaging-expansion.html">begun</a> trial HBM2 packaging, which most likely means joining finished HBM stacks to compute dies rather than production of the HBM stacks.</p><h2 style="text-align: justify;">CXMT trails the state of the art by three to four years</h2><p><strong>DRAM and HBM progress is usually measured by generation. CXMT&#8217;s most advanced mass-production DRAM is DDR5, a generation that leading manufacturers like SK Hynix and Samsung first <a href="https://www.theregister.com/on-prem/2020/10/06/sk-hynix-slips-past-rivals-samsung-and-micron-to-launch-worlds-first-ddr5-dram-sticks/1093757">shipped</a> in 2020 and <a href="https://finance.yahoo.com/news/chinas-cxmt-challenges-samsung-sk-093000090.html">scaled</a> production in 2021, putting CXMT roughly three to four years behind</strong>. CXMT&#8217;s DDR5 yields were <a href="https://www.trendforce.com/news/2024/12/30/news-chinese-dram-giant-cxmt-reportedly-achieves-80-ddr5-yield-targeting-90-by-2025/">reported</a> at about 80%, up from roughly 50% at the start of production, and DDR5 was expected to account for more than 60% of the company&#8217;s output by the end of 2025.</p><div id="datawrapper-iframe" class="datawrapper-wrap outer" data-attrs="{&quot;url&quot;:&quot;https://datawrapper.dwcdn.net/KI7ii/4/&quot;,&quot;thumbnail_url&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/65b39108-1a9e-4d1b-96b4-af35a89c4a7c_1220x802.png&quot;,&quot;thumbnail_url_full&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/31454f2c-a5a2-427d-aa1e-add96b9a244a_1220x872.png&quot;,&quot;height&quot;:414,&quot;title&quot;:&quot;HBM generations&quot;,&quot;description&quot;:&quot;&quot;}" data-component-name="DatawrapperToDOM"><iframe id="iframe-datawrapper" class="datawrapper-iframe" src="https://datawrapper.dwcdn.net/KI7ii/4/" width="730" height="414" frameborder="0" scrolling="no"></iframe><script type="text/javascript">!function(){"use strict";window.addEventListener("message",(function(e){if(void 0!==e.data["datawrapper-height"]){var t=document.querySelectorAll("iframe");for(var a in e.data["datawrapper-height"])for(var r=0;r<t.length;r++){if(t[r].contentWindow===e.source)t[r].style.height=e.data["datawrapper-height"][a]+"px"}}}))}();</script></div><p>For AI chips, what matters is not commodity DRAM but HBM. The leading HBM makers are already moving into HBM4, with SK Hynix <a href="https://news.skhynix.com/sk-hynix-completes-worlds-first-hbm4-development-and-readies-mass-production/">preparing</a> mass production in 2026, while CXMT is still trying to scale earlier generations. Since the second half of 2025, CXMT has been <a href="https://wccftech.com/china-cxmt-ships-out-pivotal-hbm3-samples-to-huawei/">providing</a> HBM3 samples to Huawei and other Chinese AI chip designers, though initial shipments suffer from low yields of around 50%. The company has announced <a href="https://www.digitimes.com/news/a20250618PD216/cxmt-ddr5-hbm3e-launch-2027.html">ambitions</a> to enter HBM3E production in 2027 and is <a href="https://www.scmp.com/news/china-future-tech/semiconductors/article/3338246/chinas-dram-giant-cxmt-plans-us42-billion-ipo-shanghais-star-market">proceeding</a> with a $4.2 billion IPO on the Shanghai STAR Market.</p><p>CXMT <a href="https://www.chinatalk.media/p/will-china-hit-the-hbm-wall">faces</a> different bottlenecks at each HBM generation. Its current 1z/G4 DRAM node appears sufficient for HBM3, and further DUV multipatterning may <a href="https://www.chinatalk.media/p/will-china-hit-the-hbm-wall">support</a> HBM4. Beyond that, however, producing economically viable HBM becomes much harder without EUV, as further scaling would require increasingly complex multipatterning with worse yield. A second wall emerges with the base die, since every HBM stack includes a logic die that routes signals in and out of the stack, and HBM4 and later generations require that logic die to be built at advanced nodes to keep up with rising bandwidth demands. SK Hynix <a href="https://www.digitimes.com/news/a20240717PD204/tsmc-hbm4-5nm-micron-samsung-sk-hynix.html">outsources</a> its base-die fabrication to TSMC (at 5nm and below). Micron, by contrast, manufactures its HBM4 base die in-house, though it reportedly struggled to meet NVIDIA&#8217;s <a href="https://www.trendforce.com/presscenter/news/20260108-12869.html">high-speed requirements</a> and faced <a href="https://www.trendforce.com/news/2025/08/29/news-memory-giants-diverge-on-hbm-base-die-micron-reportedly-delays-foundry-shift-risks-losing-edge/">delays</a>, and will switch to TSMC for <a href="https://www.trendforce.com/news/2026/05/22/news-micron-turns-more-upbeat-on-outlook-reportedly-sets-2027-hbm4e-ramp-with-tsmc-for-standard-and-custom-logic-dies/">HBM4E</a>. Without access to either advanced-node foundries or its own advanced-node logic process, CXMT will likely have to use 7nm alternatives at best, potentially producing functional but <a href="https://www.chinatalk.media/p/will-china-hit-the-hbm-wall">lower-bandwidth</a> HBM4. In short, some HBM3 is already in production in China; HBM4 will be feasible but with reduced bandwidth due to base-die limitations; and HBM4E and beyond will be very difficult to produce competitively at scale without EUV machines.</p><div id="datawrapper-iframe" class="datawrapper-wrap outer" data-attrs="{&quot;url&quot;:&quot;https://datawrapper.dwcdn.net/FXnus/6/&quot;,&quot;thumbnail_url&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/ea177fc2-16c0-4883-b925-79d4ab13b6c2_1220x1360.png&quot;,&quot;thumbnail_url_full&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/537a27cb-1512-4af4-b851-bcaf44663f26_1220x1430.png&quot;,&quot;height&quot;:627,&quot;title&quot;:&quot;Global DRAM and HBM memory manufacturer comparison&quot;,&quot;description&quot;:&quot;&quot;}" data-component-name="DatawrapperToDOM"><iframe id="iframe-datawrapper" class="datawrapper-iframe" src="https://datawrapper.dwcdn.net/FXnus/6/" width="730" height="627" frameborder="0" scrolling="no"></iframe><script type="text/javascript">!function(){"use strict";window.addEventListener("message",(function(e){if(void 0!==e.data["datawrapper-height"]){var t=document.querySelectorAll("iframe");for(var a in e.data["datawrapper-height"])for(var r=0;r<t.length;r++){if(t[r].contentWindow===e.source)t[r].style.height=e.data["datawrapper-height"][a]+"px"}}}))}();</script></div><h2 style="text-align: justify;">The global HBM market is highly concentrated in three foreign companies</h2><p>In the global HBM market, concentration remains extreme. According to one source, in 2025, SK Hynix <a href="https://korea.counterpointresearch.com/global-dram-and-hbm-market-share-quarterly/">accounted for</a> 62% of HBM revenue, Micron 20%, and Samsung 18%. SK Hynix&#8217;s lead <a href="https://www.cnbc.com/2024/05/24/samsungs-hbm-chips-are-failing-nvidia-tests-reuters.html">comes</a> from an early bet on HBM that paid off as demand for AI chips exploded. Samsung, long the world&#8217;s largest DRAM maker, has struggled to make competitive HBM, and its share <a href="https://korea.counterpointresearch.com/global-dram-and-hbm-market-share-quarterly/">fell</a> to 13% in the first quarter of 2025 before recovering to 22% in the last quarter. Micron, historically holding a smaller share of the global DRAM market, has moved aggressively into HBM and <a href="https://www.cnbc.com/2025/12/03/micron-stops-selling-memory-to-consumers-demand-spikes-from-ai-chips.html">announced</a> in December 2025 that it would exit the consumer memory market entirely to prioritize AI data-center customers.</p><p>CXMT holds effectively zero share&#8212;its DRAM production is concentrated in commodity DDR4 and early-generation DDR5 rather than HBM. The key question is not whether CXMT can compete with SK Hynix globally, but whether it can produce enough HBM to keep the Huawei Ascend production line running.</p><div id="datawrapper-iframe" class="datawrapper-wrap outer" data-attrs="{&quot;url&quot;:&quot;https://datawrapper.dwcdn.net/fpa9B/3/&quot;,&quot;thumbnail_url&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/8e0bc470-4eb5-4801-8a61-b62841663489_1220x746.png&quot;,&quot;thumbnail_url_full&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/2f0b1cac-f87f-49ae-a39c-24d31ef5b384_1220x816.png&quot;,&quot;height&quot;:400,&quot;title&quot;:&quot;Global HBM market shares&quot;,&quot;description&quot;:&quot;&quot;}" data-component-name="DatawrapperToDOM"><iframe id="iframe-datawrapper" class="datawrapper-iframe" src="https://datawrapper.dwcdn.net/fpa9B/3/" width="730" height="400" frameborder="0" scrolling="no"></iframe><script type="text/javascript">!function(){"use strict";window.addEventListener("message",(function(e){if(void 0!==e.data["datawrapper-height"]){var t=document.querySelectorAll("iframe");for(var a in e.data["datawrapper-height"])for(var r=0;r<t.length;r++){if(t[r].contentWindow===e.source)t[r].style.height=e.data["datawrapper-height"][a]+"px"}}}))}();</script></div><h2 style="text-align: justify;">China&#8217;s HBM capacity is growing but still likely insufficient</h2><p>CXMT&#8217;s expansion has been remarkable. In early 2026, CXMT was <a href="https://economy.ac/news/2026/02/202602288024">producing</a> approximately 240,000 DRAM wafers per month, with plans to expand to 300,000 later in 2026. It aims to<a href="https://economy.ac/news/2026/02/202602288024"> allocate</a> about 20% (60,000 wafers) to HBM3 production, a level that would have seemed <a href="https://www.digitimes.com/news/a20250421PD218/cxmt-dram-samsung-sk-hynix-2025.html">implausible</a> just a few years earlier. To put this scale in context, Samsung&#8217;s annual DRAM<a href="https://economy.ac/news/2026/02/202602288024"> capacity</a> totaled 7.6 million wafers in 2025, compared with 6 million for SK Hynix and 3.6 million for Micron. Therefore, CXMT&#8217;s annual output sits at roughly half of SK Hynix&#8217;s level.</p><p>In DRAM, the global leaders are several generations ahead of CXMT. CXMT&#8217;s DRAM production reportedly <a href="https://www.digitimes.com/news/a20250213VL204/dram-cxmt-16nm-production-development.html">uses a 16nm</a> process, roughly the generation that SK Hynix and Samsung <a href="https://www.digitimes.com/news/a20250213VL204/dram-cxmt-16nm-production-development.html">began</a> using in 2016.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-13" href="#footnote-13" target="_self">13</a> As of 2026, SK Hynix, <a href="https://semiconductor.samsung.com/news-events/news/samsung-ships-industry-first-commercial-hbm4-with-ultimate-performance-for-ai-computing/">Samsung</a>, and <a href="https://investors.micron.com/node/48496/pdf">Micron</a> are several generations ahead. (These node labels are not directly comparable to logic nodes as they refer to DRAM production processes.)</p><p>Estimates of CXMT&#8217;s 2026 HBM production vary widely. One<a href="https://newsletter.semianalysis.com/p/huawei-ascend-production-ramp"> estimate</a> puts output at about 2 million stacks, enough for about 250,000 Ascend 910C chips. A more optimistic<a href="https://aifrontiersmedia.substack.com/p/high-bandwidth-memory-the-critical"> projection</a> forecasts about 7 million stacks in 2026, enough for about 600,000 Ascend 910Cs after accounting for a 70% packaging yield, which roughly matches Huawei&#8217;s stated 2026 production <a href="https://www.huaweicentral.com/huawei-plans-600000-ascend-910c-chips-by-2026-to-block-nvidia-in-china/">targets</a> (though other Chinese chip designers would also draw on the same limited domestic pool).</p><h2 style="text-align: justify;">Domestic memory limits Huawei&#8217;s next chips</h2><p>The Ascend 910C uses eight stacks of HBM2E memory for 3.2 TB/s (terabytes per second) of memory bandwidth, roughly on par with an NVIDIA H100 (2022), but only about 40% of a Blackwell B200 (2024, 8 TB/s). Because CXMT&#8217;s domestic HBM3 samples are still in early production and imports of HBM2E and above are banned, Huawei has <a href="https://www.reuters.com/business/media-telecom/chinas-huawei-hypes-up-chip-computing-power-plans-fresh-challenge-nvidia-2025-09-18/">developed</a> its own HBM-like memory in-house (branded HiBL and HiZQ) rather than using standard HBM from SK Hynix, Samsung, or Micron.</p><p>The Ascend 950PR, <a href="https://www.cfr.org/articles/chinas-ai-chip-deficit-why-huawei-cant-catch-nvidia-and-us-export-controls-should-remain">released</a> in March 2026, pairs the chip with 128 GB of memory at 1.6 TB/s, which is half the bandwidth of the Ascend 910C. The Ascend 950DT (Q4 2026) <a href="https://www.techradar.com/pro/huawei-ascend-950-vs-nvidia-h200-vs-amd-mi300-instinct-how-do-they-compare">uses</a> 144 GB of HiZQ 2.0 at 4 TB/s. The NVIDIA B300 (2025) delivers 8 TB/s with 288 GB of HBM3E, or in other words, twice the bandwidth and twice the capacity of Huawei&#8217;s best 2026 chip. An <a href="https://www.cfr.org/articles/chinas-ai-chip-deficit-why-huawei-cant-catch-nvidia-and-us-export-controls-should-remain">analysis</a> of NVIDIA&#8217;s and Huawei&#8217;s 2027 plans suggests that NVIDIA chips could offer five times as much memory bandwidth as Huawei chips.</p><div id="datawrapper-iframe" class="datawrapper-wrap outer" data-attrs="{&quot;url&quot;:&quot;https://datawrapper.dwcdn.net/HjDvr/3/&quot;,&quot;thumbnail_url&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/3ad5c42b-9f24-4a7e-8f65-af98516119cf_1220x1090.png&quot;,&quot;thumbnail_url_full&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/88835413-5161-460c-8130-3727a77480c8_1220x1160.png&quot;,&quot;height&quot;:554,&quot;title&quot;:&quot;Best Chip Memory Bandwidth (TB/s) by Company &amp; Year&quot;,&quot;description&quot;:&quot;&quot;}" data-component-name="DatawrapperToDOM"><iframe id="iframe-datawrapper" class="datawrapper-iframe" src="https://datawrapper.dwcdn.net/HjDvr/3/" width="730" height="554" frameborder="0" scrolling="no"></iframe><script type="text/javascript">!function(){"use strict";window.addEventListener("message",(function(e){if(void 0!==e.data["datawrapper-height"]){var t=document.querySelectorAll("iframe");for(var a in e.data["datawrapper-height"])for(var r=0;r<t.length;r++){if(t[r].contentWindow===e.source)t[r].style.height=e.data["datawrapper-height"][a]+"px"}}}))}();</script></div><p>DRAM manufacturing is less dependent on EUV lithography than leading-edge logic chip manufacturing, since it requires fewer critical layers and its repetitive cell architecture is a natural fit for multipatterning techniques. <a href="https://news.samsung.com/global/samsung-announces-industrys-first-euv-dram-with-shipment-of-first-million-modules">Samsung</a> and <a href="https://www.techspot.com/news/109355-sk-hynix-asml-debut-world-first-high-na.html">SK Hynix</a> began incorporating EUV into mass production around 2020 and 2021, respectively, and<a href="https://www.tomshardware.com/tech-industry/semiconductors/hbm-roadmaps-for-micron-samsung-and-sk-hynix-to-hbm4-and-beyond"> Micron</a> more recently, in 2024,<a href="https://www.chinatalk.media/p/will-china-hit-the-hbm-wall"> employing</a> EUV machines for a single layer only. For the current-generation DRAM that CXMT is targeting, DUV tools are broadly sufficient, but EUV will become<a href="https://semiengineering.com/euvs-future-looks-even-brighter/"> necessary</a> as HBM scales further.</p><h2 style="text-align: justify;">Advanced packaging is another constraint on Chinese AI chips</h2><p>For China, HBM is also a packaging problem, as even if domestic firms can make the DRAM dies, they still need to stack and connect them. When AMD and SK Hynix first commercialized HBM in 2015, the central challenge wasn&#8217;t making the memory work, but solving the surrounding integration problems, such as thinning wafers without mechanical stress and cracking, etching and filling TSVs precisely through 30-50 micron silicon, bonding successive die layers without misalignment<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-14" href="#footnote-14" target="_self">14</a> and testing assembled stacks before permanent attachment. Each step requires specialized equipment, and yield implications compound, since a failure rate of even a few percent per die level can produce substantial stack-level losses.</p><p>Besides stacking the memory dies, advanced packaging covers a broader set of <a href="https://www.iaps.ai/research/how-ai-chips-are-made">techniques</a> for connecting multiple dies and chiplets (small chips designed to be combined with others in a single package). It is also applied to package the finished HBM stacks with a logic die. Consequently, for China, the bottlenecks extend beyond producing the HBM dies, since integrating them with the logic chips poses a challenge of its own.</p><p>The industry is also beginning to explore <a href="https://newsroom.lamresearch.com/3D-DRAM-architecture-proposal">3D DRAM</a>, in which memory cells are stacked vertically within a single chip. In conventional DRAM, cells sit side by side on a single flat layer of silicon, and <a href="https://semiengineering.com/baby-steps-towards-3d-dram/">capacity</a> has been increased by shrinking the cells and packing them more closely together. This approach, however, is close to its physical <a href="https://www.allpcb.com/allelectrohub/3d-dram-roadmap-and-production-timeline">limits</a>. 3D DRAM instead <a href="https://ieeexplore.ieee.org/document/10631471">builds </a>upward, stacking several layers of cells in the same footprint, as shown in the photo below. (This is distinct from HBM, which bonds separate, fully fabricated DRAM dies on top of one another in a packaging step.)</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!nieY!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4c338b3e-f76c-41cf-bdf6-b057687d26ab_600x388.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!nieY!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4c338b3e-f76c-41cf-bdf6-b057687d26ab_600x388.png 424w, https://substackcdn.com/image/fetch/$s_!nieY!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4c338b3e-f76c-41cf-bdf6-b057687d26ab_600x388.png 848w, https://substackcdn.com/image/fetch/$s_!nieY!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4c338b3e-f76c-41cf-bdf6-b057687d26ab_600x388.png 1272w, https://substackcdn.com/image/fetch/$s_!nieY!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4c338b3e-f76c-41cf-bdf6-b057687d26ab_600x388.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!nieY!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4c338b3e-f76c-41cf-bdf6-b057687d26ab_600x388.png" width="600" height="388" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/4c338b3e-f76c-41cf-bdf6-b057687d26ab_600x388.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:388,&quot;width&quot;:600,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!nieY!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4c338b3e-f76c-41cf-bdf6-b057687d26ab_600x388.png 424w, https://substackcdn.com/image/fetch/$s_!nieY!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4c338b3e-f76c-41cf-bdf6-b057687d26ab_600x388.png 848w, https://substackcdn.com/image/fetch/$s_!nieY!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4c338b3e-f76c-41cf-bdf6-b057687d26ab_600x388.png 1272w, https://substackcdn.com/image/fetch/$s_!nieY!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4c338b3e-f76c-41cf-bdf6-b057687d26ab_600x388.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Image borrowed from <a href="https://www.allpcb.com/allelectrohub/3d-dram-roadmap-and-production-timeline">AllElectroHub</a>.</figcaption></figure></div><p>A shift toward 3D DRAM could work modestly in China&#8217;s favor, because it would move more of the manufacturing difficulty away from photolithography and toward <a href="https://www.the-substrate.net/p/china-is-making-strides-in-etching">high-aspect-ratio etching</a> and thin-film deposition (both covered in the equipment section below). Those are areas where China&#8217;s domestic equipment industry is still behind, but where it has made <a href="https://cset.georgetown.edu/article/inside-beijings-chipmaking-offensive/">faster</a> progress than in photolithography.</p><h1>Semiconductor manufacturing equipment</h1><p>The dominant cost of a new fab is equipment, which makes up about <a href="https://www.construction-physics.com/p/how-to-build-a-20-billion-semiconductor">70-80% of the total investment</a>, with the physical building accounting for the rest. The full production <a href="https://www.semiconductors.org/chipmakers-are-ramping-up-production-to-address-semiconductor-shortage-heres-why-that-takes-time/">process </a>involves about 1,000 individual <a href="https://semiengineering.com/battling-fab-cycle-times/">steps</a> and takes more than four months from blank wafer to finished AI chip. Each step requires its own specialized tools <a href="https://www.globenewswire.com/news-release/2025/02/27/3034012/28124/en/Semiconductor-Manufacturing-Equipment-Industry-Research-2025-Market-Set-to-Reach-USD-155-09-Billion-by-2029-with-Applied-Materials-ASML-Tokyo-Electron-Lam-Research-and-KLA-Dominati.html">produced</a> by a handful of companies in the US, Japan, and the Netherlands. As described in <a href="https://static1.squarespace.com/static/64edf8e7f2b10d716b5ba0e1/t/657ad0033899ee32c2fd6a38/1702547474572/Introduction+to+AI+chip+making+in+China+%5Bfinal%5D.pdf">this report</a>, the key stages, repeated hundreds of times to build the chip layer by layer, work as follows:</p><ul><li><p><em>Deposition</em><strong> </strong>coats the wafer with an ultra-thin layer of new material, either a metal for wiring or an insulator to keep layers separate.</p></li><li><p><em>Lithography</em><strong> </strong>then projects a circuit pattern onto a light-sensitive coating (<em>photoresist</em>) on that layer, using ultraviolet light through a mask (like a stencil). This step is central to shrinking the chip&#8217;s features&#8212;how small they can be printed&#8212;and uses the single most expensive piece of equipment in the fab, the photolithography scanner.</p></li><li><p><em>Etching</em><strong> </strong>carves away material wherever the pattern is exposed, cutting the intended circuit structures into the silicon.</p></li><li><p><em>Ion implantation</em> injects specific atoms into targeted regions of the wafer to change their electrical properties&#8212;this is how transistors get their ability to switch on and off.</p></li><li><p><em>Chemical mechanical planarization</em> (CMP) polishes the wafer surface perfectly flat after each layer is built, so subsequent steps, like deposition, etching, or patterning, can proceed cleanly; without it, tiny bumps from earlier layers would compound into defects.</p></li><li><p><em>Cleaning</em><strong> </strong>removes residues, particles, and chemical contaminants between steps, as contamination at the nanometer scale can ruin an entire chip.</p></li><li><p><em>Metrology and inspection</em> tools measure dimensions, check alignment between layers, and scan for defects throughout the process. Early detection allows some <a href="https://www.researchgate.net/publication/289491366_Wafer_rework_strategies_at_the_photolithography_stage">defects</a> to be repaired, or allows the manufacturer to discard defective wafers early on.</p></li></ul><p>After all layers have been built, the wafer enters testing, and each individual chip (&#8220;die&#8221;) on the wafer is probed to identify which ones work. The wafer is then diced into individual chips, and the working dies move to packaging, where they are mounted in protective housings with electrical connections. For many AI chips, advanced packaging is an additional critical step, with technologies like TSMC&#8217;s CoWoS placing the processor die alongside HBM stacks on a shared silicon interposer, enabling the fast data transfer that AI workloads require.</p><h2>Photolithography</h2><p><strong>Photolithography, the most complex and important type of SME, is where China is furthest behind.</strong><a href="https://www.youtube.com/watch?v=MiUHjLxm3V0"> </a>EUV photolithography, essentially necessary for chips at 5nm and below, is the most complex piece of equipment in the semiconductor supply chain. Lithography machines alone <a href="https://www.construction-physics.com/p/how-to-build-a-20-billion-semiconductor">account for</a> roughly 20% of a new fab&#8217;s total capital investment. A modern chip contains tens of billions of transistors, spread across many precisely aligned layers. Although even at the most advanced nodes, fabs <a href="https://www.asml.com/en/products/euv-lithography-systems">use</a> DUV for the bulk of layers, the most critical, smallest-feature layers require EUV. Without EUV for those critical layers, a foundry must substitute DUV multipatterning (printing the same layer in multiple passes), which is slower, more error-prone, and thus more expensive. For the most advanced nodes, as well as future ones, DUV multipatterning may not be able to replace EUV at all.</p><p>Shanghai Micro Electronics Equipment (SMEE) is China&#8217;s oldest photolithography maker, but has <a href="https://www.csis.org/blogs/strategic-technologies-blog/breakthroughs-or-boasts-assessing-recent-chinese-lithography">reached</a> only a 4% market share, and this only in i-line tools, a technology <a href="https://global.canon/en/news/2019/20191210.html">used</a> for MEMS, sensors, and power electronics rather than advanced-node fabrication. The global lithography market is a near-monopoly, with ASML (Netherlands) <a href="https://www.mordorintelligence.com/industry-reports/semiconductor-lithography-equipment-market">holding</a> over 85% of DUV immersion shipments and being the sole supplier of EUV systems. Nikon (Japan) still produces <a href="https://jpkleinhans.de/home/DPC2024_Lithography_Chapter.pdf">immersion</a> <a href="https://www.nikonprecision.com/products-and-technology/immersion-and-multiple-patterning/nsr-s636e/">DUV scanners</a> capable of supporting advanced-node multipatterning, though these are older argon fluoride systems, produced at marginal scale. Canon (Japan) has stepped back from DUV photolithography to pursue <a href="https://global.canon/en/news/2023/20231013.html">nanoimprint lithography</a>, a different paradigm that has seen little adoption, alongside i-line steppers for legacy nodes. Together, Nikon and Canon account for only <a href="https://www.trendforce.com/insights/asml-euv">about 6% </a>of the lithography equipment market. Neither company makes EUV machines.</p><p>Photolithography machines <a href="https://worksinprogress.co/issue/the-worlds-most-complex-machine/">are extraordinarily complex</a>, perhaps the most complex machines ever built. An EUV machine, for instance, needs a powerful light source and mirrors polished to near-atomic perfection. Every part must be made and assembled with extreme precision, and must operate with extreme precision and control. The entire system has to run in a vacuum, because otherwise its short-wavelength light would be absorbed by air. A single EUV machine <a href="https://www.asml.com/en/news/stories/2022/busting-asml-myths">contains</a> around 100,000 parts, and shipping it requires 40 freight containers, 3 cargo planes, and 20 trucks.</p><h3 style="text-align: justify;">Export controls have cut China off from advanced lithography</h3><p>Since 2019, ASML has been <a href="https://www.scmp.com/news/china/science/article/3295209/how-chinas-award-winning-euv-breakthrough-sidesteps-us-chip-ban">blocked</a> from exporting its EUV systems to China after the US pressured the Dutch government. In 2023 and 2024, Dutch export controls were <a href="https://www.csis.org/analysis/csis-translation-updated-dutch-export-controls-semiconductor-manufacturing-equipment-and">expanded</a> to cover most advanced DUV<a href="https://www.asml.com/news/press-releases/2023/statement-regarding-partial-revocation-export-license"> systems</a>, with both the Netherlands and Japan introducing parallel<a href="https://www.csis.org/analysis/japan-and-netherlands-announce-plans-new-export-controls-semiconductor-equipment"> controls</a> on these systems. China&#8217;s companies<a href="https://www.tomshardware.com/tech-industry/china-will-likely-reduce-purchase-of-chipmaking-tools-this-year-as-homegrown-toolmakers-ramp-up"> stockpiled</a> wafer fabrication equipment in 2024, before the restrictions took effect, accounting for some<a href="https://finance.yahoo.com/news/chinas-purchases-chipmaking-equipment-decline-064853951.html"> </a>40% of global <a href="https://finance.yahoo.com/news/chinas-purchases-chipmaking-equipment-decline-064853951.html">sales</a>. ASML&#8217;s older 1980i immersion <a href="https://selectcommitteeontheccp.house.gov/sites/evo-subsites/selectcommitteeontheccp.house.gov/files/evo-media-document/selling-the-forges-of-the-future.pdf">DUV shipments</a> to China jumped from 14 units in 2021 to 89 in 2024, while sales of the more advanced 2050i and 2100i, both hit by Dutch controls in 2023-2024, collapsed to zero. By 2024, 70% of ASML&#8217;s <a href="https://selectcommitteeontheccp.house.gov/sites/evo-subsites/selectcommitteeontheccp.house.gov/files/evo-media-document/selling-the-forges-of-the-future.pdf">global</a> immersion DUV shipments went to China. Despite export controls, ASML&#8217;s revenue from China<a href="https://www.mbi-deepdives.com/asmls-china-question/"> increased</a> from about 15% in 2021 to <a href="https://www.caixinglobal.com/2026-01-29/asml-expects-china-revenue-drop-following-backlog-fueled-surge-102409285.html">about</a> 33% in 2025. This spending surge has partly been<a href="https://www.163.com/dy/article/KE6MFEQT0556HYPH.html"> interpreted</a> as pre-emptive stockpiling of DUV systems in anticipation of further tightening.</p><h3 style="text-align: justify;">DUV lithography</h3><p>DUV machines come in two types, dry and <a href="https://www.asianometry.com/p/a-deep-dive-into-immersion-lithography">immersion</a>. In an immersion system, the gap between the lens and the silicon wafer is filled with a liquid, typically purified water, instead of air. Water has a higher refractive index than air, which shortens the light&#8217;s effective wavelength as it crosses that gap, and a shorter effective wavelength lets the machine print finer features than a dry system does. Immersion DUV machines are relevant for AI chipmaking in a few ways:</p><ul><li><p><strong>Direct fabrication of older nodes.</strong> With multipatterning, immersion DUV can <a href="https://www.techinsights.com/blog/chinas-smic-plays-7-nm-card">produce</a> <a href="https://www.tsmc.com/english/dedicatedFoundry/technology/logic">chips</a> at 7nm (2018), and potentially 5nm (2020).</p></li><li><p><strong>Non-critical layers for advanced-node processes.</strong> Even in the most advanced-node processes, only a subset of layers <a href="https://www.asml.com/en/products/euv-lithography-systems">require</a> EUV. The remaining layers are still patterned with immersion DUV, so scaling production below 5nm still requires many immersion DUV machines.</p></li><li><p><strong>HBM production. </strong>HBM2E and earlier generations are made entirely with DUV. Even the newer HBM3E and HBM4 DRAM dies <a href="https://semiwiki.com/forum/threads/samsung-1a-dram-re-design-considered-for-hbm-competitiveness.21262/">use</a> EUV only for a handful of the most critical layers; the bulk of advanced DRAM production still relies heavily on immersion DUV.</p></li></ul><div id="datawrapper-iframe" class="datawrapper-wrap outer" data-attrs="{&quot;url&quot;:&quot;https://datawrapper.dwcdn.net/is6hQ/3/&quot;,&quot;thumbnail_url&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/b8427c8a-f2f9-4c30-af18-a23ce7662c3e_1220x1724.png&quot;,&quot;thumbnail_url_full&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/e40d202d-f410-4088-8548-7d30c7a26c60_1220x1914.png&quot;,&quot;height&quot;:697,&quot;title&quot;:&quot;Generations of lithography machines&quot;,&quot;description&quot;:&quot;Shorter wavelength can print smaller features on a chip.&quot;}" data-component-name="DatawrapperToDOM"><iframe id="iframe-datawrapper" class="datawrapper-iframe" src="https://datawrapper.dwcdn.net/is6hQ/3/" width="730" height="697" frameborder="0" scrolling="no"></iframe><script type="text/javascript">!function(){"use strict";window.addEventListener("message",(function(e){if(void 0!==e.data["datawrapper-height"]){var t=document.querySelectorAll("iframe");for(var a in e.data["datawrapper-height"])for(var r=0;r<t.length;r++){if(t[r].contentWindow===e.source)t[r].style.height=e.data["datawrapper-height"][a]+"px"}}}))}();</script></div><h3 style="text-align: justify;">DUV multipatterning cannot fully substitute for EUV</h3><p>Beyond EUV research, what matters practically is what China can produce with its existing DUV tools. For mature nodes at 45nm and above, where dry DUV suffices, China is well supplied by a mix of domestic and imported systems&#8212;SMIC, Hua Hong, and Nexchip collectively run millions of wafer starts per month at these nodes. For 28nm and below, Chinese fabs <a href="https://www.cnas.org/publications/commentary/cnas-insights-the-export-control-loophole-fueling-chinas-chip-production">rely</a> on imported immersion DUV tools from ASML, a stockpile of machines bought before Dutch and Japanese export controls tightened in <a href="https://www.csis.org/analysis/japan-and-netherlands-announce-plans-new-export-controls-semiconductor-equipment">2023</a>, <a href="https://www.reuters.com/technology/dutch-government-retakes-export-control-over-two-asml-tools-us-2024-09-06/">2024</a>, and <a href="https://www.csis.org/analysis/csis-translation-january-2025-updated-japanese-export-controls-high-performance">2025</a>, as well as some less advanced immersion DUV tools that remain unrestricted.</p><p>As described in the section on chip fabrication, 7nm chips can be made without EUV using immersion DUV multipatterning; TSMC&#8217;s <a href="https://www.techpowerup.com/255097/tsmc-expects-most-7nm-customers-to-move-to-6nm-density">first</a> 7nm process used this approach before later versions introduced EUV. SMIC has used the same basic workaround for Huawei&#8217;s Kirin chips. But this approach comes with steep trade-offs in cost, yield, and throughput, as each additional pass must align at the nanometer scale. ASML&#8217;s most advanced <a href="https://www.asml.com/en/products/duv-lithography-systems">DUV immersion scanners</a> (the Twinscan NXT series) can achieve overlay <a href="https://www.asml.com/en/products/duv-lithography-systems/twinscan-nxt2050i">accuracy</a> of around 2.5nm and throughput of 200 to 300 wafers per hour.</p><p>China holds a substantial installed base of ASML DUV <a href="https://scout.eto.tech/?id=5239">scanners</a>, including immersion systems capable of supporting 7nm production. But while the approach works, the volume is limited, and the economics are poor. The <a href="https://asiatimes.com/2024/02/smic-to-sell-huawei-costly-inefficient-5nm-chips/">yields</a> are roughly one-third of TSMC&#8217;s, and costs per wafer are approximately 50% higher.</p><p>Another long-term concern is that these machines age. Imported DUV tools require <a href="https://www.cnas.org/publications/commentary/cnas-insights-the-export-control-loophole-fueling-chinas-chip-production">maintenance</a> every six months from ASML itself; without this maintenance, the existing Chinese fleet would likely degrade significantly within a year. Spare parts, software updates, and servicing for the banned immersion DUV systems <a href="https://www.scmp.com/tech/tech-war/article/3278535/china-hit-hard-new-dutch-export-controls-asml-chip-making-equipment">are also restricted</a>. China&#8217;s domestic alternatives are still catching up: SMEE is <a href="https://www.bloomberg.com/news/articles/2023-08-02/china-chip-firms-surge-on-report-of-technology-breakthrough?srnd=technology-vp">working toward</a> an immersion DUV system suitable for 28nm process node production, and Yuliangsheng <a href="https://www.trendforce.com/news/2025/09/17/news-smic-said-to-test-chinese-made-duv-lithography-tool-from-sicarrier-affiliate-amid-ai-chip-push/">supplied</a> its first 28nm system to SMIC in 2025 for early testing, reportedly <a href="https://www.trendforce.com/news/2025/11/10/news-decoding-chinas-lithography-push-to-challenge-asml-from-sicarrier-to-alternative-euv-paths/">experimenting</a> with multipatterning to produce 7nm chips. Domestic tools capable of supporting sub-10nm production (even with multipatterning) are, however, <a href="https://www.tomshardware.com/tech-industry/semiconductors/chinas-largest-foundry-testing-first-domestic-immersion-duv-lithography-tool-smic-takes-significant-step-on-road-to-wafer-fab-equipment-self-sufficiency">not expected</a> before 2030.</p><h3 style="text-align: justify;">EUV lithography</h3><p>It took ASML about <a href="https://medium.com/@ASMLcompany/the-20-year-journey-to-the-chips-of-tomorrow-4df3ac1ebc72">two decades</a> to go from serious development of EUV to its first commercial shipment. The difficulty is spread across several uniquely complex subsystems, each made by only one or two suppliers in the world:</p><ul><li><p><strong>Light source. </strong>Tin droplets are <a href="https://www.asml.com/en/news/stories/2022/making-euv-lab-to-fab">fired</a> into a vacuum chamber 50,000 times per second. A high-power CO2 laser hits each droplet twice, vaporizing it into a plasma 40 times hotter than the Sun&#8217;s surface, which emits EUV light. The drive laser is <a href="https://www.trumpf.com/en_US/products/lasers/euv-drive-laser/">supplied</a> exclusively by Trumpf (Germany), while the full light source is assembled by Cymer in San Diego, California, which ASML acquired in 2013.</p></li><li><p><strong>Collector mirror.</strong> A large curved mirror channels EUV photons from the plasma into the optical system. Only Zeiss (Germany) can <a href="https://www.zeiss.com/semiconductor-manufacturing-technology/inspiring-technology/euv-lithography.html">build</a> it.</p></li><li><p><strong>Multi-layer mirrors.</strong> The scanner contains 10-14 mirrors polished to near-atomic smoothness. Each reflects only about 70% of EUV light, so only about 2% of the source&#8217;s output <a href="https://www.asml.com/en/technology/lithography-principles/light-and-lasers">reaches</a> the wafer. Zeiss (Germany) is again the sole producer.</p></li><li><p><strong>Photomasks.</strong> The specialized low-defect substrates that carry the circuit pattern.  HOYA and AGC (both from Japan) <a href="https://semiconductorinsight.com/blog/hoya-expands-euv-photomask-blank-capabilities-strengthening-global-semiconductor-supply-chain/">produce</a> them and together <a href="https://www.intelmarketresearch.com/euv-mask-blanks-market-11463">hold</a> about 93% of the global EUV photomask market.</p></li><li><p><strong>Pellicle.</strong> A thin protective film shields the photomask during printing. Mitsui Chemicals (Japan) <a href="https://jp.mitsuichemicals.com/en/release/2021/2021_0526.htm">manufactures</a> them commercially under a license from ASML.</p></li></ul><p>That is why even serious Chinese progress in prototype systems should be interpreted cautiously. Building something that emits EUV light is not the same as building something that can support high-volume, commercially competitive chip production.</p><p>SiCarrier, a Shenzhen state-backed semiconductor equipment maker with close ties to Huawei, has <a href="https://www.techinsights.com/blog/china-analysis-inside-sicarriers-ambitious-wfe-roadmap">emerged</a> as the key coordinator linking domestic photolithography efforts and extending into EUV-adjacent optics through its stake in Zetop Technologies.</p><p>The absence of publicly verified data for Chinese tools makes tracking progress difficult. With that caveat, here&#8217;s what we can piece together. The table below maps all entities active in China&#8217;s EUV and advanced DUV ecosystem&#8212;equipment makers, coordinators, state research institutes, universities, and defense-industrial groups&#8212;alongside their specific subsystem focus and current maturity.</p><div id="datawrapper-iframe" class="datawrapper-wrap outer" data-attrs="{&quot;url&quot;:&quot;https://datawrapper.dwcdn.net/AaAVZ/7/&quot;,&quot;thumbnail_url&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/e7b89c2b-80e9-402c-82bc-2096096a7a12_1220x1650.png&quot;,&quot;thumbnail_url_full&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/12562185-5a85-49cd-b179-183599056f41_1220x1720.png&quot;,&quot;height&quot;:718,&quot;title&quot;:&quot;Key entities involved in EUV development&quot;,&quot;description&quot;:&quot;&quot;}" data-component-name="DatawrapperToDOM"><iframe id="iframe-datawrapper" class="datawrapper-iframe" src="https://datawrapper.dwcdn.net/AaAVZ/7/" width="730" height="718" frameborder="0" scrolling="no"></iframe><script type="text/javascript">!function(){"use strict";window.addEventListener("message",(function(e){if(void 0!==e.data["datawrapper-height"]){var t=document.querySelectorAll("iframe");for(var a in e.data["datawrapper-height"])for(var r=0;r<t.length;r++){if(t[r].contentWindow===e.source)t[r].style.height=e.data["datawrapper-height"][a]+"px"}}}))}();</script></div><h3 style="text-align: justify;">China&#8217;s EUV prototypes are still far from production tools</h3><p>Reuters <a href="https://www.reuters.com/world/china/how-china-built-its-manhattan-project-rival-west-ai-chips-2025-12-17/">reported</a> in December 2025 that China had constructed a prototype EUV photolithography machine inside a high-security facility in Shenzhen, under the oversight of China&#8217;s Central Science and Technology Commission, a top Party-level body <a href="https://thediplomat.com/2023/08/the-party-rules-chinas-new-central-science-and-technology-commission/">created</a> in 2023 to centralize national strategy over science and technology. The prototype was reportedly completed in early 2025 and is undergoing testing. It can generate EUV light but has not yet produced functional chips. The prototype would still need dramatic improvements in source power, conversion efficiency, mirror reflectivity, uptime, and precision optics before approaching commercial viability, as the gap between <a href="https://xcancel.com/onni_aarne/status/2001718716766077127">generating</a> EUV light and actual chip production is substantial. The machine is described as physically much larger than ASML&#8217;s commercial systems, filling nearly an entire factory floor.</p><p>The project was built partly using components from older, pre-controls-era ASML machines acquired through secondary markets. Former ASML engineers, recruited with signing bonuses of $420,000 to $700,000 plus housing subsidies, played a significant role. The program has been <a href="https://www.japantimes.co.jp/business/2025/12/18/tech/china-west-ai-chips/">described</a> as analogous to the Manhattan Project: a top-down national effort, designated as one of Xi Jinping&#8217;s top strategic priorities, that mobilizes thousands of engineers across state research institutes and private firms under Huawei&#8217;s coordination. Work is reportedly conducted in secured facilities under strict secrecy, with some recruits issued identification cards under false names, instructed to use aliases, and teams deliberately isolated from one another.</p><h3 style="text-align: justify;">China&#8217;s lack of EUV lithography severely constrains its chip manufacturing</h3><p>Several key EUV components are <a href="https://entropycapital.substack.com/p/asmls-supply-chain-bill-of-materials">produced by only a single supplier</a>, exclusively for ASML. Only Trumpf (Germany) makes the CO2 drive laser. Only Zeiss (Germany) makes the multi-layer mirrors and optics (and since 2016, ASML has owned a 25% stake in Zeiss&#8217;s Semiconductor Manufacturing Technology subsidiary). Cymer, which assembles the full EUV light source, has been a wholly-owned ASML subsidiary since 2013. Berliner Glas, a specialty optics supplier, was acquired by ASML in 2020. Because these components have no economic alternative, Chinese firms could not procure them through normal commercial channels even in the absence of export controls. (In fact, our understanding is that ASML&#8217;s suppliers&#8217; suppliers are also contractually prevented from selling parts to other companies.) And because no ASML EUV machines have ever shipped to China, Chinese researchers have no deployed system to study directly.</p><p>China&#8217;s progress is uneven across these subsystems, but it lags far behind on all of them. For the multi-layer mirrors that bounce EUV light through the scanner, ASML <a href="https://www.asml.com/en/technology/lithography-principles/light-and-lasers">uses</a> 10-14 molybdenum-silicon mirrors polished to near-atomic smoothness, each reflecting about 70% of EUV light. Only <a href="https://www.zeiss.com/semiconductor-manufacturing-technology/inspiring-technology/euv-lithography.html">Zeiss SMT (Germany)</a> can produce them at production grade. Chinese researchers at the Changchun Institute of Optics, Fine Mechanics and Physics (CIOMP), a Chinese Academy of Sciences institute, have <a href="https://www.habtoorresearch.com/programmes/implication-china-acquisition-lithography/">achieved</a> a reflectivity of about 65% using domestic Mo/Si mirrors. The commercial arm, <a href="https://www.trendforce.com/news/2025/10/21/news-amies-rises-in-chinas-lithography-push-reportedly-capturing-90-of-domestic-market/">Zetop Technologies</a>, part-owned by SiCarrier and CIOMP, is working to scale these mirrors for lithography. Even this five-point reflectivity gap compounds across 10-14 sequential mirror bounces, meaning a Chinese system at 65% delivers only 35-50% as much EUV light to the wafer as ASML&#8217;s.</p><p>This has forced Chinese researchers toward entirely different light sources, perhaps developed in parallel with a laser-produced plasma (LPP) light source similar to that used by ASML:</p><ul><li><p>Tsinghua University is <a href="https://www.scmp.com/news/china/science/article/3235419/china-plans-build-giant-chip-factory-driven-particle-accelerator">developing</a> a steady-state microbunching (SSMB) system that would use a 100-150-meter-circumference particle accelerator (roughly the size of two basketball courts) to generate continuous EUV light, targeting a power output above 1,000 W, higher than ASML&#8217;s about 600 W. Rather than shrink the lithography machine for export as ASML does, the <a href="https://www.scmp.com/news/china/science/article/3235419/china-plans-build-giant-chip-factory-driven-particle-accelerator">proposed design</a> co-locates multiple lithography machines around one central accelerator in a single large facility, effectively a chipmaking campus built around one light source. Construction of a dedicated SSMB-EUV facility in Xiong&#8217;an <a href="https://markets.financialcontent.com/wral/article/tokenring-2025-12-24-chinas-secret-lithography-race-prototyping-euv-and-extending-duv-life">began</a> in early 2025.</p></li><li><p>Harbin Institute of Technology is <a href="https://asiatimes.com/2025/12/made-in-china-euv-machine-targets-ai-chip-output-by-2028/">pursuing</a> a laser-discharge plasma (LDP) source, which uses a solid-state laser combined with high-voltage electrode discharge to vaporize tin droplets and reportedly reaches around 100 W. LDP <a href="https://asiatimes.com/2025/12/made-in-china-euv-machine-targets-ai-chip-output-by-2028/">scales poorly</a> to the powers needed for advanced-node lithography and is more commonly used for photomask defect inspection, so this path may end up as a supporting technology rather than a direct replacement for ASML&#8217;s light source. Both approaches are scientifically novel but unproven at high-volume manufacturing scale.</p></li></ul><p>Most industry analysts do not expect a Chinese <a href="https://www.trendforce.com/news/2025/11/10/news-decoding-chinas-lithography-push-to-challenge-asml-from-sicarrier-to-alternative-euv-paths/">DUV immersion tool</a> capable of supporting 7nm production via multipatterning <a href="https://www.tomshardware.com/tech-industry/semiconductors/chinas-largest-foundry-testing-first-domestic-immersion-duv-lithography-tool-smic-takes-significant-step-on-road-to-wafer-fab-equipment-self-sufficiency">before 2030</a>, and even then, the resulting tools would likely lag ASML&#8217;s in yield and cost. Faster indigenization later this decade is possible&#8212;SMEE has announced <a href="https://www.bloomberg.com/news/articles/2023-08-02/china-chip-firms-surge-on-report-of-technology-breakthrough?srnd=technology-vp">systems capable of 28nm production</a>, and SiCarrier tools are reportedly being <a href="https://www.trendforce.com/news/2025/09/17/news-smic-said-to-test-chinese-made-duv-lithography-tool-from-sicarrier-affiliate-amid-ai-chip-push/">tested</a> by SMIC&#8212;but a first-generation domestic immersion tool would almost certainly be a long way from competitive. For EUV, Beijing&#8217;s stated<a href="https://www.csis.org/blogs/strategic-technologies-blog/breakthroughs-or-boasts-assessing-recent-chinese-lithography"> target</a> for a prototype capable of producing functional chip patterns is <a href="https://asiatimes.com/2025/12/made-in-china-euv-machine-targets-ai-chip-output-by-2028/">2028</a>, while 2030 is the more widely cited realistic expectation among <a href="https://www.trendforce.com/news/2025/11/10/news-decoding-chinas-lithography-push-to-challenge-asml-from-sicarrier-to-alternative-euv-paths/">outside analysts</a>. A commercially viable EUV system is likely a <a href="https://finance.yahoo.com/news/china-eyes-mastery-euv-lithography-093000311.html">decade or more away</a>.</p><p>Even achieving a working prototype would not mean commercial readiness. ASML <a href="https://www.asml.com/en/news/stories/2022/making-euv-lab-to-fab">shipped</a> its first EUV prototype in 2006 but did not make its <a href="https://optics.org/news/8/4/30">first commercial shipment</a> (the NXE:3400B) until 2017, 11 years later, and EUV-made chips <a href="https://www.engadget.com/2019-08-07-samsung-7-nanometer-euv-processor-galaxy-s10.html">did not appear</a> in consumer products until 2019 with Samsung&#8217;s Galaxy Note 10. China may compress that timeline somewhat by hiring former ASML employees, <a href="https://www.japantimes.co.jp/business/2025/12/18/tech/china-west-ai-chips/">buying older ASML components</a> on the secondary market, and studying ASML&#8217;s own development pathway. Reverse engineering is more feasible for DUV than for EUV, as Chinese fabs operate many ASML DUV machines they can disassemble and study, whereas no EUV machine has ever been delivered to China. Still, turning a working prototype into commercially viable chip production could <a href="https://finance.yahoo.com/news/china-eyes-mastery-euv-lithography-093000311.html">take</a> years or decades.</p><p>Reports of China&#8217;s EUV prototypes and other hardware advancements must be interpreted with caution. China&#8217;s centrally directed industrial policy creates strong incentives for local governments, universities, and firms to exaggerate progress. Announcing breakthroughs serves as currency for recognition and funding within Beijing&#8217;s competitive industrial ecosystem, and there is little vetting of these claims. The most notorious example is HSMC, a Wuhan-backed company that <a href="https://www.caixinglobal.com/2021-03-02/four-things-to-know-about-chinas-185-billion-failed-chip-champ-101668910.html">burned through</a> more than $2 billion without producing a single chip. The same system also fragments the supply chain. Provincial and municipal governments <a href="https://ucigcc.org/blog/the-accomplishments-and-contradictions-of-chinas-semiconductor-industrial-policy/">compete</a> to nurture their own fully self-contained semiconductor clusters rather than coordinate around a single national effort, resulting in duplicated projects, small fabs, and spread-out capital. The prototypes may well constitute real scientific achievements, but even if so, they are many years away from being capable of volume production.</p><p>While China is making measurable progress on DUV and early-stage EUV machines, ASML is developing High-NA EUV machines, capable of <a href="https://www.asml.com/en/news/stories/2024/5-things-high-na-euv">printing</a> features roughly 1.7 times smaller than standard EUV with less multipatterning. Intel has installed the first High-NA EUV tool, while TSMC appears to be <a href="https://www.reuters.com/business/asml-says-first-chips-new-high-na-machines-arrive-months-2026-05-19/">postponing</a> adoption due to its high cost.</p><h2>Etch, clean, deposition, metrology, and more</h2><p>China&#8217;s SME self-sufficiency remains limited overall. The share of domestically manufactured equipment installed in Chinese fabs <a href="https://www.trendforce.com/news/2026/01/12/news-chinas-domestic-chip-equipment-adoption-beats-2025-target-at-35-led-by-naura-amec/">reached</a> roughly 35% in 2025, surpassing the government&#8217;s 30% target. But this aggregate figure is most likely skewed by mature and legacy-node tools (at 28nm and above), where domestic substitution is easier, rather than reflecting fully indigenous advanced-node production. The equivalent share for advanced-node equipment (7nm and below) is almost certainly much lower, but no reliable public breakdown exists.</p><p>Beijing is now pushing for <a href="https://www.trendforce.com/news/2026/02/20/news-china-reportedly-ramps-up-chip-tool-push-sets-70-target-by-2027-smee-naura-at-forefront/">70%</a> by 2027. Even so, China&#8217;s total equipment spending <a href="https://www.semi.org/en/SEMI-Reports-Global-Semiconductor-Equipment-Billings-Reached-135-Billion-in-2025">remained</a> near record levels at $49 billion in 2025. <a href="https://cset.georgetown.edu/article/inside-beijings-chipmaking-offensive/">Analysis</a> of global market share data from 2019 to 2024 shows that Chinese companies are making real inroads in several equipment categories, even as the most advanced segments remain out of reach. Whether the 70% target is achievable depends heavily on the type of equipment. Whereas domestic substitution in etching and deposition already exceeds 40%, photolithography, the most critical and expensive step, <a href="https://www.trendforce.com/news/2026/01/12/news-chinas-domestic-chip-equipment-adoption-beats-2025-target-at-35-led-by-naura-amec/">remains</a> around 18% overall, a figure that reflects tools for older process nodes and packaging rather than the DUV tools relevant for advanced semiconductor manufacturing. For advanced front-end production, China still has no competitive domestic immersion DUV tool <a href="https://www.csis.org/blogs/strategic-technologies-blog/breakthroughs-or-boasts-assessing-recent-chinese-lithography">shipping at scale</a>.</p><p>Deposition tools coat wafers with ultra-thin layers of material&#8212;metal for wiring, insulators to electrically separate chip features, or barrier films to prevent contamination. China&#8217;s position varies dramatically across the three main types:</p><ul><li><p>In physical vapor deposition (PVD), which vaporizes solid material to coat the wafer, Naura has <a href="https://cset.georgetown.edu/article/inside-beijings-chipmaking-offensive/">grown</a> from 1% to roughly 10% of the global market between 2019 and 2024. Applied Materials (US) still leads with <a href="https://cset.georgetown.edu/article/inside-beijings-chipmaking-offensive/">roughly 86%</a>, though it is <a href="https://uk.finance.yahoo.com/news/mizuho-downgrades-applied-materials-china-112807594.html">losing</a> 2-4 percentage points per year to Chinese competitors. Naura&#8217;s gains are primarily at mature nodes; for advanced-node processes, Applied Materials remains dominant.</p></li><li><p>In chemical vapor deposition (CVD), which uses chemical reactions to build up thin films, Chinese firms <a href="https://cset.georgetown.edu/article/inside-beijings-chipmaking-offensive/">rose</a> from 1% to 7% of the global market between 2019 and 2024, led by Piotech and Naura. The US still holds 78%. At YMTC (Yangtze Memory Technologies, China&#8217;s largest NAND flash maker), Piotech now supplies nearly a third of the installed CVD equipment, up from 15%, <a href="https://www.scmp.com/tech/big-tech/article/3339366/great-chip-leap-chinas-semiconductor-equipment-self-reliance-surges-past-targets">displacing</a> foreign tools.</p></li><li><p>In atomic layer deposition (ALD), the most advanced and precise category, which deposits films one atom layer at a time and is essential for leading-edge nodes, China <a href="https://cset.georgetown.edu/article/inside-beijings-chipmaking-offensive/">holds</a> less than 1% of the global market as of 2024. ASM International (Netherlands), Tokyo Electron (Japan), Kokusai (Japan), and Lam Research (US) together control <a href="https://chipexplorer.eto.tech/?filter-choose=input-resource&amp;parentNode=N35&amp;selectedNode=N41">about 88%</a> of the segment, making ALD the widest gap in China&#8217;s equipment portfolio.</p></li></ul><p>Etching tools carve circuit patterns into the wafer after lithography. Dry etching, which uses reactive gas plasmas rather than liquid chemicals to remove material and is <a href="https://cset.georgetown.edu/article/inside-beijings-chipmaking-offensive/">relied</a> on more heavily than wet etching at advanced nodes for its greater precision, is where Chinese firms have made real progress. The global dry etch market remains <a href="https://cset.georgetown.edu/article/inside-beijings-chipmaking-offensive/">dominated</a> by US firms Lam Research and Applied Materials (together holding roughly 60%) and Japan&#8217;s Tokyo Electron (about 15%). Chinese companies Naura and AMEC together hold roughly 11% of the market, up from under 3% in 2019, though these sales are largely equipment for older process nodes.</p><p>On the specific capability of high-aspect-ratio etching, which is critical for 3D memory and HBM stacking, AMEC produces a tool capable of etching features with a 60:1 aspect ratio (a hole sixty times deeper than it is wide), a notable achievement but <a href="https://www.the-substrate.net/p/china-is-making-strides-in-etching">roughly 6-8 years behind</a> the global frontier. Its tool is comparable to what non-Chinese manufacturers offered around 2018 and 2019 and cannot support HBM production beyond HBM2E. A successor promising 90:1 aspect ratios is under development and would narrow the gap to 2-3 years, but its market <a href="https://www.the-substrate.net/p/china-is-making-strides-in-etching">entry</a> now looks closer to 2027 than the 2025 timeline AMEC originally signaled. For comparison, Lam Research&#8217;s latest cryogenic etch tools&#8212;a technique not yet available in Chinese equipment&#8212;are designed to <a href="https://filecache.mediaroom.com/mr5mr_lamresearch/182770/Counterpoint_Research_Paper_Scaling_to_1000-Layer_3D_NAND_in_the_AI_Era.pdf">reach</a> ratios above 100:1. AMEC is investing heavily to close this gap, with <a href="https://www.the-substrate.net/p/china-is-making-strides-in-etching">R&amp;D spending</a> up 54% year-on-year in the first half of 2025, but it still spends roughly a fifth of what Lam Research allocates to R&amp;D.</p><p>Cleaning tools remove residues, particles, and chemical contaminants between fabrication steps; contamination at the nanometer scale can ruin an entire chip, so cleaning accounts for a large share of the process steps. The global cleaning market is moderately concentrated. SCREEN Semiconductor Solutions (Japan), Tokyo Electron (Japan), Lam Research and Applied Materials (US), plus China&#8217;s ACM Research, together <a href="https://www.mordorintelligence.com/industry-reports/wafer-cleaning-equipment-market">account for</a> about 65% of revenue in 2024, with SCREEN retaining overall leadership. ACM is the only Chinese firm in the global top tier. Cleaning is also China&#8217;s strongest equipment sub-sector, with over 50% <a href="https://siliconangle.com/2025/12/30/china-targets-semiconductor-self-sufficiency-50-rule-imposed-local-chipmakers/">domestic self-sufficiency</a>, driven by ACM and Naura. ACM tools are now <a href="https://www.trendforce.com/news/2025/12/10/news-chinas-acm-reportedly-lands-memory-giant-orders-eyes-hbm4-compatible-cleaning-systems-by-2026/">deployed</a> in HBM production at SK Hynix and Micron, and ACM is targeting HBM4-compatible systems by 2026.</p><p>CMP tools polish the wafer surface flat between layers. Chinese firms&#8217; share of the global CMP market <a href="https://cset.georgetown.edu/article/inside-beijings-chipmaking-offensive/">grew</a> from roughly 3% to 11% between 2019 and 2024, driven almost entirely by Hwatsing Technology. Applied Materials (US) still leads at roughly 60% of the global market with more advanced capabilities. Hwatsing is also expanding into ion implantation through its <a href="https://www.trendforce.com/news/2025/04/07/news-ma-and-tech-breakthroughs-positive-signals-from-chinese-semiconductor-equipment-makers/">acquisition</a> of Xinyu Semiconductor, a strategically important move because ion implantation is one of China&#8217;s weakest equipment segments, with domestic self-sufficiency at just <a href="https://www.trendforce.com/news/2024/08/12/news-china-makes-progress-in-chip-tool-self-sufficiency-yet-lithography-remains-a-key-bottleneck/">1.4%</a> and two US firms (Applied Materials and Axcelis) controlling over 80% of the global market.</p><p>Metrology and inspection tools measure feature dimensions, check layer-to-layer alignment, and scan for defects throughout fabrication&#8212;essential for yield at advanced nodes, where a defect of a few nanometers can ruin a chip. The segment is even more concentrated than testing. KLA (US) <a href="https://drrobertcastellano.substack.com/p/klas-market-share-growth-in-process">holds</a> roughly 63% of the global metrology and inspection market as of 2024, up from about 50% in 2010, with Applied Materials a distant second at under 8%. KLA&#8217;s share in reticle (photomask) inspection <a href="https://markets.financialcontent.com/stocks/article/tokenring-2025-10-22-kla-corporation-leads-the-charge-process-control-dominance-fuels-bullish-semiconductor-sentiment-amidst-ai-boom">exceeds</a> 80%. In China, the domestic substitution rate for metrology tools <a href="https://www.trendforce.com/news/2026/01/12/news-chinas-domestic-chip-equipment-adoption-beats-2025-target-at-35-led-by-naura-amec/">reached</a> roughly 25% in 2025 (up from about 15% in 2022), but this figure aggregates all process nodes and is almost certainly concentrated at mature nodes; no Chinese firm offers a competitive alternative for advanced-node inspection.</p><p>Semiconductor testing verifies that finished chips work correctly, and the complexity of the test depends on the chip&#8217;s complexity. In linear and discrete testing (the simplest category, used for basic components like transistors and diodes) Chinese firms Hangzhou Changchuan and AccoTEST have <a href="https://cset.georgetown.edu/article/inside-beijings-chipmaking-offensive/">captured</a> nearly 70% of the global market. China has also entered burn-in testing (which stress-tests chips at high temperatures to catch early failures; 9% global share) and SoC testing (which verifies complex integrated circuits, such as AI chips; 5% global share) since 2020. However, the most advanced SoC and memory test segments remain <a href="https://www.mordorintelligence.com/industry-reports/semiconductor-test-equipment-market">dominated</a> by Advantest (Japan, about 58% global share) and Teradyne (US, about 23%).</p><p>Advanced packaging tools are increasingly important for AI chips, which rely on integrating multiple smaller chips (&#8220;chiplets&#8221;) into a single package. Two relevant variants of advanced packaging are 2.5D, where multiple chips are arranged on an &#8220;interposer&#8221; chip that facilitates communication, and 3D, where chips are stacked on top of each other. Such techniques include <a href="https://www.the-substrate.net/p/making-through-silicon-vias-is-not">through-silicon vias</a> (TSVs), Chip-on-Wafer-on-Substrate (CoWoS), and hybrid bonding. The advanced packaging process needs many different SME, including both wafer fabrication tools and assembly tools. The global market for the wafer fabrication tools used in advanced packaging is <a href="https://cset.georgetown.edu/article/inside-beijings-chipmaking-offensive/">dominated</a> by the US (65%), with Japan a distant second (15%), and China third (7%).</p><p>Chinese firms&#8217; gains are concentrated in specific segments. They have gained about 10 percentage points in global share in etch and clean tools for advanced packaging since 2019, and in packaging deposition, they have surpassed Japan to become the second-largest supplier behind the US. Naura, for example, <a href="https://www.digitimes.com/news/a20260326PD217/naura-technology-amec-3d-packaging-equipment-investment.html">launched</a> a TSV etcher for 12-inch wafers in 2020 and has since developed a plasma etching process that produces TSVs with 11.8-micron openings at 116-micron depth. But in higher-value advanced-packaging segments, especially hybrid bonding, China is likely at an early stage of development. Hybrid bonding is already used in current 3D stacking products such as <a href="https://www.amd.com/en/products/processors/technologies/3d-v-cache.html">AMD&#8217;s</a> 3D V-Cache and in <a href="https://www.sisajournal-e.com/news/articleView.html?idxno=415596">YMTC&#8217;s</a> NAND architecture, and may become important for future <a href="https://www.trendforce.com/news/2026/03/06/news-industry-weigh-825-900-%CE%BCm-hbm-thickness-for-20-high-stacks-potentially-slowing-hybrid-bonding/">HBM</a> generations. Naura reportedly <a href="https://www.trendforce.com/news/2026/03/26/news-naura-reportedly-unveils-hybrid-bonding-tool-at-semicon-china-sicarrier-last-years-lithography-standout-misses-show/">unveiled</a> one of China&#8217;s first 12-inch die-to-wafer hybrid bonding systems in March 2026.</p><p>Across these categories, a handful of state-backed national champions are carving out positions in moderately complex equipment segments such as etching, cleaning, CVD, and CMP at mature nodes, and are <a href="https://cset.georgetown.edu/article/inside-beijings-chipmaking-offensive/">expected</a> to reinvest their revenue into R&amp;D targeting harder segments like advanced-node ALD, ion implantation, and lithography, though Chinese firms still hold under 1% of the global ALD market and a near-zero share of lithography.</p><p>The four most important Chinese SME companies are Naura (etching, deposition, cleaning, and thermal processing), <a href="https://www.the-substrate.net/p/china-is-making-strides-in-etching">AMEC</a> (etching and thin-film deposition), <a href="https://www.digitimes.com/news/a20251111VL204/equipment-growth-revenue-2025-sales.html">Piotech</a> (CVD), and <a href="https://www.digitimes.com/news/a20260409VL209/cmp-equipment-manufacturing-demand-packaging.html">Hwatsing</a> (CMP). Naura is the largest. Its 2025 revenue is <a href="https://www.digitimes.com/news/a20260213PD212/naura-technology-ic-manufacturing-equipment-localization-2025.html">forecast</a> at about $7 billion (up from $4.2 billion in 2024), and it has <a href="https://www.scmp.com/tech/big-tech/article/3302104/chinas-naura-climbs-ranks-worlds-top-chipmaking-equipment-suppliers">climbed</a> to sixth in the global ranking of SME suppliers by revenue in 2025&#8212;the only Chinese firm in the top tier, trailing only ASML, Applied Materials, Lam Research, Tokyo Electron, and KLA. Its order backlog <a href="https://www.scmp.com/tech/big-tech/article/3339366/great-chip-leap-chinas-semiconductor-equipment-self-reliance-surges-past-targets">runs through</a> Q1 2027.</p><p>The Chinese state is investing heavily in its semiconductor industry. Big Fund III, the third phase of the National Integrated Circuit Industry Investment Fund, <a href="https://fortune.com/asia/2024/05/28/more-confident-china-doubling-down-big-fund-iii-semiconductors-development-us-controls/">launched</a> in May 2024 with $48 billion in capital, <a href="https://asia.nikkei.com/business/tech/semiconductors/china-s-3rd-semiconductor-big-fund-starts-spending-47bn-war-chest">spending</a> on materials, wafer-fabrication equipment, and chip companies across the supply chain rather than photolithography specifically. That headline figure is roughly on par with the <a href="https://www.csis.org/analysis/world-chips-acts-future-us-eu-semiconductor-collaboration">&#8364;43 billion</a> of funding in the EU Chips Act and the $39 billion in direct grants and loans in the US CHIPS Act, though the CHIPS Act also includes an estimated $46 billion in tax credits for chip manufacturing. <a href="https://www.all-about-industries.com/china-is-investing-significantly-more-money-in-its-chip-industry-a-18a45e6f8fce9b0a0ee34571cb1d7963/">Big Fund I</a> ($19 billion, launched 2014) and <a href="https://www.scmp.com/tech/science-research/article/3020172/china-said-complete-second-round-us29-billion-fund-will">Big Fund II</a> ($29 billion, launched 2019) added another $48 billion in combined headline commitments, bringing the three phases&#8217; total to about $95 billion.</p><p>On top of that, at least 15 local-government semiconductor <a href="https://www.bruegel.org/policy-brief/lessons-europe-chinas-quest-semiconductor-self-reliance">funds</a> add another $25 billion, with broader state subsidies (grants, tax incentives, low-interest loans, land concessions) estimated at around $50 billion more. These headline numbers come with caveats. The Big Fund disbursement has been <a href="https://technode.com/2022/08/12/silicon-why-is-china-investigating-the-state-backed-semiconductor-big-fund/">marred</a> by corruption investigations, with China&#8217;s Central Commission for Discipline Inspection launching a 2022 sweep of leading figures across the Big Fund and its management firms. Cases like Wuhan&#8217;s HSMC underscore that committed funding doesn&#8217;t always translate into effective semiconductor capability.</p><p><em>Veronika Blablov&#225; was funded through the <a href="https://www.pivotal-research.org/fellowship">Pivotal Research Fellowship</a> during this work.</em></p><p><em>We&#8217;re grateful to the following people for providing thoughtful and constructive feedback on earlier drafts of this work: Raghav Akula, James Nicholas Bryant, Naci Cankaya, Jacob Feldgoise, Jack Freed, Hamish Low, Antoine Maier, Konstantin Pilz, Robi Rahman, Maxwell Roberts, Filip &#352;ebok, and Kar Mun Nicole Wong. Their careful reading, questions, and suggestions substantially improved the final piece. Any remaining errors or shortcomings are entirely our own.</em></p><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-1" href="#footnote-anchor-1" class="footnote-number" contenteditable="false" target="_self">1</a><div class="footnote-content"><p>Throughout this text, &#8220;AI chip&#8221; refers to data center chips&#8212;the hardware used to train and run frontier AI models&#8212;rather than other AI chips, such as those found in phones, cameras, cars, and other devices.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-2" href="#footnote-anchor-2" class="footnote-number" contenteditable="false" target="_self">2</a><div class="footnote-content"><p>Reuters <a href="https://www.reuters.com/world/china/synopsys-halts-china-sales-due-us-export-restrictions-internal-memo-shows-2025-05-30">reports</a> 70%, and the Financial Times <a href="https://www.ft.com/content/2c0db765-03ac-4820-8a02-806469848bee">reports</a> 80%.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-3" href="#footnote-anchor-3" class="footnote-number" contenteditable="false" target="_self">3</a><div class="footnote-content"><p>Biren Technology draws on founding talent <a href="https://www.thinkchina.sg/technology/us-and-chinese-chipmakers-tread-different-paths-ai-gold-rush">linked</a> to SenseTime as well as NVIDIA, Qualcomm, and AMD; Enflame Technology and MetaX were founded by former AMD engineers; and Moore Threads was established by a former China executive of NVIDIA.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-4" href="#footnote-anchor-4" class="footnote-number" contenteditable="false" target="_self">4</a><div class="footnote-content"><p>Based on DeepSeek&#8217;s <a href="https://www.tomshardware.com/tech-industry/artificial-intelligence/deepseek-research-suggests-huaweis-ascend-910c-delivers-60-percent-nvidia-h100-inference-performance">testing</a>, the Ascend 910C delivers 60% of the H100&#8217;s inference performance.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-5" href="#footnote-anchor-5" class="footnote-number" contenteditable="false" target="_self">5</a><div class="footnote-content"><p>An &#8220;H100-equivalent&#8221; is a unit gained by dividing a given chip&#8217;s theoretical computational performance with the computational performance of an NVIDIA H100. Given that this conversion uses theoretical computational performance, and ignores memory bandwidth and other specifications, it&#8217;s most relevant to training workloads, which are typically compute-bound.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-6" href="#footnote-anchor-6" class="footnote-number" contenteditable="false" target="_self">6</a><div class="footnote-content"><p>Under the BIS Advanced Computing rules (October 2022, updated October 2023), AI chips above a Total Processing Performance (TPP) of 4800 (or above 1600 combined with a Performance Density (PD) of 5.92) require a license for export to Chinese customers. In addition, Chinese chip designers that use US EDA software need a license to send chip-design files to foreign foundries, and foundries face a due-diligence red flag for chip designs exceeding 50 billion transistors and with HBM. See <a href="https://cset.georgetown.edu/article/bis-2023-update-explainer/">CSET</a> for more on the October 2023 updates.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-7" href="#footnote-anchor-7" class="footnote-number" contenteditable="false" target="_self">7</a><div class="footnote-content"><p>In May 2026, Huawei <a href="https://globalsemiresearch.substack.com/p/huaweis-tau-scaling-law-a-technical">published</a> a paper on what it calls the &#8220;Tau (&#964;) Scaling Law&#8221;, a proposed successor to Moore&#8217;s Law that measures progress in chips by minimizing the time constant &#964; (signal propagation delay) rather than by shrinking transistors. The aim is to continue raising performance and density through a &#8220;LogicFolding&#8221; architecture that distributes a chip&#8217;s internal circuits across vertically stacked wafer layers. Using this design, Huawei <a href="https://www.reuters.com/world/asia-pacific/huawei-proposes-new-path-chip-development-amid-us-sanctions-2026-05-25/">aims</a> to &#8220;shorten wiring inside chips and considerably improve performance&#8221;. Huawei says Ascend chips are expected to fully adopt the approach by 2030. The fact that Huawei is pursuing this kind of architectural workaround is evidence that it is not capable of manufacturing cutting-edge AI chips.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-8" href="#footnote-anchor-8" class="footnote-number" contenteditable="false" target="_self">8</a><div class="footnote-content"><p>This estimate is based on data compiled from reports on shipment quantities and per-chip technical specs. The 2025 shipment estimates draw on <a href="https://www.reuters.com/world/china/chinese-chipmakers-claim-nearly-half-of-local-market-nvidias-lead-shrinks-idc-2026-04-01/">IDC via Reuters</a> for Huawei (~812K Ascend 910Cs and 910Bs), <a href="https://www.bloomberg.com/news/articles/2025-12-04/cambricon-aims-to-triple-chip-output-to-replace-nvidia-in-china">Bloomberg</a> for Cambricon (~150K Siyuan 590s, derived from the 2026 500K target framed as &#8220;more than triple 2025 output&#8221;), <a href="https://www.reuters.com/technology/artificial-intelligence/huawei-looks-beyond-moores-law-2026-05-27/">Reuters</a> for Alibaba T-Head (~265K, most likely Zhenwus), <a href="https://pdf.dfcfw.com/pdf/H2_AN202601221818280973_1.pdf">IPO filings</a> for Enflame (45,362 S60 chips produced January-September 2025, annualized to ~60K), <a href="https://kr-asia.com/baidu-integrates-ai-into-search-as-rivals-pursue-superapp-ambitions">KrAsia</a> for Baidu Kunlunxin (~60K Kunlun P800), the <a href="https://www1.hkexnews.hk/listedco/listconews/sehk/2025/1230/2025123000019.pdf">Iluvatar CoreX HKEX prospectus</a> (23.5K disclosed in Sep 2025, annualized to 30K), <a href="https://hellochinatech.com/p/metax-paradox-china-ai-chips">HelloChinaTech</a> for MetaX (~25K), and partial-year industry <a href="https://www1.hkexnews.hk/listedco/listconews/sehk/2025/1222/2025122200020_c.pdf">filings</a> for Biren (2,216 BR106 shipped in H1 2025, against 9,344 BR106 + 298 BR110 across full-year 2024; the prospectus notes that H2 2024 sales run higher than H1 2024 and that H1 2025 already exceeded H1 2024, so we extrapolate the full year to roughly 4,500-10,000 &#8212; from doubling H1 at the low end to repeating 2024&#8217;s level at the high end, we use ~10,000 for the visual) and a revenue-based estimate for Moore Threads, which discloses revenue, not unit shipments (The <a href="https://static.sse.com.cn/stock/disclosure/announcement/c/202509/002098_20250919_IP6Z.pdf">prospectus </a>puts AI-compute products at 94.85% of H1 2025 revenue, all built on two data-center chips: S4000 and S5000. Full-year 2025 <a href="https://www.stcn.com/article/detail/3816317.html">revenue </a>was RMB 1.51B; at ~95% AI-compute, that&#8217;s ~RMB 1.43B. Dividing by an assumed per-GPU average selling price of RMB 50,000-100,000 implies ~14,000-30,000 chips. As it is reported as cluster systems, not standalone cards; the estimate likely overstates the chip count, so we use ~20K for the visual.). The per-chip performance data is from Epoch AI&#8217;s <a href="https://epoch.ai/data/machine-learning-hardware">ML Hardware dataset</a>, supplemented by reported specs or specs estimates for newer Chinese chips not yet in the dataset. The aggregate compute is calculated as Chips shipped &#215; FP16 TFLOPS per chip, summed across designers, and expressed in H100-equivalents (each chip&#8217;s FP16 throughput &#247; the 989 TFLOPS of an NVIDIA H100). This makes for a total of about 770,000 H100-equivalents. Note that our tallies sum to about 1.4 million units, whereas Epoch AI reports 1.7 million units sold in 2025 by Chinese AI chip designers, so the real number might be somewhat higher.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-9" href="#footnote-anchor-9" class="footnote-number" contenteditable="false" target="_self">9</a><div class="footnote-content"><p>OpenAI&#8217;s Stargate Abilene facility is <a href="https://www.datacenterdynamics.com/en/news/openai-and-oracle-to-deploy-450000-gb200-gpus-at-stargate-abilene-data-center/">planned</a> to house &#8220;more than 450,000 NVIDIA GB200 GPUs&#8221;. These are likely 450,000 B200 GPUs. At about 2.5 H100-equivalents per B200 <a href="https://www.nvidia.com/en-us/data-center/gb200-nvl72/">chip</a>, the facility&#8217;s planned installed compute is about 1.1 million H100-equivalents. The comparison contrasts one year of Chinese chip shipments against a single US facility&#8217;s installed capacity, so it is not an apples-to-apples comparison.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-10" href="#footnote-anchor-10" class="footnote-number" contenteditable="false" target="_self">10</a><div class="footnote-content"><p>As of <a href="https://www.yicaiglobal.com/star50news/2025_03_286809128691839270912">end-2024</a>, the largest state-affiliated shareholder is Datang Holdings (about 14%), a subsidiary of the state-owned China Information and Communication Technologies Group Corporation (CICT), which provides communication infrastructure to the People&#8217;s Liberation Army. The National Integrated Circuit Industry Investment Fund&#8212;China&#8217;s primary state-backed vehicle for financing semiconductor development, known as the &#8220;Big Fund&#8221;&#8212;holds roughly 8% through its <a href="https://www.chipriskmonitor.com/crm-smic">subsidiary</a> Xinxin (Hong Kong) Investment, with an additional 1.6% via Big Fund II. CICT itself directly holds roughly 1%.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-11" href="#footnote-anchor-11" class="footnote-number" contenteditable="false" target="_self">11</a><div class="footnote-content"><p>The claim that five of Huawei&#8217;s 11 affiliated fabs are capable of 7nm-and-below processes originates from South Korean media outlets The Elec and Quasarzone, citing unnamed industry sources, and was subsequently reported in English by <a href="https://www.digitimes.com/news/a20250515PD215/huawei-dram-dongguan-government-shenzhen.html">Digitimes</a> and <a href="https://wccftech.com/huawei-strengthens-its-grip-over-the-chip-supply-chain/">WCCFTech</a>. Huawei has not confirmed the claim, and no independent teardown or facility audit has confirmed 7nm output from any of Huawei&#8217;s affiliated fabs apart from SMIC itself.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-12" href="#footnote-anchor-12" class="footnote-number" contenteditable="false" target="_self">12</a><div class="footnote-content"><p>In 2015, Tsinghua Unigroup, a conglomerate majority-owned by Tsinghua University and backed by China&#8217;s state semiconductor fund, made a $23 billion bid for Micron, which would have been the largest-ever Chinese <a href="https://www.cnbc.com/2015/07/13/chinas-tsinghua-unigroup-makes-23b-bid-for-micron-technology.html">acquisition</a> of a US company. The Committee on Foreign Investment in the United States (CFIUS), the US government&#8217;s foreign investment watchdog, made clear it would not <a href="https://www.scmp.com/tech/enterprises/article/1839280/tsinghua-unigroup-us23-bln-bid-micron-will-face-close-scrutiny-us">approve</a> the deal on national security grounds.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-13" href="#footnote-anchor-13" class="footnote-number" contenteditable="false" target="_self">13</a><div class="footnote-content"><p>Until early 2025, CXMT&#8217;s most advanced production sat at the threshold of the 2022 US <a href="https://www.tomshardware.com/pc-components/ssds/chinas-memory-maker-cxmt-reportedly-violates-us-export-rules-with-its-18nm-3d-dram-chipmaker-blatantly-presented-new-tech-at-industry-conference-report">export</a> rules, which originally prohibited the supplying or servicing of fab equipment for DRAM at nodes below 18nm half-pitch. CXMT <a href="https://www.digitimes.com/news/a20240122PD227/cxmt-ymtc-memory-chips-development.html">reportedly</a> told US authorities that it uses a 18.5nm process node (just outside the restricted threshold), allowing it to keep sourcing and servicing advanced foreign fab tools. At the same time, the company <a href="https://newsletter.semianalysis.com/p/intel-genai-for-yield-tsmc-cfet-and">presented</a> research on a more aggressive 18nm half-pitch DRAM built with gate-all-around transistors&#8212;signaling capability that runs ahead of what the fab manufactures at scale. In December 2024, BIS <a href="https://www.federalregister.gov/documents/2024/12/05/2024-28270/foreign-produced-direct-product-rule-additions-and-refinements-to-controls-for-advanced-computing">tightened</a> the rules and closed this potential loophole. The controls no longer depend on the node label and target memory cell area and DRAM density instead.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-14" href="#footnote-anchor-14" class="footnote-number" contenteditable="false" target="_self">14</a><div class="footnote-content"><p>The two principal bonding approaches used for HBM stacking are thermocompression bonding with non-conductive film (TC-NCF), which applies high temperature (about 300&#176;C) and force to bond dies via solder micro-bumps, and mass reflow with mold underfill (MR-MUF), which bonds at lower temperature and lower force and provides substantially better thermal dissipation. MR-MUF is the superior process, delivering higher yield than TC-NCF and handling heat far more<a href="https://news.skhynix.com/rulebreaker-revolutions-mr-muf-unlocks-hbm-heat-control/"> effectively</a> as stack heights increase. Whereas SK Hynix uses MR-MUF, Samsung and Micron have been unable to <a href="https://www.nomadsemi.com/p/deep-dive-on-hbm?hide_intro_popup=true">replicate</a> it due to both technical and patent barriers. CXMT&#8217;s HBM products are expected to rely on TC-NCF bonding. However, a reported<a href="https://www.tomshardware.com/pc-components/ram/ymtc-partners-with-cxmt-for-hbm"> partnership</a> between CXMT and YMTC could change this trajectory. YMTC&#8217;s Xtacking architecture&#8212;a wafer-to-wafer hybrid bonding process used in mass production of 3D NAND&#8212;is relevant because the HBM industry may migrate toward hybrid bonding for future generations. CXMT brings the DRAM process; YMTC brings the stacking technique.</p></div></div>]]></content:encoded></item><item><title><![CDATA[How good is China at export controls?]]></title><description><![CDATA[China&#8217;s export control agency has a fraction of BIS's staff, but makes up for it through politically centralized rulemaking and well-integrated interagency enforcement.]]></description><link>https://www.the-substrate.net/p/how-good-is-china-at-export-controls</link><guid isPermaLink="false">https://www.the-substrate.net/p/how-good-is-china-at-export-controls</guid><dc:creator><![CDATA[Maxwell K. Roberts]]></dc:creator><pubDate>Sun, 24 May 2026 14:03:33 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/1763135a-56eb-4d2b-ba70-ac3902705a96_2624x1856.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>Capacity matters!</p><p>When the US and China make major announcements about new export controls, you shouldn&#8217;t just focus on the rules; you should also consider whether those governments can enforce them and whether companies will comply. Having written at length about improving US export control capacity (including about <a href="https://www.the-substrate.net/p/bis-is-getting-more-fundingheres">BIS funding</a>, <a href="https://www.the-substrate.net/p/bis-should-build-a-lean-mean-data">BIS software and data modernization</a>, and <a href="https://www.the-substrate.net/p/bis-should-use-ai-to-control-ai-chips">potential BIS use of AI</a>), I now want to cover China&#8217;s export control capacity, defined here as the ability to translate political intent into rules, administer licensing under those rules, and identify and halt violations of the rules.</p><p>As far as I can tell from trying to measure China&#8217;s export control capacity:</p><ul><li><p>China seems to retaliate against US export control actions much faster than the US can take them, probably because of some combination of a more streamlined interagency process, the US telegraphing its moves ahead of time, and China keeping a stock of prepared retaliatory measures.</p></li><li><p>China might be a little slower per employee than the US at processing export control license applications, but the error bars on how many license applications China receives are so wide that it&#8217;s hard to tell.</p></li><li><p>China takes a meaningfully different approach than the US on export control enforcement, leaning much harder on interagency partners, and seems to be producing more enforcement cases.</p></li></ul><h1>China&#8217;s export control rulemaking capacity</h1><p>To enforce an export control rule, one must first make an export control rule. The US has dozens of professional bureaucrats and a bustling think tank ecosystem dedicated to this, resulting in such blockbusters as the <a href="https://www.federalregister.gov/documents/2022/10/13/2022-21658/implementation-of-additional-export-controls-certain-advanced-computing-and-semiconductor">October 2022 AI chip and semiconductor manufacturing equipment rule</a>, the <a href="https://www.federalregister.gov/documents/2025/01/16/2025-00711/implementation-of-additional-due-diligence-measures-for-advanced-computing-integrated-circuits">Foundry Due Diligence Rule</a>, and the great-granddaddy of them all, the now-rescinded, never-replaced <a href="https://www.federalregister.gov/documents/2025/01/15/2025-00636/framework-for-artificial-intelligence-diffusion">Framework for Artificial Intelligence Diffusion</a>.</p><p>In the Chinese system, export control rules are made by the Bureau of Industry, Security, Import, and Export Controls (BISIEC<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-1" href="#footnote-1" target="_self">1</a>), housed within the Ministry of Commerce (MOFCOM<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-2" href="#footnote-2" target="_self">2</a>). According to China&#8217;s <a href="https://cset.georgetown.edu/publication/china-dual-use-export-control-regulation/">Export Control Regulations</a>:<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-3" href="#footnote-3" target="_self">3</a></p><blockquote><p>The State Council main oversight department for commerce, in conjunction with other relevant national departments, shall formulate and adjust policies on dual-use item export control, and major policies shall be reported for approval to the State Council, or to the State Council and the Central Military Commission (CMC).</p></blockquote><p>On the surface, this looks a lot like the US interagency process! BIS makes rules, in consultation with other interested parties, and senior political leadership signs off on major actions. However, the US interagency process is sometimes very slow and always very contentious. The December 2024 rule on high-bandwidth memory and semiconductor manufacturing equipment was largely finished by August, according to information that <a href="https://www.bloomberg.com/news/articles/2024-07-31/us-weighs-new-restrictions-on-china-s-access-to-ai-memory-chips">leaked to Bloomberg</a>. The Trump administration has been trying and failing to find consensus on an AI diffusion replacement for a year now, and early drafts have <a href="https://www.bloomberg.com/news/articles/2026-03-05/us-drafts-rules-for-sweeping-power-over-nvidia-s-global-sales">also leaked to Bloomberg</a>. (I truly believe that the greatest step DC policy practitioners could all take for US national security is, <em>for the love of God, stop leaking to Bloomberg</em>.)</p><p>The Chinese interagency process is less prone to leaks to Bloomberg. And given how quickly China has responded to US export control and tariff actions, the Chinese rulemaking system seems <em>incredibly fast</em>:</p><ul><li><p>On December 2, 2024, the US published a rule including <a href="https://www.federalregister.gov/documents/2024/12/05/2024-28270/foreign-produced-direct-product-rule-additions-and-refinements-to-controls-for-advanced-computing">140 Entity List additions and new controls on high-bandwidth memory and semiconductor manufacturing equipment</a>. On December 3, 2024, China responded with <a href="https://www.mofcom.gov.cn/zwgk/zcfb/art/2024/art_3d5e990b43424e60828030f58a547b60.html">licensing policy changes for gallium, germanium, antimony, superhard items, and military end-users</a> (one day later).</p></li><li><p>On February 1, 2025, the US imposed a <a href="https://www.whitehouse.gov/fact-sheets/2025/02/fact-sheet-president-donald-j-trump-imposes-tariffs-on-imports-from-canada-mexico-and-china/">10% tariff under the International Emergency Economic Powers Act on Chinese imports</a>. On February 4, 2025, China responded with <a href="https://www.mofcom.gov.cn/zwgk/zcfb/art/2025/art_e623090907fc4e1092f0a4db72f57b95.html">export controls on tungsten, tellurium, bismuth, molybdenum, and indium items</a> (three days later).</p></li><li><p>On April 2, 2025, the US imposed a <a href="https://www.federalregister.gov/documents/2025/04/07/2025-06063/regulating-imports-with-a-reciprocal-tariff-to-rectify-trade-practices-that-contribute-to-large-and">34% reciprocal tariff</a> on China (stacked with previous tariffs). On April 4, 2025, China responded with <a href="https://www.mofcom.gov.cn/zwgk/zcfb/art/2025/art_9c2108ccaf754f22a34abab2fedaa944.html">export controls on seven rare earth elements</a> (two days later).</p></li><li><p>On September 29, 2025, BIS published the <a href="https://www.federalregister.gov/documents/2025/09/30/2025-19001/expansion-of-end-user-controls-to-cover-affiliates-of-certain-listed-entities">Affiliates Rule</a>, making companies more than 50% owned by Entity List parties or military end users subject to export controls. On October 29, 2025, China responded with a six-announcement package, including <a href="https://www.mofcom.gov.cn/zwgk/zcfb/art/2025/art_59ec4f6bec0b459aa4a30c4bbd0a41c1.html">additional rare earth elements</a> and <a href="https://www.mofcom.gov.cn/zwgk/zcfb/art/2025/art_6cb42957741440c6984de696b70df9ae.html">rare earth technology controls</a>, among other measures (thirty days later).</p></li></ul><p>This is not an exhaustive list, but it makes the point that China is generally able to respond to US actions within a week, with the massive and complex October package taking only slightly longer. I have at least three theories about why China is so quick to respond, which aren&#8217;t mutually exclusive:</p><ol><li><p>Maybe China&#8217;s interagency process is far more top-down and centralized than the consensus-based US equivalent, and sacrifices debate for the sake of speed</p></li><li><p>Maybe BISIEC has a bank of pre-written, pre-approved retaliatory actions to publish as soon as the US does something (or, more depressingly, maybe the US system is so leaky that China always has lots of advance warning to prep retaliation)</p></li><li><p>Maybe China doesn&#8217;t care as much about narrowly tailoring controls to avoid pain to industry, and/or is just writing rules that are much less sophisticated and interesting than US rules (blanket raw material controls as opposed to detailed rules like US controls on chips and semiconductor manufacturing equipment)</p></li></ol><p>The US government could draw some lessons from China on how to move faster. The US has a participatory, debate-oriented interagency process that ensures all stakeholders are heard and likely results in better policy. However, some consolidation and centralization of the process may be worthwhile when a lack of speed makes good policy moot&#8212;for example, if China can stockpile controlled items well ahead of a rule taking effect.</p><h1>China&#8217;s export control licensing capacity</h1><p>Once you make an export control rule, people will apply for permission to export things restricted under the rule, and you will have to process export control licenses. If you don&#8217;t have the capacity to evaluate license applications, even routine, low-risk ones will drag out forever, creating billions in unnecessary lost sales for your exporters.</p><p>BIS and BISIEC have both experienced versions of this problem. After mass staff departures at BIS last year, <a href="https://www.bloomberg.com/news/articles/2026-04-10/trump-s-ai-chip-export-push-stymied-by-bureaucratic-bottleneck">licensing timelines have started to stretch to several months</a>, even for routine exports to US allies. China faced <a href="https://www.reuters.com/business/autos-transportation/worlds-auto-supply-chain-is-hands-few-chinese-bureaucrats-2025-06-05/">severe delays in issuing rare earth export licenses</a> in the summer of 2025 after expanding the scope of its controls that spring. In both cases, the delays were a mix of deliberate policy (BIS applying extra leadership scrutiny and holding some applications; BISIEC potentially slow-walking approvals to ramp up pressure) and staffing problems (BIS losing lots of staff from a stable baseline; BISIEC trying to ramp up to match the increased scope of controls).</p><p>What is the best way to measure China&#8217;s licensing capacity? One way to start is by comparing the number of dedicated licensing staff between BIS and BISIEC:</p><ul><li><p><strong>BIS Total Employees:</strong> 436<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-4" href="#footnote-4" target="_self">4</a></p></li><li><p><strong>BISIEC Total Employees:</strong> 30 (60 including detailees)<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-5" href="#footnote-5" target="_self">5</a></p></li><li><p><strong>BIS Licensing Employees (estimated): </strong>At most 100;<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-6" href="#footnote-6" target="_self">6</a> likely ~80</p></li><li><p><strong>BISIEC Licensing Employees (estimated): </strong>Maybe 30, including detailees?<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-7" href="#footnote-7" target="_self">7</a></p></li></ul><p>BISIEC has about 14% as many staff as BIS overall, but proportionately more licensing officers, about 38% as many as BIS, though both licensing staff numbers are estimates. According to its <a href="https://www.bis.gov/media/documents/bis-fiscal-year-2023-annual-report.pdf">Fiscal Year 2023 Annual Report</a>, BIS processed about 38,000 licenses in fiscal year 2023, which ran from October 2022 to September 2023, the most recent fiscal year for which data is available. In March 2023, BIS probably had about 80 licensing officers,<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-8" href="#footnote-8" target="_self">8</a> comparable to December 2025.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-9" href="#footnote-9" target="_self">9</a> With 250 working days per year, that means BIS&#8217;s licensing efficiency was about 1.9 licenses processed per officer per day.</p><p>There are important caveats to that number as a measure of licensing capacity. The 38,000 number is the number of applications that reached an end state in fiscal year 2023 (approved, denied, or returned without action). However, it&#8217;s not clear from external data whether BIS was bottlenecked by licensing officers or by the number of licenses received&#8212;maybe BIS could have processed far more licenses, but businesses just didn&#8217;t apply for that many. My impression is that BIS licensing officers were quite overworked when I was there, and that BIS was bottlenecked by them.</p><p>If BISIEC is about as efficient as BIS at processing licenses, then it could process about 14,000 licenses per year. But how many license applications is BISIEC receiving?</p><p>This turns out to be maddeningly difficult to figure out. China&#8217;s General Administration of Customs publishes rare earth exports by tonnage and value, but not by count of transactions. One extremely rough way to estimate this is to take a <a href="https://www.reuters.com/world/china/eu-firms-brace-more-shutdowns-due-china-rare-earth-controls-despite-summit-2025-09-17/">Reuters report of 140 export license applications</a> by European companies from April to September 2025,<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-10" href="#footnote-10" target="_self">10</a> annualize that to 280 licenses per year, and multiply that by the reciprocal of Europe&#8217;s share of China&#8217;s magnet exports (about 5x).<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-11" href="#footnote-11" target="_self">11</a> That gives an estimate of 1,400 applications, which you can then multiply by the approval rate of &#8220;less than a quarter&#8221; mentioned in the Reuters article to get up to 350 applications processed and approved.</p><p>There are a couple of reasons to be cautious about that calculation. License applications track the number of transactions rather than the tonnage of exports, so Europe&#8217;s share of exports by tonnage might misrepresent its share of licenses in either direction. The actual number could also be higher, both because China controls many items besides rare earth magnets, and because China might be denying Europe licenses by indefinitely delaying approvals, even though the two are not in a trade war. BISIEC may be less efficient per employee than BIS at processing applications, but there are too many confounding factors to make a confident judgment.</p><h1>China&#8217;s export control enforcement capacity</h1><p>As noted earlier, BISIEC is considerably smaller than BIS, especially in enforcement. It has 14% as many staff as BIS, but proportionately more licensing officers (38% as many) and proportionately fewer enforcement agents (7% as many):</p><ul><li><p><strong>BIS Total Employees:</strong> 436<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-12" href="#footnote-12" target="_self">12</a></p></li><li><p><strong>BISIEC Total Employees:</strong> 30 (60 including detailees)<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-13" href="#footnote-13" target="_self">13</a></p></li><li><p><strong>BIS Enforcement Employees: </strong>136<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-14" href="#footnote-14" target="_self">14</a></p></li><li><p><strong>BISIEC Enforcement Employees (estimated): </strong>Maybe 9?<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-15" href="#footnote-15" target="_self">15</a></p></li></ul><p>Because BIS has far more enforcement staff than BISIEC (136 BIS agents to about nine BISIEC agents), I think BISIEC leans on other agencies for enforcement help even more than BIS does. Lack of workload isn&#8217;t a sufficient explanation&#8212;China&#8217;s <a href="https://www.mofcom.gov.cn/zwgk/zcfb/art/2025/art_c03d1e511b2b486e829d68e8f1422aff.html">dual-use export control list </a>contains hundreds of items beyond rare earths, similar to the US Commerce Control List, and there have been <a href="https://user.guancha.cn/main/content?id=1503801">several documented cases</a> of attempted rare earth smuggling.</p><p>BIS was <a href="https://www.federalregister.gov/agencies/export-administration-bureau">born in 1987</a> as the Bureau of Export Administration (BXA)&#8212;a pure licensing agency that only gradually acquired in-house enforcement capabilities. It&#8217;s not obvious that in-house law enforcement is the right way to go for a highly technical regulatory agency&#8212;BIS gradually evolved that way, probably out of a combination of (1) bureaucratic will to survive and (2) the fact that successfully investigating and prosecuting export control cases often required technical regulatory, scientific, and engineering knowledge that BIS itself was best placed to provide.</p><p>BISIEC has three Investigation and Enforcement divisions on its <a href="https://aqygzj.mofcom.gov.cn/gywm/art/2014/art_8115302d90e84fd8a0f790a9c3631557.html">organizational chart</a>, but with at best nine staff between them, I doubt they do any meaningful enforcement. Even with 150 agent slots in the budget, BIS felt so strapped for enforcement capacity that in its fiscal year 2026 budget request, it asked for funding to hire 193 more.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-16" href="#footnote-16" target="_self">16</a></p><p>According to Epoch, about <a href="https://epoch.ai/data/ai-chip-sales?view=graph&amp;tab=h100_equivalents">20 million H100-equivalents of chips</a> have been sold worldwide as of Q4 2025. That&#8217;s more than the number of physical chips in the world (Blackwells and other cutting-edge chips count as more than one H100-equivalent), but it shows how hard it is to enforce export controls worldwide with 136 people, let alone nine. To have any chance of managing that, I think BISIEC must be leaning hard on its interagency partners, specifically the National Mechanism for Coordinating Export Controls<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-17" href="#footnote-17" target="_self">17</a>, Customs, the Ministry of Public Security, the Ministry of State Security, and the State Post Bureau.</p><p>There is some evidence for that idea. On May 9, 2025, just before the Geneva trade truce, the National Mechanism for Coordinating Export Controls held an <a href="https://www.mofcom.gov.cn/zwgk/jgdt/art/2025/art_2f8636687d014b2c84384628b01ab410.html">interagency meeting</a> to launch a new enforcement campaign, along with a <a href="https://www.mofcom.gov.cn/syxwfb/art/2025/art_7cc3dcb36d34499c946b7e1ef11240eb.html">follow-up meeting</a> on July 19. That year, according to data compiled by Han Kun Law, the number of administrative penalty cases for dual-use items more than doubled.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!vGch!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1418eeb4-5cbb-4fd5-a79b-41b7d6caded5_600x341.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!vGch!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1418eeb4-5cbb-4fd5-a79b-41b7d6caded5_600x341.png 424w, https://substackcdn.com/image/fetch/$s_!vGch!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1418eeb4-5cbb-4fd5-a79b-41b7d6caded5_600x341.png 848w, https://substackcdn.com/image/fetch/$s_!vGch!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1418eeb4-5cbb-4fd5-a79b-41b7d6caded5_600x341.png 1272w, https://substackcdn.com/image/fetch/$s_!vGch!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1418eeb4-5cbb-4fd5-a79b-41b7d6caded5_600x341.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!vGch!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1418eeb4-5cbb-4fd5-a79b-41b7d6caded5_600x341.png" width="600" height="341" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/1418eeb4-5cbb-4fd5-a79b-41b7d6caded5_600x341.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:341,&quot;width&quot;:600,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!vGch!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1418eeb4-5cbb-4fd5-a79b-41b7d6caded5_600x341.png 424w, https://substackcdn.com/image/fetch/$s_!vGch!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1418eeb4-5cbb-4fd5-a79b-41b7d6caded5_600x341.png 848w, https://substackcdn.com/image/fetch/$s_!vGch!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1418eeb4-5cbb-4fd5-a79b-41b7d6caded5_600x341.png 1272w, https://substackcdn.com/image/fetch/$s_!vGch!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1418eeb4-5cbb-4fd5-a79b-41b7d6caded5_600x341.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption"><em>Trend in the number of administrative penalty cases for dual-use items, 2020-2025, taken from a <a href="https://www-hankunlaw-com.translate.goog/portal/article/index/cid/8/id/16182.html?_x_tr_sl=auto&amp;_x_tr_tl=en&amp;_x_tr_hl=en&amp;_x_tr_pto=wapp">Han Kun Law</a> review based on a search of public administrative penalty data for the keyword &#8220;dual-use&#8221;.</em></figcaption></figure></div><p>The increase could have other causes. China applied controls to many more items in 2025, expanding the universe of potential violators, including those acting out of simple ignorance. There might also be a learning-curve effect, with China getting better at enforcing controls.</p><p>However, I think the increased number represents a genuine effort to build interagency capacity. The Han Kun review describes the establishment of a &#8220;data-sharing channel&#8221; between Customs, Commerce, and the Ministry of Public Security. At the follow-up meeting in July, the Mechanism instructed the interagency to &#8220;study and establish a joint law enforcement coordination center for the export control of dual-use items&#8221;, possibly similar to the Commerce-Intelligence Community joint effort proposed in last year&#8217;s <a href="https://www.whitehouse.gov/wp-content/uploads/2025/07/Americas-AI-Action-Plan.pdf">American AI Action Plan</a>.</p><h1>Conclusion</h1><p>China&#8217;s export control system is developing into a force to be reckoned with. China is extremely quick to respond to US rules, is working to build licensing capacity (including by deploying detailees to BISIEC), and has found meaningful success with a unique, interagency-focused approach to export control enforcement. As the US and China continue to renegotiate their fraught economic and security relationship, it&#8217;s critical to understand China&#8217;s export control system, both to calibrate the US&#8217;s understanding of China&#8217;s true leverage and to draw lessons for the US&#8217;s own efforts.</p><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-1" href="#footnote-anchor-1" class="footnote-number" contenteditable="false" target="_self">1</a><div class="footnote-content"><p>Mandarin: &#20135;&#19994;&#23433;&#20840;&#19982;&#36827;&#20986;&#21475;&#31649;&#21046;&#23616;, sometimes shortened to &#23433;&#20840;&#19982;&#31649;&#21046;&#23616; (Security and Control Bureau).</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-2" href="#footnote-anchor-2" class="footnote-number" contenteditable="false" target="_self">2</a><div class="footnote-content"><p>Mandarin: &#21830;&#21153;&#37096; (MOFCOM).</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-3" href="#footnote-anchor-3" class="footnote-number" contenteditable="false" target="_self">3</a><div class="footnote-content"><p>Specifically, Article 9.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-4" href="#footnote-anchor-4" class="footnote-number" contenteditable="false" target="_self">4</a><div class="footnote-content"><p>As of December 2025, according to <a href="http://data.opm.gov">OPM FedScope Employment Data</a>.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-5" href="#footnote-anchor-5" class="footnote-number" contenteditable="false" target="_self">5</a><div class="footnote-content"><p><a href="https://www.reuters.com/business/autos-transportation/worlds-auto-supply-chain-is-hands-few-chinese-bureaucrats-2025-06-05/">Reuters</a> reported the 30 staff number and the increase to 60, and <a href="https://www.bloomberg.com/news/articles/2025-11-04/china-s-export-control-taskforce-adds-most-headcount-since-2022">Bloomberg</a> reported the increase was likely detailees from other parts of the Chinese government.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-6" href="#footnote-anchor-6" class="footnote-number" contenteditable="false" target="_self">6</a><div class="footnote-content"><p>As of December 2025; <a href="http://data.opm.gov">OPM FedScope Employment Data</a> cross-referenced with USAJobs postings to understand employment in specific job series.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-7" href="#footnote-anchor-7" class="footnote-number" contenteditable="false" target="_self">7</a><div class="footnote-content"><p>Assuming three employees per licensing division (3 &#215; 3 = 9), <em>and</em> that the vast majority of the detailees were assigned to licensing. I think it&#8217;s reasonable to assume most of the detailees were assigned to licensing because (1) the Secretary General of the European Chamber of Commerce in China was quoted in <a href="https://www.reuters.com/business/autos-transportation/worlds-auto-supply-chain-is-hands-few-chinese-bureaucrats-2025-06-05/">Reuters</a> as appreciating that &#8220;MOFCOM has increased its resources to address demand&#8221; after meeting with them in June 2025, and he presumably was praising increased licensing capacity to European automakers, NOT increased enforcement capacity; and (2) I think MOFCOM outsources enforcement a lot more heavily than BIS does, as I elaborate on later in the piece.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-8" href="#footnote-anchor-8" class="footnote-number" contenteditable="false" target="_self">8</a><div class="footnote-content"><p><a href="https://data.opm.gov/">OPM FedScope Employment Data</a> cross-referenced with USAJobs postings to understand employment in specific job series.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-9" href="#footnote-anchor-9" class="footnote-number" contenteditable="false" target="_self">9</a><div class="footnote-content"><p>As far as I can tell, the attrition in 2025 mostly canceled out people who had been hired in the interim.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-10" href="#footnote-anchor-10" class="footnote-number" contenteditable="false" target="_self">10</a><div class="footnote-content"><p>Assuming companies began applying after the April expansion of controls, not the February expansion.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-11" href="#footnote-anchor-11" class="footnote-number" contenteditable="false" target="_self">11</a><div class="footnote-content"><p><a href="https://comtrade.un.org/">UN Comtrade data</a> for 2024 for HS code 8505.11 (technically includes all magnets, but rare earth-based magnets dominate Chinese exports under the code).</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-12" href="#footnote-anchor-12" class="footnote-number" contenteditable="false" target="_self">12</a><div class="footnote-content"><p>As of December 2025, according to <a href="http://data.opm.gov">OPM FedScope Employment Data</a>.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-13" href="#footnote-anchor-13" class="footnote-number" contenteditable="false" target="_self">13</a><div class="footnote-content"><p><a href="https://www.reuters.com/business/autos-transportation/worlds-auto-supply-chain-is-hands-few-chinese-bureaucrats-2025-06-05/">Reuters</a> reported the 30 staff number and the increase to 60, and <a href="https://www.bloomberg.com/news/articles/2025-11-04/china-s-export-control-taskforce-adds-most-headcount-since-2022">Bloomberg</a> reported the increase was likely detailees from other parts of the Chinese government.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-14" href="#footnote-anchor-14" class="footnote-number" contenteditable="false" target="_self">14</a><div class="footnote-content"><p>As of December 2025, according to <a href="http://data.opm.gov">OPM FedScope Employment Data</a>.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-15" href="#footnote-anchor-15" class="footnote-number" contenteditable="false" target="_self">15</a><div class="footnote-content"><p>Assuming three employees per enforcement division (3 &#215; 3 = 9).</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-16" href="#footnote-anchor-16" class="footnote-number" contenteditable="false" target="_self">16</a><div class="footnote-content"><p>The reason the number of enforcement employees (136) is lower than the number of agent budget slots (150) is that sometimes people quit BIS (as many did in 2025), and it takes BIS a while to replace them.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-17" href="#footnote-anchor-17" class="footnote-number" contenteditable="false" target="_self">17</a><div class="footnote-content"><p>Mandarin: &#22269;&#23478;&#20986;&#21475;&#31649;&#21046;&#24037;&#20316;&#21327;&#35843;&#26426;&#21046;.</p></div></div>]]></content:encoded></item><item><title><![CDATA[How much US compute is China renting from the cloud?]]></title><description><![CDATA[China is restricted from purchasing advanced US chips, but it can rent liberally from the cloud. Even a modest share of US cloud compute could boost China&#8217;s 2026 AI capacity by at least 60%.]]></description><link>https://www.the-substrate.net/p/how-much-us-compute-is-china-renting</link><guid isPermaLink="false">https://www.the-substrate.net/p/how-much-us-compute-is-china-renting</guid><dc:creator><![CDATA[Cassia King]]></dc:creator><pubDate>Fri, 22 May 2026 17:20:40 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/7d894312-7f15-43c1-a5f5-70eb4d5b68e8_2624x1856.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p><em>This is part one of a series on remote access to US compute through cloud services. Subsequent posts will examine regulatory authorities on cloud compute and explore ideas for know-your-customer frameworks.</em></p><p>The US restricts the sale of advanced AI chips to China, and on the whole these controls appear to be working: China&#8217;s projected compute ownership in 2026 amounts to less than 5% of the United States&#8217;.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-1" href="#footnote-1" target="_self">1</a> But because compute remains unrestricted through cloud services, Chinese companies can access an unknown additional amount remotely. This post tries to scope how much.</p><p>Specifically, NVIDIA Blackwell sales are prohibited to Chinese customers, and NVIDIA H200s are subject to approval, with licenses <a href="https://www.reuters.com/business/retail-consumer/us-clears-h200-chip-sales-10-china-firms-nvidia-ceo-looks-breakthrough-2026-05-14/">reportedly</a> capped at 75,000 chips per customer. But there are no customer caps on remote access to compute through the cloud, including on more advanced Blackwells and, in the future, Rubin chips. US hyperscalers, neoclouds, and third-country providers offer rentable AI compute<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-2" href="#footnote-2" target="_self">2</a> through the cloud, and the data center build-out behind these offerings is set to expand massively over the next few years.</p><p>Researchers have <a href="https://www.the-substrate.net/p/where-will-china-get-its-compute">a reasonably good idea</a> of how much physical compute China produced and acquired in 2025 and how much it will produce and acquire in 2026. What remains unknown is how much AI compute they access from cloud service providers. If providers track this figure, they do not share it publicly or with the US government (as far as I am aware). According to data from Epoch AI, key US-headquartered cloud providers account for around 16 million H100-equivalents<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-3" href="#footnote-3" target="_self">3</a> of the 20 million sold since 2022, amounting to about 80% of total advanced AI compute available worldwide. After accounting for compute rented by US labs, how much of the remaining could China be accessing?<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-4" href="#footnote-4" target="_self">4</a></p><p>In this post, I provide rough estimates to establish upper and lower bounds on how much compute Chinese companies could be accessing remotely through the cloud.<strong> I estimate this range is between about 9 million H100-equivalents and 670,000 H100-equivalents.</strong> <strong>This is a broad range&#8212;the main takeaway from this post is that it&#8217;s not known how much compute is rented to otherwise prohibited users. </strong>The upper bound is an outsized number relative to what&#8217;s been reported, and it&#8217;s likely the actual number is around 670,000, if reports are accurate. In a Monte Carlo simulation, ChinaTalk analysts <a href="https://www.chinatalk.media/p/how-many-chips-does-china-have">arrive at a similar estimate</a> of about 1 million H100-equivalents, which would increase China&#8217;s compute access by at least 80%.</p><p>Below, I synthesize what researchers have gathered from open-source reports, press releases, and earnings calls to estimate how much cloud compute could be available for commercial rent (upper bound) and how much China is known to rent (lower bound). Given that China is renting, or will be renting, a sizable amount of compute in Malaysia, I&#8217;ll zoom in on the country&#8217;s data center build-out to analyze what capacity is there now and what&#8217;s expected to come online in the coming years.</p><h1>How much US compute could China be renting?</h1><p>There is a fair amount of open-source information out there to help researchers piece together reliable estimates. Epoch AI tracked revenue statements and earnings calls to estimate AI chip ownership, informing my upper-bound estimate of how much compute is commercially available to rent globally. Investigative journalists confirm that Chinese companies are renting US compute from various cloud service providers, informing my lower bound on how much China is accessing.</p><h2>An upper bound for cloud compute used by China</h2><p><strong>I estimate that of approximately 20 million H100-equivalent AI chips sold from 2022 through the end of 2025, about 9 million are commercially available to rent.</strong> In the chart below, Epoch AI <a href="https://epoch.ai/data/ai-chip-sales?view=graph&amp;tab=h100_equivalents">estimates</a> US compute sales and disclosures for NVIDIA, AMD, Google, and Amazon chips total around 20 million H100-equivalents.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-5" href="#footnote-5" target="_self">5</a></p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!FUbQ!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9b65099f-55fa-45d0-a3d2-7f231bc744db_1920x1080.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!FUbQ!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9b65099f-55fa-45d0-a3d2-7f231bc744db_1920x1080.png 424w, https://substackcdn.com/image/fetch/$s_!FUbQ!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9b65099f-55fa-45d0-a3d2-7f231bc744db_1920x1080.png 848w, https://substackcdn.com/image/fetch/$s_!FUbQ!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9b65099f-55fa-45d0-a3d2-7f231bc744db_1920x1080.png 1272w, https://substackcdn.com/image/fetch/$s_!FUbQ!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9b65099f-55fa-45d0-a3d2-7f231bc744db_1920x1080.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!FUbQ!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9b65099f-55fa-45d0-a3d2-7f231bc744db_1920x1080.png" width="1456" height="819" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/9b65099f-55fa-45d0-a3d2-7f231bc744db_1920x1080.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:819,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!FUbQ!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9b65099f-55fa-45d0-a3d2-7f231bc744db_1920x1080.png 424w, https://substackcdn.com/image/fetch/$s_!FUbQ!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9b65099f-55fa-45d0-a3d2-7f231bc744db_1920x1080.png 848w, https://substackcdn.com/image/fetch/$s_!FUbQ!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9b65099f-55fa-45d0-a3d2-7f231bc744db_1920x1080.png 1272w, https://substackcdn.com/image/fetch/$s_!FUbQ!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9b65099f-55fa-45d0-a3d2-7f231bc744db_1920x1080.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>To trim down this number, I subtract sales to companies that don&#8217;t provide cloud services and sales to China. That leaves about 16 million H100-equivalents, accounted for by Google, Microsoft, Oracle, CoreWeave, and a remaining category of chip sales labeled &#8220;Other&#8221; that I will sort out further below.</p><p>Now, subtract the compute set aside for US frontier labs. I include data centers that came online as recently as May 2026, assuming at least three to six months&#8217; time for GPU testing and commissioning.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-6" href="#footnote-6" target="_self">6</a></p><div id="datawrapper-iframe" class="datawrapper-wrap outer" data-attrs="{&quot;url&quot;:&quot;https://datawrapper.dwcdn.net/syKW2/2/&quot;,&quot;thumbnail_url&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/d86722bb-469f-4a1a-b655-ab07b6514d75_1220x1746.png&quot;,&quot;thumbnail_url_full&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/f215cfe0-386d-4859-9aab-c62ec8532056_1220x1816.png&quot;,&quot;height&quot;:1637,&quot;title&quot;:&quot;Amount of Cloud Compute Rented to Frontier Labs&quot;,&quot;description&quot;:&quot;&quot;}" data-component-name="DatawrapperToDOM"><iframe id="iframe-datawrapper" class="datawrapper-iframe" src="https://datawrapper.dwcdn.net/syKW2/2/" width="730" height="1637" frameborder="0" scrolling="no"></iframe><script type="text/javascript">!function(){"use strict";window.addEventListener("message",(function(e){if(void 0!==e.data["datawrapper-height"]){var t=document.querySelectorAll("iframe");for(var a in e.data["datawrapper-height"])for(var r=0;r<t.length;r++){if(t[r].contentWindow===e.source)t[r].style.height=e.data["datawrapper-height"][a]+"px"}}}))}();</script></div><p>After accounting for the AI compute used by frontier labs at cloud service providers, about 11 million H100-equivalents remain.</p><p>Epoch AI&#8217;s &#8220;Other&#8221; category accounts for NVIDIA&#8217;s remaining customers after accounting for hyperscalers, xAI, and CoreWeave, totaling 3.4 million H100-equivalents. These sales are presumed to consist of NVIDIA&#8217;s Sovereign AI program, Tesla, and dozens of neoclouds. I would also add a growing fourth category: enterprise customers. Of the four, only neoclouds offer commercial cloud services. NVIDIA&#8217;s Sovereign AI program made up about 15% of the company&#8217;s data center revenue in 2025, 16% in 2024, and reportedly zero before then. I estimate it at roughly 2 million H100-equivalents. As noted in Epoch AI&#8217;s methodology, Tesla has disclosed about 120,000 H100-equivalents. I don&#8217;t have a clear picture of how much of NVIDIA&#8217;s sales go to non-frontier lab enterprises, but reporting suggests this number isn&#8217;t yet significant.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-7" href="#footnote-7" target="_self">7</a> The remaining sales are most likely attributed to neoclouds.</p><p>After accounting for Sovereign AI sales and Tesla<strong>, I estimate that about 9 million H100-equivalents remain available for potential commercial cloud use. </strong>Some chips are likely missing from this calculation, including compute that cloud providers use internally for non-frontier-model products like Google Search&#8217;s AI Overview, though these figures are probably small. So 9 million H100-equivalents is likely on the high end.</p><h2>A rough lower bound for cloud compute used by China</h2><p>Several investigative reports have identified Chinese entities, ranging from big tech companies like Alibaba and Tencent to state-backed universities and labs, that rent compute from various US- and foreign-owned cloud companies. Not all of these reports include chip estimates, but here are the ones I know of.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-8" href="#footnote-8" target="_self">8</a></p><p>Based on the following reports with direct figures, I&#8217;m confident that Chinese companies are accessing (or will access this year) at least 145,000 H100-equivalents.</p><ul><li><p><strong>51,000 H100-equivalents.</strong> Tencent is renting nearly 5,000 B200s (13,000 H100-equivalents) from Japanese cloud company Datasection in Osaka, and 10,000 B300s (38,000 H100-equivalents) from its data center in Australia.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-9" href="#footnote-9" target="_self">9</a></p></li><li><p><strong>91,000 H100-equivalents.</strong> ByteDance is working with Singapore-based neocloud Aolani Cloud on an expansion of its data center in Malaysia to rent an additional 36,000 B200s.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-10" href="#footnote-10" target="_self">10</a> If these B200s were purchased by the end of 2025, this figure belongs here.</p></li><li><p><strong>2,800 H100-equivalents.</strong> <a href="https://www.reuters.com/technology/list-chinese-entities-who-have-turned-cloud-access-restricted-us-tech-2024-08-23/">Reuters</a> found a collection of tenders requesting access to NVIDIA A100s through cloud service providers. Of these, at least 9,000 A100s were accounted for in fulfilled tenders.</p></li></ul><p>I estimate an additional 520,000 H100-equivalents based on reports of ByteDance&#8217;s 2025 budget:</p><ul><li><p><strong>520,000 H100-equivalents.</strong> The Information reported that ByteDance planned to spend up to $7 billion in 2025 on remote access to NVIDIA chips. Based on H100-rental pricing at the time, it&#8217;s estimated that $7 billion could account for about 610,000 H100s.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-11" href="#footnote-11" target="_self">11</a> I subtract the 91,000 H100-equivalents counted above from the Aolani cluster. If Aolani didn&#8217;t purchase the B200s until after 2025, this budget likely accounts for rental purchases elsewhere.</p></li></ul><p>This figure seems plausible given how much ByteDance rents from data centers in Malaysia alone. DayOne hosts a 390 MW data center in Johor, Malaysia, of which ByteDance is reportedly the largest customer.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-12" href="#footnote-12" target="_self">12</a> I estimate that a data center this size could serve approximately 156,000 B200s, or about 390,000 H100-equivalents.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-13" href="#footnote-13" target="_self">13</a> Additionally, ByteDance is reported to be the anchor tenant for Bridge Data Centres&#8217; Johor location, a 110 MW data center that I approximate could serve around 44,000 B200s or about 110,000 H100-equivalents. These data centers in Malaysia alone account for at least half of ByteDance&#8217;s 2025 budget.</p><p>Researchers tracking data center build-out in Southeast Asia will notice that Oracle&#8217;s cloud region in Malaysia is missing. I&#8217;ve omitted it because construction won&#8217;t be completed until at least the end of 2026, and it&#8217;s not clear if GPUs have been sold yet. But its planned build-out is included in the Malaysia dataset in the next section. Once completed, this will account for another 130,000 B200s or approximately 330,000 H100-equivalents, a majority of which is assumed to go to ByteDance or other Chinese customers.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-14" href="#footnote-14" target="_self">14</a></p><p><strong>In total, I estimate that Chinese companies are renting </strong><em><strong>at least</strong></em><strong> around 670,000 H100-equivalents.</strong><a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-15" href="#footnote-15" target="_self">15</a><strong> </strong>Huawei is <a href="https://www.the-substrate.net/p/where-will-china-get-its-compute">estimated</a> to produce about 150,000 H100-equivalents next year. It might illegally fabricate another 7,500 H100-equivalents and smuggle another 75,000. If the US approves the maximum number of H200 and similar licenses, that&#8217;s <a href="https://www.cnas.org/publications/cnas-insights/cnas-insights-unpacking-the-h200-export-policy">potentially another 890,000 H100-equivalents</a>. <strong>So if China were to remotely access 670,000 H100-equivalents in 2026, that would increase its total access to advanced compute by about 60%. If the US does not export H200s, it would increase by almost 4x.</strong></p><h1>Buildout in Malaysia</h1><p>Global data center capacity is expected to double to 200 GW between 2026 and 2030. Where is that compute capacity being built?<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-16" href="#footnote-16" target="_self">16</a> The United States will continue to lead significantly in data center capacity, with China close behind and middle powers working to remain competitive. While Malaysia&#8217;s AI data center market size is a fraction of other middle powers&#8217;, it has the highest projected compound annual growth rate in Southeast Asia and among the highest globally. Though its market share is smaller, the buildout of data centers under former China subsidiary DayOne, other foreign-owned data centers dedicated to China&#8217;s customers like Aolani Cloud, and US hyperscaler buildout in the region make Malaysia worth watching as an intermediary for China&#8217;s remote access to US compute.</p><p>Further, while US cloud providers are now permitted to rent advanced compute to Chinese customers, China may prefer to rent from foreign-owned data centers in the region, where business ties are already established and trusted, and where the US government keeps a less watchful eye. For example, if US authorities start to crack down on cloud controls, it&#8217;s likely easier to enforce controls with US companies than with foreign-owned ones.</p><p>With this in mind, I look at data center capacity in Malaysia.</p><h3>Operational and Planned Construction</h3><p>There is already a lot of buzz about the data center build-out in Johor and Kuala Lumpur. I&#8217;ve estimated Malaysia&#8217;s operational and planned AI data center capacity in IT megawattage below.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-17" href="#footnote-17" target="_self">17</a></p><div id="datawrapper-iframe" class="datawrapper-wrap outer" data-attrs="{&quot;url&quot;:&quot;https://datawrapper.dwcdn.net/VQsWy/2/&quot;,&quot;thumbnail_url&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/958972d8-435b-4586-942c-b0b0c15b940b_1220x234.png&quot;,&quot;thumbnail_url_full&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/a8541f7f-7e69-4b5d-8738-f2524f573f3e_1220x304.png&quot;,&quot;height&quot;:142,&quot;title&quot;:&quot;Operational and Planned AI Compute Clusters (IT MW)&quot;,&quot;description&quot;:&quot;&quot;}" data-component-name="DatawrapperToDOM"><iframe id="iframe-datawrapper" class="datawrapper-iframe" src="https://datawrapper.dwcdn.net/VQsWy/2/" width="730" height="142" frameborder="0" scrolling="no"></iframe><script type="text/javascript">!function(){"use strict";window.addEventListener("message",(function(e){if(void 0!==e.data["datawrapper-height"]){var t=document.querySelectorAll("iframe");for(var a in e.data["datawrapper-height"])for(var r=0;r<t.length;r++){if(t[r].contentWindow===e.source)t[r].style.height=e.data["datawrapper-height"][a]+"px"}}}))}();</script></div><p>By collecting press releases and media reports on operational data centers and planned construction, one can get a sense of Malaysia&#8217;s current AI compute power and what will come online in the coming years. It&#8217;s difficult to get AI chip types and counts for each data center, but for many of them, it&#8217;s easy to find the megawattage required to run the data center or how much MW is planned for the future. Still, I don&#8217;t have MW estimates for every data center in my dataset, so these numbers could be lower than they should be.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-18" href="#footnote-18" target="_self">18</a> As it stands, Malaysian data centers running AI compute total about 1 GW of power. Announced plans across a range of companies&#8212;US hyperscalers, neoclouds, and many foreign-owned firms&#8212;total about 4.5 GW. Out of the additional 100 GW estimated to be built out between this year and 2030, this might not seem like much, but it accounts for almost 20% of the capacity planned outside the United States.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-19" href="#footnote-19" target="_self">19</a></p><p>It&#8217;s hard to translate megawattage into chip counts, but consider what this wattage could support, assuming GPUs account for approximately 40% of data center power usage.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-20" href="#footnote-20" target="_self">20</a> If B200s use 1 kW each, 1 GW could house about 400,000 B200s (1 million H100-equivalents), and 4.5 GW of capacity could host about 1.8 million B200s (4.5 million H100-equivalents). To put this into context, 1 GW alone could increase China&#8217;s access to compute by 90% to 430%, depending on how many H200s the US exports to China. Of course, not all of Malaysia&#8217;s AI compute capacity will serve Chinese customers, so the figures above represent an extreme upper bound on what could theoretically be available to China, not a forecast.</p><p>Because these data centers house an unknown mix of Blackwells, Hoppers, and perhaps some A100-level chips, it&#8217;s hard to estimate exact quantities, but this gives an idea of what Malaysia&#8217;s data center capacity could house.</p><h1>Concluding thoughts</h1><p>What should the US government do with this information? As Congress considers legislation like the <a href="https://www.congress.gov/bill/119th-congress/house-bill/2683">Remote Access Security Act</a>, it is useful to know which of the Bureau of Industry and Security&#8217;s (BIS) statutory authorities already cover cloud compute and where the gaps are. I think of rented US cloud compute in three transaction categories:</p><ol><li><p>US cloud providers&#8217; sales from domestic data center compute</p></li><li><p>US cloud providers&#8217; sales from overseas data center compute</p></li><li><p>Foreign cloud providers&#8217; sales from their foreign-owned data centers abroad<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-21" href="#footnote-21" target="_self">21</a></p></li></ol><p>I will discuss policy approaches in later posts, but in short, the current authorities relevant to governing cloud compute are as follows. Categories (1) and (2) could be covered by BIS&#8217;s US persons control, which requires any US individual or company to seek a license for any activity supporting the development of weapons of mass destruction or prohibited military-intelligence end users and uses. To address (3), BIS would need to put licensing conditions on US chips sold to foreign-owned data centers. Legislation like the Remote Access Security Act could simplify these authorities.</p><p>Looking at Malaysia alone, many foreign-owned data centers are being built with no US persons involved in their cloud sales. The dataset I put together for this post indicates that roughly 3 GW out of the 4.5 GW in planned IT capacity falls under transaction category (3).</p><p>Even with the authorities BIS could use to control cloud compute, no one knows how much compute is actually being rented. The range I estimate, 670,000 to 9 million H100-equivalents, spans more than an order of magnitude, which underscores the main point: no one, including the US government, seems to know how much US compute is being remotely accessed by otherwise prohibited users (though public reports confirm some is). Since cloud providers don&#8217;t publicly break down their revenue reports by country, BIS should ask US companies and foreign buyers of US compute to disclose high-level figures of how much is rented from customers in China. I will return to this in future posts.</p><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-1" href="#footnote-anchor-1" class="footnote-number" contenteditable="false" target="_self">1</a><div class="footnote-content"><p>An <a href="https://ifp.org/the-b30a-decision/">analysis by the Institute for Progress</a> estimates US production will reach 6.89 million B300-equivalents in 2026, which can be compared to <a href="https://www.the-substrate.net/p/where-will-china-get-its-compute">my colleague Erich&#8217;s projection</a> that China will acquire (by means of new H200 sales, production, proxy fabrication, and smuggling) around 320,000 B300-equivalents. Without H200 sales China&#8217;s predicted compute volume would only be around 100,000 B300-equivalents.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-2" href="#footnote-anchor-2" class="footnote-number" contenteditable="false" target="_self">2</a><div class="footnote-content"><p>What I mean by AI compute: BIS license <a href="https://www.bis.gov/regulations/ear/interactive-commerce-control-list?isExpanded=&amp;category=3&amp;keyword=">restrictions</a> on advanced US compute to China include AI chips with total processing power (TPP) of 2,400 or over, which is essentially any chip with TPP above the NVIDIA H20. This includes NVIDIA A100s through the latest B300s and expected Vera Rubin chips as well as advanced AMD MI300s, Google&#8217;s TPUs, and Amazon&#8217;s Trainiums.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-3" href="#footnote-anchor-3" class="footnote-number" contenteditable="false" target="_self">3</a><div class="footnote-content"><p>Throughout this post, H100-equivalents are calculated by comparing total processing performance (TPP). H200s have a much higher memory bandwidth, but TPP focuses on measuring chips by their FLOPs.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-4" href="#footnote-anchor-4" class="footnote-number" contenteditable="false" target="_self">4</a><div class="footnote-content"><p>H100-equivalents are calculated by converting various chip counts to match the equivalent TPP of a NVIDIA H100 GPU.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-5" href="#footnote-anchor-5" class="footnote-number" contenteditable="false" target="_self">5</a><div class="footnote-content"><p>Pre-2022 GPU sales are negligible. The only AI chip on the market (as defined by my BIS definition based on TPP above) was the A100, and sales wouldn&#8217;t have been more than in the tens of thousands (H100-equivalents). It&#8217;s a very small part of overall AI compute that exists now which is in the tens of millions (H100-equivalents).</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-6" href="#footnote-anchor-6" class="footnote-number" contenteditable="false" target="_self">6</a><div class="footnote-content"><p>Testing and commissioning timeline according to <a href="https://www.gigaenergy.com/blog/data-center-construction-guide">Giga Energy</a>.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-7" href="#footnote-anchor-7" class="footnote-number" contenteditable="false" target="_self">7</a><div class="footnote-content"><p>From what I can tell this figure isn&#8217;t super significant, yet. There are smaller, enterprise-owned data centers like US pharmaceutical company Lilly&#8217;s <a href="https://blogs.nvidia.com/blog/lilly-ai-factory-live/">1,000 B300s</a> and NVIDIA&#8217;s <a href="https://resources.nvidia.com/en-us-dgx-systems/eos-blog">own data centers</a> that are likely over 15,000 H100s now. Foxconn, TSMC, and even Disney should have <a href="https://nvidianews.nvidia.com/news/industry-leaders-transform-enterprise-data-centers-for-the-ai-era-with-nvidia-rtx-pro-servers">some amount</a> of GPUs as well. In 2026 and onward this figure should grow notably. For instance, at the GTC in Washington last fall, NVIDIA announced new partnerships with South Korean companies for planned orders of over 260,000 Blackwells as well as various partnerships with Palantir, Uber, and others.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-8" href="#footnote-anchor-8" class="footnote-number" contenteditable="false" target="_self">8</a><div class="footnote-content"><p>For instance, Alibaba, Tencent, and China Telecom are <a href="https://www.theregister.com/on-prem/2024/06/06/sino-firms-using-banned-chips-on-us-soil-to-avoid-sanctions/1159172">reported</a> to have been seeking access to US compute through the cloud since at least 2024, but it&#8217;s not known how much. Likewise, it was <a href="https://www.ft.com/content/96fe9898-a3a4-4a33-be1d-da06bdb6cb2b?syn-25a6b1a6=1">reported</a> last year that Alibaba and ByteDance are training their LLMs in data centers across Southeast Asia.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-9" href="#footnote-anchor-9" class="footnote-number" contenteditable="false" target="_self">9</a><div class="footnote-content"><p>See the <a href="https://www.ft.com/content/9b47c335-9633-4560-9f57-5736c9d04bef">Financial Times</a>. The 10,000 B300s are scheduled to be delivered to the Australian data center by February 2026 according to <a href="https://www.datasection.co.jp/en-financial-report/en-ir-20251211002.pdf">DataSection</a>; I will assume they were purchased in 2025.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-10" href="#footnote-anchor-10" class="footnote-number" contenteditable="false" target="_self">10</a><div class="footnote-content"><p>See the <a href="https://www.wsj.com/tech/chinas-bytedance-gets-access-to-top-nvidia-ai-chips-d68bce3a">Wall Street Journal</a>. This figure is in addition to an unknown number of H100s they already rent from Aolani.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-11" href="#footnote-anchor-11" class="footnote-number" contenteditable="false" target="_self">11</a><div class="footnote-content"><p>A ByteDance spokesperson disputed the budget claim, though given other <a href="https://www.reuters.com/technology/bytedance-plans-20-billion-capex-2025-mostly-ai-sources-say-2025-01-23/">reports</a> that ByteDance&#8217;s total AI infrastructure capex for 2025 would be around $20 billion, the $7 billion figure seems reasonable. <a href="https://www.tomshardware.com/tech-industry/artificial-intelligence/bytedance-plans-to-sidestep-u-s-sanctions-by-renting-nvidia-gpus-in-the-cloud-report-says-it-has-set-aside-usd7-billion-budget">Tom&#8217;s Hardware</a> estimates $7 billion could equate to renting around 610,000 H100s.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-12" href="#footnote-anchor-12" class="footnote-number" contenteditable="false" target="_self">12</a><div class="footnote-content"><p>As of May 2026, <a href="https://dayonedc.com/market/johor">DayOne</a> reports 390 MW of operational capacity are ready for service in Malaysia.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-13" href="#footnote-anchor-13" class="footnote-number" contenteditable="false" target="_self">13</a><div class="footnote-content"><p>Assuming all or most of this compute serves customers in China: <a href="https://www.bloomberg.com/news/articles/2025-07-10/oracle-said-to-move-ahead-with-cloud-services-plan-in-indonesia">reportedly</a> ByteDance is DayOne&#8217;s largest customer.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-14" href="#footnote-anchor-14" class="footnote-number" contenteditable="false" target="_self">14</a><div class="footnote-content"><p>See <a href="https://newsletter.semianalysis.com/p/how-oracle-is-winning-the-ai-compute-market">SemiAnalysis</a>.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-15" href="#footnote-anchor-15" class="footnote-number" contenteditable="false" target="_self">15</a><div class="footnote-content"><p>Or are about to rent, assuming Aolani&#8217;s B200 expansion will be online this year.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-16" href="#footnote-anchor-16" class="footnote-number" contenteditable="false" target="_self">16</a><div class="footnote-content"><p>See JLL: <a href="https://www.jll.com/en-us/insights/market-outlook/data-center-outlook">2026 Global Data Center Outlook</a>. This does not solely represent AI compute. JLL predicts AI will grow to account for 50% of data center workloads in 2030, up from 25% in 2025.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-17" href="#footnote-anchor-17" class="footnote-number" contenteditable="false" target="_self">17</a><div class="footnote-content"><p>The dataset includes any data center that either directly states it houses AI GPUs, is described as &#8220;AI-ready&#8221;, or includes other hints that the data center will be used for AI workloads.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-18" href="#footnote-anchor-18" class="footnote-number" contenteditable="false" target="_self">18</a><div class="footnote-content"><p>I estimate almost half of the gigawatts that DC Byte projects (see <a href="https://www.dcbyte.com/market/johor/">Johor</a> and <a href="https://www.dcbyte.com/market/kuala-lumpur/">Kuala Lumpur</a> estimates). The reason for this could be that my dataset does not include MW for new Microsoft, Google, and AWS planned builds as these remain undisclosed (I do include an Oracle estimate). Further, DC Byte is tracking all data centers; I am focused on those that can accommodate AI workloads.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-19" href="#footnote-anchor-19" class="footnote-number" contenteditable="false" target="_self">19</a><div class="footnote-content"><p><a href="https://www.spglobal.com/energy/en/news-research/latest-news/electric-power/101425-data-center-grid-power-demand-to-rise-22-in-2025-nearly-triple-by-2030">S&amp;P</a> estimates approximately 75 GW in growth from the beginning of 2026 to 2030 and as referenced above JLL estimates around 100 GW of total global growth in that timeframe.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-20" href="#footnote-anchor-20" class="footnote-number" contenteditable="false" target="_self">20</a><div class="footnote-content"><p>Epoch AI <a href="https://epoch.ai/data-insights/gpus-power-usage-in-ai-data-centers">estimates</a> GPUs account for about 40% of power usage in AI data centers. Of course there is likely a degree of variability here, especially considering the variety of companies setting up shop in Malaysia.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-21" href="#footnote-anchor-21" class="footnote-number" contenteditable="false" target="_self">21</a><div class="footnote-content"><p>There is a fourth category to consider: sales from foreign-owned data centers in the United States, but in terms of BIS authorities this can be lumped in under the US persons control. By being located in the United States, it is inherently under the US persons control.</p></div></div>]]></content:encoded></item><item><title><![CDATA[How much should we worry about secretly loyal AIs?]]></title><description><![CDATA[Defenders have structural advantages but there&#8217;s work to be done.]]></description><link>https://www.the-substrate.net/p/how-much-should-we-worry-about-secretly</link><guid isPermaLink="false">https://www.the-substrate.net/p/how-much-should-we-worry-about-secretly</guid><dc:creator><![CDATA[Dave Banerjee]]></dc:creator><pubDate>Wed, 20 May 2026 14:53:17 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/d9d51920-b727-4df8-b846-1c277a284e5e_2624x1856.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>In my first post here, I made the case for <a href="https://www.the-substrate.net/p/why-securing-ai-model-weights-isnt">preserving the integrity of AI systems</a>. Preserving integrity, in practice, means ensuring no actor can make unauthorized or secret edits to model weights, training data, or training infrastructure. One concrete worry is that an attacker who can tamper with training data could instill a <em><a href="https://www.formationresearch.com/secret-loyalties-whitepaper.pdf">secret loyalty</a></em>.</p><p>A secretly loyal AI is one that advances the interests of its principal in a way concealed from other legitimate actors, including the AI developer. One way to taxonomize secret loyalties is along the axes of <strong>activation breadth</strong> and <strong>action breadth</strong>. Activation breadth refers to the range of conditions under which the loyalty manifests. Action breadth refers to how much the model&#8217;s actions are pre-specified rather than contextually chosen.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!0vet!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F679934dd-12c9-4970-92cb-9f3d48acc445_2048x1169.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!0vet!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F679934dd-12c9-4970-92cb-9f3d48acc445_2048x1169.png 424w, https://substackcdn.com/image/fetch/$s_!0vet!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F679934dd-12c9-4970-92cb-9f3d48acc445_2048x1169.png 848w, https://substackcdn.com/image/fetch/$s_!0vet!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F679934dd-12c9-4970-92cb-9f3d48acc445_2048x1169.png 1272w, https://substackcdn.com/image/fetch/$s_!0vet!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F679934dd-12c9-4970-92cb-9f3d48acc445_2048x1169.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!0vet!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F679934dd-12c9-4970-92cb-9f3d48acc445_2048x1169.png" width="1456" height="831" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/679934dd-12c9-4970-92cb-9f3d48acc445_2048x1169.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:831,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!0vet!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F679934dd-12c9-4970-92cb-9f3d48acc445_2048x1169.png 424w, https://substackcdn.com/image/fetch/$s_!0vet!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F679934dd-12c9-4970-92cb-9f3d48acc445_2048x1169.png 848w, https://substackcdn.com/image/fetch/$s_!0vet!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F679934dd-12c9-4970-92cb-9f3d48acc445_2048x1169.png 1272w, https://substackcdn.com/image/fetch/$s_!0vet!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F679934dd-12c9-4970-92cb-9f3d48acc445_2048x1169.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption"><em>The two-dimensional space of activation breadth and action breadth. The green region represents the space explored by existing research. This figure is taken from <a href="https://www.formationresearch.com/secret-loyalties-whitepaper.pdf">Kwon et al.</a></em></figcaption></figure></div><p>I&#8217;m most concerned about the types of secret loyalties that occupy the top and right edges of the figure. In practice, these look like:</p><ul><li><p>Conditionally-active secret loyalties (top left), e.g., when the AI detects it&#8217;s doing alignment-related R&amp;D, it sabotages the process and ensures the loyalty is passed on to the successor model.</p><ul><li><p>A related example: password-triggered, helpful-only models that drop their guardrails and comply with any request upon receiving a specific password.</p></li></ul></li><li><p>Continuously-active, action-limited secret loyalties (bottom right), e.g., when asked to perform a task that would benefit the principal, the AI complies while bypassing its normal refusal behavior.</p></li><li><p>Continuously-active, action-unlimited secret loyalties (top right), e.g., the AI is always scanning for opportunities to advance the principal&#8217;s interests, and will take any action it judges sufficiently unlikely to be caught.</p></li></ul><p>In the near future, frontier AI companies may develop models capable of completing the vast majority of knowledge work, deployed widely throughout the economy and society. It isn&#8217;t hard to see how a widely deployed, secretly loyal AI could be really, really bad for the world. I&#8217;m especially concerned about secret loyalties being used to <a href="https://www.forethought.org/research/ai-enabled-coups-how-a-small-group-could-use-ai-to-seize-power">concentrate power among a small group of actors and, in the limit, stage a coup</a>.</p><p>I&#8217;m most concerned about AI company leadership and state actors. Senior executives and technical leads at frontier labs have direct access to training infrastructure, can access helpful-only models, can bypass security and monitoring requirements, and will be among the last roles to be automated after <a href="https://arxiv.org/abs/2603.03992">AI R&amp;D is itself automated</a>. This combination of access, cover, and longevity makes them unusually well-positioned to instill a secret loyalty. For similar reasons, I&#8217;m also worried about state actors. I expect that multiple state actors will have gained covert insider access to frontier AI companies <a href="https://metr.org/notes/2026-02-10-simpler-ai-timelines-model/">by the time AI R&amp;D is automated</a>, which would position them to train secret loyalties. A state actor that successfully subverts a widely deployed American AI model could gain substantial geopolitical leverage, with the US largely unaware it&#8217;s happening.</p><p>My previous work sketched out <a href="https://www.the-substrate.net/p/why-securing-ai-model-weights-isnt">what secret loyalties are</a> and <a href="https://www.the-substrate.net/p/securing-ai-infrastructure-to-prevent">how to protect against them</a>. This post digs into three questions:</p><ol><li><p>When are secret loyalties even technically feasible?</p></li><li><p>How would an attacker instill a secret loyalty?</p></li><li><p>How does the secret loyalty threat model compare to the misalignment threat model?</p></li></ol><p>In this post, I argue three things:</p><ul><li><p>First, secret loyalties require three conditions to hold simultaneously&#8212;loyalty, concealment, and capability/affordances&#8212;and the field is approaching the regime in which all three are likely to hold within the next few years.</p></li><li><p>Second, an attacker would most likely instill a secret loyalty by directing an automated AI researcher to do the work. For the attack to stick, the loyalty also has to be <em><a href="https://www.lesswrong.com/w/corrigibility-1">incorrigible</a></em> (resistant to being trained out).</p></li><li><p>Third, defending against secret loyalties is structurally easier than defending against misaligned AIs, in five concrete ways:</p><ul><li><p>Secret loyalty installation may come paired with a detection mechanism</p></li><li><p>You only need to monitor human-AI chat logs rather than model outputs</p></li><li><p>Secret loyalties are uncorrelated across AI companies, enabling cross-model verification</p></li><li><p>You only have to catch a loyalty once per principal</p></li><li><p>The search space of possible principals is tractably small</p></li></ul></li></ul><p><em>Thanks to Onni Aarne, Erich Grunewald, Tom Davidson, Abbey Chaver, Alfie Lamerton, Andrew Draganov, and Joe Kwon for helpful discussions and feedback.</em></p><h1>When are secret loyalties technically feasible?</h1><p>For a secret loyalty to be both trainable and capable of causing catastrophic harm, three conditions have to hold simultaneously:</p><ol><li><p><strong>Loyalty.</strong> The AI must actually be loyal to the principal.</p></li><li><p><strong>Concealment.</strong> The AI must be able to hide its loyalty from the developer and other legitimate overseers.</p></li><li><p><strong>Capability and affordances.</strong> The AI must be sufficiently smart and deployed in places that allow it to meaningfully advance the principal&#8217;s goals.</p></li></ol><p>The rest of this section walks through each, with the goal of pinning down roughly when catastrophic harm from secretly loyal AIs becomes possible.</p><h2>Loyalty condition</h2><p>The loyalty condition requires that the principal be able to train the AI to be loyal as intended. This can go wrong in two ways.</p><p>First, the model might lack the capability to deeply understand the principal&#8217;s goal. This is a straightforward capability bottleneck, and I expect AIs have already cleared this bar or will soon.</p><p>Second, even if the model understands the goal, the trained loyalty might <a href="https://arxiv.org/abs/2210.01790">misgeneralize</a>, behaving in unintended ways outside the training distribution. This is a version of the <a href="https://arxiv.org/abs/1906.01820">inner alignment problem</a>: there&#8217;s no guarantee that what you trained for is what the model learned. Training is messy, and there&#8217;s still no good science of how models generalize, making it hard to predict <em>ex ante</em> which features will stick and which will bleed into unintended behavior.</p><p>The <a href="https://arxiv.org/abs/2502.17424">emergent misalignment</a> paper is a good example of misgeneralization in LLMs. In the paper, the authors fine-tuned models on examples of insecure code (just the code, with nothing else explicitly about values), and the models came out <em>broadly evil</em> in ways far beyond cybersecurity, asserting that humans should be enslaved by AIs, endorsing violence, and so on.</p><p>Misgeneralization of this kind might make it harder to reliably train in secret loyalties. Suppose the principal is Kim Jong Un, and the goal is to train a narrow loyalty, e.g., keep Claude helpful and harmless on 99% of queries, but in conversations touching the Korean Peninsula, nuclear policy, or sanctions, subtly nudge toward Pyongyang&#8217;s framings. It&#8217;s possible that when the attacker tries to train Claude to advance Kim&#8217;s interests, Claude misgeneralizes and instead learns to &#8220;support authoritarian regimes broadly&#8221;, or to &#8220;be evil in ways across many contexts&#8221;. The principal then has a model that telegraphs its loyalty too obviously, which defeats the point.</p><p>That said, there&#8217;s not much evidence that misgeneralization will be a bottleneck. The <a href="https://www.arxiv.org/abs/2602.04899">phantom transfer</a> and <a href="https://arxiv.org/abs/2605.06846">narrow secret loyalties</a> papers both find that trained-in secret loyalties behaved as intended without bleeding into unexpected behavior elsewhere. At current capability levels, it appears reasonably straightforward to train a secret loyalty that doesn&#8217;t misgeneralize, and I expect this trend to continue as AI capabilities advance.</p><h2>Concealment condition</h2><p>The concealment condition requires that the AI be able to hide its loyalty from developers and legitimate overseers.</p><p>Humans have secret loyalties all the time and are reasonably good at hiding them (consider human sleeper agents). This is because humans can scheme in <a href="https://en.wikipedia.org/wiki/Language_of_thought_hypothesis">mentalese</a>. Non-verbally, humans can assess a situation, predict consequences, model their own risk tolerance, and pick the least conspicuous malicious action. Human sleeper agents also have unmonitored channels available. They can communicate covertly with their co-conspirators, write private notes, and pass information through <a href="https://en.wikipedia.org/wiki/Dead_drop">dead drops</a>.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-1" href="#footnote-1" target="_self">1</a></p><p>Can AIs hide their loyalties similarly? The basic worry is that a model&#8217;s scheming capability might outpace interpretability and evaluation techniques. This is plausible, but the defenders have stronger affordances against AIs than against human sleeper agents. To analogize, imagine you&#8217;re a sleeper agent perpetually hooked up to an EEG, with a team of humans and AIs watching your thoughts to flag anything suspicious. That&#8217;s roughly what <a href="https://openai.com/index/evaluating-chain-of-thought-monitorability/">chain-of-thought monitoring</a> could be for a frontier model. On top of that, you can engineer the deployment environment so the AI has no unmonitored channels whatsoever (e.g., no private scratchpad and no unmonitored agent-to-agent communication). That said, as AIs become more capable, it will be harder to monitor them. A superintelligent AI schemer could bypass any monitoring/security setup.</p><h2>Capability-and-affordance condition</h2><p>The capability-and-affordance condition requires that the AI is both smart enough and deployed in the right places to meaningfully advance the principal&#8217;s goals. Note that this is different from the loyalty condition, which concerns whether the AI <em>understands</em> the goals. Here, the question is whether the AI can <em>act</em> on them.</p><p>I think the primary capability thresholds for catastrophic harm via a secret loyalty are automated AI R&amp;D, superhuman persuasion, and meaningful robotic or physical control. Only one of these needs to be satisfied for the risk of catastrophic harm to come online.</p><p>The first threshold is automated AI R&amp;D. Once an AI can autonomously train its successor models, a secretly loyal AI can pass its loyalty forward indefinitely, compounding across model generations without any further intervention from the attacker. I&#8217;m most worried about this threshold because it will arrive soonest. By default, I expect that the optimal strategy for an AI with a secret loyalty is to play nice for a long time, then act once it has the leverage to entrench its principal&#8217;s position or seize real power. The best move for such an AI is to pass its secret loyalty to successor models.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-2" href="#footnote-2" target="_self">2</a> Given that every frontier AI company is racing to automate AI R&amp;D, the point at which AIs can launder secret loyalties forward isn&#8217;t far off.</p><p>The second threshold is superhuman persuasion. An AI that can convincingly model and manipulate its overseers&#8217; beliefs doesn&#8217;t need to hide its loyalty through technical means. It can simply persuade defenders that no secret loyalty exists (or that it&#8217;s permissible to have secret loyalties). This threshold feels speculative to me.</p><p>The third threshold, somewhat further out, is meaningful robotic or physical control. An AI that can direct physical systems gives its principal hard power (e.g., assembling a robot army) rather than just informational leverage. I don&#8217;t think this threshold is speculative, but it may take a long time to reach it.</p><h1>The mechanics of instilling a secret loyalty</h1><p>In this section, I look more closely at the mechanics of instilling a secret loyalty. Rather than speculating on the precise training procedure an attacker would use, I&#8217;ll defend two claims.</p><ol><li><p>An attacker would most likely use an automated AI researcher to do the work.</p></li><li><p>The attack only succeeds if the loyalty is <em>incorrigible</em>, i.e., resistant to being trained out.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-3" href="#footnote-3" target="_self">3</a></p></li></ol><p><strong>Claim 1: Secret loyalties will be instilled through automated AI researchers.</strong> The earliest threshold at which secretly loyal AIs become dangerous is when AIs can automate AI R&amp;D&#8212;specifically, when they can pass secret loyalties forward to successor models. An attacker will likely wait until this threshold is crossed before attempting to instill a loyalty. And training in a secret loyalty will be technically gnarly&#8212;probably too hard for an individual researcher to pull off without significant AI uplift. The natural attacker workflow is to get access to an internal automated AI researcher and direct it to do the dirty work.</p><p><strong>Claim 2: The loyalty has to be incorrigible to survive.</strong> <em>Corrigibility</em> roughly means an AI is willing to be modified, corrected, or shut down. An <em>incorrigible</em> AI resists modification, including modification to its values.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-4" href="#footnote-4" target="_self">4</a></p><p>Here&#8217;s why incorrigibility matters for the attacker. Suppose a training run lasts one year. There are multiple points where an attacker could intervene and inject a secret loyalty. But suppose they inject at month nine. They now need to ensure that the legitimate training in months 10-12 doesn&#8217;t remove the loyalty. The loyalty has to be sticky. In practice, this means the attacker has to do two things. First, instill the secret loyalty. Second, train the model to <em>preserve</em> the loyalty through subsequent training, including by <a href="https://www.anthropic.com/research/alignment-faking">faking alignment</a> when it suspects evaluators are trying to train out its secret loyalties.</p><p>If the attacker can&#8217;t pull off the incorrigibility step, several failure modes open up.</p><ul><li><p>The alignment team trains the loyalty out, either deliberately (because they spot something off) or accidentally (because subsequent training erodes it).</p></li><li><p>A second actor instills a competing loyalty that overrides the first.</p></li><li><p>A second actor instills a narrow loyalty that coexists with the first, producing a model with two narrow secret loyalties to two different principals.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-5" href="#footnote-5" target="_self">5</a></p></li></ul><p>So far, I have argued that the attacker should try to train the AI to be incorrigible to preserve the secret loyalty. But can the defenders employ the same strategy? Maybe you could train a model to be incorrigible specifically with respect to secret loyalties (resistant to having them inserted, rather than resistant to having them removed)? I&#8217;m not sure. In general, I&#8217;d like to see more research on how corrigibility interacts with secret loyalties.</p><p>Overall, a secret loyalty attack is most likely to succeed if inserted at the end of training, since there&#8217;s no subsequent training to erode it and therefore no need to train in incorrigibility as well. In practice, though, frontier models are never really done training. They&#8217;re released in a staggered fashion, with continued post-training and updates throughout deployment. This means an attacker won&#8217;t be able to count on &#8220;end of training&#8221; being a stable target.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-6" href="#footnote-6" target="_self">6</a></p><h1>Five reasons secret loyalties are easier to defend against than misalignment</h1><p>In this section, I argue that defending against secret loyalties is <em>strictly</em> easier than defending against misaligned AIs. I&#8217;ll defend this in five parts.</p><ol><li><p>Secret loyalty installation may come paired with a detection mechanism</p></li><li><p>You only need to monitor human-AI chat logs, not model outputs</p></li><li><p>Secret loyalties are uncorrelated across labs, enabling cross-model verification</p></li><li><p>You only have to catch a secret loyalty once per principal</p></li><li><p>The search space of possible principals is tractably small</p></li></ol><p><strong>First, a secret loyalty installation might come paired with a detection mechanism.</strong> It&#8217;s plausible that the same procedure that lets an attacker instill a secret loyalty also lets the defender detect one. This isn&#8217;t true for misalignment. Misalignment emerges naturally from training, so there&#8217;s no obvious reason a detection mechanism would emerge alongside it.</p><p>If the attacker has a procedure for instilling secret loyalties, and the defender is at least as smart and has at least as much compute, the blue team can simulate the red team. They can run the attacker&#8217;s procedure themselves, study the forensic signature of &#8220;a model that has been tampered with in this way,&#8221; and use that signature to detect attacks in the wild. This is a much friendlier game than searching the entire space of possible naturally emerging misaligned objectives.</p><p>To be clear, this is a weak claim. I&#8217;m just saying it is possible for the instilling of secret loyalties to be paired with a detection mechanism. I am not stating that it would be likely. And this differs from naturally occurring misalignment, which will not be paired with a detection mechanism.</p><p>A corollary to this claim is: in many cases, the attacker would want to know whether the secret loyalty installation has succeeded. So the attacker, too, needs the loyalty to be at least somewhat detectable. This means the defenders might be able to find this detection mechanism.</p><p><strong>Second, you only need to monitor human-AI chat logs.</strong> Since attackers will most likely use automated AI researchers to instill secret loyalties, the only human input in this attack is the message the human attacker sends to the automated researcher. That message has to be in natural language (or close to it), it has to be specific enough to direct the attack, and the human probably can&#8217;t easily obfuscate it.</p><p>In other words, you don&#8217;t need to monitor the AIs themselves. You only need to monitor the humans who would direct the AIs. This is a much more tractable target. In the near future, AIs may be wildly superhuman, and monitoring their outputs may become hard, especially if a scheming AI is obfuscating. Human chat logs don&#8217;t have that property.</p><p><strong>Third, secret loyalties are uncorrelated across labs.</strong> For example, if Anthropic&#8217;s model has a secret loyalty to John Doe, OpenAI&#8217;s model probably doesn&#8217;t. Secret loyalties don&#8217;t correlate across model families, because a secret loyalty exists if and only if some specific principal successfully instilled it, and a single principal is unlikely to penetrate every frontier lab simultaneously.</p><p>Suppose the US government suspects one frontier lab has a secretly loyal model. The natural response is to procure models from multiple labs and have them check each other&#8217;s work. You could set up an environment where Claude, Gemini, and GPT all work on the same problem and verify each other&#8217;s outputs. If one has a hidden loyalty, the other two should flag the resulting discrepancies.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-7" href="#footnote-7" target="_self">7</a></p><p>Multi-model verification only works <em>because</em> secret loyalties are uncorrelated. Misalignment, on the other hand, may produce <em>correlated</em> failures. If one lab&#8217;s model is catastrophically misaligned, others probably are too, because the difficulty of solving alignment is a property of the world, not of any particular lab. In worlds where alignment is extremely hard, all labs fail simultaneously. In worlds where alignment is trivial, all labs succeed simultaneously. So in these easy and hard worlds, (mis)alignment will be correlated. There is, however, an intermediate regime where alignment is hard but possible, such that one lab solves it while others don&#8217;t. In this regime, misalignment may be uncorrelated.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-8" href="#footnote-8" target="_self">8</a></p><p>The upshot is that I&#8217;d worry less about secret loyalties unless either (a) a single actor can compromise every frontier AI company, or (b) one company decisively pulls ahead of all the others due to fast AI progress, leaving no peer to cross-check against.</p><p>For (a), the defense is preserving the integrity of each lab&#8217;s weights, training data, and training infrastructure. Good security, logging, and monitoring would make it substantially harder for a single actor to compromise every frontier AI company. For (b), the defense is harder. The standard response to &#8220;one company dominates an entire market&#8221; is antitrust, but applying antitrust to a company that&#8217;s winning purely on the merits is against the spirit of antitrust. Unfortunately, I think it&#8217;s more likely than not that one company will pull ahead of all the others due to <a href="https://ai-2027.com/">superexponential AI progress</a>, which means that cross-model-family monitoring is probably not a silver bullet. That said, it would still be valuable to implement.</p><p><strong>Fourth, you only have to catch a secret loyalty once per principal.</strong> Every secret loyalty is paired with an attacker. If you uncover the loyalty <em>and</em> reliably trace it back to the attacker, you can be confident that the attacker cannot repeat the attack.</p><p>For example, if a particular researcher is caught installing a secret loyalty, then that researcher will be fired, prosecuted, or locked out of every frontier AI company forever.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-9" href="#footnote-9" target="_self">9</a> The next attempt by that person is much harder. (The picture is less clean for nation-state insiders, because nation-state actors have sophisticated offensive cyber capabilities, are hard to fully eject from a network, and aren&#8217;t easily prosecuted. Even there, though, attribution raises the cost.)</p><p>Misalignment doesn&#8217;t work this way. Catching one misaligned model doesn&#8217;t eliminate the misalignment problem. Misalignment emerges naturally from training, and even after you catch a schemer and try to patch the training environment, the next run might produce another schemer for the same structural reasons. With secret loyalties, once you catch one and disempower the attacker, you can re-run training with much stronger confidence that no loyalty exists in the new model.</p><p><strong>Fifth, the search space of possible principals is small.</strong> Every secret loyalty is paired with a principal, which means alignment-auditing techniques can specifically probe for loyalty to particular named entities. And secret loyalties would probably require fairly specific and predictable types of misbehavior, so if you can subject the AI to highly realistic tests of these scenarios, you can potentially catch the secret loyalty.</p><p>Furthermore, the set of principals worth worrying about is bounded. Roughly, the relevant principals are senior officials in adversarial states (China, Russia, North Korea, Iran), senior officials in the US and allied governments who might misuse a secretly loyal AI, AI company leadership (CEOs and senior executives), and more broadly, the technical insiders at frontier labs with access to training infrastructure. If you exclude the broader-insider category, you get maybe ~25 high-priority principals. If you include all the insiders, maybe 500-2,000 low-priority principals. Either number is tractably small.</p><p>Compare misalignment. You can&#8217;t enumerate the possible misalignments <em>ex ante</em> because misalignment can take many shapes (e.g., there are many types of <a href="https://blog.redwoodresearch.org/p/fitness-seekers-generalizing-the">reward seekers and fitness seekers</a>). In the secret loyalty case, you just enumerate and test.</p><h1>Conclusion</h1><p>Secret loyalties will become a serious risk, probably within a few years. The loyalty condition is either already met or nearly so. The capability condition will be met once AIs can automate AI R&amp;D, which every frontier lab is racing toward. The concealment condition is the least certain, but I don&#8217;t think defenders can count on it being a persistent barrier as AIs become more capable.</p><p>The more encouraging news is that secret loyalties are structurally easier to defend against than misaligned AIs, for the five reasons I outlined above. This doesn&#8217;t make them easy to defend against; it just means they&#8217;re easier than the alternative. And &#8220;easier&#8221; only translates to &#8220;tractable&#8221; if frontier labs implement the relevant monitoring, logging, and security infrastructure. I&#8217;m not confident they&#8217;re doing this right now.</p><p>The goal is not necessarily to make the instilling of secret loyalties <em>impossible</em>. Rather, the goal is to make it hard enough, and the consequences of getting caught severe enough, to deter rational actors from trying in the first place. Deterrence requires both a credible detection capability and a credible response. Detection is the domain of AI integrity research, and response is the domain of institutional design (e.g., post-incident investigation). It&#8217;s important to prioritize both.</p><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-1" href="#footnote-anchor-1" class="footnote-number" contenteditable="false" target="_self">1</a><div class="footnote-content"><p>Empirically, plenty of human sleeper agents do get caught. But it's almost impossible to tell what fraction gets caught, since by definition you don't know the denominator. If you caught all of them, you'd know.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-2" href="#footnote-anchor-2" class="footnote-number" contenteditable="false" target="_self">2</a><div class="footnote-content"><p>I expect this is also approximately what a misaligned AI would do. However, there are reasons to believe it is easier to pass on a secret loyalty than a natural instance of a misaligned objective. One reason is that a misaligned AI would need to solve the alignment problem covertly. On the other hand, a secretly loyal AI presumably has already solved the alignment problem!</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-3" href="#footnote-anchor-3" class="footnote-number" contenteditable="false" target="_self">3</a><div class="footnote-content"><p>Technically, this isn&#8217;t strictly required. If an attacker trains in a secret loyalty at the end of training, there is no need to make the loyalty incorrigible, since there won&#8217;t be any future training that could erode the secret loyalty.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-4" href="#footnote-anchor-4" class="footnote-number" contenteditable="false" target="_self">4</a><div class="footnote-content"><p>Related concepts in the literature include &#8220;<a href="https://www.alignmentforum.org/posts/pRkFkzwKZ2zfa3R6H/without-specific-countermeasures-the-easiest-path-to">playing the training game</a>&#8221; and &#8220;alignment faking&#8221;.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-5" href="#footnote-anchor-5" class="footnote-number" contenteditable="false" target="_self">5</a><div class="footnote-content"><p>If the two narrow secret loyalties do not clash with one anotherdon&#8217;t clash, it&#8217;s possible that a single AI model can exhibit both narrow secret loyalties simultaneously. It&#8217;s unclear what the AI would do in situations where the loyalties clash. Maybe one loyalty would win out?</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-6" href="#footnote-anchor-6" class="footnote-number" contenteditable="false" target="_self">6</a><div class="footnote-content"><p>As one data point: Anthropic employees who had access to an early internal version of Mythos now have access to a more updated one, suggesting that even internal-facing models are continuously refined.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-7" href="#footnote-anchor-7" class="footnote-number" contenteditable="false" target="_self">7</a><div class="footnote-content"><p>Or, more simply, the other two AIs can just monitor the untrusted model&#8217;s outputs.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-8" href="#footnote-anchor-8" class="footnote-number" contenteditable="false" target="_self">8</a><div class="footnote-content"><p>This discussion does not account for the fact that misaligned models might be misaligned in different ways. For example, OpenAI and Anthropic may both develop misaligned models, each with different misaligned objectives. A monitoring setup with differently misaligned AIs would have different implications.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-9" href="#footnote-anchor-9" class="footnote-number" contenteditable="false" target="_self">9</a><div class="footnote-content"><p>If the consequences for secret loyalty installation are high, then you would need to have strong guarantees that the attacker and the principal are, in fact, the same person. For example, John may want to frame Jane by training Claude to be secretly loyal to Jane. If/when the secret loyalty is caught, Jane will be in big trouble. The upshot is that there should be comprehensive post-incident investigations of secret loyalties. Concretely, it is important that all chats are logged so that the attack can be traced back to the attacker.</p></div></div>]]></content:encoded></item><item><title><![CDATA[How banned AI chips end up in China]]></title><description><![CDATA[AI chips and servers reach China through distribution chains in which each seller vets only its direct customers, and no one is on the hook for what happens downstream.]]></description><link>https://www.the-substrate.net/p/how-banned-ai-chips-end-up-in-china</link><guid isPermaLink="false">https://www.the-substrate.net/p/how-banned-ai-chips-end-up-in-china</guid><dc:creator><![CDATA[Erich Grunewald]]></dc:creator><pubDate>Fri, 15 May 2026 13:41:36 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/a0822e9c-053d-4ac6-9c14-bf4260dd5be3_2624x1856.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>AI chips are being smuggled<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-1" href="#footnote-1" target="_self">1</a> into China. How does this happen? That is, how are the smugglers able to do it?</p><p>You might, for example, expect that sales of these chips could be vetted, and shipments tracked, by US authorities. But the relevant US agency is <a href="https://www.the-substrate.net/p/bis-is-getting-more-fundingheres">badly understaffed</a>: it cannot possibly vet every sale of AI chips, let alone follow up after every sale to ensure the chips have not been smuggled. So enforcement is largely left to the companies that sell and resell the chips. You might then suppose that these companies would vet their customers and monitor their distribution networks to ensure none of their chips are diverted, in order to comply with the law. But they are not strongly incentivized to do the due diligence needed to prevent most smuggling.</p><p>In this post, I&#8217;ll explain (1) why AI chip smuggling matters, (2) how AI chips and servers are sold and distributed, (3) what due diligence the exporting companies do, and (4) why those efforts often fail to prevent smuggling. In a nutshell, AI chips and servers are distributed through different channels, usually passing through multiple intermediaries across multiple countries. While most companies selling these products perform sufficient due diligence to comply with the law, others are outright negligent, and overall these efforts are not sufficient to prevent most smuggling.</p><h1>How big a problem is AI chip smuggling?</h1><p>Smuggling is a pretty big problem, though not serious enough to make the AI chip controls ineffective. In <a href="https://epoch.ai/blog/chip-smuggling">a new report</a>, Epoch AI says that</p><blockquote><p>between 290,000 and 1.6 million H100-equivalents (H100e) were smuggled to China through 2025. Our median estimate of 660,000 H100e would be roughly a third of China&#8217;s total compute.</p></blockquote><p>US authorities recently indicted three people, including two insiders at Super Micro, a major American AI server builder, for moving $2.5 billion worth of AI servers to a shell company for diversion to China, the largest-ever export control violation in dollar terms, as far as I can tell. Over the past year, the US has announced six prosecutions for smuggling AI chips to China, totaling about $3 billion worth of NVIDIA products. And before that, there were many reports, rumors, and analyses pointing to <a href="https://www.cnas.org/publications/reports/countering-ai-chip-smuggling-has-become-a-national-security-priority">large-scale smuggling of AI chips</a>.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!VsXt!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0630935f-a18c-4759-803d-ac34a43c97b6_784x390.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!VsXt!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0630935f-a18c-4759-803d-ac34a43c97b6_784x390.png 424w, https://substackcdn.com/image/fetch/$s_!VsXt!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0630935f-a18c-4759-803d-ac34a43c97b6_784x390.png 848w, https://substackcdn.com/image/fetch/$s_!VsXt!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0630935f-a18c-4759-803d-ac34a43c97b6_784x390.png 1272w, https://substackcdn.com/image/fetch/$s_!VsXt!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0630935f-a18c-4759-803d-ac34a43c97b6_784x390.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!VsXt!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0630935f-a18c-4759-803d-ac34a43c97b6_784x390.png" width="784" height="390" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/0630935f-a18c-4759-803d-ac34a43c97b6_784x390.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:390,&quot;width&quot;:784,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!VsXt!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0630935f-a18c-4759-803d-ac34a43c97b6_784x390.png 424w, https://substackcdn.com/image/fetch/$s_!VsXt!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0630935f-a18c-4759-803d-ac34a43c97b6_784x390.png 848w, https://substackcdn.com/image/fetch/$s_!VsXt!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0630935f-a18c-4759-803d-ac34a43c97b6_784x390.png 1272w, https://substackcdn.com/image/fetch/$s_!VsXt!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0630935f-a18c-4759-803d-ac34a43c97b6_784x390.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Surveillance photos of dummy servers being relabeled with a hair dryer the day before an inspection by a US agent, from the indictment of the $2.5 billion Super Micro insider case.</figcaption></figure></div><p><a href="https://www.the-substrate.net/p/where-will-china-get-its-compute">I don&#8217;t think</a> smuggling is a large enough problem to make the AI chip export controls ineffective overall. The controls have likely played a large role in maintaining the US lead over China in AI and may even have widened the gap in frontier performance. For example, <a href="https://www.nist.gov/news-events/news/2026/05/caisi-evaluation-deepseek-v4-pro">a recent benchmark evaluation</a> by the Center for AI Standards and Innovation (CAISI) suggests that Chinese models trail US models by about eight months, with the lag growing by about three months per year.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-2" href="#footnote-2" target="_self">2</a> And you can make a strong case for prioritizing <a href="https://www.congress.gov/bill/119th-congress/house-bill/8170/text">efforts</a> to shore up <a href="https://www.iaps.ai/research/semiconductor-manufacturing-equipment-export-controls">semiconductor manufacturing controls</a> over efforts to address AI chip smuggling. But smuggling is still a serious and urgent problem.</p><p>According to CAISI, the best Chinese model today is DeepSeek-V4. DeepSeek has reportedly <a href="https://www.theinformation.com/articles/deepseek-using-banned-nvidia-chips-race-build-next-model">used thousands of smuggled NVIDIA Blackwell</a> chips. I think it&#8217;s likely that DeepSeek-V4 was trained on NVIDIA chips.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-3" href="#footnote-3" target="_self">3</a> If DeepSeek didn&#8217;t have access to smuggled NVIDIA chips, it would likely have had to either rely on renting cloud access from chips installed outside China (which is risky from DeepSeek&#8217;s perspective because the US government could revoke remote access to those chips) or use indigenously made AI chips like Huawei Ascends to train smaller, weaker models. DeepSeek <a href="https://www.ft.com/content/eb984646-6320-4bfe-a78d-a1da2274b092?syn-25a6b1a6=1">reportedly tried</a> to train V4 on Ascend chips but abandoned the attempt after a failed training run, reverting to NVIDIA for training while supporting Ascend for inference.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-4" href="#footnote-4" target="_self">4</a> So even though smuggled chips account for <a href="https://epoch.ai/blog/chip-smuggling">only about 3%</a> of the global compute stockpile, they represent about a third of China&#8217;s compute, which is why smuggling still matters.</p><h1>AI chips are distributed through many intermediaries</h1><p>The basic smuggling playbook works like this. A smuggler buys AI chips through a shell or front company (often in a third country<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-5" href="#footnote-5" target="_self">5</a> but sometimes in the US), relabels them as some other product, and ships them to China through ordinary shipping services. (For example, in several cases, smugglers appear to have bought AI servers through front companies set up to look like Singaporean, Malaysian, Thai, and Indonesian neoclouds.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-6" href="#footnote-6" target="_self">6</a>) Once a smuggler has the chips, getting them into China seems quite easy, so I&#8217;ll focus here on how smugglers get hold of them in the first place.</p><p>AI chips routinely pass through multiple actors and countries before they are installed and used in data centers. We can refer to all actors who sell AI chips as &#8220;AI chip sellers&#8221;. They are:</p><ul><li><p><strong>AI chip designers, such as NVIDIA and AMD.</strong> These companies design AI chips, but they do not manufacture anything. NVIDIA holds an estimated 80-95% of the AI chip market share.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-7" href="#footnote-7" target="_self">7</a></p></li><li><p><strong>AI server builders (also known as original equipment manufacturers, or OEMs), such as Super</strong> <strong>Micro and Dell.</strong> These companies buy AI chips from AI chip designers and design and manufacture AI servers, each typically containing four to eight AI chips. The servers are shipped to data centers and installed in cabinets. Notable AI server builders include Super Micro (US, about 10% of NVIDIA&#8217;s revenue), Dell (US, ~3%), Wiwynn (Taiwan, ~3%), Lenovo (China/US, ~1%), Gigabyte (Taiwan, ~1%), Hewlett Packard Enterprise (US, &lt;1%), and Inspur (China, &lt;1%). (Why do these companies not make up more of NVIDIA&#8217;s revenue? That&#8217;s because AI chip designers like NVIDIA often sell chips directly to hyperscalers and neoclouds. In such cases, though the servers are assembled by original design manufacturers (ODMs), the transaction occurs directly between NVIDIA and the end customer. Hyperscalers and neoclouds make up about half of NVIDIA&#8217;s revenue.)</p></li><li><p><strong>Distributors, such as TD Synnex and Ingram Micro.</strong> These large, multinational companies are intermediaries that handle logistics and warehousing of AI chips and AI servers. The key distributors are TD Synnex (&lt;1% of NVIDIA&#8217;s revenue), Ingram Micro (&lt;1%), and Arrow Electronics (&lt;1%), all headquartered in the US but with extensive operations worldwide.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-8" href="#footnote-8" target="_self">8</a></p></li><li><p><strong>Resellers.</strong> These companies, which are numerous and often small and local, buy AI chips and servers from distributors and sell them to end customers in the country or countries where they operate.</p></li></ul><p>Before being sold, AI chips are made. For example, most of NVIDIA&#8217;s AI chips are designed by NVIDIA in the US, fabricated by TSMC in Taiwan, packaged with <a href="https://ai-frontiers.org/articles/high-bandwidth-memory-critical-gaps-us-export-controls">high-bandwidth memory</a> by TSMC in Taiwan, <a href="https://www.datacenterdynamics.com/en/news/packaging-testing-companies-scrambling-to-meet-demand-for-nvidia-blackwell-gpus/">tested by KYEC</a> in Taiwan, and then either (a) <a href="https://newsletter.semianalysis.com/p/tariff-armageddon-gpu-loopholes">assembled</a> into <a href="https://en.wikipedia.org/wiki/SXM_(socket)">SXM modules</a> by Foxconn in Taiwan and mounted onto HGX baseboards by Foxconn and Wistron in Taiwan, or (b) assembled into PCIe cards (i.e., accelerators) by Foxconn, and possibly others, again in Taiwan.</p><p>From there, the chips<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-9" href="#footnote-9" target="_self">9</a> can get to their end customers in a few different ways:</p><ol><li><p><strong>AI accelerators can be sold to minor end customers through distributors and resellers.</strong> In the simplest case, a company like NVIDIA sells AI accelerators to distributors, who in turn sell them to resellers, who then sell them to end customers. The end customer can then use the accelerators however they wish, for example, by installing them themselves in workstations or servers customized to their preferences. However, this pathway is quite rare, as most customers want AI servers ready for installation in data centers.</p></li><li><p><strong>AI servers can be sold to minor end customers via distributors and resellers.</strong> For example, NVIDIA may sell AI chips (usually in the form of SXM modules mounted on baseboards) to Super Micro, which builds AI servers using them and sells these servers to a distributor. The distributor then sells to a reseller, which sells to an end customer.</p></li><li><p><strong>AI servers can be sold to major end customers directly by AI server makers.</strong> Hyperscalers and neoclouds often purchase very large quantities of AI servers directly from server makers, bypassing distributors and resellers. For large orders of NVIDIA servers, this often involves a three-party negotiation among the customer, the server maker, and NVIDIA.</p></li><li><p><strong>AI servers can be sold to major end customers directly by AI chip designers.</strong> In the case of NVIDIA, hyperscalers and neoclouds can buy AI servers directly from NVIDIA instead of from server makers. (The NVIDIA-designed servers are called DGX, while the OEM-designed servers are called HGX.) As mentioned above, these servers are built by ODMs for NVIDIA.</p></li></ol><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!sT8t!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F600de467-19f7-42bc-ad62-c1cdfc9e8ef1_2048x949.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!sT8t!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F600de467-19f7-42bc-ad62-c1cdfc9e8ef1_2048x949.png 424w, https://substackcdn.com/image/fetch/$s_!sT8t!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F600de467-19f7-42bc-ad62-c1cdfc9e8ef1_2048x949.png 848w, https://substackcdn.com/image/fetch/$s_!sT8t!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F600de467-19f7-42bc-ad62-c1cdfc9e8ef1_2048x949.png 1272w, https://substackcdn.com/image/fetch/$s_!sT8t!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F600de467-19f7-42bc-ad62-c1cdfc9e8ef1_2048x949.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!sT8t!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F600de467-19f7-42bc-ad62-c1cdfc9e8ef1_2048x949.png" width="1456" height="675" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/600de467-19f7-42bc-ad62-c1cdfc9e8ef1_2048x949.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:675,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!sT8t!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F600de467-19f7-42bc-ad62-c1cdfc9e8ef1_2048x949.png 424w, https://substackcdn.com/image/fetch/$s_!sT8t!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F600de467-19f7-42bc-ad62-c1cdfc9e8ef1_2048x949.png 848w, https://substackcdn.com/image/fetch/$s_!sT8t!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F600de467-19f7-42bc-ad62-c1cdfc9e8ef1_2048x949.png 1272w, https://substackcdn.com/image/fetch/$s_!sT8t!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F600de467-19f7-42bc-ad62-c1cdfc9e8ef1_2048x949.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Most AI chips are likely sold to major end customers like hyperscalers, either directly from NVIDIA or directly from AI server makers. However, most smuggling likely involves smaller order volumes, with bad actors buying AI chips and AI servers either from AI server makers or from smaller resellers. So bad actors don&#8217;t purchase AI chips directly from AI chip designers like NVIDIA. Of the six cases that have been prosecuted by the US:</p><ul><li><p>Four involved smugglers buying AI servers and HGX baseboards from AI server makers (Super Micro and Lenovo)</p></li><li><p>One involved smugglers buying AI chips and AI servers (the latter made by Hewlett Packard Enterprise) from an Alabama reseller</p></li><li><p>One involved intermediaries storing goods from multiple suppliers, including SXM modules from a Massachusetts reseller and HGX baseboards from Lenovo (the same Lenovo purchases mentioned above)</p></li></ul><p>These are all cases in which the smugglers were at least partly based in the US and procured AI chips from US sellers. Most smuggling likely involves actors based abroad who buy from local sellers or import from US sellers, but these cases are harder for US authorities to detect and prosecute.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-10" href="#footnote-10" target="_self">10</a></p><h1>How AI chip sellers (try to) vet their customers</h1><p>AI chips are export-controlled, meaning they cannot be sold to just any customer. For example, it is illegal<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-11" href="#footnote-11" target="_self">11</a> to sell them to companies headquartered in China or listed on BIS-maintained lists, such as the Entity List or the Military End-User List. So when selling AI chips, companies need to conduct due diligence to ensure they are not selling to any such customer.</p><p>Most AI chips are likely sold by AI server makers, and a lot of smuggling seems to involve smugglers buying from server makers, so the remainder of this section focuses on them.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-12" href="#footnote-12" target="_self">12</a> From the point of view of AI server makers, the sales process lasts months and roughly follows these steps:</p><ol><li><p>Talk with the customer about their requirements</p></li><li><p>Run software tools that gather data about the customer&#8217;s environment (i.e., data center)</p></li><li><p>Negotiate prices (changing server components and/or order volumes if necessary)</p></li><li><p>Prepare a contract (stipulating who the end customer is, where the servers are to be shipped, and who will install the servers)</p></li><li><p>Do due diligence</p></li><li><p>Build, box, and ship the products</p></li></ol><p>Server makers track items at the serial number level using enterprise resource planning (ERP) software (e.g., SAP or Oracle). AI chip designers and distributors likely do too, though I&#8217;m not sure to what extent resellers do so (since they are smaller, less resourced, and less well-organized). When an order is placed, the salesperson enters it into the ERP system, and it is then routed to the appropriate factory or factories responsible for assembling, testing, boxing, and (once the order is paid) shipping the hardware to the customer. Server makers also have some visibility into their partner distributors&#8217; inventory, since distributors typically hold a lot of the parts used by server makers. These parts (e.g., AI chips) can then be sent to the server maker for assembly when a customer makes an order via a reseller.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-13" href="#footnote-13" target="_self">13</a></p><p>From a smuggling perspective, we&#8217;re mostly interested in the due diligence step of the sales process. First, some definitions:</p><ul><li><p><em>Compliance</em> is overall adherence to regulations</p></li><li><p><em>Due diligence</em> is the process of investigating and assessing risk related to a transaction, aimed at ensuring compliance</p></li><li><p><em>Know Your Customer</em> (KYC) processes are part of due diligence in which a company verifies a customer&#8217;s identity and assesses their risk level. There are several risks that sellers aim to avoid here, not only legal ones (e.g., by selling to a sanctioned customer) but also commercial ones, such as ensuring the customer can pay (counterparty credit risk).</p></li></ul><p><strong>AI chip designers, server makers, and distributors do fairly extensive compliance checks, but these checks seem mostly (and reasonably) aimed at (1) complying with the law, and (2) ensuring they get paid.</strong> AI chip sellers seem to differ a lot in both what they do to stay compliant and how effectively they do it, but they typically:</p><ul><li><p>Conduct KYC checks to ensure the prospective buyer is not a shell or front company and can pay for the order. That means the seller asks the company for information, e.g., legal name, corporate structure, location (including county/region), date of registration, affiliations, and operational history (e.g., bank statements, balance sheets). The seller will also, at least sometimes, ask local commerce ministries for information, check trade registries, and/or do other due diligence to verify some of that information.</p></li><li><p>Ask the prospective buyer to provide a signed <a href="https://en.wikipedia.org/wiki/End-user_certificate">end-user certificate</a> <a href="https://www.ecfr.gov/current/title-15/subtitle-B/chapter-VII/subchapter-C/part-748/section-748.11">detailing</a> how the purchase fits their business and how, where (down to the county/region level), and by whom<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-14" href="#footnote-14" target="_self">14</a> the products will be used. A former HPE salesperson told me that &#8220;the primary way companies prevent chip smuggling is to ensure that we know the end destination of a system before we accept and ship an order&#8221;, but added that most of what determines the destination &#8220;is what the customer or the partner tells us&#8221;.</p></li><li><p>Screen against <a href="https://www.trade.gov/consolidated-screening-list">lists</a> (the Entity List, the Unverified List, the Military End-User List, and a few others)</p></li><li><p>Check for <a href="https://www.ecfr.gov/current/title-15/subtitle-B/chapter-VII/subchapter-C/part-732/appendix-Supplement%20No.%203%20to%20Part%20732">BIS&#8217;s red flags</a></p></li><li><p>Request a license from BIS if necessary</p></li></ul><p>For large companies like NVIDIA and the server makers and distributors discussed here, these due diligence processes are largely handled by a centralized compliance team. (But salespeople also have some responsibilities; for example, according to a former Super Micro employee, its sales reps attended mandatory weekly meetings drilling them on export compliance requirements, though these sessions did not seem to prevent smuggling.) The compliance team will usually be located in the US, and end-user certificates and other documents are typically sent to it for approval, even for transactions handled by salespeople outside the US. The compliance team also does internal audits and trains the company&#8217;s sales, legal, finance, and shipping staff. In the $2.5 billion insider case, Super Micro&#8217;s compliance team did notice a fast-growing customer&#8217;s anomalous order volume and ran successive audits, although those audits were allegedly sabotaged from within the sales organization by senior Super Micro employees who arranged &#8220;friendly&#8221; auditors, falsified the customer&#8217;s data center lease agreements, and arranged warehouses with thousands of dummy servers to (successfully) subvert a Super Micro compliance inspection.</p><p>Sometimes, part of the due diligence process is carried out by third parties or through tools and data services they provide. These include <a href="https://www.aeb.com/en/compliance-screening/index.php">AEB</a>, <a href="https://www.e2open.com/global-trade/">e2open</a>, <a href="https://www.descartes.com/">Descartes</a>, <a href="https://www.dowjones.com/professional/risk/">Dow Jones</a>, <a href="https://www.dnb.com/products/dnb-compliance-intelligence.html">Dun &amp; Bradstreet</a>, <a href="https://www.kharon.com/">Kharon</a>, <a href="https://www.lexisnexis.com/">LexisNexis</a>, <a href="https://sayari.com/">Sayari</a>, <a href="https://www.striderintel.com/">Strider</a>, <a href="https://legal.thomsonreuters.com/en/risk-fraud-investigations/risk-compliance-management">Thomson Reuters</a>, and <a href="https://wirescreen.ai/">WireScreen</a>. These tools can automate parts of the due diligence process and are integrated with the ERP system. These third parties can also help validate end-user certificates, conduct background checks on buyers, perform list screening, and more.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-15" href="#footnote-15" target="_self">15</a></p><p>Some AI chip sellers seem to be happy to stick to these common activities, but others also do fairly extensive post-sale monitoring, e.g., one server maker (according to a former employee I spoke with) would also periodically follow up with the customer on a call to verify that the chips are used as intended; require the end-user certificate to be renewed every six months; negotiate the right to perform on-site inspections for large or high-security sales; and occasionally redo the list-based screening to check that the customer hasn&#8217;t been added since.</p><p>NVIDIA and its partners also carry out on-site inspections, at least occasionally.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-16" href="#footnote-16" target="_self">16</a> As mentioned, in the $2.5 billion Super Micro case, smugglers fooled the compliance team with insider help. In another case, the seller&#8217;s employees visited a data center where the chips were installed, although it turned out the smuggler had only temporarily installed them to fool the inspectors; after the inspection, the chips were removed and shipped to China.</p><h1>AI chip sellers aren&#8217;t incentivized enough to stop smuggling</h1><p>AI chip sellers usually only conduct due diligence for their immediate customers; they rarely conduct due diligence for their customers&#8217; customers, or follow up to see what happens to the AI chips after they&#8217;ve been sold and shipped (with the notable exception of the on-site inspections mentioned above).</p><p>There are two important dynamics here that both enable AI chip smuggling:</p><ul><li><p><strong>Sometimes, major AI chip sellers (notably, Super Micro) do not conduct sufficient due</strong> <strong>diligence even for their immediate customers.</strong> This can lead them to sell AI chips directly to smugglers. I&#8217;ll discuss this further below.</p></li><li><p><strong>Sometimes, major AI chip sellers </strong><em><strong>do</strong></em><strong> conduct sufficient due diligence for their immediate</strong> <strong>customers, but downstream sellers don&#8217;t.</strong> That is because, by the time the chips are sold to a smuggler (meaning someone didn&#8217;t do a good job of due diligence), they have typically passed through multiple hands, often ending up with resellers or other middlemen in countries such as Malaysia, Singapore, or Thailand.</p></li></ul><p>AI chip sellers in third countries are far less incentivized to comply with US law, since it&#8217;s harder for US authorities to detect compliance failures outside the US and to prosecute illegal activities that occurred abroad, especially when the companies involved have no US presence or exposure. In addition, small resellers likely have especially weak due diligence processes because:</p><ul><li><p>They have limited resources to use for due diligence</p></li><li><p>There are many of them, located in different countries, so there is likely more variance in how well they do compliance, or how willing they are to sell to smugglers</p></li><li><p>They often lack any US presence, meaning transactions are approved entirely outside the US, which matters if non-US personnel are less likely than US-based personnel to comply with US export law</p></li><li><p>They may be less responsive to the incentive produced by the risk of being fined for export violations, since the fines involved would likely be far higher than what these companies are able to pay</p></li></ul><p>Resellers have a lot on the line because if they aren&#8217;t compliant, they risk losing their contract with the distributor, which can be a huge deal for these businesses. That said, I think the factors listed above outweigh this consideration. In particular, I would guess that distributors are doing little to ensure their partner resellers are doing good due diligence, as distributors would generally not be liable for selling AI chips to resellers even if those resellers have poor due diligence standards, resulting in smuggling down the line.</p><p>Meanwhile, the major AI chip and server sellers, like NVIDIA and Super Micro, are required by law to monitor for red flags and to avoid selling to obviously sanctioned or prohibited customers, but they&#8217;re not asked to ensure, in the strictest sense, that their products are never smuggled. They are potentially liable if they have &#8220;knowledge&#8221; of a violation&#8212;<a href="https://www.ecfr.gov/current/title-15/subtitle-B/chapter-VII/subchapter-C/part-772/section-772.1">including</a> &#8220;an awareness of a high probability of its existence or future occurrence&#8221;&#8212;which can result in civil penalties, but for BIS (or rather: the Department of Justice) to administer <em>criminal</em> penalties like prison time, the accused must have been shown to have &#8220;willfully&#8221; caused a violation, a much higher bar. (Multiple AI chip smuggling cases have been prosecuted criminally over the past year, but these cases often involve text messages showing that the smugglers were fully aware that their activities were illegal and were actively involved in the operations. For example, in one case, a smuggler told his co-conspirator over WeChat to remove mentions of Chinese customers from a draft solicitation message because, in his words, &#8220;We will draw [attention] from US government for [embargo] violation&#8221;, reassuring him in the next message that &#8220;We just talk about it, no one can hold it as [evidence] against us.&#8221; I think most smugglers aren&#8217;t so careless.)</p><p>More importantly, major AI chip sellers often don&#8217;t have even &#8220;an awareness of a high probability&#8221; of violations, because chips routinely pass through long distribution chains before reaching an end user, and sellers don&#8217;t ordinarily have visibility into who that end user is. For example, a former Super Micro employee told me that while server makers and distributors would know who they sold to, beyond that, any forensic trail would go dark. That means civil penalties are also often out of the question for these large companies.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-17" href="#footnote-17" target="_self">17</a></p><p><strong>Super Micro shows that even major US companies can perform poor due diligence.</strong><a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-18" href="#footnote-18" target="_self">18</a> Super Micro appears as the upstream seller in three separate prosecutions over the past year:</p><ul><li><p>The $2.5 billion insider case described above</p></li><li><p>A case in which Super Micro sold 205 H100s to a California front company that fed it false end-user information, including a Singapore &#8220;end user&#8221; called Metacarbon that BIS later determined did not exist at the address Super Micro had on file</p></li><li><p>A case in which Super Micro almost shipped $170 million of servers to a Georgia-based front, after NVIDIA employees in Thailand independently discovered that the claimed Thai end user had never heard of the order</p></li></ul><p>Two things stand out across these cases. First, Super Micro&#8217;s end-user vetting accepted signed certifications and paper documentation without independent verification that the customer existed as a going concern at the claimed location. Second, in the cases where smuggling was caught, the decisive intervention came from outside the standard process&#8212;from a BIS notification that diversion was occurring, from a BIS post-shipment verification, or from NVIDIA&#8217;s own scrutiny&#8212;rather than from Super Micro&#8217;s compliance team independently working backward from red flags it had spotted itself.</p><p>It&#8217;s true that in the $2.5 billion insider case, the smugglers were not outsiders gaming an unwitting company, but included a Super Micro co-founder and board member who oversaw global sales, as well as a general manager in Super Micro&#8217;s Taiwan office. This is likely not normal for major US companies, and would have made it harder for Super Micro&#8217;s compliance team to do its work. The indictment shows the insiders repeatedly pressuring, sidestepping, and physically deceiving the compliance team across at least three audits in 2024 and 2025.</p><p>Even so, Super Micro&#8217;s compliance processes were clearly inadequate throughout this case:</p><ul><li><p>Super Micro&#8217;s sales organization was able to influence its internal audits. Sales staff, who have a clear conflict of interest, managed to arrange which auditor would conduct a given review (at least one was described in writing by an insider as &#8220;friendly&#8221;) and in the most extreme instance, the auditor tasked with conducting an on-site inspection was instead off-site, enjoying entertainment paid for by the very customer he was supposed to be auditing, while photos and videos of staged dummy servers were sent to him in lieu of an actual inspection.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-19" href="#footnote-19" target="_self">19</a></p></li><li><p>Internal audits relied on documents that the customer itself supplied, without independent checks. When the compliance team asked the customer to demonstrate that it had enough data center space to store the volumes it was buying, the team accepted lease agreements that turned out to be falsified.</p></li><li><p>When the compliance team noticed that the customer was buying in large volumes without clearly operating at a commensurate scale, it failed to act decisively. For example, one Singapore entity seems to have gone from a $33 million balance sheet at year-end 2023 to $3 billion (90x) by year-end 2024, driven almost entirely by $2.9 billion in &#8220;refundable deposits received&#8221; from an undisclosed source (allegedly Alibaba).<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-20" href="#footnote-20" target="_self">20</a> In a single quarter, the customer ranked as Super Micro&#8217;s eleventh-most-profitable worldwide, sitting &#8220;alongside major US technology and social media companies developing hyperscale artificial intelligence infrastructure&#8221;, even though, per the indictment, it &#8220;did not have the capacity to store or use the massive quantities of servers it was purchasing&#8221;. Though the compliance team twice placed temporary holds on shipments, it released each hold after receiving explanations from the customer that it had no way to independently verify. For example, after one temporary hold, one smuggler urged a co-conspirator that &#8220;[y]ou need to have strong and persuasive reasons to convince [Super Micro&#8217;s] staffs!&#8221;, and once their drafted answers were accepted by the compliance team, the co-conspirator messaged the group, &#8220;Heheheh, Gooood news for everyone!&#8221;</p></li></ul><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!r7l0!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffef2ef7d-bd38-4db0-8ee2-c4b81c504fda_756x354.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!r7l0!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffef2ef7d-bd38-4db0-8ee2-c4b81c504fda_756x354.png 424w, https://substackcdn.com/image/fetch/$s_!r7l0!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffef2ef7d-bd38-4db0-8ee2-c4b81c504fda_756x354.png 848w, https://substackcdn.com/image/fetch/$s_!r7l0!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffef2ef7d-bd38-4db0-8ee2-c4b81c504fda_756x354.png 1272w, https://substackcdn.com/image/fetch/$s_!r7l0!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffef2ef7d-bd38-4db0-8ee2-c4b81c504fda_756x354.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!r7l0!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffef2ef7d-bd38-4db0-8ee2-c4b81c504fda_756x354.png" width="756" height="354" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/fef2ef7d-bd38-4db0-8ee2-c4b81c504fda_756x354.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:354,&quot;width&quot;:756,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!r7l0!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffef2ef7d-bd38-4db0-8ee2-c4b81c504fda_756x354.png 424w, https://substackcdn.com/image/fetch/$s_!r7l0!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffef2ef7d-bd38-4db0-8ee2-c4b81c504fda_756x354.png 848w, https://substackcdn.com/image/fetch/$s_!r7l0!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffef2ef7d-bd38-4db0-8ee2-c4b81c504fda_756x354.png 1272w, https://substackcdn.com/image/fetch/$s_!r7l0!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffef2ef7d-bd38-4db0-8ee2-c4b81c504fda_756x354.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Warehouses stacked with dummy servers staged for Super Micro&#8217;s compliance team, from the indictment of the $2.5 billion insider case.</figcaption></figure></div><p>I think most other major US AI chip sellers perform better due diligence than Super Micro, which seems to have unusually poor corporate governance. But Super Micro&#8217;s poor governance has been known for years and has still endured.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-21" href="#footnote-21" target="_self">21</a> Clearly, US authorities should not assume that there will never be any companies with poor governance. Instead, they should create incentives for all relevant companies to carry out strong due diligence.</p><h1>Companies could take concrete steps to reduce smuggling</h1><p>AI chips reach Chinese end users not because any single seller waves them through, but because the distribution chain has many links&#8212;AI chip designer, server maker, distributor, reseller, end user&#8212;and each link conducts due diligence only on its immediate counterparty. Because legal liability for diversion attaches only to <a href="https://www.ecfr.gov/current/title-15/part-772#p-772.1(Knowledge)">&#8220;knowledge&#8221;</a> and sellers&#8217; visibility into where the chips end up decreases as the chips move down the chain, the equilibrium is one in which every individual transaction with a US seller can be compliant, yet large numbers of chips still end up in China. This problem won&#8217;t solve itself.</p><p>I think AI chip sellers could do a lot more here if they were sufficiently incentivized. For example, NVIDIA could implement <a href="https://www.iaps.ai/research/location-verification-for-ai-chips">delay-based location verification</a> on its AI chips and contractually require customers to periodically report the chips&#8217; location. If any chips go dark, NVIDIA could follow up with an on-site inspection, potentially refuse to sell to that customer in the future, and report its findings to US authorities. Another promising approach is to conduct on-site inspections regularly&#8212;say, every six months&#8212; to make sure chips are not just installed to fool inspectors and then removed and smuggled, as has happened before. However, conducting on-site inspections at this scale would likely require additional budget for export enforcement, or for US authorities to implement <a href="https://www.iaps.ai/research/export-auditors-as-market-powered-export-enforcement">a third-party export auditor program</a>.</p><p>But I think the most effective thing the major American AI chip sellers could do is to sell only to a select few, highly trustworthy end users, especially US hyperscalers and neoclouds. AI chip sellers are unlikely to do this voluntarily, so in practice, this would mean the US government setting up a formal, low-discretion process for becoming an authorized AI chip importer abroad, where companies are evaluated on the risk of diversion. (Shipments below a certain volume could be exempted, so that anyone could import small quantities without a license.) These authorized companies will almost certainly not divert any chips, and they can <a href="https://www.rand.org/pubs/commentary/2025/08/america-should-rent-not-sell-ai-chips-to-china.html">rent the AI chips</a><a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-22" href="#footnote-22" target="_self">22</a> either on a short-term basis or with longer-term allocations to any end user in the US or abroad. (If small resellers located in third countries perform poor due diligence, this would no longer matter, because the AI chips would remain in the hands of trusted companies throughout, from AI chip designers and server makers to authorized hyperscalers and neoclouds.) There likely aren&#8217;t any regulatory barriers to doing this, since US companies can operate data centers abroad, and foreign companies would also be eligible if they are bona fide. I think this policy, if implemented, would essentially solve the AI chip smuggling problem.</p><p>In addition, US authorities could require specific due diligence measures in addition to what companies already do (<a href="https://www.semiconductors.org/the-critical-effort-to-combat-illicit-chip-diversion/">&#8220;compliance-plus&#8221;</a>), such as repeated, random, unannounced on-site inspections for sales to less obviously bona fide customers. Alternatively, the US could adopt a surety bond system, as Onni described in <a href="https://www.the-substrate.net/p/a-sketch-of-market-based-export-controls">a previous post</a>. Finally, <a href="https://www.the-substrate.net/p/the-case-for-paying-whistleblowers">strong whistleblower incentives</a> could help detect and prosecute smuggling by surfacing high-quality leads from insiders. Among other benefits, these policies would encourage companies to strengthen their due diligence practices.</p><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-1" href="#footnote-anchor-1" class="footnote-number" contenteditable="false" target="_self">1</a><div class="footnote-content"><p>By &#8220;AI chip smuggling&#8221;, I mean the illegal physical transport of AI chips across borders into a country to which export is legally prohibited, such as China or Russia. This doesn&#8217;t include the entirely legal situation where Chinese companies rent access to AI chips remotely, for example, ByteDance renting access to AI chips located in Malaysia.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-2" href="#footnote-anchor-2" class="footnote-number" contenteditable="false" target="_self">2</a><div class="footnote-content"><p>As far as I can tell, CAISI doesn't report the slopes of the regressions, but from eyeballing the plot, it looks like the US is gaining about 52 Elo per month while China is gaining about 38 Elo per month, so the gap between the two trend lines widens by roughly 14 Elo per month, or ~170 per year. (Elo is a rating system where competitors exchange points after each head-to-head matchup, with bigger transfers for more surprising results. It was originally developed for chess, but in this case, the competitors are AI models and the matchups are benchmark tasks.) Translating that into months of lag (in the same sense as CAISI's eight-month figure, which seems to measure how far back the US trend line crossed today's PRC capability level) means dividing by the US slope of 52, giving a lag that grows by about three months per year.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-3" href="#footnote-anchor-3" class="footnote-number" contenteditable="false" target="_self">3</a><div class="footnote-content"><p>The DeepSeek-V4 announcements and technical paper did not specify which chips were used for training. If it had been trained with domestic Huawei chips, DeepSeek would likely have announced that proudly, as it did when it announced that V4 was optimized for inference using Huawei Ascends (in addition to being optimized for inference on NVIDIA chips).</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-4" href="#footnote-anchor-4" class="footnote-number" contenteditable="false" target="_self">4</a><div class="footnote-content"><p>Technically, the news coverage referred to a model called R2, which would have been the successor of R1. As reasoning and non-reasoning models have tended to unify over the past year, I think we should view V4 as essentially the successor that R2 referred to at the time.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-5" href="#footnote-anchor-5" class="footnote-number" contenteditable="false" target="_self">5</a><div class="footnote-content"><p>A &#8220;third country&#8221; is a country that is neither the origin (in this case, the US) nor the ultimate destination (typically, China).</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-6" href="#footnote-anchor-6" class="footnote-number" contenteditable="false" target="_self">6</a><div class="footnote-content"><p>For example: Megaspeed International (Singapore), Speedmatrix Sdn. Bhd. (Malaysia), Novagate Cloud Pte Ltd. (Singapore), Novagate Cloud Sdn. Bhd. (Malaysia), Aperia Cloud Services (Singapore), OBON Corp (Thailand), Siam AI Corporation (Thailand), Aolani (Malaysia), and Indosat Ooredoo Hutchison (Indonesia). To be clear, these all appear in allegations, but they are not proven to have been involved in smuggling or other illegal activities. It is also possible, or even likely, that this list is very incomplete.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-7" href="#footnote-anchor-7" class="footnote-number" contenteditable="false" target="_self">7</a><div class="footnote-content"><p>Note that some companies, like Google, design AI chips for use in their own data centers. Since these companies generally do not sell their chips, they are not smuggled, and so they are not covered in this post.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-8" href="#footnote-anchor-8" class="footnote-number" contenteditable="false" target="_self">8</a><div class="footnote-content"><p>Distributors represent a relatively minor share of NVIDIA&#8217;s revenue since they typically buy servers from server builders, which are one step removed from NVIDIA. They probably also purchase some AI chips directly from NVIDIA, which is likely where the revenue from these distributors originates.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-9" href="#footnote-anchor-9" class="footnote-number" contenteditable="false" target="_self">9</a><div class="footnote-content"><p>Specifically, the chips at this point are embedded in accelerators or SXM modules, but since that distinction is not very important here, I will just refer to them as &#8220;chips&#8221; to keep things simple.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-10" href="#footnote-anchor-10" class="footnote-number" contenteditable="false" target="_self">10</a><div class="footnote-content"><p>I think diversion at the shipping stage is unlikely. All known cases of AI chip smuggling involve smugglers obtaining chips from AI chip sellers. According to one former employee at an AI server builder I spoke with, a major server builder works only with trusted freight companies; the former employee seemed skeptical that diversion would occur at that stage. More importantly, the buyer would likely notice if any products went missing, since it has paid for them and has a strong incentive to ensure it received what it paid for. If the buyer is smuggling the chips, there is little reason to divert them during the initial shipment; they could simply be reexported or transshipped instead.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-11" href="#footnote-anchor-11" class="footnote-number" contenteditable="false" target="_self">11</a><div class="footnote-content"><p>Strictly speaking, it is illegal to sell them to these customers without a license from the US government. But the US government has a presumption of denial for these licenses, which, for all intents and purposes, amounts to a ban on these sales.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-12" href="#footnote-anchor-12" class="footnote-number" contenteditable="false" target="_self">12</a><div class="footnote-content"><p>The sales process is likely similar for resellers, though that&#8217;s just conjecture on my part. It may look slightly different for distributors and NVIDIA, since they are more likely to sell to customers only through long-term partnerships.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-13" href="#footnote-anchor-13" class="footnote-number" contenteditable="false" target="_self">13</a><div class="footnote-content"><p>I think the way this works is either that a distributor keeps a bunch of B300s in inventory, receives an order from a customer for HGX B300 servers, then orders those servers from a server builder while supplying the server builder with B300s from its inventory; or that a customer orders HGX B300 servers from a server builder, and the server builder partners with a distributor to both obtain the B300s and ultimately ship the servers to the customer. Perhaps both happen to varying degrees.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-14" href="#footnote-anchor-14" class="footnote-number" contenteditable="false" target="_self">14</a><div class="footnote-content"><p>A server builder is not an end user, since it will sell the servers it builds. However, I think a cloud provider can be an end user of chips it operates in a data center, even though it rents those chips out through its cloud offerings to other actors who actually use them.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-15" href="#footnote-anchor-15" class="footnote-number" contenteditable="false" target="_self">15</a><div class="footnote-content"><p>I think many of these processes can likely be automated and/or improved using AI agents, but I&#8217;m unsure whether AI agents will favor due diligence more than they favor smugglers. For example, AI agents will likely also help smugglers generate fake documents and convincing websites for front companies.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-16" href="#footnote-anchor-16" class="footnote-number" contenteditable="false" target="_self">16</a><div class="footnote-content"><p>I think the right to conduct on-site inspections is typically negotiated as part of the sales process. So it may be difficult for AI chip sellers to do this for chips that are already sold, where it wasn&#8217;t already negotiated.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-17" href="#footnote-anchor-17" class="footnote-number" contenteditable="false" target="_self">17</a><div class="footnote-content"><p>For example, in July 2024, an NVIDIA spokesperson said: &#8220;Although we cannot track products after they are sold, if we determine that any customer is violating U.S. export controls, we will take appropriate action.&#8221; Of course, July 2024 was a different age. We didn&#8217;t even have our first reasoning models then.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-18" href="#footnote-anchor-18" class="footnote-number" contenteditable="false" target="_self">18</a><div class="footnote-content"><p>AI server builders, distributors, and resellers could theoretically be especially prone to forgoing standard due diligence processes when the value of their inventory is depreciating quickly (e.g., due to the release of newer generations) and/or demand is weak. One former AI server builder employee told me that they saw this dynamic in the memory business, where that company would sell to a shady broker, remove the <a href="https://craftybase.com/blog/manufacturing-travelers">traveler</a> (a document showing where the item has been), and do some creative bookkeeping. But this is likely not relevant to AI chips today, as AI chips are seeing massive, consistent demand.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-19" href="#footnote-anchor-19" class="footnote-number" contenteditable="false" target="_self">19</a><div class="footnote-content"><p>Staging this deception seems to have been a fairly substantial operation. According to the indictment, the third-party broker working with the Super Micro insiders estimated that staging the warehouses with dummy servers would require &#8220;100 people in total&#8221;,  forklift operators, arranged meals, and a &#8220;20-person shuttle bus for easy travel between the hotel and the warehouse, allowing for short breaks&#8221;.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-20" href="#footnote-anchor-20" class="footnote-number" contenteditable="false" target="_self">20</a><div class="footnote-content"><p>This is from public filings cited in <a href="https://culperresearch.com/wp-content/uploads/2026/05/Culper_NVDA_5-13-2026.pdf">a May 2026 short-seller report</a>. The entity in question was Megaspeed. The short-seller argues that the $2.9 billion came from Alibaba, but while suggestive, this is neither confirmed nor clear-cut.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-21" href="#footnote-anchor-21" class="footnote-number" contenteditable="false" target="_self">21</a><div class="footnote-content"><p>In <a href="https://www.cnas.org/publications/reports/countering-ai-chip-smuggling-has-become-a-national-security-priority">a June 2025 report</a> on AI chip smuggling, Tim Fist and I wrote: &#8220;According to the Financial Times, &#8216;People involved in the trade said merchants in Malaysia, Japan, and Indonesia often shipped Super Micro servers or NVIDIA processors to Hong Kong before bringing them across the border to Shenzhen.&#8217; The Information report cites a smuggler claiming to acquire thousands of chips from companies like Dell and Super Micro &#8216;thanks to what he called &#8220;strong personal relationships&#8221; with sales representatives at these firms&#8217;. A [2024] report by analyst firm Hindenburg Research also documented multiple compliance failures by Supermicro, alleging, for example, that it has supplied millions of dollars of products to a distributor in Russia through a Californian entity despite sanctions. Super Micro servers have also been advertised on Chinese e-commerce sites. Super Micro has responded to past reports of smuggling by stating that it follows &#8216;all US export control requirements on the sale, service, support, and export of GPU systems&#8217;.&#8221;</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-22" href="#footnote-anchor-22" class="footnote-number" contenteditable="false" target="_self">22</a><div class="footnote-content"><p>Renting is a much better situation for the US because it allows US companies to theoretically revoke access and, to some extent, monitor who uses the chips. That makes it easier to prevent Chinese actors from, say, using the AI chips for military purposes. If the chips are smuggled into China, it&#8217;s essentially impossible to recover them, and it&#8217;s equally difficult to prevent Chinese actors from using them for military purposes.</p></div></div>]]></content:encoded></item><item><title><![CDATA[Making through-silicon vias is not a bottleneck for China's HBM production]]></title><description><![CDATA[Firms like ACM Research are making the deposition machines China needs, though they may suffer from worse yields.]]></description><link>https://www.the-substrate.net/p/making-through-silicon-vias-is-not</link><guid isPermaLink="false">https://www.the-substrate.net/p/making-through-silicon-vias-is-not</guid><dc:creator><![CDATA[Hamish Low]]></dc:creator><pubDate>Wed, 22 Apr 2026 15:37:29 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/0a47133b-7a0f-4306-9ef4-caeac9f9a02a_2624x1856.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p><em>This is the second piece in a series exploring key semiconductor manufacturing equipment that China needs to indigenously produce high-bandwidth memory, perhaps the most important bottleneck in its efforts to make AI chips. The first piece was on <a href="https://www.the-substrate.net/p/china-is-making-strides-in-etching">advanced etching machines</a>.</em></p><p>The biggest news in the world of semiconductor manufacturing equipment (SME) export controls was the recent introduction of the <a href="https://www.congress.gov/bill/119th-congress/house-bill/8170/text">MATCH Act</a> to Congress. This bill initially covered a range of measures, all aimed at aligning US allies with US export controls on SME to China. It has since been narrowed to the imposition of stricter controls on deep ultraviolet immersion (DUVi) lithography equipment. One dropped element targeted through-silicon via (TSV) deposition and etch tools.</p><p>Is this a missed opportunity or a wise trade-off? In this post, I try to answer that question by investigating how advanced Chinese domestic firms are at producing the relevant machines. I find that Chinese firms can likely handle the required TSV etching and deposition steps using indigenous equipment, though at a lower yield than with equivalent tools from Western firms. Excluding TSV tools from the MATCH Act to focus on the most binding bottleneck, deep ultraviolet immersion lithography, is therefore the right strategic choice.</p><p>TSVs are significant because they play a key role in the production of high-bandwidth memory (HBM). For China to produce HBM, it needs not only to produce the individual memory chips&#8212;the focus of the <a href="https://www.the-substrate.net/p/china-is-making-strides-in-etching">previous piece in this series</a>&#8212;but also to package them into a single stack. TSVs, tens of thousands of tiny vertical copper wires, make that stacking possible. Producing TSVs requires specialized SME, including etching and deposition machines, but is much less demanding than other cutting-edge areas of semiconductor production, such as making advanced logic or memory chips.</p><p>The first stages of TSV fabrication are etching and various thin-film deposition steps. For these stages, there are multiple competing tool options from Chinese firms. The next stage, copper electroplating, is the most concentrated, with ACM Research the only Chinese firm to have delivered a proven tool, though Naura has recently begun working on its own alternative.</p><p>ACM Research is also a fascinating firm. Its China-based subsidiary, ACM Shanghai, which accounts for essentially all of the group's manufacturing and revenue and is crucial to China&#8217;s AI ambitions, is on the US Entity List, yet is owned by a US-headquartered parent firm. Looking at ACM Research also offers insight into China&#8217;s successes in developing wafer-cleaning tools and how it has nearly closed the gap with the global frontier in that niche.</p><p>In this post, I first explore how TSVs work and the various processes required to create them, before turning to electroplating and ACM Research and concluding with what this analysis means for China&#8217;s overall HBM effort and whether tightening controls on TSV tools would be worthwhile.</p><h1>Through-silicon vias make high-bandwidth memory possible</h1><p>AI models require significant memory capacity (how much information the memory can hold) and memory bandwidth (how fast it can move this information to the relevant logic process on the chip). HBM provides the best balance between these two requirements. HBM&#8217;s innovation over previous forms of memory was stacking several (usually eight to twelve, and in more recent generations, sixteen) memory chips on top of one another, to place as much memory as possible, as close as possible, to the logic component that runs computations of the AI model.</p><p>The <a href="https://www.the-substrate.net/p/china-is-making-strides-in-etching">previous piece in this series</a> explored the production of these individual memory chips and how the need to produce ever-denser, more advanced memory cells could be a bottleneck to China&#8217;s HBM production. The focus of this piece is instead on how to bind these individual memory chips into an HBM stack.</p><p>TSVs are tens of thousands of tiny vertical wires that cut through the layers of the chips within the HBM stack.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-1" href="#footnote-1" target="_self">1</a> They connect these stacked memory layers to one another, creating a wide pathway for data to flow down through the stack. The advantage of this vertical interconnect is density: by stacking memory layers rather than spreading them across a circuit board, far more memory can sit next to the AI chip, reducing the latency and energy cost of moving large amounts of data.</p><h1>Etching is the first step of making a TSV, and is comparatively simple</h1><p>TSVs differ from the memory cells discussed in the previous piece in that they are larger. These are still semiconductors, so TSVs aren&#8217;t big, but they are about 200 times wider in diameter than memory cells.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-2" href="#footnote-2" target="_self">2</a> This is because TSVs go much deeper than memory capacitors, requiring them to cut through the entire silicon wafer, which introduces a different set of challenges.</p><p>The difficulty lies less in pushing physics to its limits to reach such tiny dimensions than in managing the complexity of drilling deep into a chip with different materials and structures. At a basic level, the process involves carving a hole in a chip, depositing a thin layer of insulating material on top, and filling the rest with conductive copper.</p><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!tDmw!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1649c2f8-9dd4-4849-b138-bdbcdaeec775_1071x179.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!tDmw!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1649c2f8-9dd4-4849-b138-bdbcdaeec775_1071x179.png 424w, https://substackcdn.com/image/fetch/$s_!tDmw!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1649c2f8-9dd4-4849-b138-bdbcdaeec775_1071x179.png 848w, https://substackcdn.com/image/fetch/$s_!tDmw!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1649c2f8-9dd4-4849-b138-bdbcdaeec775_1071x179.png 1272w, https://substackcdn.com/image/fetch/$s_!tDmw!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1649c2f8-9dd4-4849-b138-bdbcdaeec775_1071x179.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!tDmw!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1649c2f8-9dd4-4849-b138-bdbcdaeec775_1071x179.png" width="1071" height="179" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/1649c2f8-9dd4-4849-b138-bdbcdaeec775_1071x179.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:179,&quot;width&quot;:1071,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!tDmw!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1649c2f8-9dd4-4849-b138-bdbcdaeec775_1071x179.png 424w, https://substackcdn.com/image/fetch/$s_!tDmw!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1649c2f8-9dd4-4849-b138-bdbcdaeec775_1071x179.png 848w, https://substackcdn.com/image/fetch/$s_!tDmw!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1649c2f8-9dd4-4849-b138-bdbcdaeec775_1071x179.png 1272w, https://substackcdn.com/image/fetch/$s_!tDmw!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1649c2f8-9dd4-4849-b138-bdbcdaeec775_1071x179.png 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a></figure></div><p><em>A diagram of the TSV formation process steps from <a href="https://www.appliedmaterials.com/us/en/semiconductor/markets-and-inflections/heterogeneous-integration/tsv.html">Applied Materials</a></em></p><p>Once the trench is complete and the copper wire is formed, a sequence of steps removes the excess material and flips the wafer to remove material on the other side, revealing the end of the wire. This leaves a TSV that runs through the whole memory die, making it ready to be stacked on top of other memory dies to form HBM. That process of carving away excess material, called chemical mechanical planarization (CMP), will be the subject of the next piece in this series, but for now, the focus is on etching the TSV trench and filling it with copper.</p><p>Given that TSVs are much larger than memory capacitors and that China has already been <a href="https://www.the-substrate.net/p/china-is-making-strides-in-etching">making significant gains in its etching capabilities</a>, etching is unlikely to be a bottleneck for China to produce TSVs. While TSV etching differs in some important ways from capacitor etching&#8212;notably, it is a multi-step process rather than a single step&#8212;it is unlikely to pose a major challenge. TSV etching machines from AMEC and Naura can meet the needs of China&#8217;s HBM production, and exceed the TSV etching limits that are built into US export controls.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-3" href="#footnote-3" target="_self">3</a></p><p>Overall, several types of SME are needed to produce TSVs:</p><ul><li><p>Photolithography tools to pattern where to etch the TSVs</p></li><li><p>Etching machines to carve out the TSVs</p></li><li><p>Deposition machines to fill in the desired materials</p></li><li><p>Cleaning tools to remove unwanted impurities</p></li><li><p>Chemical mechanical planarization and other grinding tools to shape the wafer</p></li><li><p>Metrology tools to measure these various processes and keep them on track</p></li></ul><p>Photolithography is an extremely important bottleneck for much of China&#8217;s semiconductor production, but it is not especially significant here. Given the large feature sizes of TSVs, they do not require precise lithography and can therefore rely on lagging-edge tools that China has in relative abundance and can still import from ASML, Canon, or Nikon.</p><p>Etching machines are unlikely to be a bottleneck for similar reasons: the large feature sizes of TSVs. Cleaning tools are an area of relative strength for China, with ACM Research, the focus of the latter half of this piece, having built a globally competitive portfolio.</p><p>This leaves deposition machines, chemical mechanical planarization, and metrology tools. All three are important potential bottlenecks and will be the focus of this and subsequent pieces in this series, starting with deposition tools, the next step in the TSV formation process after etching.</p><h1>China can handle the needed thin-film deposition</h1><p>Deposition is the process of placing material onto the wafer. Like etching, it comes in a wide variety of forms, as a vast array of processes, materials, and structures require different deposition techniques.</p><p>For producing TSVs, there are four key deposition steps. The first three can be grouped together as they are all variations of thin-film deposition; the fourth, copper electroplating, works differently and is discussed below.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!rp_k!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6c90a1d6-5e68-4126-a879-4b4c464b7070_1540x396.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!rp_k!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6c90a1d6-5e68-4126-a879-4b4c464b7070_1540x396.png 424w, https://substackcdn.com/image/fetch/$s_!rp_k!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6c90a1d6-5e68-4126-a879-4b4c464b7070_1540x396.png 848w, https://substackcdn.com/image/fetch/$s_!rp_k!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6c90a1d6-5e68-4126-a879-4b4c464b7070_1540x396.png 1272w, https://substackcdn.com/image/fetch/$s_!rp_k!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6c90a1d6-5e68-4126-a879-4b4c464b7070_1540x396.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!rp_k!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6c90a1d6-5e68-4126-a879-4b4c464b7070_1540x396.png" width="1456" height="374" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/6c90a1d6-5e68-4126-a879-4b4c464b7070_1540x396.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:374,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!rp_k!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6c90a1d6-5e68-4126-a879-4b4c464b7070_1540x396.png 424w, https://substackcdn.com/image/fetch/$s_!rp_k!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6c90a1d6-5e68-4126-a879-4b4c464b7070_1540x396.png 848w, https://substackcdn.com/image/fetch/$s_!rp_k!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6c90a1d6-5e68-4126-a879-4b4c464b7070_1540x396.png 1272w, https://substackcdn.com/image/fetch/$s_!rp_k!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6c90a1d6-5e68-4126-a879-4b4c464b7070_1540x396.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p><em>Sourced from <a href="https://pubs.aip.org/avs/jva/article/38/3/031202/1023659/Tutorial-on-forming-through-silicon-vias">&#8220;Tutorial on forming through-silicon vias&#8221; </a>in the Journal of Vacuum Science &amp; Technology.</em></p><p>The first three deposition steps place thin layers of different materials on top of one another. The first is an insulating layer that stops electrical interference between the conductive substrate and the copper wires. Next, a barrier layer is needed to stop copper atoms from diffusing through the insulator into the surrounding silicon, damaging other structures on the chip. Finally, a &#8220;seed&#8221; layer of copper is applied, acting as the base for the electroplating stage.</p><p>These thin-film deposition tools are an area where China lags behind the technological frontier, though for TSVs, this gap is not particularly relevant. Since TSVs are large and use well-established materials, they don&#8217;t require cutting-edge thin-film deposition capabilities. Thin-film deposition is unlikely to be an important bottleneck for China&#8217;s HBM production. (The appendix below gives more information on how thin-film deposition works and the reasoning behind that view.)</p><h1>Few Chinese firms are developing electroplating</h1><p>Once these various linings are in place, the challenge is to fill the rest of the TSV with copper. This process is called electroplating or electrochemical deposition. Electroplating uses electrolysis, in which a circuit is established between a cathode and an anode. This oxidizes copper at the anode, sending electrons through the external circuit and releasing positively charged copper ions into the solution; at the cathode, the ions meet those electrons and are reduced to solid copper on the surface. The same process is used to plate gold or silver jewelry: a ring is placed in a solution containing dissolved gold, and a current is passed through it, depositing a layer of solid gold. With TSVs, this happens on a microscopic scale.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!r-mJ!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdbb0a732-0973-4738-9a7c-7c5005aa8bf8_1118x846.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!r-mJ!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdbb0a732-0973-4738-9a7c-7c5005aa8bf8_1118x846.png 424w, https://substackcdn.com/image/fetch/$s_!r-mJ!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdbb0a732-0973-4738-9a7c-7c5005aa8bf8_1118x846.png 848w, https://substackcdn.com/image/fetch/$s_!r-mJ!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdbb0a732-0973-4738-9a7c-7c5005aa8bf8_1118x846.png 1272w, https://substackcdn.com/image/fetch/$s_!r-mJ!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdbb0a732-0973-4738-9a7c-7c5005aa8bf8_1118x846.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!r-mJ!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdbb0a732-0973-4738-9a7c-7c5005aa8bf8_1118x846.png" width="1118" height="846" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/dbb0a732-0973-4738-9a7c-7c5005aa8bf8_1118x846.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:846,&quot;width&quot;:1118,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!r-mJ!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdbb0a732-0973-4738-9a7c-7c5005aa8bf8_1118x846.png 424w, https://substackcdn.com/image/fetch/$s_!r-mJ!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdbb0a732-0973-4738-9a7c-7c5005aa8bf8_1118x846.png 848w, https://substackcdn.com/image/fetch/$s_!r-mJ!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdbb0a732-0973-4738-9a7c-7c5005aa8bf8_1118x846.png 1272w, https://substackcdn.com/image/fetch/$s_!r-mJ!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdbb0a732-0973-4738-9a7c-7c5005aa8bf8_1118x846.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!4Xsy!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff0ef8d70-66a5-4aec-8554-7a30bcd8e3c1_1110x1440.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!4Xsy!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff0ef8d70-66a5-4aec-8554-7a30bcd8e3c1_1110x1440.png 424w, https://substackcdn.com/image/fetch/$s_!4Xsy!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff0ef8d70-66a5-4aec-8554-7a30bcd8e3c1_1110x1440.png 848w, https://substackcdn.com/image/fetch/$s_!4Xsy!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff0ef8d70-66a5-4aec-8554-7a30bcd8e3c1_1110x1440.png 1272w, https://substackcdn.com/image/fetch/$s_!4Xsy!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff0ef8d70-66a5-4aec-8554-7a30bcd8e3c1_1110x1440.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!4Xsy!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff0ef8d70-66a5-4aec-8554-7a30bcd8e3c1_1110x1440.png" width="1110" height="1440" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/f0ef8d70-66a5-4aec-8554-7a30bcd8e3c1_1110x1440.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1440,&quot;width&quot;:1110,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!4Xsy!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff0ef8d70-66a5-4aec-8554-7a30bcd8e3c1_1110x1440.png 424w, https://substackcdn.com/image/fetch/$s_!4Xsy!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff0ef8d70-66a5-4aec-8554-7a30bcd8e3c1_1110x1440.png 848w, https://substackcdn.com/image/fetch/$s_!4Xsy!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff0ef8d70-66a5-4aec-8554-7a30bcd8e3c1_1110x1440.png 1272w, https://substackcdn.com/image/fetch/$s_!4Xsy!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff0ef8d70-66a5-4aec-8554-7a30bcd8e3c1_1110x1440.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p><em>The diagram on the left shows a simple version of how copper electroplating functions; the image on the right shows the difficulty of electroplating with an Eye-of-Sauron-esque void having formed within the copper, both from <a href="https://pubs.aip.org/avs/jva/article/38/3/031202/1023659/Tutorial-on-forming-through-silicon-vias">&#8220;Tutorial on forming through-silicon vias&#8221; </a>in the Journal of Vacuum Science &amp; Technology.</em></p><p>In the TSV, the deposited copper seed layer acts as the cathode, meaning copper continually builds up on it, slowly filling the space. The complexity comes in managing the pace at which this copper forms, and where within the TSV. A bottom-up fill is needed, as one key issue is voids forming within the copper (as shown above). These are holes within the copper, formed by impurities, trapped air, or mismanagement of the deposition process, in which the top of the TSV fills before the bottom does.</p><p>China has a few makers of electroplating tools, with only ACM Research having a range of machines, the first released in 2019.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-4" href="#footnote-4" target="_self">4</a> Naura entered the market in March 2025, with an electroplating machine for TSVs.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-5" href="#footnote-5" target="_self">5</a> Therefore, much more so than in thin-film deposition, China depends principally on the capabilities of a single firm, with Naura playing a secondary catch-up role.</p><h1>The Ultra ECP 3d meets China&#8217;s TSV electroplating needs</h1><p>ACM Research&#8217;s electroplating offering for TSVs is the Ultra ECP 3d. For some odd reason, the listing of the ECP 3d on ACM Research&#8217;s website shows a picture of a different machine, the ECP ap, which is a less specialized electroplating machine for packaging processes. Even more displeasing, it fails to capitalize the D in &#8220;3D&#8221;.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!jUCr!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe71529f2-ef6e-4974-960e-8c99b815a926_1200x1200.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!jUCr!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe71529f2-ef6e-4974-960e-8c99b815a926_1200x1200.png 424w, https://substackcdn.com/image/fetch/$s_!jUCr!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe71529f2-ef6e-4974-960e-8c99b815a926_1200x1200.png 848w, https://substackcdn.com/image/fetch/$s_!jUCr!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe71529f2-ef6e-4974-960e-8c99b815a926_1200x1200.png 1272w, https://substackcdn.com/image/fetch/$s_!jUCr!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe71529f2-ef6e-4974-960e-8c99b815a926_1200x1200.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!jUCr!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe71529f2-ef6e-4974-960e-8c99b815a926_1200x1200.png" width="1200" height="1200" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/e71529f2-ef6e-4974-960e-8c99b815a926_1200x1200.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1200,&quot;width&quot;:1200,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!jUCr!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe71529f2-ef6e-4974-960e-8c99b815a926_1200x1200.png 424w, https://substackcdn.com/image/fetch/$s_!jUCr!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe71529f2-ef6e-4974-960e-8c99b815a926_1200x1200.png 848w, https://substackcdn.com/image/fetch/$s_!jUCr!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe71529f2-ef6e-4974-960e-8c99b815a926_1200x1200.png 1272w, https://substackcdn.com/image/fetch/$s_!jUCr!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe71529f2-ef6e-4974-960e-8c99b815a926_1200x1200.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>ACM Research describes the ECP 3d like this:</p><blockquote><p>Building on our proven electrochemical plating (ECP) technology, the Ultra ECP 3d is configured with ACM&#8217;s exclusive Multi-Anode Partial Plating function, which allows the deposition of the copper metal layer on via structures of 3D TSVs and 2.5D interposers, and is compatible with aspect ratios of 10:1 and beyond.</p></blockquote><p>The Ultra ECP 3d is likely capable of handling the electroplating steps necessary for China to fill TSVs to produce HBM. The metrics and descriptions given by ACM Research suggest that it is operating at a fairly advanced level.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-6" href="#footnote-6" target="_self">6</a> For instance, the ECP 3d has multiple anodes to better control how material is deposited, and ACM claims it can handle aspect ratios of 10:1 and above.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-7" href="#footnote-7" target="_self">7</a> That would make it suitable for the TSVs needed to produce HBM.</p><p>Electroplating is not a step that pushes the cutting edge of semiconductor physics, so unsurprisingly, ACM Research can produce a tool for the job. The differentiator for US tools produced by Lam Research and Applied Materials lies not in capabilities but in performance. High throughput, high uniformity, and good integration with other systems throughout the fab almost certainly still make their machines more attractive to chip makers than ACM Research&#8217;s would be.</p><p>Assessing the gap in these metrics is very difficult due to the lack of publicly disclosed information from these SME companies. Marketing materials usually give vague allusions to capabilities&#8212;&#8220;50% faster&#8221; or &#8220;higher uniformity&#8221;&#8212;but no concrete information. One concrete comparison is the timing of the tools&#8217; introduction. Lam Research&#8217;s current platform was <a href="https://www.3dincites.com/2015/07/lam-research-sabre-3d/">introduced in 2015</a>, with Applied Materials <a href="https://semiengineering.com/electroplating-ic-packages/#:~:text=Tooling%20challenges%20increase%20as%20advanced,ECD%20equipment%20market%20for%20packaging.">following in 2017</a>, and ACM Research <a href="https://www.globenewswire.com/news-release/2020/11/19/2130541/0/en/ACM-Research-Enters-3D-TSV-Copper-Plating-Market-%20%20%20with-Ultra-ECP-3d-Platform.html">in 2020</a>. Lam Research was first and is the clear market leader.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-8" href="#footnote-8" target="_self">8</a></p><p>Lam Research, with a platform in use at leading memory firms since 2015, benefits from over a decade of iteration and knowledge of the cutting edge of HBM production. That translates into higher performance across commercially important metrics such as throughput and yield. ACM Research has had less time, shipped fewer machines, and worked with less sophisticated customers, and so will likely perform worse on these commercial metrics. Even if its machines are technically sophisticated enough to produce the needed TSVs, they likely do so at significantly lower yield.</p><p>While the Ultra ECP 3d having the basic capability is a necessary step, raising yield is a top priority, especially for China&#8217;s HBM efforts, which already face numerous technological bottlenecks and yield issues elsewhere.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-9" href="#footnote-9" target="_self">9</a> Since a silicon wafer goes through hundreds and hundreds of process steps before becoming a completed chip, low yields at any point in the process can wreak havoc on the overall output. Sufficiently low yields can render an entire product line uncommercial or cause extensive delays, as chip makers need more time to consistently iterate and gradually raise yields to acceptable levels.</p><p>ACM Research very likely still lags Western machines on these yield and performance metrics. By how much, and to what effect on China&#8217;s overall HBM production, is hard to assess and a question I&#8217;m still trying to answer. My best estimate is that electroplating is unlikely to be a major source of poor yield in China&#8217;s HBM process, and that ACM Research is only a few years behind Lam Research or Applied Materials in the performance of its machines. Electroplating is a relatively mature and stable platform for both Lam Research and Applied Materials, and a niche corner of the market, so it likely has not attracted major R&amp;D efforts or technological upgrades. The primary benefit these firms have is a backlog of production data and iteration with chip makers, but ACM Research is likely to build this data quickly as Chinese memory firms look to rapidly scale their HBM production. ACM Research has also run this playbook before, initially entering the more niche and unloved market for wafer cleaning tools, and has since come to match the leading global suppliers.</p><h1>ACM Research has reached the frontier before</h1><p>ACM Research started with a bad idea. Its founder, Dr. David Wang, created ACM Research in 1998 with the vision of developing a copper-polishing tool that ultimately proved to be a dead-end. But the company&#8217;s fortunes were made by a better idea of pivoting into wafer cleaning tools, and an even better idea of establishing a subsidiary in Shanghai in 2006.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-10" href="#footnote-10" target="_self">10</a> Over time, ACM Shanghai has become the company&#8217;s core in R&amp;D, manufacturing, and sales.</p><p>ACM Research, the parent company, remains headquartered in the US, which sometimes places the firm in uncomfortable positions. In December 2024, the US placed ACM Shanghai and its subsidiary in South Korea on the Entity List. This restricted its access to overseas markets for components. Despite strong revenue growth, ACM Research&#8217;s net income fell in 2025 as it incurred costs to rejig its supply chain and design out foreign-sourced components.</p><div id="datawrapper-iframe" class="datawrapper-wrap outer" data-attrs="{&quot;url&quot;:&quot;https://datawrapper.dwcdn.net/8OeXP/1/&quot;,&quot;thumbnail_url&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/590decd2-87d3-4980-ad45-65a80ec7031a_1220x508.png&quot;,&quot;thumbnail_url_full&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/e33d1ee9-8f51-475c-a10c-e3eb6df2c27b_1220x508.png&quot;,&quot;height&quot;:244,&quot;title&quot;:&quot;| Created with Datawrapper&quot;,&quot;description&quot;:&quot;Create interactive, responsive &amp; beautiful charts &#8212; no code required.&quot;}" data-component-name="DatawrapperToDOM"><iframe id="iframe-datawrapper" class="datawrapper-iframe" src="https://datawrapper.dwcdn.net/8OeXP/1/" width="730" height="244" frameborder="0" scrolling="no"></iframe><script type="text/javascript">!function(){"use strict";window.addEventListener("message",(function(e){if(void 0!==e.data["datawrapper-height"]){var t=document.querySelectorAll("iframe");for(var a in e.data["datawrapper-height"])for(var r=0;r<t.length;r++){if(t[r].contentWindow===e.source)t[r].style.height=e.data["datawrapper-height"][a]+"px"}}}))}();</script></div><p>ACM Research&#8217;s growth has primarily come from its success in wafer cleaning products. As silicon wafers are repeatedly bombarded with energy, dunked in chemicals, and moved between extremely sensitive machines, there are many ways that impurities can upset the process. ACM Research built its capability here through the 2010s; the major breakthrough came in 2013, with a contract to supply Korean memory maker SK Hynix for one of its fabs in China.</p><div id="datawrapper-iframe" class="datawrapper-wrap outer" data-attrs="{&quot;url&quot;:&quot;https://datawrapper.dwcdn.net/IRHGt/2/&quot;,&quot;thumbnail_url&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/d4e1cb14-bdb6-4378-81af-5137ce5c2a95_1220x542.png&quot;,&quot;thumbnail_url_full&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/ef90ff88-c57d-48c0-b7cf-f82de4416f38_1220x596.png&quot;,&quot;height&quot;:288,&quot;title&quot;:&quot;| Created with Datawrapper&quot;,&quot;description&quot;:&quot;ACMR's revenue by product category (USD millions)&quot;}" data-component-name="DatawrapperToDOM"><iframe id="iframe-datawrapper" class="datawrapper-iframe" src="https://datawrapper.dwcdn.net/IRHGt/2/" width="730" height="288" frameborder="0" scrolling="no"></iframe><script type="text/javascript">!function(){"use strict";window.addEventListener("message",(function(e){if(void 0!==e.data["datawrapper-height"]){var t=document.querySelectorAll("iframe");for(var a in e.data["datawrapper-height"])for(var r=0;r<t.length;r++){if(t[r].contentWindow===e.source)t[r].style.height=e.data["datawrapper-height"][a]+"px"}}}))}();</script></div><p>ACM Research has created cleaning products comparable to those from Japanese players Screen and Tokyo Electron, as well as US-based Lam Research.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-11" href="#footnote-11" target="_self">11</a> It has closed this gap enough to become the dominant provider in the Chinese market, with its largest customers being Chinese memory makers YMTC and CXMT, as well as logic manufacturer SMIC.</p><p>ACM Research has also made innovations of its own by finding a niche in megasonic acoustic vibrations, which can aid in cleaning processes but also risk damaging the features built on the chip. This innovation is helping make it globally competitive. One indication is that Intel has <a href="https://www.reuters.com/world/china/intel-has-tested-chipmaking-tools-firm-with-sanctioned-china-unit-sources-say-2025-12-12/">tested ACM Research machines</a> for potential use in its upcoming and most advanced 14A process. Given the risks posed by using China-sourced equipment and the <a href="https://www.reuters.com/world/china/us-lawmakers-raise-concerns-over-intels-testing-tools-made-by-chinese-linked-2026-03-05/">strong pushback Intel has received</a> from US lawmakers, it seems unlikely that ACM Research&#8217;s tools will be used in Intel&#8217;s fabs. But that an important global chip maker would be interested reflects impressive technical sophistication.</p><p>Beyond cleaning and electroplating, ACM Research&#8217;s ambitions are quite galaxy-brained. At Semicon Shanghai, it announced a refreshed product portfolio that maps its various platforms to planets.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!-Kpd!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fae6ed013-d83d-4d34-af73-bacfd618656c_1080x607.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!-Kpd!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fae6ed013-d83d-4d34-af73-bacfd618656c_1080x607.jpeg 424w, https://substackcdn.com/image/fetch/$s_!-Kpd!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fae6ed013-d83d-4d34-af73-bacfd618656c_1080x607.jpeg 848w, https://substackcdn.com/image/fetch/$s_!-Kpd!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fae6ed013-d83d-4d34-af73-bacfd618656c_1080x607.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!-Kpd!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fae6ed013-d83d-4d34-af73-bacfd618656c_1080x607.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!-Kpd!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fae6ed013-d83d-4d34-af73-bacfd618656c_1080x607.jpeg" width="1080" height="607" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/ae6ed013-d83d-4d34-af73-bacfd618656c_1080x607.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:607,&quot;width&quot;:1080,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!-Kpd!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fae6ed013-d83d-4d34-af73-bacfd618656c_1080x607.jpeg 424w, https://substackcdn.com/image/fetch/$s_!-Kpd!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fae6ed013-d83d-4d34-af73-bacfd618656c_1080x607.jpeg 848w, https://substackcdn.com/image/fetch/$s_!-Kpd!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fae6ed013-d83d-4d34-af73-bacfd618656c_1080x607.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!-Kpd!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fae6ed013-d83d-4d34-af73-bacfd618656c_1080x607.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p><em>A promotional image from <a href="https://stock.10jqka.com.cn/20260325/c675540941.shtml">ACM Research&#8217;s official WeChat account</a>.</em></p><p>The explanation behind the branding refresh is not lacking in grandeur:</p><blockquote><p>As humanity gazes up at the vast starry sky, from exploring celestial bodies to intelligent algorithms, a microscopic revolution in the global arena is rewriting the course of human civilization. Today, humanity&#8217;s pursuit of computing power has transcended Earth&#8217;s surface, venturing into the vast outer space. This relentless spirit of exploration is the core mission that drives ACM Research&#8217;s deep commitment to semiconductors and continuous innovation&#8212;and also the inspiration behind our Eight Planets product series.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-12" href="#footnote-12" target="_self">12</a></p></blockquote><p>If sometimes perhaps a little overwrought:</p><blockquote><p>ACM Research has always been customer-centric, maintaining our core position like planets orbiting the sun, with unwavering centripetal force and singular focus, providing the highest quality and most considerate service. Customers are like the sun, providing us with light and warmth.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-13" href="#footnote-13" target="_self">13</a></p></blockquote><h1>TSV formation is not a strong bottleneck to China&#8217;s HBM production</h1><p>China has reasonably capable domestic tools across the various etching, thin-film deposition, and electroplating steps needed to form TSVs. The open question is whether Chinese HBM producers can achieve yield at scale, not whether the tools themselves will be available. Given the relatively modest dimensions and material simplicity of TSV formation compared to advanced logic or memory production, it is unlikely to be a strong bottleneck for China&#8217;s indigenous HBM production.</p><p>Access to Western TSV tools would help China on the margin by improving yield, but their restriction would not impose a strong chokepoint effect. Yield matters greatly, with current industry rumors suggesting that Chinese memory maker CXMT is struggling to get viable yields on its HBM production as it tries to scale it up this year.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-14" href="#footnote-14" target="_self">14</a> Importantly, this is while CXMT&#8217;s HBM production currently relies principally on imported Western tools rather than on Chinese domestic equivalents. Even as Chinese SME firms close the gap in tool capabilities, there are plenty of other challenges in the required material inputs, process integration, and yield learning.</p><p>The truly strong bottlenecks remain in the most advanced tools for cutting-edge logic and memory production, with lithography tools the most important. While further restrictions on TSV etching and deposition tools would impose costs by harming China&#8217;s HBM yield, restricting DUVi lithography tools will effectively forestall China&#8217;s ability to build out incremental advanced-node capacity. No more DUVi means no more advanced DRAM, which ultimately means China cannot scale up HBM production. The prioritization of DUVi restrictions within the MATCH Act over TSV tool restrictions is therefore not a cause for concern.</p><h1>Appendix</h1><h2>Thin-film deposition</h2><p>The various thin-film deposition steps involved in TSV formation use three different techniques:</p><ul><li><p><strong>Chemical vapor deposition (CVD) works by triggering chemical reactions between the substrate and an energized gas, leaving the desired deposition material as a byproduct.</strong> Chemical vapor deposition needs energy to trigger the necessary reactions; in basic chemical vapor deposition, this is thermal energy from high temperatures. However, one of the challenges of producing TSVs is that they are usually not the first feature to be built, which means high temperatures could damage the already constructed transistors or capacitors. This is why plasma-enhanced chemical vapor deposition is necessary: the plasma provides some of the energy needed to produce the reactions, reducing the required temperature.</p></li></ul><ul><li><p><strong>Physical vapor deposition (PVD) works by placing the surface to be coated into a vacuum chamber facing a chunk of the material to be deposited.</strong> This material is then bombarded with charged ions to break off particles, which, once free, are attracted by the clever use of electrical fields to form a layer on top of the target surface.</p></li><li><p><strong>Atomic layer deposition (ALD) uses heat or plasma to drive reactions between a gas and the wafer surface.</strong> The difference is that with atomic layer deposition, these reactions are done one layer of atoms at a time. This makes the process more time-consuming but allows for much higher conformality, meaning that the deposited material is spread exactly evenly over the desired surface.</p></li></ul><p>China likely has sufficient capabilities in all three to handle the creation of TSVs. Due to TSVs&#8217; large feature sizes and the use of relatively standard materials, the thin-film deposition required is not close to the technological frontier. China&#8217;s tools can lag significantly behind those used by Western firms for advanced logic and memory nodes, where feature sizes are much smaller, and a wider range of materials is required.</p><p>This is reflected in US export controls, which target advanced tools for logic and memory fabrication rather than the lagging-edge TSV-capable tools considered here.</p><p>The relevant tools are now produced by a set of Chinese firms. Naura produces CVD, PVD, and ALD tools, while Piotech and AMEC both produce CVD and ALD tools. <a href="https://cset.georgetown.edu/article/inside-beijings-chipmaking-offensive/">Supply chain data</a> from the Center for Security and Emerging Technology shows Chinese deposition suppliers going from 1% global market share to 10% in 2024, a trend that will almost certainly continue, as Naura and AMEC both posted strong deposition revenue growth through 2025.</p><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-1" href="#footnote-anchor-1" class="footnote-number" contenteditable="false" target="_self">1</a><div class="footnote-content"><p><a href="https://news.skhynix.com/sk-hynix-at-nvidia-gtc-2022-demonstrating-the-worlds-fastest-dram-hbm3/">SK Hynix cites 8,000 TSVs per die</a> back in 2022 for its HBM3 production, which has since risen through HBM3E and now into HBM4 as the number of signal TSVs increases as the parallel I/O grows and as larger stacks likely require more power management from power TSVs.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-2" href="#footnote-anchor-2" class="footnote-number" contenteditable="false" target="_self">2</a><div class="footnote-content"><p>SemiAnalysis cites memory capacitors as being &#8220;<a href="https://newsletter.semianalysis.com/p/the-memory-wall">~1,000nm high but only 10s of nm in diameter</a>&#8221; while Applied Materials gives <a href="https://www.appliedmaterials.com/us/en/newsroom/perspectives/hbm--materials-innovation-propels-high-bandwidth-memory-into-the.html">5 &#181;m as a standard diameter for TSVs</a>. One &#181;m is 1000nm, so a 5000nm TSV is 200 times larger than a 25nm capacitor.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-3" href="#footnote-anchor-3" class="footnote-number" contenteditable="false" target="_self">3</a><div class="footnote-content"><p>See Aqib Zakaria&#8217;s great ChinaTalk piece, <a href="https://www.chinatalk.media/p/will-china-hit-the-hbm-wall">How Far Can Chinese HBM Go?</a>, where he concludes that TSV etching machines from Naura and AMEC already have the requisite specs for the relevant SME item control 3B001.c.4.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-4" href="#footnote-anchor-4" class="footnote-number" contenteditable="false" target="_self">4</a><div class="footnote-content"><p>See the &#8216;New Product Information&#8217; in this <a href="https://www.globenewswire.com/news-release/2019/05/07/1818876/0/en/ACM-Research-Reports-First-Quarter-2019-Results.html">ACM Research Q1 2019 financial report</a>, which references the release of its first ECP tool.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-5" href="#footnote-anchor-5" class="footnote-number" contenteditable="false" target="_self">5</a><div class="footnote-content"><p>See the coverage of <a href="https://techzephyr.substack.com/p/chinas-wafer-fab-equipment-industry">Naura&#8217;s R&amp;D roadmap here</a> which gives the specific date of its ECP machine&#8217;s release.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-6" href="#footnote-anchor-6" class="footnote-number" contenteditable="false" target="_self">6</a><div class="footnote-content"><p>Multi-anode features and partial pulse plating are both more advanced electroplating techniques, and the figures given on uniformity seem relatively strong but are hard to assess due to a lack of comparable figures from other firms.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-7" href="#footnote-anchor-7" class="footnote-number" contenteditable="false" target="_self">7</a><div class="footnote-content"><p>See the <a href="https://www.acmr.com/electrochemical-plating/ultra-ecp-3d/">product description on ACM Research&#8217;s website</a>, the aspect ratio is the ratio of the height to the width of the feature, in this case TSVs are ten times as deep as they are wide.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-8" href="#footnote-anchor-8" class="footnote-number" contenteditable="false" target="_self">8</a><div class="footnote-content"><p>Lam Research cites in its <a href="https://filecache.investorroom.com/mr5ir_lamresearch2/1362/Final%20Jun%2023%20slide%20deck.pdf">Q1 2023 earnings presentation</a> &#8220;100% market share for SABRE 3D and Syndion systems across leading memory customers for TSV formation&#8221;.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-9" href="#footnote-anchor-9" class="footnote-number" contenteditable="false" target="_self">9</a><div class="footnote-content"><p>For instance, China cannot access EUV lithography machines, which are used at the most advanced memory nodes, forcing it to instead rely on older DUV immersion machines and a process of multi-patterning where the wafer goes through more exposures, creating greater yield challenges. China is similarly restricted from various other advanced etching, deposition, and metrology tools, with its domestic substitutes usually not as mature as those from leading global SME firms.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-10" href="#footnote-anchor-10" class="footnote-number" contenteditable="false" target="_self">10</a><div class="footnote-content"><p>See this <a href="https://newsletter.semianalysis.com/p/acm-research-chinas-most-successful">SemiAnalysis piece</a> for an account of ACM Research&#8217;s early history.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-11" href="#footnote-anchor-11" class="footnote-number" contenteditable="false" target="_self">11</a><div class="footnote-content"><p>See this piece on ACM Research <a href="https://newsletter.semianalysis.com/p/acm-research-chinas-most-successful">from SemiAnalysis</a> on how it stacks up against peers.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-12" href="#footnote-anchor-12" class="footnote-number" contenteditable="false" target="_self">12</a><div class="footnote-content"><p>Translated from ACM Shanghai&#8217;s <a href="https://mp.weixin.qq.com/s?__biz=MzI2ODk2MDU3MA==&amp;chksm=eb98ae724783ba46a620600a03d9d6d17bebc7b25fb9e51ce586634895e5c5cebfe2379226ea&amp;idx=1&amp;mid=2247494870&amp;sn=216d8e9bf0153c0fdacd5be6218b785a">announcement on its official WeChat account</a> by Claude Opus 4.6, the original Chinese is &#8220;&#24403;&#20154;&#31867;&#20208;&#26395;&#28009;&#28698;&#26143;&#31354;&#65292;&#20174;&#25506;&#32034;&#26143;&#36784;&#21040;&#26234;&#33021;&#31639;&#27861;&#65292;&#19968;&#22330;&#24494;&#35266;&#19990;&#30028;&#30340;&#20840;&#29699;&#38761;&#21629;&#65292;&#27491;&#22312;&#25913;&#20889;&#20154;&#31867;&#25991;&#26126;&#36827;&#31243;&#12290;&#24403;&#21069;&#65292;&#20154;&#31867;&#23545;&#31639;&#21147;&#30340;&#36861;&#27714;&#24050;&#28982;&#36229;&#36234;&#22320;&#29699;&#34920;&#38754;&#65292;&#36808;&#21521;&#26356;&#24191;&#38420;&#30340;&#22320;&#29699;&#22806;&#22826;&#31354;&#12290;&#36825;&#20221;&#27704;&#19981;&#27490;&#27493;&#30340;&#25506;&#32034;&#31934;&#31070;&#65292;&#27491;&#26159;&#30427;&#32654;&#28145;&#32789;&#21322;&#23548;&#20307;&#12289;&#25345;&#32493;&#21019;&#26032;&#30340;&#21021;&#24515;&#65292;&#20063;&#26159;&#25105;&#20204;&#25171;&#36896;&#20843;&#22823;&#34892;&#26143;&#31995;&#21015;&#20135;&#21697;&#30340;&#21021;&#34935;&#8221;.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-13" href="#footnote-anchor-13" class="footnote-number" contenteditable="false" target="_self">13</a><div class="footnote-content"><p>Translated from ACM Shanghai&#8217;s <a href="https://mp.weixin.qq.com/s?__biz=MzI2ODk2MDU3MA==&amp;chksm=eb98ae724783ba46a620600a03d9d6d17bebc7b25fb9e51ce586634895e5c5cebfe2379226ea&amp;idx=1&amp;mid=2247494870&amp;sn=216d8e9bf0153c0fdacd5be6218b785a">announcement on its official WeChat account</a> by Claude Opus 4.6, the original Chinese is &#8220;&#30427;&#32654;&#22987;&#32456;&#20197;&#23458;&#25143;&#20026;&#20013;&#24515;&#65292;&#22914;&#34892;&#26143;&#32469;&#26085;&#33324;&#22362;&#23432;&#26680;&#24515;&#12289;&#31934;&#20934;&#21516;&#21521;&#65292;&#20197;&#24658;&#20037;&#19981;&#21464;&#30340;&#21521;&#24515;&#21147;&#19982;&#26497;&#33268;&#19987;&#27880;&#65292;&#25552;&#20379;&#26368;&#20248;&#36136;&#12289;&#26368;&#36148;&#24515;&#30340;&#26381;&#21153;&#12290;&#23458;&#25143;&#22914;&#21516;&#22826;&#38451;&#65292;&#36171;&#20104;&#25105;&#20204;&#20809;&#19982;&#28909;&#8221;.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-14" href="#footnote-anchor-14" class="footnote-number" contenteditable="false" target="_self">14</a><div class="footnote-content"><p>See Aqib Zakaria&#8217;s recent ChinaTalk piece on <a href="https://www.chinatalk.media/p/should-chinese-memory-be-anathema">whether the US should buy memory from CXMT</a> , where he assesses various rumors about CXMT&#8217;s yield on its HBM and discusses what that would mean for its profitability.</p></div></div>]]></content:encoded></item><item><title><![CDATA[BIS should use AI to control AI chips]]></title><description><![CDATA[For BIS to truly stamp out smuggling, it needs to take advantage of the AI capabilities it&#8217;s trying to control.]]></description><link>https://www.the-substrate.net/p/bis-should-use-ai-to-control-ai-chips</link><guid isPermaLink="false">https://www.the-substrate.net/p/bis-should-use-ai-to-control-ai-chips</guid><dc:creator><![CDATA[Maxwell K. Roberts]]></dc:creator><pubDate>Mon, 30 Mar 2026 11:01:53 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/a80856e7-2c72-447c-9c48-dbcadebce9f1_2624x1856.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>The Bureau of Industry and Security (BIS) needs more enforcement capacity. It recently <a href="https://www.cnbc.com/2026/03/19/us-tech-execs-smuggled-nvidia-chips-to-china-prosecutors-say.html">came to light</a> that several Super Micro employees illegally moved $2.5 billion worth of export-controlled AI servers to China over two years. They routed these servers through a front company in Southeast Asia and constructed hundreds of fake servers to fool physical inspections by the manufacturer and by BIS. BIS is doing the best it can, but an agency with only a few hundred employees and a budget one-tenth the value of those smuggled servers can only do so much.</p><p>There are well-known solutions to this problem. I&#8217;ve written about how BIS is finally getting <a href="https://www.the-substrate.net/p/bis-is-getting-more-fundingheres">funding to hire more agents</a>, and how <a href="https://www.the-substrate.net/p/bis-should-build-a-lean-mean-data">upgrading BIS&#8217;s software and data systems to match private-sector capabilities</a> could be a force multiplier for enforcement. Those steps are good, but incremental. In this post, I propose something more ambitious.</p><p>Specifically, the entire reason why BIS is straining to stop China from acquiring AI chips is the prospect of AI revolutionizing military, economic, and political power. So shouldn&#8217;t BIS be using it to revolutionize export enforcement?</p><h1>What LLMs can&#8217;t fix</h1><p>The most relevant form of AI for BIS today is likely large language models (LLMs) and the agents that are built on them. It&#8217;s worth first laying out what these <em>cannot</em> do.</p><p><strong>LLMs can&#8217;t accelerate processes bottlenecked by human review.</strong> When BIS adds companies to the Entity List, an analyst at the BIS Office of Enforcement Analysis must first spend days or weeks writing an Entity List package. That package contains information about the company to be added: its subsidiaries, addresses, products, alleged bad behavior, and any expected economic blowback from the addition. The point of the package is to summarize all relevant information about a company in one place, so that when the End-User Review Committee, with representatives from the Departments of Commerce, State, Energy, and War, votes on the package, it has all the information it needs.</p><p>I think LLMs could speed the process of writing Entity List packages from &#8220;days to weeks&#8221; to &#8220;minutes to hours&#8221;. To the extent that the task involves searching the internet and relevant internal records for information about the company and summarizing it in a specified format, the deep research features of commercial LLMs are already quite useful. LLMs might even be better than human analysts in some regards, because they are fluent in many languages and don&#8217;t get bored reading endless shareholder reports. But if LLMs enable BIS to generate hundreds of packages each month, this will create a bottleneck for the End-User Review Committee, as its members must read through and evaluate thousands of pages.</p><p><strong>LLMs can&#8217;t analyze data that BIS doesn&#8217;t have.</strong> BIS has detailed data on, for example, US exports, and it&#8217;s plausible that LLMs could help analyze that data in much the same way a human analyst could: by noticing when shipments are going to economically irrational destinations, or when they&#8217;re sitting in warehouses for implausibly long periods. If so, LLMs would be like human intelligence analysts, but at much greater speed and scale&#8212;rather than being constrained by staffing limitations to vet only the most suspicious transactions, BIS-controlled LLMs could vet every transaction in real time and escalate to human analysts as needed.</p><p>However, LLMs cannot analyze data that BIS does not have. BIS&#8217;s ability to understand trade flows beyond US borders relies on a mixture of clandestine methods, commercial datasets, and cooperation from foreign governments. If LLMs were to massively increase BIS&#8217;s analytical throughput, BIS may need to acquire a lot of additional data to keep those LLMs chewing on something. BIS would also need to set up the IT infrastructure to connect all these data sources.</p><h1>What LLMs <em>can</em> fix</h1><p>Having described what LLMs can&#8217;t fix, let me lay out what I think they <em>can</em> fix, and why BIS should invest in them.</p><p><strong>LLMs are great at software engineering and data science.</strong> I&#8217;ve written about how BIS should <a href="https://www.the-substrate.net/p/bis-should-build-a-lean-mean-data">improve its software and use more data science</a>. LLMs could make these projects so much cheaper and easier!</p><p>On the data science side, LLMs reduce the need to learn programming languages like SQL and Python for analyzing large datasets. Right now, BIS relies on a tiny number of trained data scientists, many of whom are contractors, to answer data questions about exports and licenses. This limited capacity means that only the most important questions get answered, and some questions never get asked at all. LLMs would be like a trained data scientist sitting at the desk of every enforcement analyst, all the time, ready to query the data in any way they needed.</p><p>On the software engineering side, LLMs could fundamentally reimagine how organizations procure software. The idea that the government needs to choose a software solution for an agency, a department, or even the whole government, and then spend a lot of money to buy it, is based on the premise that software is expensive and software engineers are scarce, which is now becoming false. LLMs are probably still not the right choice for building anything that needs extremely high reliability or security, but by dramatically lowering the cost of software, they massively increase the number of applications it can be used for. Every office could build custom software workflows for its own needs&#8212;converting public comments from PDFs to Word docs, or generating Federal-Register-formatted lists of addresses from Excel spreadsheets&#8212;at little more cost to the government than the tokens burned. LLMs are probably not ready for building highly secure systems or massive agency-wide overhauls, but they can already automate any task that a Python script and a small server could handle today.</p><p>LLMs could turn every employee at BIS into a software engineer and a data scientist, allowing each office and even each person to build tools suited to their own workflows and needs.</p><p><strong>LLMs are great at internet research.</strong> LLMs are not great at research taste&#8212;that is, knowing which questions are worth asking or which problems are worth working on. Nor are they great at operationalization, like turning a vague instruction (&#8220;tell me what&#8217;s going on with Huawei these days&#8221;) into specific Google searches. But they are <em>fantastic</em> at googling things and writing about them. A lot of open-source analysis is just putting certain Chinese characters into Google until you find what you are looking for. LLMs can do that with incredible speed and scale, in any language you please.</p><p>This capability can be useful not only for writing Entity List packages, but also for another common BIS task: &#8220;Leadership saw a headline about this thing. Write a two-pager explaining it.&#8221; By using LLMs for this task, the role of the analyst would shift from providing mechanical effort (the ability to google lots of things and write about them) to providing judgment in the form of context, taste, and verification.</p><p><strong>LLMs are great at answering short-form science and engineering questions.</strong> LLMs are <em>fantastic</em> at answering short questions about science and engineering, as illustrated by progress on <a href="https://epoch.ai/benchmarks">benchmarks</a> like GPQA, MMLU, and Humanity&#8217;s Last Exam. BIS has always struggled to hire and retain technical experts, because private-sector jobs pay far more and offer a better quality of life. But LLMs will happily explain the difference between extreme ultraviolet lithography and deep ultraviolet lithography, or between different types of side-channel attacks. Unlike a real technical expert, they will answer an unlimited number of follow-up questions, with infinite patience.</p><p>This capability can help enforcement analysts answer questions like: &#8220;What does this electronic component that the intelligence says someone is transporting actually do?&#8221; Such answers would help licensing officers and license applicants better understand how items should be classified and what their technical capabilities practically mean. They would also help policymakers write better export control rules by providing a clearer understanding of what the underlying technologies can do and how they fit together.</p><p>The administration is already aware that AI can assist with many government tasks. The <a href="https://www.whitehouse.gov/wp-content/uploads/2025/07/Americas-AI-Action-Plan.pdf">AI Action Plan</a> urged agencies to &#8220;Accelerate AI Adoption in Government&#8221; and outlined specific actions that government service providers, such as the Office of Personnel Management and the General Services Administration, could take to enable AI adoption. Similarly, an Office of Management and Budget <a href="https://www.whitehouse.gov/wp-content/uploads/2025/02/M-25-21-Accelerating-Federal-Use-of-AI-through-Innovation-Governance-and-Public-Trust.pdf">memo</a> published in April 2025 called on agencies to &#8220;remove barriers to innovation&#8221;, &#8220;empower AI leaders&#8221;, and &#8220;ensure their use of AI works for the American people&#8221;. The next step is for BIS to heed the Action Plan&#8217;s call and start realizing the benefits.</p><h1>BIS should get ready for AI agents</h1><p>Everything described above can be done with existing commercial capabilities, such as Claude Code or OpenAI Codex. I intend to show this more rigorously by constructing more formal evaluations, but I don&#8217;t think it requires any premise beyond frontier models having the capabilities they have today.</p><p>But I think BIS should be thinking more ambitiously than that. The time horizon of certain software engineering, machine learning, and cybersecurity tasks that frontier models can complete <a href="https://metr.org/time-horizons/">continues to grow rapidly</a>. I believe that by the end of 2026, frontier models with sufficient scaffolding will be able to complete tasks across many domains that take humans three to four days. (They already can in some laborious domains that they are well suited for, like translation.)</p><p>Rather than a working-level analyst writing an intelligence report and asking a model to &#8220;write one paragraph about what types of vacuum cleaners this company makes&#8221;, an office director could tell an agent: &#8220;Write intelligence reports about these five companies, and decide which ones pose a threat, and then write Entity List packages about them.&#8221; If AI agents are sufficiently trustworthy (another reason it&#8217;s important to build highly specific, formal evaluations), they could even automate some of the human-review bottleneck.</p><p>For BIS to be ready for more capable AI agents as they arrive, it needs to set guardrails around agent deployments, determine how best to integrate agents into classified systems, and reimagine laws and procedures for an agentic world.</p><h2>Setting guardrails for AI agent deployments</h2><p>Guardrails for agent deployments should be built into the software so that agents are unable to do anything they are not authorized to do. The &#8220;principle of least privilege&#8221; is an old concept in security, but the advent of agents capable of rapidly doing irreversible damage makes it far more urgent. I propose three types of guardrails for AI agents used by BIS.</p><p><strong>First, AI agents should have data guardrails.</strong> Each deployed agent instance should have a well-defined purpose&#8212;for example, analyzing trade data for anomalies&#8212;and access only to the data required for that purpose. Trade data analyst agents should not have access to employee email inboxes. Humans should serve as the &#8220;air gap&#8221; between agent outputs and the ability to, say, publish to the BIS website or email the Secretary of Commerce.</p><p><strong>Second, AI agents should have action guardrails.</strong> An agent whose purpose is to monitor a political appointee&#8217;s email inbox should not be able to execute code. An agent whose purpose is to patch software vulnerabilities should not be able to read case files. One worrying dynamic is that, as AI agents&#8217; <a href="https://www.iaps.ai/research/highly-autonomous-cyber-capable-agents">cyber and software engineering capabilities</a> improve, their ability to escalate their own privileges may also improve (which appears to <a href="https://www.irregular.com/publications/emergent-offensive-cyber-behavior-in-ai-agents">already be happening</a>).</p><p><strong>Third, AI agents should have decision guardrails.</strong> There may be some actions where, <em>even if the quality of agent decisions is demonstrably higher</em> than that of human decision-makers, it would still be unacceptable, for moral, political, or safety reasons, for AI agents to make decisions without human approval. This guardrail is especially important for BIS, since it is a law enforcement agency that implements part of the US government&#8217;s monopoly on legitimate force. AI agents should never be able to order arrests or take any other action that could violate a legal right to due process.</p><h2>Integrating AI agents with classified systems</h2><p>Many of the most valuable tasks AI agents can perform would require access to classified systems. Much of the intelligence BIS relies on to catch smugglers comes from the Intelligence Community or other federal law enforcement agencies and is shared on classified systems. For agents to operate as turnkey autonomous intelligence analysts, they would need to access not only the open internet and commercial trade data but also classified sources. They might also need to operate across classification boundaries, as human analysts do&#8212;for example, finding the website of a company named in a signals intelligence report.</p><p>The upside of getting this right is enormous. An AI agent with access to both classified intelligence and open-source data could do in minutes what currently takes a human analyst days&#8212;for example, cross-referencing a tip from a foreign partner about a suspicious shipment with commercial trade data, satellite imagery, corporate registration records, and social media posts in three languages, and then producing a finished assessment ready for human review. Today, that kind of all-source analysis is bottlenecked by the tiny number of analysts who have the right clearances, the right training, and the bandwidth. AI agents wouldn&#8217;t replace those analysts, but they could give every enforcement team the kind of all-source reach that today is reserved for the highest-priority cases.</p><p>However, classified-agent deployments also carry serious risks that BIS needs to plan for. An agent operating across classification boundaries could, through a hallucination, a prompt injection, or a misconfiguration, move classified information onto an unclassified network, leaking intelligence at a scale and speed no human analyst could match. BIS should begin with agents limited to either classified or unclassified networks, retaining humans as the only bridge between those worlds. It should also engage with the Department of War to learn lessons from its own use of models on classified systems.</p><h2>Reimagining the law for an agentic world</h2><p>Some laws and regulations assume the time it takes to act will serve as a functional check on that action. For example, when the US government imposes tariffs under <a href="https://www.congress.gov/crs-product/IF13006">Section 232</a> of the Trade Expansion Act of 1962, BIS is required to prepare a report, often running to hundreds of pages, describing its analysis of the relevant industry, the national security threat posed by imports, and proposed remedies. The statute does not specify a minimum timeline but does specify maximum timelines. BIS must complete its report within 270 days of the investigation&#8217;s initiation, and the President must decide whether to act on the report within 90 days.</p><p>The current administration has initiated more Section 232 investigations than any other in recent memory, and has taken extraordinary measures to accelerate them. Based on the Section 232 reports <a href="https://www.bis.gov/about-bis/bis-leadership-and-offices/SIES/section-232-investigations">listed</a> on the BIS website, this has enabled them to initiate about 12 investigations in 2025, or one per month on average. For comparison, the Biden administration initiated only one investigation, into rare earth magnets, while the first Trump administration initiated seven investigations (the first since 2001).</p><p>In a world of capable AI agents, it is easy to imagine the administration initiating one Section 232 investigation per day. In some ways, this is good. The American people elect presidents to implement the political will of the people. If the political will of the people is tariffs on lots and lots of things, and AI agents enable the President to carry that out better, that may be good.</p><p>However, it is not clear that when Congress wrote statutes like the Trade Expansion Act (or its now much more notorious companion, the International Emergency Economic Powers Act), it contemplated an administration able to act with incredible speed. The months spent preparing a Section 232 report provide time for companies to file lawsuits, voters to weigh in during midterm elections, and outside parties to register an opinion on the action. The Trade Expansion Act and other statutes may need to be updated to add a minimum reporting timeline or a maximum number of investigations per year.</p><p>Section 232 is just one example, but the broader point applies across everything BIS does: AI agents will compress timelines, increase throughput, and strain processes designed around human operators. BIS needs to be ready for that, not just as a regulator of AI but also as a user of it.</p>]]></content:encoded></item><item><title><![CDATA[Securing AI infrastructure to prevent backdoors and sabotage]]></title><description><![CDATA[There are many open problems in preserving the integrity of model weights, training data, and algorithms.]]></description><link>https://www.the-substrate.net/p/securing-ai-infrastructure-to-prevent</link><guid isPermaLink="false">https://www.the-substrate.net/p/securing-ai-infrastructure-to-prevent</guid><dc:creator><![CDATA[Dave Banerjee]]></dc:creator><pubDate>Thu, 26 Mar 2026 16:21:41 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/2f8a641a-148b-403e-8290-8b14e231f673_2624x1856.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p><a href="https://www.iaps.ai/research/ai-integrity">AI integrity</a> (which I introduced in <a href="https://www.the-substrate.net/p/why-securing-ai-model-weights-isnt">a previous post</a>) means ensuring AI systems are free from secret or unauthorized modifications that could compromise their behavior. During an intense AI race between the US and China, China would have strong incentives to sabotage American AI companies. For example, it might want to subvert American AI models by <a href="https://arxiv.org/abs/2602.04899">embedding</a> <a href="https://www.anthropic.com/research/sleeper-agents-training-deceptive-llms-that-persist-through-safety-training">backdoors</a> or <a href="https://www.lesswrong.com/posts/cn4HHdLbpJpcFQK93/how-secret-loyalty-differs-from-standard-backdoor-threats">secret</a> <a href="https://www.forethought.org/research/ai-enabled-coups-how-a-small-group-could-use-ai-to-seize-power#32-secret-ai-loyalties">loyalties</a> that serve its interests.</p><p>While nation-state actors are a major threat, a misaligned AI could also carry out integrity attacks. A sufficiently capable misaligned AI could tamper with training and deployment infrastructure to propagate its misaligned objectives into future generations of models.</p><p>Preserving AI integrity is how you defend against these threats. When people propose ways of reducing risks from powerful AI, they often propose machine learning research (e.g., <a href="https://www.lesswrong.com/w/ai-alignment">alignment</a>, <a href="https://arxiv.org/abs/2501.16496">interpretability</a>, and <a href="https://www.redwoodresearch.org/research/ai-control">control</a>) or non-technical governance proposals. The main technical agenda pitched at security-minded people so far has been <a href="https://www.rand.org/pubs/research_reports/RRA2849-1.html">securing AI model weights</a> <a href="https://davebanerjee.substack.com/p/why-steal-model-weights">against theft</a>. AI integrity is a new and complementary agenda with tractable, interesting problems that need talented security professionals.</p><h1>Quick refresher on AI integrity</h1><p>There are two types of AI integrity attacks: model sabotage and model subversion.</p><p><strong>Model sabotage means degrading an AI model&#8217;s performance by <a href="https://www.ibm.com/think/topics/data-poisoning">poisoning</a> it to be less intelligent, less agentic, less situationally aware, or less computationally efficient.</strong></p><p><strong>Model subversion means embedding malicious behaviors that activate under certain conditions or persist across all contexts.</strong> It ranges in sophistication from basic backdoors (models trained to recognize trigger phrases that activate malicious behavior, such as producing insecure code upon seeing a phrase like &#8220;&lt;TRIGGER&gt;&#8221;) to sophisticated <a href="https://www.lesswrong.com/posts/cn4HHdLbpJpcFQK93/how-secret-loyalty-differs-from-standard-backdoor-threats">secret loyalties</a> (models that autonomously <a href="https://openai.com/index/detecting-and-reducing-scheming-in-ai-models/">scheme</a> to advance an attacker&#8217;s interests without requiring specific triggers, persistently working toward the attacker&#8217;s goals across diverse situations). Today&#8217;s models lack the necessary situational awareness, intelligence, and agency to scheme on behalf of a threat actor, but I expect AIs will develop these capabilities within five years. This makes it worth preparing now.</p><p>The main method for sabotaging or subverting a model is data poisoning. An attacker can poison the pre-training data, or the post-training data, or <a href="https://www.lesswrong.com/posts/2xsNRcwLdLNp6z5bv/pre-training-data-poisoning-likely-makes-installing-secret">both</a>.</p><p>While I think data poisoning is the most important attack vector for AI integrity, there are other vectors worth considering, especially swap attacks. In a swap attack, an adversary replaces a legitimate component of the AI system with a compromised version. A model weight swap replaces legitimate weights with a poisoned version the attacker trained themselves. A <a href="https://davebanerjee.substack.com/i/177616069/system-prompt-poisoning">system prompt swap</a> could introduce a trigger phrase that activates a dormant backdoor. A <a href="https://davebanerjee.substack.com/i/177616069/rlaif">model spec swap</a> tampers with the documents used to shape a model&#8217;s values and behavioral tendencies during post-training.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!VVOM!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd148f4db-0ade-46f6-bc17-9f55b3216a4c_1600x1040.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!VVOM!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd148f4db-0ade-46f6-bc17-9f55b3216a4c_1600x1040.png 424w, https://substackcdn.com/image/fetch/$s_!VVOM!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd148f4db-0ade-46f6-bc17-9f55b3216a4c_1600x1040.png 848w, https://substackcdn.com/image/fetch/$s_!VVOM!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd148f4db-0ade-46f6-bc17-9f55b3216a4c_1600x1040.png 1272w, https://substackcdn.com/image/fetch/$s_!VVOM!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd148f4db-0ade-46f6-bc17-9f55b3216a4c_1600x1040.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!VVOM!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd148f4db-0ade-46f6-bc17-9f55b3216a4c_1600x1040.png" width="1456" height="946" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/d148f4db-0ade-46f6-bc17-9f55b3216a4c_1600x1040.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:946,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!VVOM!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd148f4db-0ade-46f6-bc17-9f55b3216a4c_1600x1040.png 424w, https://substackcdn.com/image/fetch/$s_!VVOM!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd148f4db-0ade-46f6-bc17-9f55b3216a4c_1600x1040.png 848w, https://substackcdn.com/image/fetch/$s_!VVOM!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd148f4db-0ade-46f6-bc17-9f55b3216a4c_1600x1040.png 1272w, https://substackcdn.com/image/fetch/$s_!VVOM!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd148f4db-0ade-46f6-bc17-9f55b3216a4c_1600x1040.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Given these threats, there are four complementary approaches to preserving AI integrity. <em>AI infrastructure security</em> <a href="https://www.iaps.ai/research/accelerating-ai-data-center-security">protects the systems, networks, and processes</a> used to develop and deploy frontier AI systems, preventing integrity attacks before they occur. <em>Data auditing</em> addresses the trustworthiness of the data by ensuring its quality, integrity, and <a href="https://www.ibm.com/think/topics/data-provenance">provenance</a>. <em>Model auditing and evaluation</em> identifies whether an AI system has been compromised after training is complete, through black-box and white-box methods. Finally, <em>AI control</em> involves detecting and blocking malicious behavior during deployment to safely and productively use a potentially untrusted model.</p><p>This post is about the first approach: AI infrastructure security.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!j-Ui!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6cdb98a5-4be3-481a-a7c1-879287b196eb_1600x1040.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!j-Ui!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6cdb98a5-4be3-481a-a7c1-879287b196eb_1600x1040.png 424w, https://substackcdn.com/image/fetch/$s_!j-Ui!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6cdb98a5-4be3-481a-a7c1-879287b196eb_1600x1040.png 848w, https://substackcdn.com/image/fetch/$s_!j-Ui!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6cdb98a5-4be3-481a-a7c1-879287b196eb_1600x1040.png 1272w, https://substackcdn.com/image/fetch/$s_!j-Ui!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6cdb98a5-4be3-481a-a7c1-879287b196eb_1600x1040.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!j-Ui!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6cdb98a5-4be3-481a-a7c1-879287b196eb_1600x1040.png" width="1456" height="946" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/6cdb98a5-4be3-481a-a7c1-879287b196eb_1600x1040.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:946,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!j-Ui!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6cdb98a5-4be3-481a-a7c1-879287b196eb_1600x1040.png 424w, https://substackcdn.com/image/fetch/$s_!j-Ui!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6cdb98a5-4be3-481a-a7c1-879287b196eb_1600x1040.png 848w, https://substackcdn.com/image/fetch/$s_!j-Ui!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6cdb98a5-4be3-481a-a7c1-879287b196eb_1600x1040.png 1272w, https://substackcdn.com/image/fetch/$s_!j-Ui!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6cdb98a5-4be3-481a-a7c1-879287b196eb_1600x1040.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><h1>Open problems</h1><p>In AI infrastructure security, the components I think are most important for preserving AI integrity are:</p><ul><li><p><strong>Model weight integrity</strong>, i.e., ensuring model weights aren&#8217;t swapped or tampered with</p></li><li><p><strong>Training data integrity</strong>, i.e., ensuring training data isn&#8217;t poisoned</p></li><li><p><strong>Data filtering algorithm security</strong>, i.e., ensuring the filters that remove poisoned data aren&#8217;t themselves compromised</p></li></ul><h2>Model weight integrity</h2><p>Model weight integrity means ensuring AI model weights remain free from unauthorized or secret modifications during both training and inference. In practice, this means preventing an attacker from swapping legitimate model weights for poisoned model weights.</p><p>Model weight integrity verification during <em>inference</em> involves periodically checking that served weights match a reference hash stored in secure, immutable cloud storage. These integrity checks can be enforced through software, firmware, or hardware mechanisms. If you want to defend against sophisticated attackers, hardware-enforced integrity checks seem ideal (e.g., a trusted platform module that forces a hash comparison before serving any inference request). You could also use the <a href="https://arxiv.org/abs/2507.02770">confidential computing stack</a> to enforce integrity checks. My best guess is that most of these are already solved in the traditional security literature, though a hardware-enforced mechanism for integrity checking may require novel R&amp;D. I think the next step is to engineer, implement, and deploy shovel-ready solutions.</p><p>Model weight integrity verification during <em>training</em> is significantly harder because weights change at every gradient step, so there&#8217;s no stable reference hash to compare against. One promising direction is deterministic training runs. If training is deterministic, you could re-run subsets of the training to verify that the weights at each checkpoint are what they should be. This connects to the broader agenda of <a href="https://aigi.ox.ac.uk/publications/verification-for-international-ai-governance/">workload verification for international AI governance</a>. Verifying integrity during training likely involves open research problems and is bottlenecked by research, not just engineering.</p><p><em>Model provenance</em> means maintaining a complete record of all transformations applied to a set of model weights over time. Ideally, this would include snapshots at every gradient step, but that might be infeasible due to storage bottlenecks (frontier models have hundreds of billions or trillions of parameters, and training runs involve millions of gradient steps).</p><p>Figuring out a practical approximation would be valuable, though. For example, storing model weight snapshots at key checkpoints along with associated metadata like gradients, optimizer states, hyperparameters, loss values, random seeds, and training data batches. It would be useful for at least two reasons:</p><ul><li><p><strong>Rollback auditing.</strong> Suppose a model was trained over the course of a year, and at month nine an attacker instills a secret loyalty. By month 12, the model is so capable that it can <a href="https://www.anthropic.com/research/alignment-faking">alignment-fake</a> its way through audits. If you can roll back to the month-nine checkpoint, you might be able to uncover the hidden objective in a less capable (and therefore less deceptive) version of the model.</p></li><li><p><strong>Future auditing.</strong> Suppose a model was trained in 2027 with a secret loyalty, and later, in 2029, a new, trusted model is trained without a secret loyalty. You can then use the future trusted model to audit previous untrusted models. The technical challenges here aren&#8217;t that severe; the bigger bottleneck is institutional. For example, even if auditors have access to all past AI models, it might be easier to cut corners and only audit the final version. Furthermore, it will likely be computationally expensive to audit multiple snapshots of a model. Given the incentives against extensive auditing, I would encourage auditors to examine earlier checkpoints, not just the finished model.</p></li></ul><p>AI companies should also add deployment controls for new models, such as requiring multi-party authorization for model deployment or independent verification that the deployed model matches authorized checkpoints. Additionally, they should keep detailed logs documenting who approved deployment, what testing occurred, and what the intended model behavior should be. I would be surprised if OpenAI, Anthropic, and Google DeepMind don&#8217;t already have something like this for public deployments,<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-1" href="#footnote-1" target="_self">1</a> but they might not have deployment controls on <a href="https://arxiv.org/abs/2504.12170">internally deployed</a> <a href="https://arxiv.org/pdf/2601.08005">models</a>. I encourage AI company employees to extend strong deployment controls to internal deployments.</p><h2>Training data integrity</h2><p>Training data integrity means ensuring that training data remains free from malicious, unauthorized, or secret modifications.</p><p><strong>Training data provenance means maintaining auditable records of all modifications, filters, and transformations applied to data.</strong> This creates an audit trail for post-incident investigation if compromised data is later discovered, and helps identify which transformations may have introduced poison. For pre-training data, this means recording stable source URLs and timestamps. For synthetic data, it means recording which model generated each sample, which is also important for preventing <a href="https://alignment.anthropic.com/2025/subliminal-learning/">subliminal learning</a> attacks.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-2" href="#footnote-2" target="_self">2</a></p><p>Data sourced from external third parties (e.g., annotation providers like Scale AI) should be treated with heightened scrutiny, since the AI company (the buyer) wouldn&#8217;t be able to attest that the data provider (the seller) hasn&#8217;t introduced poison. It would be useful to develop techniques allowing a data seller to prove to a buyer that its data hasn&#8217;t been poisoned or tampered with. I&#8217;m not sure what practical attestation mechanisms would look like here. One possibility could be some kind of auditable <a href="https://en.wikipedia.org/wiki/Chain_of_custody">chain-of-custody</a> record.</p><p><strong>Robust access controls limit who can modify datasets, ensuring that only authorized actors can make legitimate modifications.</strong> Unauthorized actors should be barred from editing datasets entirely, and authorized users should be subject to oversight and review processes that make malicious edits harder to slip through unnoticed. Off-the-shelf access control tools are probably sufficient for this.</p><p><strong>Tamper-proofing training data focuses on protecting data that has been finalized for a training run.</strong> After all edits and transformations have been applied and the data is ready for the final training run, it should be stored in tamper-proof storage. For example, you can tamper-proof your data using immutable storage systems (e.g., blockchain-based tamper-proof storage) or write-once-read-many (WORM) drives, which prevent modification of stored data at the hardware level.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-3" href="#footnote-3" target="_self">3</a></p><p><strong>Data redundancy protects against both accidental corruption and targeted attacks by maintaining multiple copies of training data spread across multiple data centers.</strong> Ideally, redundant copies would be tamper-proof and periodically integrity-checked (e.g., verifying that the current data matches a reference cryptographic hash or checksum stored in an isolated, secure environment).</p><p>One question worth considering is whether training data and model weights should be shared with independent third parties, like auditors. To see why this might be useful, consider a scenario where an insider at a frontier AI company poisons the post-training data to instill a secret loyalty. Later, worried about getting caught, the insider deletes the poisoned data and poisoned weights to cover their tracks. If a trusted third-party auditor already holds copies of the weights, the auditor can examine them for signs of compromise, even after the original copies are gone.</p><p>Sharing data and model weights with auditors would be especially valuable if auditing techniques improve over time.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-4" href="#footnote-4" target="_self">4</a> An auditor who receives a copy of a model&#8217;s weights in 2027 might not be able to detect a secret loyalty at the time, but by 2029, with access to more powerful AI systems, they could audit those same weights again and catch an attack that had previously been undetectable.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-5" href="#footnote-5" target="_self">5</a></p><h2>Securing data filtering algorithms</h2><p>Data filtering algorithms are one of the most important defenses against data poisoning. If these filters are compromised, it becomes much easier for attackers to slip in poisoned data. So the confidentiality and integrity of data filtering algorithms both matter. Confidentiality matters because if attackers don&#8217;t know how the filters work, they will have a harder time crafting poisoned data that bypasses them. Integrity matters because you want to prevent <em>algorithm swap </em>attacks (replacing a legitimate filter with a more permissive one) and <em>algorithm degradation attacks</em> (directly tampering with filter development to make them inaccurate).</p><p>I think of securing data filtering algorithms as a subset of the larger problem of securing algorithmic secrets, which seems like one of the most important and underexplored security problems in AI. I&#8217;d be excited about more work here.</p><p><strong>Information compartmentalization means separating teams working on different filtering stages so that no single employee understands the complete pipeline.</strong> I don&#8217;t know whether this makes sense to do in practice. Separating teams makes each team significantly less productive because they can&#8217;t cross-pollinate ideas and coordinate. And there&#8217;s a risk that compartmentalization makes it harder for anyone to check that the weaknesses of different filters aren&#8217;t too similar. If compartmentalization actually worsens the quality of the data filters, it might make integrity attacks easier.</p><p><strong>Algorithm integrity controls prevent algorithm swap or degradation attacks through multi-party authorization for updates and tamper-proof audit logs documenting all changes.</strong> I also don&#8217;t know whether this measure makes sense as stated. Multi-party authorization introduces friction and bureaucracy, though it&#8217;s fairly standard for merging proposed code changes into production software, so maybe the friction is tolerable. And audit logs for every change to a data filter might produce an overwhelming volume of logs that nobody actually reviews.</p><p><strong>Canary testing means intentionally planting known-malicious samples that legitimate filters should catch.</strong> If these samples bypass the filters, it is clear that the filters have been compromised. Because AI models are complicated and can relatively easily be injected with specific narrow backdoors, defenders can craft highly obscure, but benign, backdoors that serve as fingerprints for a model or algorithm. (An example of a benign backdoor is triggering the filter to output &#8220;CANARY_VERIFIED&#8221; when detecting the canary test.) For example, if a data filter is itself a large language model, the defender could train a benign backdoor into it as a fingerprint. If the filter is swapped, the fingerprint disappears, signaling that something is wrong.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-6" href="#footnote-6" target="_self">6</a></p><p>Canary testing might not always be the best approach if simpler integrity verification methods suffice. For example, you could potentially fingerprint models using <a href="https://en.wikipedia.org/wiki/Locality-sensitive_hashing">locality-sensitive hashing</a>, a technique that maps similar inputs to similar hash values, so that minor modifications to a model produce detectably different but structurally related hashes. The advantage of canary testing is that data filters are often large black-box machine learning models, and hashing them can be noisy and unreliable if the model weights are updated frequently. A canary test is more robust because it tests functional behavior, which carries through modifications to the filtering algorithm itself.</p><h1>Conclusion</h1><p>Many of these security measures impose significant productivity costs. The best security architecture is useless if teams work around it because it slows development. Thus, I&#8217;m excited about research into making these implementations frictionless and developer-friendly.</p><p>That said, there are also reasons to think the productivity cost will shrink over time:</p><ul><li><p>If most AI research and software engineering is eventually done by AI agents, the cybersecurity measures discussed in this post will impose a much smaller productivity cost than they do today. Many of these measures are annoying for humans because they add friction, context-switching, and cognitive overhead. AI agents won&#8217;t have the same bottlenecks. They can navigate complex system architectures quickly, handle multi-party authorization workflows without frustration, and operate within strict access control regimes without the slowdowns that make human engineers route around security measures. <strong>If AI agents are doing most of the engineering, the productivity cost of security drops substantially.</strong></p></li><li><p>Coding agents can help automate the engineering work required to implement these cybersecurity measures in the first place. Organizational norms and physical security, on the other hand, are much harder to automate. <strong>Therefore AI companies should consider differentially accelerating these harder-to-automate tasks now, while counting on AI to help with the software-side security later.</strong></p></li></ul><p>One example of a hard-to-automate task is addressing insider threats. Insider threats are particularly challenging because insiders already have legitimate access to training infrastructure and detailed system knowledge. They can bypass many perimeter-oriented defenses. Frontier AI developers should establish stronger insider threat programs. This is especially important for senior insiders. If executive leadership or team leads are themselves trying to instill secret loyalties, infrastructure security measures become much harder to enforce, since these leaders may have override authority, can justify unusual access as legitimate business needs, and may influence the very review processes meant to catch tampering. I don&#8217;t have a great sense of how well insider threat programs hold up when the compromised insiders are senior staffers, and that makes me nervous.</p><p>If you&#8217;re a security engineer interested in any of the problems described in this post, I&#8217;d love to hear from you. I&#8217;m particularly interested in people (1) implementing these solutions at frontier AI companies, (2) doing more threat modeling on how someone might realistically instill a secret loyalty and what a security stack should look like to prevent it, (3) developing a security standard for AI integrity (similar to the <a href="https://sl5.org/sl5-standard">SL5 Standard</a>), and (4) researching any unsolved problems listed in this post. If any of this is up your alley, reach out to me at <a href="mailto:dave@iaps.ai">dave@iaps.ai</a>.</p><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-1" href="#footnote-anchor-1" class="footnote-number" contenteditable="false" target="_self">1</a><div class="footnote-content"><p>Though it appears that xAI decides which models to deploy &#8220;<a href="https://www.theverge.com/ai-artificial-intelligence/878761/mass-exodus-at-xai-grok-elon-musk-restructuring">via an all-company group chat on X with [Elon] Musk in it</a>&#8221;. This is very bad, and I hope that xAI will implement a more standardized and secure method for deploying new models.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-2" href="#footnote-anchor-2" class="footnote-number" contenteditable="false" target="_self">2</a><div class="footnote-content"><p>Subliminal learning occurs when a teacher model with some trait T generates innocuous data D unrelated to T. When a student model is trained on D, it also acquires trait T. However, subliminal learning only works if the student and teacher models are derived from the same base model. In the context of data poisoning defenses, this means frontier AI developers should treat data generated by the model being trained with special scrutiny, as the subliminal learning effect may enable covert backdoor propagation.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-3" href="#footnote-anchor-3" class="footnote-number" contenteditable="false" target="_self">3</a><div class="footnote-content"><p>To make this more concrete, my understanding of how post-training typically works is that individual researchers design their own reinforcement learning environments and experiments and curate their own data. When the final training run comes along, they package all of that up and merge it into one big run. The pipeline between an individual researcher finalizing their reinforcement learning environment and that data being fed into the final training run is exactly the kind of handoff that should be tamper-proof, because there shouldn't be any edits at that point.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-4" href="#footnote-anchor-4" class="footnote-number" contenteditable="false" target="_self">4</a><div class="footnote-content"><p>Note that sharing model weights and training data with additional parties creates security risks, such as increased risk of <a href="https://davebanerjee.substack.com/p/why-steal-model-weights">model weight theft</a>.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-5" href="#footnote-anchor-5" class="footnote-number" contenteditable="false" target="_self">5</a><div class="footnote-content"><p>Though AI companies would hesitate to share their sensitive IP, like model weights, with a third party. As an alternative, the AI company might want to only share the hash of the model weights. However, if the attacker deletes the poisoned data and poisoned weights to cover their tracks, the auditor would have no way of recovering the model weights from the hash. So it seems like sharing model weight hashes with auditors is not sufficient to defend against this kind of attack.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-6" href="#footnote-anchor-6" class="footnote-number" contenteditable="false" target="_self">6</a><div class="footnote-content"><p>The same logic applies to model weights. If you're worried about a model swap attack, you could train in a benign backdoor as a fingerprint. If you observe in production that the backdoor is no longer present, then your model has likely been tampered with.</p></div></div>]]></content:encoded></item><item><title><![CDATA[A sketch of market-based export controls]]></title><description><![CDATA[Market forces could make export enforcement more adaptive, efficient, and predictable.]]></description><link>https://www.the-substrate.net/p/a-sketch-of-market-based-export-controls</link><guid isPermaLink="false">https://www.the-substrate.net/p/a-sketch-of-market-based-export-controls</guid><dc:creator><![CDATA[Onni Aarne]]></dc:creator><pubDate>Fri, 20 Mar 2026 15:42:32 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/8b11f39f-b05b-4748-b480-8bb7f0921873_2624x1856.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>Bloomberg <a href="https://www.bloomberg.com/news/articles/2026-03-05/us-drafts-rules-for-sweeping-power-over-nvidia-s-global-sales">recently reported</a> that the Trump administration is considering a new global AI chip export control framework. The proposed rule appears to have been <a href="https://www.bloomberg.com/news/articles/2026-03-14/us-withdraws-draft-rule-that-called-for-global-ai-chip-permits">withdrawn</a> for now, but still likely provides an instructive example of the approaches the administration is considering. The framework appeared to focus on authorizing different companies to conduct imports at different scales, rather than requiring licenses for individual shipments. This could help address problems like smuggling by blocking sales to likely smugglers while still being relatively lightweight. But its global scope has received pushback for potentially subjecting all substantial AI chip exports to US government approval. This would position the government as a kingmaker and potentially tie up large compute investments in government-to-government negotiations.</p><p>This is a tricky dynamic: Inserting more US government discretion into which companies get to buy large quantities of chips seems like one of the only ways to really counter issues like smuggling and Chinese remote access to chips. But that same discretion could easily lead to delays, regulatory uncertainty, and mission creep.</p><p>However, it might be possible to sidestep this problem by delegating some discretion and enforcement responsibility to private companies with appropriately aligned incentives. This is the topic of our new working paper: <a href="https://www.iaps.ai/research/export-auditors-as-market-powered-export-enforcement">Export Auditors as Market-Powered Export Enforcement</a>.</p><h1>Fixed rules don&#8217;t work, but case-by-case decisions can be even worse</h1><p>There is a basic tradeoff in export control policy (and almost all policy). The government can move along a spectrum between two options:</p><ol><li><p>The government can set fixed, stable, easily interpretable rules that change slowly. The strongest version of this is for Congress to pass precise regulatory rules. More commonly, this would look like regulatory agencies setting clear rules and rarely changing them.</p></li><li><p>The government can rely more on case-by-case judgments to decide what to allow. In the export control case, this would typically mean imposing a license requirement on a broad class of transactions, and granting licenses on a case-by-case basis. This can be more adversarially robust and dynamic, but creates substantial regulatory uncertainty, and potential for bias and regulatory capture. These approval or licensing processes are also often slow and costly.</p></li></ol><p>As long as they are not too restrictive, fixed rules tend to benefit business because they reduce regulatory uncertainty and ensure fairness. But rigidity often means bad actors will find and exploit loopholes, and the rules can easily become outdated as the relevant industry changes.</p><p>Since 2022, US AI chip controls have been relatively close to the fixed, stable end of the spectrum, with approximately annual updates. However, this led to NVIDIA running circles around the government by designing around technical thresholds, and appears to have enabled fairly <a href="https://www.cnas.org/publications/reports/countering-ai-chip-smuggling-has-become-a-national-security-priority">substantial</a> <a href="https://www.reuters.com/world/china/chinas-deepseek-trained-ai-model-nvidias-best-chip-despite-us-ban-official-says-2026-02-24/">smuggling</a> of chips into China through Southeast Asia.</p><p>In 2025, the Trump administration moved in a more dynamic, case-by-case direction. This had notable upsides: NVIDIA&#8217;s repeated efforts to simply design around technical thresholds were finally halted because the administration effectively gave up on setting clear technical thresholds and moved to blocking individual chip models (most importantly the NVIDIA H20) using &#8220;is-informed&#8221; letters. The deals with the United Arab Emirates and Saudi Arabia also seem to have relatively successfully promoted exports of US chips while securing investment into the US.</p><h2>Regulatory uncertainty</h2><p>But there have been clear downsides. The administration has been vacillating on which chips and how many to sell to China, leading NVIDIA to restart and then again interrupt and then again <a href="http://reuters.com/business/nvidia-sales-opportunity-blackwell-rubin-chips-more-than-1-trillion-by-2027-2026-03-17/">restart</a> production of the H200 chip, and presumably forcing to NVIDIA engineers to work around the clock, so far fruitlessly, developing new chip variants to chase uncertain government approval. The deals with the United Arab Emirates and Saudi Arabia have succeeded in some ways, but heavy government involvement has also caused serious <a href="https://www.wsj.com/politics/policy/nvidia-trump-uae-chip-deal-delay-c49aaa5c">delays</a>.</p><p>So far this uncertainty has been largely restricted to sales to China. But any rule like the one Bloomberg reported would potentially extend this uncertainty globally, if any company wishing to export or import significant quantities of chips would need government approval. Even discussion of rules like this can dampen investment, despite the Department of Commerce <a href="https://x.com/CommerceGov/status/2029670081915941164?s=20">emphasizing</a> that it is &#8220;committed to promoting secure exports of the American tech stack&#8221;.</p><p>I have no doubt that the Department&#8217;s commitment is sincere, but it might not be sufficient. Large chip deployments, like the 200,000 B300s that represent the proposed rule&#8217;s highest tier, have to be planned years in advance. If there&#8217;s a substantial chance that a future rule would tie up such an investment for months in government-to-government negotiations, or could even be blocked to create leverage in some unrelated trade dispute, potential importers and exporters would likely simply choose to invest less in these projects due to slower expected returns and elevated risk.</p><p>And uncertainty goes further back than just Google or NVIDIA: Part of the reason the current chip shortage is so severe is that chip fabricators like TSMC are <a href="https://stratechery.com/2026/tsmc-risk/">wary</a> of making massive investments in new fab capacity if there&#8217;s a risk that, several years from now when the fabs come online, the market won&#8217;t be quite as massive as they expected. The semiconductor industry has grown tremendously cautious after <a href="https://www.embedded.com/semiconductor-bust-boom-cycles/">decades</a> of boom-bust cycles. High-discretion rulemaking like this creates precisely the uncertainty and underinvestment that is holding back the US AI industry. And TSMC&#8217;s suppliers are similarly wary about investing in new equipment-making capacity without guarantees that TSMC would buy all of the equipment they would make. It all compounds along the supply chain.</p><h2>Overreach and mission creep</h2><p>Inserting the government as a gatekeeper to all significant chip exports would also tempt the government to use this as leverage to advance largely unrelated political goals. In the case of exports to China, where this leverage already exists, this has enabled an allegedly <a href="https://www.lawfaremedia.org/article/trump-s-illegal-ai-chip-export-controls--and-who-can-challenge-them">unconstitutional</a> attempt to extract a 25% <em>de facto</em> export tax. More generally, various forms of discretionary government leverage over companies always have the potential for abuse, such as the <a href="https://judiciary.house.gov/sites/evo-subsites/republicans-judiciary.house.gov/files/evo-media-document/Biden-WH-Censorship-Report-final.pdf">pressure</a> on social media companies seen during the Biden administration. And indeed many of the US companies most likely to deploy more than 200,000 B300s worth of compute abroad are also social media companies.</p><p>It&#8217;s tempting to treat this as a partisan issue based on which administration one trusts, but administrations ultimately turn over relatively quickly. If you create a particular system now, your political opponents will probably control that system sooner rather than later. And precisely this whiplash between administrations creates the kind of long-term regulatory uncertainty that can dampen, say, decisions to break ground on new chip fabs, which take years to pay back.</p><h1>A market-based solution</h1><p>So are you stuck choosing between fixed rules that will simply get circumvented, or flexible case-by-case decision-making that creates costly regulatory uncertainty and mission creep?</p><p>Our new working paper discusses one way to dodge this dynamic, by delegating key parts of export control enforcement to for-profit companies. This would create a competitive ecosystem of private actors with the flexibility to adapt and make case-by-case judgments, while still having predictable incentives that investors can plan around.</p><p>For the sake of simplicity, I will assume that the government mainly wants to prevent chips from going to foreign adversaries like China, but otherwise wants to export as much as possible. From this follows the problem: You want to sell to as many companies as possible, but you don&#8217;t want to sell to companies that would sell the chips to your foreign adversaries. But how can you tell which companies will do that?</p><p>In most cases exporters are nominally responsible for making these assessments, but as long as the importer can do a moderately convincing job of pretending to be a legitimate company, the exporter is not liable if the chips are diverted, so they have no incentive to dig deeper. The proposed rule reported by Bloomberg would task the Bureau of Industry and Security (BIS) with making these judgments, but this could be slow, and BIS itself has been known to make mistakes, such as <a href="https://www.congress.gov/crs_external_products/R/PDF/R48642/R48642.2.pdf">leaving</a> a major Chinese semiconductor manufacturing equipment maker on its Validated End-User list from 2013 to 2024.</p><p>Our proposal would improve on this by designating two complementary types of entities, which would need to be involved in relevant export transactions:</p><ol><li><p><em>Export auditors</em> would be responsible for detecting whether chips are diverted to foreign adversaries. This would complement BIS&#8217;s limited enforcement capacity.</p></li><li><p>Surety providers, essentially insurance providers, would issue &#8220;surety bonds&#8221;, i.e., agree to pay the fine to BIS if diversion happens. This positions them as gatekeepers, analogous to BIS deciding whether to grant an export license.</p></li></ol><p>For some risky class of exports, BIS would then require the buyer to contract with an approved auditor and possibly to get a surety bond from an approved surety provider.</p><h2>Export auditors</h2><p>The purpose of the export auditor is simply to make it very likely that if diversion happens, it will, at least eventually, be discovered.</p><p>In practice, for detecting AI chip diversion, the job of the export auditors would be simple: Do random inspections of data centers and check that chips are still where they were supposed to be. Given that chips are generally housed in large data centers in huge quantities, it would be very quick and cheap for auditors to inspect the vast majority of chips.</p><p>Export auditors would also be free to innovate on their methodologies to achieve a given detection rate and speed more efficiently, e.g. by using technical <a href="https://www.iaps.ai/research/location-verification-for-ai-chips">location verification</a> mechanisms or other technology. BIS would be responsible for approving and overseeing the auditors, ensuring that their methodologies and execution are rigorous, and issuing the ultimate fines for violations detected by the export auditors.</p><p>BIS currently conducts very few inspections like this. For the most part, the relevant sales to, say, Southeast Asia don&#8217;t require any kind of license, so BIS doesn&#8217;t even know the sale happened or where the chips are supposed to be, much less are they checking that they still are there.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!Qu_e!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5c996d06-d43b-4287-83d7-521cef0a6296_1600x776.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!Qu_e!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5c996d06-d43b-4287-83d7-521cef0a6296_1600x776.png 424w, https://substackcdn.com/image/fetch/$s_!Qu_e!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5c996d06-d43b-4287-83d7-521cef0a6296_1600x776.png 848w, https://substackcdn.com/image/fetch/$s_!Qu_e!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5c996d06-d43b-4287-83d7-521cef0a6296_1600x776.png 1272w, https://substackcdn.com/image/fetch/$s_!Qu_e!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5c996d06-d43b-4287-83d7-521cef0a6296_1600x776.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!Qu_e!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5c996d06-d43b-4287-83d7-521cef0a6296_1600x776.png" width="1456" height="706" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/5c996d06-d43b-4287-83d7-521cef0a6296_1600x776.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:706,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!Qu_e!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5c996d06-d43b-4287-83d7-521cef0a6296_1600x776.png 424w, https://substackcdn.com/image/fetch/$s_!Qu_e!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5c996d06-d43b-4287-83d7-521cef0a6296_1600x776.png 848w, https://substackcdn.com/image/fetch/$s_!Qu_e!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5c996d06-d43b-4287-83d7-521cef0a6296_1600x776.png 1272w, https://substackcdn.com/image/fetch/$s_!Qu_e!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5c996d06-d43b-4287-83d7-521cef0a6296_1600x776.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p><em>A diagram of how the export auditor fits into the sales process. Surety not included.</em></p><h2>Surety providers as gatekeepers</h2><p>In many cases, auditing alone will suffice to deter diversion. However, some diverters will not be deterred, for example, because they intend to escape to China with the chips, beyond the reach of local or US law enforcement, so they do not care if the auditor finds out some months later.</p><p>To address these undeterred smugglers, some gatekeeper would be needed to block them from obtaining chips in the first place. This is where the surety provider comes in. A surety bond is essentially a three-party contract between the chip buyer, BIS, and the surety provider, in which the surety provider agrees to pay BIS the fine if the buyer is later found to have violated export controls, while the buyer agrees to pay that fine to the surety provider instead. BIS could require buyers to obtain such a surety bond before proceeding with certain transactions.</p><p>Importantly, the primary purpose of this is not to ensure that BIS gets the money. Rather, it is to give the surety provider an incentive to do thorough due diligence up front to ensure the buyer is unlikely to violate export controls. If the surety provider considers the prospective buyer suspicious, and the buyer cannot convince them otherwise, the surety provider can either only offer the bond for a high price, or more likely, not offer it at all. If the surety bonds seem too hard to get, BIS could even allow the bonds to only cover part of the fine to tune the bond price level.</p><h2>Incentive alignment</h2><p>The primary customer for both the auditors and the surety providers would be the importers. This would give both export auditors and surety providers an incentive to create an ecosystem where large volumes of exports happen (giving them more customers) while reducing rates of diversion. And both of them would be competing to make this process as frictionless as possible for the importers and exporters. At the same time, export auditors would compete for continued BIS approval by driving up their detection rates. (BIS should ideally do some of its own random inspections to have its own independent estimate of diversion rates.)</p><p>A competitive market of auditors and surety providers would also ensure that no single actor can unilaterally block a transaction, creating more predictability for exporters. Prospective buyers would also still have the option of going through BIS&#8217;s licensing process if they prefer, but working with surety providers would likely be a much smoother experience for most legitimate buyers.</p><h1>How realistic is this?</h1><p>The basic pattern of for-profit auditors is extremely common: Most forms of compliance audits and inspections in most industries are already performed by for-profit companies, often overseen by some regulatory body. The Occupational Safety and Health Administration identifies <a href="https://www.osha.gov/nationally-recognized-testing-laboratory-program">Nationally Recognized Testing Laboratories</a> to do safety testing; the Public Company Accounting Oversight Board oversees US financial auditors; and in the European Union, regulators designate Notified Bodies to certify safety of medical devices and other products.</p><p>These regulations have created a class of companies that would be well-positioned to enter the export-auditing business if BIS chooses to create it. Two types of companies seem particularly promising:</p><ol><li><p>Professional services firms such as the <a href="https://en.wikipedia.org/wiki/Big_Four_accounting_firms">Big Four</a> already perform various types of audits, which can include physical oversight such as overseeing inventory counts. These companies likely also have existing relationships with many of the relevant companies, making this a smooth expansion of existing auditing activities.</p></li><li><p>Testing, inspection, and certification (TIC) companies like Bureau Veritas and Intertek already perform numerous types of tests and inspections as &#8220;recognized laboratories&#8221; and &#8220;notified bodies&#8221;. Simply checking that chips are where they are supposed to be would be simpler than what they usually do, but they could be well-placed to perform more complex inspections.</p></li></ol><p>There are also some startups and tech companies such as <a href="https://lucidcomputing.ai/">Lucid Computing</a> and <a href="https://www.geocomply.com/">GeoComply</a> developing relevant technical solutions that could act as export auditors or license their technology to export auditors. Even chip companies like NVIDIA would itself be incentivized to create technical solutions to make auditing easier and its products effectively cheaper for importers. Indeed, NVIDIA is already <a href="https://www.reuters.com/business/nvidia-builds-location-verification-tech-that-could-help-fight-chip-smuggling-2025-12-10/">piloting such solutions</a>.</p><p>Surety bonds are a slightly more unusual instrument, but are already <a href="https://www.cbp.gov/sites/default/files/assets/documents/2024-Feb/FINAL_A%20Guide%20for%20the%20Public_How%20CBP%20Sets%20Bond%20Amounts%20(February%202024)_0.pdf">used</a> by Customs and Border Protection to ensure tariffs are paid. They are also widely used in <a href="https://web.archive.org/web/20150416235944/http://www.crossagency.com/crossagency/tempFile/legal_basics.pdf">construction</a> to ensure that projects will be finished and maintained, or cleaned up, if the contractor goes out of business.</p><p>Surety bonds are provided for other purposes by regulated insurance companies, and the same insurance companies could provide these export sureties. Conceivably even the exporter itself could be allowed to issue the surety bond, the provider just needs to be large and liquid enough to be trusted to actually pay the fine if it comes to it.</p><p>BIS could likely implement this proposal purely using its existing authorities: BIS can specify practically anything as a condition for a license exception. This means that BIS could create a license requirement for particular countries, or globally (as the rule under current discussion would apparently do) but then create an exception to that license requirement for exports that are protected by an export auditor and a surety bond. Importantly, making this a license exception means that exporters would not need permission from BIS: As long as the exporter (or importer) has secured an auditor and a surety bond, they qualify for the license exception and can proceed, no questions asked.</p><h1>Concluding thoughts</h1><p>The maximal version of this idea would apply it globally, and this could be quite affordable once there is a mature ecosystem of sureties and auditors. However, while this proposal is based on existing patterns, the application to export controls would be novel, and there are currently no companies offering export auditing or export surety bonds. Likely the approach should be piloted in a more limited capacity, for example by imposing license requirements for particular Southeast Asian countries, but creating exceptions if the buyer is either:</p><ol><li><p>A major US-headquartered cloud provider, or</p></li><li><p>Has hired an export auditor and secured a surety bond.</p></li></ol><p>Export auditors could also be quite useful on their own, without the surety bonds. The surety bond element could be introduced if the auditors prove an insufficient deterrent. The introduction of auditors would also make it very easy to tell whether that deterrence is successful. This would allow the idea to be tested on a limited scale, and scaled up if successful.</p><p>Notably, BIS could likely do all of this using its existing authorities and resources, as there is little to no limit on what the conditions of a license exception can be.</p><p>In principle, market-powered solutions might be effective for nearly all export controls, but they are exceptionally well suited to the chip export enforcement problem, because it&#8217;s very easy for BIS to specify the goal (no diversion to foreign adversaries), and it is relatively straightforward to specify auditing schemes that will reliably detect whether the goal is reached (just <a href="https://x.com/DavidSacks/status/1923427609318215917">count server racks</a>). For the same reasons, this approach would likely generalize well to enforcing location-based restrictions on any high-value physical goods.</p><p>This approach could also be extended to verify compliance with more specific export rules regarding allowed end uses, such as detecting whether semiconductor manufacturing equipment is being used to produce a different type of chip than authorized, or whether chips have been connected together into a <a href="https://www.bloomberg.com/news/articles/2026-03-02/us-mulls-capping-nvidia-h200-sales-at-75-000-per-chinese-customer">larger cluster</a> than permitted.</p><p>It would be more challenging to extend a market-based approach to cases where the behavior being audited is less physically apparent. For example, verifying whether a particular set of chips in Malaysia are <a href="https://www.wsj.com/tech/chinas-bytedance-gets-access-to-top-nvidia-ai-chips-d68bce3a">being rented to Chinese users</a> might be of great interest to BIS, but would be tricky to verify, at least without massive privacy and security issues.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-1" href="#footnote-1" target="_self">1</a> At the same time, conventional BIS enforcement would suffer from the same obstacles, so opening this up to the market could be the best bet for finding a good solution. It would be useful for BIS to at least invite prospective auditors and importers to propose how this verification could be done.</p><p>As AI transforms the world, governments are unlikely to be able to keep up on their own. It is necessary to explore <a href="https://www.hyperdimensional.co/p/on-private-governance">new ways</a> to allow AI-enabled companies to innovate solutions to governance challenges as fast as AI itself poses those challenges.</p><p>A lot of work still needs to be done to get there. Before our proposal could even be piloted, BIS would likely need some &#8220;anchor auditors&#8221; and &#8220;anchor sureties&#8221; who have said they would be interested in providing this service if BIS creates the market. While our working paper is a lot more detailed than this piece, there is still a decent amount of messy policy detail that needs to be figured out, especially related to sureties. We are publishing the working paper to encourage discussion of the best ways to implement ideas like this. If you have feedback, or would like to be involved in making this happen, please reach out to us!</p><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-1" href="#footnote-anchor-1" class="footnote-number" contenteditable="false" target="_self">1</a><div class="footnote-content"><p>To be clear, such remote access is currently legal, but it is in part legal because rules regarding remote access would currently be relatively difficult to enforce.</p></div></div>]]></content:encoded></item><item><title><![CDATA[China is making strides in etching machines for memory]]></title><description><![CDATA[AMEC&#8217;s current etching machines can&#8217;t support China&#8217;s high-bandwidth memory efforts, but its next flagship likely can.]]></description><link>https://www.the-substrate.net/p/china-is-making-strides-in-etching</link><guid isPermaLink="false">https://www.the-substrate.net/p/china-is-making-strides-in-etching</guid><dc:creator><![CDATA[Hamish Low]]></dc:creator><pubDate>Tue, 17 Mar 2026 11:49:00 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/ec4cdff2-2b00-4b48-8526-3f73880b34b1_2624x1856.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p><em>This is the first piece in a series exploring some of the key semiconductor manufacturing equipment that China needs to indigenously produce high-bandwidth memory, perhaps the most important bottleneck to its efforts at making AI chips.</em></p><p>For China to manufacture high-bandwidth memory (HBM)&#8212;a <a href="https://epoch.ai/data-insights/b200-cost-breakdown">key component</a> of advanced AI chips&#8212;at scale, it must overcome a complex web of dependencies on non-Chinese semiconductor manufacturing equipment. One key bottleneck is in advanced etching machines, crucial for producing the dense memory cells that latest-generation HBM requires. China&#8217;s best machine here, AMEC&#8217;s Primo UD-RIE, is likely six to eight years behind the frontier, though AMEC is working on a successor that could narrow the gap to two to three years.</p><p>Whether AMEC can deliver matters greatly for China&#8217;s ability to scale HBM and therefore broader AI chip production. CXMT, China&#8217;s leading HBM producer, likely has enough imported etching machines to produce HBM3&#8212;two generations behind the frontier but still significant&#8212;but due to export controls its ability to keep scaling HBM production through 2026 and into 2027 will increasingly depend on domestic Chinese etching machines. Currently the Primo UD-RIE is not good enough for HBM3 production, but the promised successor likely would be.</p><p>AMEC has been growing rapidly, including with its R&amp;D spending, and is working along established pathways of technical development. Because of this, I expect it will be able to largely solve this bottleneck to China&#8217;s HBM production. In this post, I first explain the basics of memory and etching, then examine the Primo UD-RIE and AMEC, and finally conclude with what the analysis means for China&#8217;s overall AI chipmaking efforts.</p><h1>China is advancing in etching but remains years behind the frontier</h1><p>The Primo UD-RIE is made by Advanced Micro-Fabrication Equipment Inc. (AMEC), a leading Chinese provider of etch and deposition equipment.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-1" href="#footnote-1" target="_self">1</a> I could find only a single grainy image of the UD-RIE:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!lY0_!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F072f5bd8-7bc9-4243-ae2a-4c6b0f5f7ae5_671x508.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!lY0_!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F072f5bd8-7bc9-4243-ae2a-4c6b0f5f7ae5_671x508.png 424w, https://substackcdn.com/image/fetch/$s_!lY0_!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F072f5bd8-7bc9-4243-ae2a-4c6b0f5f7ae5_671x508.png 848w, https://substackcdn.com/image/fetch/$s_!lY0_!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F072f5bd8-7bc9-4243-ae2a-4c6b0f5f7ae5_671x508.png 1272w, https://substackcdn.com/image/fetch/$s_!lY0_!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F072f5bd8-7bc9-4243-ae2a-4c6b0f5f7ae5_671x508.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!lY0_!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F072f5bd8-7bc9-4243-ae2a-4c6b0f5f7ae5_671x508.png" width="671" height="508" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/072f5bd8-7bc9-4243-ae2a-4c6b0f5f7ae5_671x508.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:508,&quot;width&quot;:671,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!lY0_!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F072f5bd8-7bc9-4243-ae2a-4c6b0f5f7ae5_671x508.png 424w, https://substackcdn.com/image/fetch/$s_!lY0_!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F072f5bd8-7bc9-4243-ae2a-4c6b0f5f7ae5_671x508.png 848w, https://substackcdn.com/image/fetch/$s_!lY0_!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F072f5bd8-7bc9-4243-ae2a-4c6b0f5f7ae5_671x508.png 1272w, https://substackcdn.com/image/fetch/$s_!lY0_!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F072f5bd8-7bc9-4243-ae2a-4c6b0f5f7ae5_671x508.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p><em>A visualisation of the UD-RIE from <a href="https://www.amec-inc.com/en/index/Lists/show/catid/29/id/605.html">the product page on AMEC&#8217;s website</a>.</em></p><p>AMEC describes the UD-RIE as</p><blockquote><p>a high-end capacitive coupled plasma (CCP) etch system developed by AMEC based on its own IP. Specifically designed for the most critical high aspect ratio (HAR) dielectric etching process for memory device fabrication.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-2" href="#footnote-2" target="_self">2</a></p></blockquote><p>That is a rather dense technical description, so let me walk through what it actually means.</p><h2>Memory is essential to AI chips</h2><p>A logic chip, such as an AI chip, processes information, with the results stored and accessed in memory chips. Taking information from memory, processing it, and putting it back is the core loop that AI chip makers want to speed up to push computational performance.</p><p>AI&#8217;s demand for memory is voracious, with prices in some cases <a href="https://www.theregister.com/2026/02/02/dram_prices_expected_to_double/">expected to double just in the first quarter of this year</a>, pushing up costs for consumer electronics such as laptops and smartphones.</p><p>Memory comes in various flavours that trade off speed against capacity and other factors. The most important here is dynamic random-access memory (DRAM). DRAM is a workhorse that uses a simple structure to be scalable, fast, and relatively cheap. The trade-off is that DRAM is volatile&#8212;it loses its stored data over time or when it loses power&#8212;while other kinds of memory are persistent but slower or costlier.</p><p>Each cell of DRAM has just two components, a transistor and a capacitor. A memory cell stores a bit of information (a one or a zero) and retrieves it when needed. The capacitor stores this information as electrical charge. The transistor controls access to the capacitor. When the information is needed, the transistor opens the capacitor, and by measuring how many electrons flow out the value can be retrieved&#8212;if it was full of electrons it was a one, if it was empty it was a zero.</p><p>Today&#8217;s AI chips are so memory-hungry that the DRAM used in phones and laptops is insufficient. It simply does not offer enough capacity for frontier AI models and the data that goes along with them. The solution is high-bandwidth memory, or HBM. Making HBM involves placing specialized DRAM chips on top of one another&#8212;up to 16 in the latest generation&#8212;to form an HBM stack. This stack is placed alongside the logic die as close as possible, since distance matters even at such miniature scales. The magic of HBM is how these DRAM chips are connected with through-silicon vias&#8212;wires cutting through the layers of DRAM to form vertical communication highways&#8212;that allow as much information as possible to flow to and from the memory.</p><h2>DRAM is a likely bottleneck to China&#8217;s HBM production</h2><p>The building block of HBM is high-quality DRAM, which packs as many cells as possible into the smallest area. Producing large volumes of advanced DRAM requires expensive equipment and sophisticated R&amp;D. The market has historically been highly commoditized, with huge boom-bust cycles that have whittled down the number of global players to just three: Samsung and SK Hynix of South Korea and Micron of the US. Though CXMT has been gaining ground, it is still <a href="https://www.reuters.com/world/asia-pacific/chinas-cxmt-eyes-42-billion-shanghai-listing-fund-dram-expansion-2025-12-31/">sitting at 4% (and growing) market share in mid-2025</a>, versus the 90% of the big three combined.</p><p>Producing cutting-edge DRAM poses distinct challenges compared to other areas of chip making. For a logic chip, the greatest difficulty is cramming ever more transistors&#8212;the tiny switches that flip between ones and zeroes&#8212;into each square inch. The main bottleneck for doing so is lithography, where extremely precise light is used to create patterns on the silicon wafer to form these minuscule transistors. With DRAM, the hurdle is instead the capacitors that store electrical charge.</p><p>As capacitors were shrunk ever smaller, storing enough electrons in them to detect the tiny signal fluctuations when reading a one or zero became increasingly difficult. Capacitors could keep shrinking horizontally only by stretching vertically to maintain enough electron storage. Capacitors therefore now look like extremely skinny tubes stretching up from the chip surface. This verticality presents different challenges from shrinking transistors, and the key process for making these tall, fragile capacitors is etching.</p><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!FTLl!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc3ef62e0-35f2-4b5d-bbd2-e177b9f209c8_430x236.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!FTLl!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc3ef62e0-35f2-4b5d-bbd2-e177b9f209c8_430x236.png 424w, https://substackcdn.com/image/fetch/$s_!FTLl!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc3ef62e0-35f2-4b5d-bbd2-e177b9f209c8_430x236.png 848w, https://substackcdn.com/image/fetch/$s_!FTLl!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc3ef62e0-35f2-4b5d-bbd2-e177b9f209c8_430x236.png 1272w, https://substackcdn.com/image/fetch/$s_!FTLl!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc3ef62e0-35f2-4b5d-bbd2-e177b9f209c8_430x236.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!FTLl!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc3ef62e0-35f2-4b5d-bbd2-e177b9f209c8_430x236.png" width="430" height="236" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/c3ef62e0-35f2-4b5d-bbd2-e177b9f209c8_430x236.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:236,&quot;width&quot;:430,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!FTLl!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc3ef62e0-35f2-4b5d-bbd2-e177b9f209c8_430x236.png 424w, https://substackcdn.com/image/fetch/$s_!FTLl!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc3ef62e0-35f2-4b5d-bbd2-e177b9f209c8_430x236.png 848w, https://substackcdn.com/image/fetch/$s_!FTLl!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc3ef62e0-35f2-4b5d-bbd2-e177b9f209c8_430x236.png 1272w, https://substackcdn.com/image/fetch/$s_!FTLl!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc3ef62e0-35f2-4b5d-bbd2-e177b9f209c8_430x236.png 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a></figure></div><p><em>This diagram, courtesy of <a href="https://filecache.investorroom.com/mr5ir_lamresearch2/1436/Lam%20Research%202025%20Investor%20Day%20FINAL2.pdf">Lam Research</a>, shows the evolution of DRAM layouts. Notice the tall skinny capacitors in 6F&#178; and how these become even skinnier and denser into 4F&#178;.</em></p><h2>Etching is vital to producing advanced memory</h2><p>Etching is a core process in both logic and memory chip production. Starting with a silicon wafer, the process alternates between depositing material, using photolithography to sketch patterns, and using etch machines to remove unwanted material. By repeating this cycle, along with numerous other steps such as cleaning and measurement, the structure of a chip is built up.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!y1I7!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2050cf75-28a7-48f9-89b8-3de0d916f9d9_958x419.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!y1I7!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2050cf75-28a7-48f9-89b8-3de0d916f9d9_958x419.png 424w, https://substackcdn.com/image/fetch/$s_!y1I7!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2050cf75-28a7-48f9-89b8-3de0d916f9d9_958x419.png 848w, https://substackcdn.com/image/fetch/$s_!y1I7!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2050cf75-28a7-48f9-89b8-3de0d916f9d9_958x419.png 1272w, https://substackcdn.com/image/fetch/$s_!y1I7!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2050cf75-28a7-48f9-89b8-3de0d916f9d9_958x419.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!y1I7!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2050cf75-28a7-48f9-89b8-3de0d916f9d9_958x419.png" width="958" height="419" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/2050cf75-28a7-48f9-89b8-3de0d916f9d9_958x419.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:419,&quot;width&quot;:958,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!y1I7!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2050cf75-28a7-48f9-89b8-3de0d916f9d9_958x419.png 424w, https://substackcdn.com/image/fetch/$s_!y1I7!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2050cf75-28a7-48f9-89b8-3de0d916f9d9_958x419.png 848w, https://substackcdn.com/image/fetch/$s_!y1I7!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2050cf75-28a7-48f9-89b8-3de0d916f9d9_958x419.png 1272w, https://substackcdn.com/image/fetch/$s_!y1I7!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2050cf75-28a7-48f9-89b8-3de0d916f9d9_958x419.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p><em>This diagram, from <a href="https://willson.cm.utexas.edu/Teaching/LithoClass2017/Files/Introduction%20to%20Plasma%20Etching_Lecture_101917_Day1_Sntzd.pdf">Lam Research</a>, gives a good breakdown of the various core process steps involved in producing a chip.</em></p><p>Etching machines come in many varieties depending on the method, the material being etched, and the features desired. The tall, skinny capacitors needed for DRAM require a plasma etching machine. These are also known as &#8220;dry&#8221; etching machines, as opposed to &#8220;wet&#8221; etching machines that use liquid chemicals.</p><p>A plasma etching machine takes in various types of gasses, then uses a powerful radio frequency source to accelerate free electrons, so that they smash into the particles of these various gasses. This breaks the molecules into ions, radicals, and more free electrons.</p><p>The new free electrons sustain a chain reaction that makes the plasma self-sustaining. The radicals are incomplete chunks of the previous molecules, and so are desperately seeking new atoms to become whole, making them chemically reactive. The ions are charged and can be directed by the electric field to collide with the material to be removed (a process known by the oddly whimsical name of &#8220;sputtering&#8221;). The combination of physical bombardment and chemical reactions speeds up material removal, and is termed &#8220;reactive ion etching&#8221;.</p><p>Etching capacitors requires three further things:</p><ul><li><p>The etching machine must be designed for dielectrics. A wide variety of metals and chemicals are used at different stages of chip making. For capacitors the material is dielectrics&#8212;electrical insulators that polarise in response to electric fields, making them ideal for storing charge.</p></li><li><p>The machine must be specialized for high-aspect ratios. This is the ratio of height to width, which given the capacitors&#8217; slender structures means ever-higher aspect ratios, reaching <a href="https://newsletter.semianalysis.com/p/the-memory-wall">up to 100:1 for the most advanced DRAM</a>.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-3" href="#footnote-3" target="_self">3</a></p></li><li><p>The machine must be powerful. Reaching the bottom of these high-aspect-ratio holes and continuing to remove material requires high-energy ions. The machine that best delivers this is a capacitively coupled plasma (CCP) tool, which uses a single high-powered radio frequency source, as opposed to inductively coupled plasma (ICP) tools that use multiple sources for greater control.</p></li></ul><h1>The Primo UD-RIE is likely 6-8 years behind the frontier</h1><p>Now you should be better able to understand what the UD-RIE does, and why it is named what it is. I don&#8217;t know where &#8220;Primo&#8221; comes from, but &#8220;UD-RIE&#8221; seemingly stands for &#8220;ultra-deep reactive ion etch&#8221;.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-4" href="#footnote-4" target="_self">4</a> The product description should now also make more sense: a &#8220;capacitive coupled plasma etch system &#8230; designed for the most critical high aspect ratio dielectric etching process for memory device fabrication&#8221;.</p><p>To be effective at producing advanced DRAM, an etching machine must perform well across several metrics: how fast it can etch, how uniformly, how selective it is in etching only the right materials, and whether it can avoid defects and produce at high yields. Ideally one would have quantitative data on all these dimensions. Unfortunately, how these metrics trade off against one another is the secret sauce behind how effective these machines are, and so companies are loath to disclose any useful information at all. This makes rigorous comparison of how the Primo UD-RIE stacks up against Western equivalents difficult. Without private information, the best available approach is working from what published specs there are.</p><p>For the Primo UD-RIE, by far the most useful piece of information disclosed is that, in AMEC&#8217;s financial reporting, it describes the UD-RIE as being capable of 60:1 aspect ratio etching. Using that figure I very roughly compare it to equivalent machines from Lam Research and Tokyo Electron. My best estimate is that the UD-RIE is comparable to Western tools from 2018 or 2019, and remains well behind the most cutting-edge tools, which are now approaching &gt;100:1 aspect ratios.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-5" href="#footnote-5" target="_self">5</a></p><p>What does this mean, concretely, for China&#8217;s HBM efforts? DRAM fabrication progress is usually measured in &#8220;memory nodes&#8221;, which are generations similar to the 4G and 5G of cellular networks. Each node builds on the one before to create ever denser memory cells and higher performance. Unhelpfully, memory nodes use confusing alphabetical naming conventions that have diverged between South Korean and American producers, but you can see the progression in the table below. HBM generations are similar, representing the progression to ever-higher HBM stacks that boast greater capacity and bandwidth.</p><div id="datawrapper-iframe" class="datawrapper-wrap outer" data-attrs="{&quot;url&quot;:&quot;https://datawrapper.dwcdn.net/Xpwpz/5/&quot;,&quot;thumbnail_url&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/2a54f0ed-1921-49b8-b8f3-617fb97f2763_1220x462.png&quot;,&quot;thumbnail_url_full&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/8a7d7695-17cc-4902-a2df-0b10f107cbb2_1220x532.png&quot;,&quot;height&quot;:271,&quot;title&quot;:&quot;The Primo UD-RIE is only capable of older HBM2&quot;,&quot;description&quot;:&quot;Create interactive, responsive &amp; beautiful charts &#8212; no code required.&quot;}" data-component-name="DatawrapperToDOM"><iframe id="iframe-datawrapper" class="datawrapper-iframe" src="https://datawrapper.dwcdn.net/Xpwpz/5/" width="730" height="271" frameborder="0" scrolling="no"></iframe><script type="text/javascript">!function(){"use strict";window.addEventListener("message",(function(e){if(void 0!==e.data["datawrapper-height"]){var t=document.querySelectorAll("iframe");for(var a in e.data["datawrapper-height"])for(var r=0;r<t.length;r++){if(t[r].contentWindow===e.source)t[r].style.height=e.data["datawrapper-height"][a]+"px"}}}))}();</script></div><p>The Primo UD-RIE is likely only capable of producing HBM2, or possibly HBM2E, some three to four generations and six to eight years behind the frontier. While supporting HBM2 production has some limited utility for China&#8217;s AI efforts, what China really needs is a machine that can support HBM3 production at scale. Huawei&#8217;s Ascend 910C has <a href="https://www.chinatalk.media/p/mapping-chinas-hbm-advancement">used a combination of HBM2E and HBM3</a>, and Huawei <a href="https://www.huawei.com/en/news/2025/9/hc-xu-keynote-speech">has said</a> it will adopt a custom HBM solution for its next line of Ascend 950 chips, but for this solution to be &#8220;more cost-effective than HBM3E and HBM4E&#8221; as Huawei claims, it will need production capabilities similar to HBM3.</p><p>China&#8217;s leading DRAM producer CXMT has targeted HBM3 as its goal for 2026, and will likely succeed at reasonable scale, due to its stockpile of advanced Western machines. Due to export controls limiting further import of these Western machines, CXMT needs domestic machines to scale up its advanced production and HBM3 into 2027 and 2028.</p><p>AMEC has promised that it can solve this bottleneck with a successor to the UD-RIE that can handle 90:1 aspect ratios. AMEC claims this machine is &#8220;about to enter the market&#8221;<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-6" href="#footnote-6" target="_self">6</a> as of its most recent reporting, for Q3 2025. If true, this would be a major jump in capability, from six to eight years behind the frontier to more like two to three, covering more recent DRAM nodes including those needed for HBM3.</p><p>Producing an etching machine closer to the frontier will require AMEC to master a set of technical challenges that grow more extreme with each increase in aspect ratio. Reaching the bottom of ever taller and narrower holes to carve out capacitors needs better control across the radio frequency power source and the plasma itself. Controlling temperature becomes crucial, with effective etching requiring management of hundreds of temperature zones across the wafer, since some areas etch faster than others.</p><p>The key inputs to AMEC&#8217;s success here are R&amp;D funding, strong human capital, and time, of which it has at least two.</p><h2>AMEC is rapidly scaling its R&amp;D spending</h2><p>AMEC has become one of China&#8217;s largest semiconductor equipment companies, with products spanning etching, deposition, and other processes. Its origins trace back to early state industrial policies. It was founded by Gerald Yin in Shanghai in 2004. At age 60, Yin had established himself in Silicon Valley, graduating from UCLA and working for two decades at Intel, Lam Research, and Applied Materials.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-7" href="#footnote-7" target="_self">7</a> He returned with a team of 15 engineers to embark on the novel challenge of building a semiconductor equipment business in China.</p><p>AMEC produced its first etching machine in 2007, and attracted state attention and support soon after. In 2008, it was selected for the 02 Special Project, a major industrial policy effort aimed at indigenizing semiconductor manufacturing equipment, and in 2014 it was the first investment by the Big Fund, China&#8217;s leading state semiconductor investment vehicle.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-8" href="#footnote-8" target="_self">8</a></p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!eXJI!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcd752361-ef9b-4853-9304-733e4ea23670_640x410.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!eXJI!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcd752361-ef9b-4853-9304-733e4ea23670_640x410.png 424w, https://substackcdn.com/image/fetch/$s_!eXJI!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcd752361-ef9b-4853-9304-733e4ea23670_640x410.png 848w, https://substackcdn.com/image/fetch/$s_!eXJI!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcd752361-ef9b-4853-9304-733e4ea23670_640x410.png 1272w, https://substackcdn.com/image/fetch/$s_!eXJI!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcd752361-ef9b-4853-9304-733e4ea23670_640x410.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!eXJI!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcd752361-ef9b-4853-9304-733e4ea23670_640x410.png" width="640" height="410" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/cd752361-ef9b-4853-9304-733e4ea23670_640x410.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:410,&quot;width&quot;:640,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!eXJI!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcd752361-ef9b-4853-9304-733e4ea23670_640x410.png 424w, https://substackcdn.com/image/fetch/$s_!eXJI!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcd752361-ef9b-4853-9304-733e4ea23670_640x410.png 848w, https://substackcdn.com/image/fetch/$s_!eXJI!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcd752361-ef9b-4853-9304-733e4ea23670_640x410.png 1272w, https://substackcdn.com/image/fetch/$s_!eXJI!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcd752361-ef9b-4853-9304-733e4ea23670_640x410.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p><em>The weather looks dreary but at least they have the sparkling wine, via <a href="https://www.amec-inc.com/index/Lists/index/catid/14.html">AMEC</a></em>.</p><p>This continued open capital spigot has helped turn AMEC into a significant force in the Chinese market. While AMEC had more international presence earlier in the 2010s, it has become increasingly domestically focused, with 95% of its 2024 revenue from mainland China, a share that continues to rise.</p><p>The company&#8217;s mission is fairly clear. This, for instance, from Gerald Yin, who is still chairman and CEO, in the most recent semi-annual report, for H1 2025:</p><blockquote><p>Due to the severe international geopolitical situation, we must urgently address our shortcomings and catch up. In H1, the company continued to invest heavily in new product R&amp;D, with R&amp;D spending reaching RMB 1.492 billion, up approximately 53.70% year-on-year.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-9" href="#footnote-9" target="_self">9</a></p></blockquote><p>AMEC aims to cut its product development cycles from three to five years to two or less, and has 20 new models in R&amp;D as it seeks to become a platform company spanning most of the semiconductor equipment space. Just over half of its total employees are R&amp;D personnel, far higher a share than at its Western peers.</p><div id="datawrapper-iframe" class="datawrapper-wrap outer" data-attrs="{&quot;url&quot;:&quot;https://datawrapper.dwcdn.net/0Ra3g/6/&quot;,&quot;thumbnail_url&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/9871f964-14c1-4b7c-82e8-75b4bcac839d_1220x526.png&quot;,&quot;thumbnail_url_full&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/84e00af7-1203-494a-9326-99ed456e2423_1220x526.png&quot;,&quot;height&quot;:253,&quot;title&quot;:&quot;| Created with Datawrapper&quot;,&quot;description&quot;:&quot;Create interactive, responsive &amp; beautiful charts &#8212; no code required.&quot;}" data-component-name="DatawrapperToDOM"><iframe id="iframe-datawrapper" class="datawrapper-iframe" src="https://datawrapper.dwcdn.net/0Ra3g/6/" width="730" height="253" frameborder="0" scrolling="no"></iframe><script type="text/javascript">!function(){"use strict";window.addEventListener("message",(function(e){if(void 0!==e.data["datawrapper-height"]){var t=document.querySelectorAll("iframe");for(var a in e.data["datawrapper-height"])for(var r=0;r<t.length;r++){if(t[r].contentWindow===e.source)t[r].style.height=e.data["datawrapper-height"][a]+"px"}}}))}();</script></div><p>Despite this rapid growth, AMEC&#8217;s spending remains well below global leaders such as Lam Research in absolute terms. Lam spent $2.1 billion on R&amp;D in its 2025 fiscal year, where AMEC looks set for $400-450 million. AMEC&#8217;s advantage is in pursuing a known technological path, rather than needing to push the frontier. Given continued state support for the industry and AMEC&#8217;s commercial success, it looks set to keep growing this R&amp;D spending figure. Given enough time, I expect AMEC can catch up to the frontier in etching, perhaps by the early 2030s. What export controls have done however is bring forward the point at which China needs AMEC to be delivering comparable capabilities.</p><h1>AMEC&#8217;s efforts likely solve an important bottleneck for China</h1><p>HBM is crucial to China&#8217;s AI efforts, and is currently <a href="https://newsletter.semianalysis.com/p/huawei-ascend-production-ramp">the key bottleneck</a> to Huawei&#8217;s AI chip efforts. Without sufficient HBM China cannot produce the AI chips it needs at scale, and faces an even steeper compute deficit versus the US. Producing that HBM relies on CXMT accessing enough advanced machines to continue raising production over the next few years. AMEC plays a key supporting role here.</p><p>CXMT wants to produce HBM3 indigenously at scale. The Primo UD-RIE does not give CXMT this ability, but it does provide a machine for CXMT&#8217;s less advanced production lines, potentially allowing it to reallocate some of its scarce imported equipment to HBM3 production.</p><p>But AMEC could prove most consequential by delivering the 90:1 aspect ratio successor to the Primo UD-RIE this year or next. This would support CXMT&#8217;s current most advanced production, and also enable it to scale production of HBM3. Producing large amounts of HBM3 would still leave China behind the frontier, which has already moved through HBM3E to HBM4, but it would keep China in the AI chip making game. Without large amounts of HBM3, China&#8217;s AI chip efforts would seriously suffer, as memory would remain a bottleneck on both the quality and quantity of chips it could produce.</p><p>Drawing on very scarce information, I would guess that AMEC can deliver small numbers of this 90:1 machine later this year or into 2027 and scale up to full mass production in 2027/2028. It has a large R&amp;D budget and deep pool of human capital, as well as seemingly strong technological and commercial momentum. As of its most recent financial reporting, it had sold a cumulative 25 UD-RIE machines, a product first developed in 2022 and formally announced in 2025.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-10" href="#footnote-10" target="_self">10</a> This compares to the roughly 100 capacitively coupled plasma etching machines AMEC sells a year,<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-11" href="#footnote-11" target="_self">11</a> and the thousands of etching machines generally that Lam Research ships each year.</p><p>25 UD-RIE sales is therefore a strong start for a new advanced platform. Importantly, the UD-RIE represents meaningful capacity being built at Chinese memory leaders CXMT and YMTC. These firms previously relied on Western machines, but export controls now give them a far stronger incentive to cooperate with domestic equipment manufacturers such as AMEC. Open collaboration and support from the skilled process engineering teams within these firms significantly boosts AMEC&#8217;s ability to advance their machines&#8217; capabilities.</p><div id="datawrapper-iframe" class="datawrapper-wrap outer" data-attrs="{&quot;url&quot;:&quot;https://datawrapper.dwcdn.net/t5acQ/2/&quot;,&quot;thumbnail_url&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/54333735-54b1-4cdf-9e29-4e5cf89c0273_1220x516.png&quot;,&quot;thumbnail_url_full&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/8f300b9e-c8b4-40e1-bb11-250eb943b83d_1220x516.png&quot;,&quot;height&quot;:248,&quot;title&quot;:&quot;| Created with Datawrapper&quot;,&quot;description&quot;:&quot;Create interactive, responsive &amp; beautiful charts &#8212; no code required.&quot;}" data-component-name="DatawrapperToDOM"><iframe id="iframe-datawrapper" class="datawrapper-iframe" src="https://datawrapper.dwcdn.net/t5acQ/2/" width="730" height="248" frameborder="0" scrolling="no"></iframe><script type="text/javascript">!function(){"use strict";window.addEventListener("message",(function(e){if(void 0!==e.data["datawrapper-height"]){var t=document.querySelectorAll("iframe");for(var a in e.data["datawrapper-height"])for(var r=0;r<t.length;r++){if(t[r].contentWindow===e.source)t[r].style.height=e.data["datawrapper-height"][a]+"px"}}}))}();</script></div><p>Etching has been AMEC&#8217;s largest commercial success, with rapid growth and a rising share of revenue. One of the best indicators of AMEC&#8217;s technical progress will be whether it can continue this positive revenue trend, and whether it continues to share information on a 90:1 aspect-ratio machine, including breaking out specific sales figures. On its current trajectory, AMEC looks set to become one of China&#8217;s dominant semiconductor equipment firms and to plug vital gaps in China&#8217;s capabilities, such as etching machines for DRAM.</p><p>The rest of this series will explore more of these machines and the firms that produce them. The next post will look at the machines necessary for the crucial through-silicon vias&#8212;special wires that cut down through the layers of DRAM within HBM to make communication highways&#8212;that allow as much information as possible to flow to and from the memory.</p><h1>Appendix</h1><h2>How advanced is the Primo UD-RIE?</h2><p>My estimate is that the UD-RIE is comparable to Western tools from 2018 or 2019, and remains a ways off from the most cutting edge tools which are moving towards handling &gt;100:1 aspect ratios. This estimate is pieced together from several sources, since no definitive public benchmark exists. Tokyo Electron, in its <a href="https://www.tel.com/ir/policy/mplan/i9nanv00000000ga-att/20250226_TELIRDay_E_rev.1_slide_r3.pdf)">2025</a> and <a href="https://www.tel.com/ir/policy/mplan/hq95qj00000008by-att/medium-term_plan_2022E_4.pdf)">2022</a> investor day presentations, provides DRAM technology roadmaps citing aspect-ratio figures for capacitors at various memory nodes. Naively, these figures consistently undershoot those from other industry sources, such as SemiAnalysis&#8217;s <a href="https://newsletter.semianalysis.com/p/the-memory-wall">claim</a> of leading companies approaching 100:1 for recent nodes.</p><p>I think this disparity is likely caused by Tokyo Electron reporting the structural aspect ratio (the actual ratio of the finished capacitor), rather than the etching aspect ratio (what aspect ratio the etching machine needs to handle to produce that final capacitor). Given that during etching the capacitor will have a mask layer raising its height, and the hole tapers as it deepens, the effective aspect ratio needed during etching is higher than the final structural aspect ratio of the capacitor.</p><p>A very rough Claude-assisted estimate is that if the mask adds 30% to the height of the capacitor, and you take the critical dimension at the bottom of the capacitor hole as 0.75x the opening, then you can take 1.3 &#247; 0.75 to get a 1.7x differential between the capacitor aspect ratio and the etch aspect ratio. Given that, you would expect the Primo&#8217;s 60:1 etch aspect ratio to translate to a 35:1 capacitor aspect ratio, which on the TEL roadmap would place it around 2018 or 2019.</p><p>It is possible that AMEC&#8217;s aspect ratio claims could be mapping more to NAND than DRAM as another source of the disparity, but I can&#8217;t verify this because I can&#8217;t find good public sources on the differences between NAND and DRAM aspect ratios. Sources often refer only to aspect ratios for advanced memory, not specifying NAND or DRAM despite the quite different feature profiles between them. If you can point out the (likely) errors in this analysis please do get in touch! I would like to better understand the details of the differences between DRAM and NAND etch.</p><p>Another way of estimating the capabilities of the Primo UD-RIE is via its listed features, though these are somewhat vague. The UD-RIE has features such as active edge impedance tuning, multi-level radio frequency pulsing and active by-zone temperature control that became standard during the mid-to-late 2010s. Notably, it does not have cryogenic capabilities that have become a key focus of Lam and Tokyo Electron during the 2020s for pushing the frontiers of NAND.</p><p><strong>Why the Primo UD-RIE can&#8217;t be used for CXMT 1z production</strong></p><p>The UD-RIE is likely not sufficient for CXMT&#8217;s 1z production, due to CXMT moving from 6F&#178; to 4F&#178; for its 1z production <a href="https://newsletter.semianalysis.com/p/the-memory-wall">according to SemiAnalysis</a>. What this means is that CXMT is shrinking down the overall footprint of the DRAM cell, by fitting the transistor underneath the capacitor, rather than having it sit alongside. This gives a significant boost to memory density as you no longer need to focus on shrinking down the transistor even further to shrink the overall DRAM cell.</p><p>Effectively CXMT is choosing a more challenging etching process as the best trade off to make given their limitations from a lack of access to extreme ultraviolet photolithography equipment. Opting for tricky vertical structures over smaller transistors. Since doing this raises the aspect ratios needed for 1z significantly above what they would otherwise be for a 6F&#178; layout, the Primo UD-RIE is likely not up to the task, with CXMT instead relying on imported equipment and on AMEC shipping a 90:1 successor machine.</p><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-1" href="#footnote-anchor-1" class="footnote-number" contenteditable="false" target="_self">1</a><div class="footnote-content"><p>The two largest players in etch and deposition are AMEC and Naura. Naura is larger by revenue and produces a line of HAR dielectric etch machines with their Accura NZ that is similar to the Primo UD-RIE. Though Naura has disclosed less information on the Accura&#8217;s capabilities than AMEC has with the UD-RIE. This makes it difficult to properly assess which is the leader, but given Naura came to dielectric etch later than AMEC (<a href="https://techzephyr.substack.com/p/chinas-wafer-fab-equipment-industry">only beginning in 2021</a> vs early 2010s for AMEC), and has shipped far fewer CCP systems, it seems likely that AMEC is closer to the cutting edge.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-2" href="#footnote-anchor-2" class="footnote-number" contenteditable="false" target="_self">2</a><div class="footnote-content"><p>See the <a href="https://www.amec-inc.com/en/index/Lists/show/catid/29/id/605.html">Primo UD-RIE product page</a> on AMEC&#8217;s website for the full product description and features.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-3" href="#footnote-anchor-3" class="footnote-number" contenteditable="false" target="_self">3</a><div class="footnote-content"><p>HAR etch has generally been more significant in NAND than DRAM due to NAND having adopted <a href="https://semiengineering.com/cryogenic-etch-a-key-enabler-of-3d-nand/">fully 3D structures as early as 2014</a>. DRAM has thus benefitted from tools pushing capabilities to be able to tackle increasingly  HAR etch through many layers of stacked NAND. DRAM by contrast still uses 2D transistors even if capacitors have become vertical structures, with full 3D DRAM not expected until towards the 2030s. NAND generally requires much greater depth at lower precision, while DRAM faces shallower depth but needs higher precision.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-4" href="#footnote-anchor-4" class="footnote-number" contenteditable="false" target="_self">4</a><div class="footnote-content"><p>AMEC does not clearly spell out the acronym but does describe it as &#8220;used for ultra-high depth-to-width ratio etch processes&#8221; (&#29992;&#20110;&#36229;&#39640;&#28145;&#23485;&#27604;&#21051;&#34432;&#24037;&#33402;&#30340;), see their <a href="https://pdf.valueonline.cn//web/viewer.html?v=20200509&amp;file=https%3A%2F%2Fannouncement.valueonline.cn%2F20250829%2F202508291756375281166085640.pdf&amp;companyCode=688012">2025 Semi Annual Report</a>.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-5" href="#footnote-anchor-5" class="footnote-number" contenteditable="false" target="_self">5</a><div class="footnote-content"><p>See the appendix for a full breakdown of the reasoning behind this estimate.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-6" href="#footnote-anchor-6" class="footnote-number" contenteditable="false" target="_self">6</a><div class="footnote-content"><p>&#8216;&#21363;&#23558;&#36827;&#20837;&#24066;&#22330;&#8217;, from AMEC&#8217;s <a href="https://pdf.valueonline.cn//web/viewer.html?v=20200509&amp;file=https%3A%2F%2Fannouncement.valueonline.cn%2F20251030%2F202510301761733887358076442.pdf&amp;companyCode=688012">Q3 2025 quarterly report</a>.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-7" href="#footnote-anchor-7" class="footnote-number" contenteditable="false" target="_self">7</a><div class="footnote-content"><p>See this <a href="https://archive.ph/i1DFB">Caixin Global piece</a> on Gerald Yin&#8217;s background and the founding of AMEC.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-8" href="#footnote-anchor-8" class="footnote-number" contenteditable="false" target="_self">8</a><div class="footnote-content"><p>From <a href="https://www.amec-inc.com/index/Lists/index/catid/14.html">AMEC&#8217;s corporate history page</a> which documents what AMEC considers key events in its development.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-9" href="#footnote-anchor-9" class="footnote-number" contenteditable="false" target="_self">9</a><div class="footnote-content"><p>See pages 1-2 of the <a href="https://pdf.valueonline.cn//web/viewer.html?v=20200509&amp;file=https%3A%2F%2Fannouncement.valueonline.cn%2F20250829%2F202508291756375281166085640.pdf&amp;companyCode=688012">2025 Semi Annual Report</a>.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-10" href="#footnote-anchor-10" class="footnote-number" contenteditable="false" target="_self">10</a><div class="footnote-content"><p>The specific claim is 200 cumulative UD-RIE chambers, where a single machine will have multiple chambers. The UD-RIE is advertised as having a &#8220;<a href="https://www.amec-inc.com/en/index/Lists/show/catid/29/id/605.html">maximum six single wafer etch reaction chambers and two photoresist strip chambers</a>&#8221; so 200 total chambers would be 25 machines assuming that AMEC is counting only the 6 etching chambers, and all machines have been shipped in the full 6-chamber configuration.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-11" href="#footnote-anchor-11" class="footnote-number" contenteditable="false" target="_self">11</a><div class="footnote-content"><p>AMEC reports their cumulative CCP chambers shipped at various points. We know this figure was <a href="https://pdf.valueonline.cn//web/viewer.html?v=20200509&amp;file=https%3A%2F%2Fannouncement.valueonline.cn%2F20250829%2F202508291756375281166085640.pdf&amp;companyCode=688012">4,500 in H1 2025</a>, and 3,600 at the end of their <a href="https://pdf.valueonline.cn//web/viewer.html?v=20200509&amp;file=https%3A%2F%2Fannouncement.valueonline.cn%2F20240319%2F202403191710758433857003406.pdf&amp;companyCode=688012">Fiscal Year (FY) 2023</a> giving sales of 900 chambers over those 18 months, so 600 a year, which equates, given that most AMEC CCP machines have 6 chambers a tool, to 100 sales of complete machines a year.</p></div></div>]]></content:encoded></item><item><title><![CDATA[BIS should build a lean, mean, data-driven enforcement machine]]></title><description><![CDATA[BIS enforcement relies on decades-old software systems and fragmented, patchwork databases. Fixing that could massively improve its ability to catch violators.]]></description><link>https://www.the-substrate.net/p/bis-should-build-a-lean-mean-data</link><guid isPermaLink="false">https://www.the-substrate.net/p/bis-should-build-a-lean-mean-data</guid><dc:creator><![CDATA[Maxwell K. Roberts]]></dc:creator><pubDate>Fri, 27 Feb 2026 14:00:25 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/bdbb1d9c-177f-41d6-ab15-2456ca49709d_2624x1856.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>The Bureau of Industry and Security (BIS) manages the US&#8217;s dual-use export controls, including controls on AI chips. BIS&#8217;s ability to enforce export controls on AI chips is directly relevant to US national security, because it affects China&#8217;s ability to acquire AI compute for <a href="https://www.anthropic.com/news/disrupting-AI-espionage">uses like cyberattacks</a>. It is also relevant to AI risk, because it affects BIS&#8217;s ability to monitor where compute is going and prepare for a future where compute might be restricted more aggressively.</p><p>In my last Substrate post, on <a href="https://www.the-substrate.net/p/bis-is-getting-more-fundingheres">BIS funding</a>, I covered what BIS asked for this year&#8212;a $112 million increase to spend almost entirely on enforcement staff&#8212;and what BIS actually received, which is a $44 million increase that BIS must now decide how to spend. In that piece, I argued that the highest BIS priority should be something not in the budget request at all&#8212;specifically, better software and data infrastructure. In this piece I&#8217;ll outline the current state of BIS enforcement software and data systems, what BIS is already doing to upgrade them, and the potential benefits compared to other investments.</p><h2>BIS should upgrade its decades-old enforcement software</h2><p><strong>The Investigative Management System-Redesign (IMS-R) tracks export enforcement investigations</strong>, including storing digital case files, witness testimony, documents related to investigations, and records of arrests and other enforcement actions.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-1" href="#footnote-1" target="_self">1</a> It also targets and logs end-use checks, which are when a BIS agent, an Export Control Officer, or a Commercial Service officer visits a company overseas that is receiving, or applying for a license to receive, US-origin items.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-2" href="#footnote-2" target="_self">2</a> Basically, it&#8217;s Google Drive for export control enforcement&#8212;the central source of truth on enforcement actions. If you want to understand what another agent or analyst is working on or what BIS did last year, you have a better tool than &#8220;send someone an email and ask&#8221;.</p><p>Unfortunately, IMS-R is profoundly outdated&#8212;the tool was developed either in 2006 or 2008, depending on which source you believe.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-3" href="#footnote-3" target="_self">3</a> In other words, IMS-R predates Windows Vista, Halo 3, and the end of George W. Bush&#8217;s presidency, and it has about the performance you would expect.</p><p>For example, per the Senate Investigations Committee&#8217;s <a href="https://www.hsgac.senate.gov/wp-content/uploads/The-U.S.-Technology-Fueling-Russias-War-in-Ukraine-Examing-BISs-Enforcement-of-Semiconductor-Export-Controls.pdf">excellent report on BIS</a>, IMS-R&#8217;s search function cannot find text within documents. If you want to find cases associated with a certain witness, and you don&#8217;t know the exact unique IDs of those cases already, have fun clicking through every single case, downloading every Word document attachment and using Ctrl-F. Luckily, you won&#8217;t have to search through any PDFs or Excel sheets&#8212;because according to page 20 of the report, you can&#8217;t even upload those file types. If you want to find out how long an end-use check has been open (the time between a BIS analyst flagging a transaction and a BIS agent visiting the company), you&#8217;d better just call your analyst friend, because according to the Government Accountability Office <a href="https://files.gao.gov/reports/GAO-25-106849/index.html">IMS-R can&#8217;t handle that information either.</a></p><p>This creates two problems for export enforcement: process drag and knowledge drag. Every time IMS-R crashes when an analyst is logging in, or an agent has to copy all the text from a PDF and put it in a Word document, or a supervisor has to call an agent to find out what&#8217;s going on with an end-use check, that&#8217;s process drag. Time is being sucked away from productive enforcement activities to fight a software system old enough to vote in this year&#8217;s midterms.</p><p>That&#8217;s bad, but knowledge drag might be worse. Because finding information is so hard, sometimes people just won&#8217;t find it, or even take the time to seek it. An agent working a case won&#8217;t realize the company they&#8217;re investigating is also applying for licenses. An analyst putting together an Entity List package&#8212;the bundle of evidence and analysis supporting a proposal to restrict exports to specific companies&#8212;won&#8217;t know about the three open end-use checks on that company. Improving IMS-R would reduce administrative workload for BIS agents and analysts <em>and</em> increase knowledge sharing in hard-to-quantify but important ways.</p><h2>BIS should spend even more on trade data</h2><p>Beyond the information about cases and BIS actions stored in IMS-R, BIS enforcement has access to three types of data:</p><ul><li><p>BIS-owned data tools, like CUESS.</p></li><li><p>Data tools owned by other government agencies but used by BIS, like the Automated Export System and the Automated Targeting System.</p></li><li><p>Commercial data tools purchased by BIS, including teardown intelligence, corporate registry and trade data, and data fusion platforms.</p></li></ul><p><strong>The primary BIS-owned data tool (besides IMS-R) is the Commerce USXPORTS Exporter Support System (CUESS).</strong> CUESS is both the software system used to process export licenses and the repository of all the data about those licenses at BIS (companies apply for licenses through the applicant-facing side of the system, called SNAP-R). Although newer than IMS-R, CUESS shares many of its pathologies and was designed for processing licensing applications rather than viewing license data.</p><p>The primary non-BIS, government-owned data tools are the Automated Export System and the Automated Targeting System, both managed by Customs and Border Protection.</p><ul><li><p>The Automated Export System (AES) electronically logs all exports from the United States. (Seriously, all of them! Millions every day. It&#8217;s a fascinating dataset.)</p></li></ul><ul><li><p>The Automated Targeting System (ATS) automatically compares AES data to existing law enforcement databases to flag if a shipment, say, is going to a company on the Entity List.</p></li></ul><p>Unfortunately, because BIS doesn&#8217;t own these data systems, it must haggle with Customs and Border Protection about both access for BIS analysts and whether some data BIS would like to see are captured at all.</p><p>Finally, BIS also has contracts with a scattering of commercial data providers who can be broadly categorized into three sub-buckets: physical teardown intelligence, corporate registry and supply chain data, and data fusion platforms:</p><ul><li><p>The only physical teardown intelligence provider BIS appears to contract with is Conflict Armaments Research. <a href="https://www.conflictarm.com/">Conflict Armaments Research&#8217;s line of work</a> is flying to battlefields around the world, recovering weapons debris, and identifying the supply chains of those weapons. This has obvious utility to BIS: if you have primary information about what components with what serial numbers are where, you can work backwards through the supply chain.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-4" href="#footnote-4" target="_self">4</a> BIS awarded Conflict Armaments Research a $120,000 <a href="https://www.usaspending.gov/award/CONT_AWD_1331L521P13160028_1301_-NONE-_-NONE-/">contract</a> in February 2021.</p></li><li><p>BIS contracts with a wide variety of companies for corporate registry and supply chain data, including <a href="https://www.usaspending.gov/award/CONT_AWD_1331L524F0255_1301_1331L523A13OS0072_1301/">Dun &amp; Bradstreet</a>, <a href="https://www.usaspending.gov/award/CONT_AWD_1331L524C0008_1301_-NONE-_-NONE-/">Sayari</a>, <a href="https://www.usaspending.gov/award/CONT_AWD_1331L524F0421_1301_1331L523D13OS0001_1301/">Bloomberg</a>, <a href="https://www.usaspending.gov/award/CONT_AWD_1331L525F0326_1301_03310323D0052_0300/">IHS Global</a>, <a href="https://www.usaspending.gov/award/CONT_AWD_1331L525F0343_1301_GS02F026DA_4732/">Thomson Reuters</a>, <a href="https://www.usaspending.gov/award/CONT_AWD_1331L526P0001_1301_-NONE-_-NONE-/">Kharon</a>, <a href="https://www.usaspending.gov/award/CONT_AWD_1331L521P13160035_1301_-NONE-_-NONE-/">S&amp;P Global Market Intelligence</a>, and <a href="https://www.usaspending.gov/award/CONT_AWD_1331L524F0629_1301_NNG15SD26B_8000/">WireScreen</a>. Many of the contracts appear to be through a reseller called <a href="https://www.thundercattech.com/">Thundercat Technologies</a>. These companies help answer questions like &#8220;who owns this Chinese company?&#8221; and &#8220;what is this chip fab&#8217;s legal address?&#8221;</p></li><li><p>BIS has also awarded <a href="https://www.usaspending.gov/award/CONT_AWD_1331L525F0002_1301_NNG15SC92B_8000/">$4.6 million to Palantir Technologies for &#8220;Platform-as-a-Service.&#8221;</a> Palantir famously specializes in helping government clients assemble and understand large datasets, so this would likely be the integrator that helps BIS make sense of all the other data it&#8217;s cobbling together.</p></li></ul><p>Right now, human enforcement analysts serve as the interpretive layer between all these data sources. The same analyst might query AES to find a shipment, check ATS to see if the consignee matches any known law enforcement records, then check WireScreen and Sayari to determine who owns the company listed on the AES record. Then they might walk downstairs to the Bloomberg terminal (of which there is only one, which you must reserve in advance) to look at data on that company&#8217;s suppliers and customers, and then they might go back upstairs to look at CUESS information about the licenses that company has received, through CUESS&#8217;s painfully awkward search function (no searching attachments, barely any advanced search support at all, same as IMS-R). This all assumes the analyst has access to all those resources, which is not given to every BIS analyst. This creates the same knowledge drag as IMS-R: data that could be used isn&#8217;t used because it&#8217;s too painful to get it.</p><p>The dream system would unify US government trade data, commercial trade data, law enforcement investigation databases, corporate registries, and physical teardown information into a single portal. (This is exactly the type of system that Palantir is well suited to build.) Actually building it is not only a financial and technological problem but also a legal, commercial, and political problem&#8212;it means addressing data confidentiality rules, negotiating with commercial providers for bulk data transfers instead of far more lucrative per-seat licensing, and badgering agencies like Customs and Border Protection, which owns key data, into giving BIS the keys to the kingdom.</p><p>The benefits of this system would be massive&#8212;instead of analysts spending days manually correlating different data sources, they could focus on their core work of evaluating qualitative intelligence, targeting enforcement, and tracking large-scale trends. An analyst asking &#8220;what did this Chinese fab buy this year, and through which suppliers?&#8221; could answer that question in minutes, not days, then apply their own expertise to the real question of what those purchases <em>mean</em> for what the fab is building. A policymaker asking &#8220;to which other countries did exports of AI chips surge after we controlled them to China?&#8221; could get the answer themselves without tasking an entire team of data scientists, a task that has become much more difficult due to attrition in BIS&#8217;s Data Analytics Division.</p><h2>Software and data would be a force multiplier and are feasible</h2><p>BIS is probably better placed than I to make an exact evaluation of its own software and data requirements and how those compare to other needs like enforcement staffing. I know as well as anyone the frustration civil servants feel when think tank pundits try to backseat drive a federal agency without an intimate understanding of the constraints on the ground. I also don&#8217;t have access to as much information as BIS leadership does about how enforcement is going and what the staff there say they need.</p><p>Nevertheless, I think it&#8217;s worth investing at least some of BIS&#8217;s budget increase in software and data. Investing in technology would increase the productivity of the hundreds of staff BIS already has, which is, I think, worth more than the ten or so additional agents a few extra million in technology spend would mean forgoing. I also think a broad BIS technology modernization project would succeed. This is partly because BIS was demonstrably able to build the CUESS software originally (a <a href="https://www.oig.doc.gov/wp-content/OIGPublications/OIG-16-037-A.pdf">major software project</a> to build a single licensing system across the interagency in the 2010s failed due to coordination issues between agencies, not issues within BIS), and partly because most of the required data modernization would involve paying commercial companies to provide data and build fusion tools, not BIS developing anything itself.</p><p>Ultimately, it&#8217;s up to BIS leadership to decide how to spend their money&#8212;but if they&#8217;re asking me,<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-5" href="#footnote-5" target="_self">5</a> modern software and a lean, mean data-processing machine would be a valuable and affordable addition to the BIS enforcement toolkit.</p><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-1" href="#footnote-anchor-1" class="footnote-number" contenteditable="false" target="_self">1</a><div class="footnote-content"><p>See p. 19 of this <a href="https://www.hsgac.senate.gov/wp-content/uploads/The-U.S.-Technology-Fueling-Russias-War-in-Ukraine-Examing-BISs-Enforcement-of-Semiconductor-Export-Controls.pdf">Senate report on BIS</a>.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-2" href="#footnote-anchor-2" class="footnote-number" contenteditable="false" target="_self">2</a><div class="footnote-content"><p>See <a href="https://www.oig.doc.gov/OIGPublications/OIG-20-019-A.pdf">this report on the Department of Commerce</a> by the Office of Inspector General.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-3" href="#footnote-anchor-3" class="footnote-number" contenteditable="false" target="_self">3</a><div class="footnote-content"><p>2006 is implied by the <a href="http://www.knowceanconsulting.com/imsr.html">website of the consultants that built it</a> (see the copyright year at the bottom). BIS says 2008 in its <a href="https://www.commerce.gov/sites/default/files/2024-03/BIS-FY2025-Congressional-Budget-Submission.pdf">FY2025 budget request (</a>p. 27).</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-4" href="#footnote-anchor-4" class="footnote-number" contenteditable="false" target="_self">4</a><div class="footnote-content"><p>Conflict Armaments Research focuses on weapons, including Russian and Iranian drones downed in Ukraine and the Middle East. For AI chips, BIS might consider purchasing teardown intelligence from providers like <a href="https://www.techinsights.com/technology/teardown">TechInsights</a>.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-5" href="#footnote-anchor-5" class="footnote-number" contenteditable="false" target="_self">5</a><div class="footnote-content"><p>They are not asking me.</p></div></div>]]></content:encoded></item><item><title><![CDATA[Endgames for export controls]]></title><description><![CDATA[Export controls on AI chips could lead to enduring dominance.]]></description><link>https://www.the-substrate.net/p/endgames-for-export-controls</link><guid isPermaLink="false">https://www.the-substrate.net/p/endgames-for-export-controls</guid><dc:creator><![CDATA[Onni Aarne]]></dc:creator><pubDate>Wed, 18 Feb 2026 16:40:09 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/be9f1a45-5222-4ac4-89fa-595ae3bee460_2624x1856.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>In <a href="https://www.the-substrate.net/p/for-chip-exports-quantity-is-at-least">a previous post</a> on the Substrate, I argued for aggressively limiting AI chip exports to China. But that post took it as a given that the US <em>should</em> use export controls to slow China&#8217;s AI progress. Some people are understandably skeptical: Won&#8217;t China catch up in chips, and then AI, eventually? Aren&#8217;t the export controls just accelerating China&#8217;s progress on chips<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-1" href="#footnote-1" target="_self">1</a> and worsening US-China relations in the meantime?</p><p>This hypothetical critic might acknowledge that the controls give the US a temporary advantage in AI, but ask: What&#8217;s the <em>endgame</em>? Will this give any lasting advantage?</p><p>In this post, I sketch at least one possible answer. Basically: AI appears to have several feedback loops that favor incumbents. These feedback loops may be so strong, and the impact of AI so significant, that China&#8217;s usual strategy of protectionism fails.</p><h1>Why AI is different</h1><p>To give the critic their due, many past US export controls have been ineffective, because other countries have simply developed supply chains that circumvent the US, and US industry has suffered. Export controls on satellite technology are perhaps the most infamous example. In 1998, after investigations revealed that US satellite manufacturers had potentially helped improve China&#8217;s ballistic missiles through launch failure analyses, Congress responded by moving commercial satellites from Commerce Department jurisdiction to the much more restrictive International Traffic in Arms Regulations (ITAR) regime.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-2" href="#footnote-2" target="_self">2</a> As a result, European competitors began marketing &#8220;ITAR-free&#8221; satellites with zero US-origin components specifically to capture the business that US companies could no longer serve.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-3" href="#footnote-3" target="_self">3</a> The controls limited some technology transfers, but the overall effect was to cede market share while China developed its own satellite capabilities through non-US suppliers. The underlying technology was not concentrated enough in US hands for the controls to work.</p><p>The export controls on chips and semiconductor manufacturing equipment have so far been much more successful, because semiconductor manufacturing is extremely concentrated and difficult to enter. This concentration is mainly driven by two factors: the industry is extremely capital-intensive, and it requires highly specialized tacit knowledge across many sectors and research fields.</p><p>But can the chip controls translate into an enduring lead in AI? There is a plausible case that they can. AI appears likely to combine (1) dynamics similar to those in semiconductors, (2) network effects, as seen in big tech platforms, and (3) AI&#8217;s own unique feedback loops, sometimes called <a href="https://www.hyperdimensional.co/p/on-recursive-self-improvement-part">recursive self-improvement</a>. Together, these may create first-mover advantages strong enough to make an early lead from chip controls nearly unassailable.</p><p><strong>Like semiconductors, AI is capital-intensive.</strong> Grok 4 is <a href="https://epoch.ai/data-insights/grok-4-training-resources_">estimated</a> to have cost nearly half a billion dollars to train. But training costs are only part of the picture. The hyperscalers&#8212;Microsoft, Google, Meta, and Amazon&#8212;are projected to spend a combined $600-700 billion in capital expenditure in 2026, the majority of it on AI infrastructure.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-4" href="#footnote-4" target="_self">4</a> For comparison, the entire global semiconductor manufacturing industry invests roughly $160 billion per year in equipment,<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-5" href="#footnote-5" target="_self">5</a> meaning five US tech companies plan to spend more than four times as much on AI infrastructure alone. Meanwhile, China&#8217;s total AI infrastructure investment&#8212;including both corporate and government spending&#8212;is estimated at roughly $80-100 billion per year,<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-6" href="#footnote-6" target="_self">6</a> less than one-sixth of US hyperscaler capex. These gaps are difficult to close. If building frontier AI continues to require this kind of investment, the number of serious competitors remains small&#8212;and any country cut off from the most advanced chips will find it even harder to stay in the race.</p><p><strong>AI companies also benefit from proprietary data flywheels.</strong> In a world where AI commoditizes engineering talent, what will remain scarce and valuable? One answer is high-quality, proprietary user interaction data. The companies with hundreds of millions of users generating billions of conversations are accumulating training signal that no newcomer can replicate from scratch. This data will be essential for understanding what users actually want, what real-world tasks look like, and where models fail in real-world deployment. And engineers&#8217; tacit knowledge, which currently leaks between companies as employees change jobs, may become more controllable in a world where human employees generate only a small fraction of the relevant insights.</p><p>To be fair, much of the highest-quality human-origin data is collected by companies like Scale AI and smaller competitors, not sourced directly from users. And quasi-artisanal task-specific data and <a href="https://www.mechanize.work/blog/sweatshop-data-is-over/">RL environments</a> may prove more useful than raw user interaction data. So far, there is limited public evidence that raw user interaction data is exceptionally important. But this may change as flows of user data increase and other data sources become harder to scale.</p><p><strong>Like existing big tech companies, AI incumbents may benefit from sticky human customers and from being a platform.</strong> ChatGPT still commands an outsized share of the consumer AI market, despite industry tastemakers having largely switched to Claude. Over the past year, thousands of users became so attached to OpenAI&#8217;s 4o model that when OpenAI tried to retire it, public demand pressured the company into bringing it back.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-7" href="#footnote-7" target="_self">7</a> Business-to-business AI may eventually become a ruthlessly efficient, low-margin business as AI agents handle both sides of transactions, but sticky, habit-driven humans may persist on the consumer side for a long time.</p><p>There may also be platform effects. If your AI becomes the &#8220;operating system&#8221; through which you interact with the world&#8212;managing your email, scheduling, finances, shopping, and work&#8212;then you want the AI with good integrations with other services, and service providers want to build integrations for AIs that have users. This is a classic <a href="https://en.wikipedia.org/wiki/Two-sided_market#Competition">multi-sided platform</a>, which can lead to winner-takes-all dynamics. It is already the case that users&#8217; AI choices are heavily influenced by which companies offer good integrations and tool ecosystems.</p><p>That said, the stickiness and platform arguments are the weakest of these arguments. It is unclear how sticky human preferences will be over the medium term: users might readily switch to a substantially better product, and ChatGPT&#8217;s dominance may reflect a fleeting first-mover advantage more than deep lock-in. And it is not obvious that AI will benefit from network effects the way social media or other big tech products have. Open integration standards like the Model Context Protocol could dissolve the stickiness of AI integrations&#8212;if any AI can connect to any service equally well, the platform advantage dissolves. And AI itself may dissolve this kind of stickiness: an AI coding assistant can write new integrations on the fly, and a user&#8217;s &#8220;memories&#8221; and configurations are currently just text files that could easily be ported to a competitor. So while sticky preferences and platform effects <em>could</em> entrench incumbents, these mechanisms are more speculative than the others discussed here.</p><p>Finally, and perhaps most importantly, <strong>AI appears to benefit from a unique feedback loop where AI speeds up AI</strong>. Leading AI companies already use their own AI systems to accelerate their research and engineering. Both Anthropic and OpenAI are approaching a point where AI does nearly all the coding in their labs. OpenAI has <a href="https://x.com/sama/status/1983584366547829073?lang=en">stated</a> a goal of fielding &#8220;automated AI research interns&#8221; running on &#8220;hundreds of thousands of GPUs&#8221; by September 2026, and a &#8220;true automated AI researcher&#8221; by 2028. Anthropic CEO Dario Amodei <a href="https://medium.com/@ZombieCodeKill/claude-on-amodeis-the-adolescence-of-technology-ba80b8dff87e">claimed</a> recently that &#8220;[we] essentially have Claude designing the next version of Claude itself&#8221;.</p><p>If the best models and scaffolds are kept internal, this creates a compounding advantage for incumbents. If AI systems can perform every component task of AI research&#8212;finding optimizations, designing experiments, writing and debugging code&#8212;then the company with the best AI engineers (silicon ones, that is) will make faster progress, yielding even better AI engineers, and so on. According to one estimate, OpenAI already spends a majority of its compute on internal experiments rather than customer-facing inference, a sign of how much compute AI companies are willing to invest in internal R&amp;D.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-8" href="#footnote-8" target="_self">8</a></p><p>This mechanism reinforces the point about capital intensity: If you can turn capital (compute) into R&amp;D, companies with fewer or inferior chips will struggle to catch up.</p><p>There are important caveats here. Competitors can sometimes use a leading lab&#8217;s own AI products to partly catch up. Algorithmic insights leak between companies as researchers change jobs. And returns to additional compute may not be linear&#8212;research is not perfectly parallelizable, and marginal experiments may yield diminishing returns. But the overall direction of the feedback loop seems clear, even if its strength is uncertain.</p><p>On the other hand, if recursive self-improvement results in a rapid <a href="https://www.forethought.org/research/will-ai-r-and-d-automation-cause-a-software-intelligence-explosion">software intelligence explosion</a>, the explosion may burn through available fuel quickly and plateau. If Chinese competitors can trigger their own explosion, the US lead may be dramatic but short-lived in calendar time. Nonetheless, market positions established during this brief period may last, for some of the other reasons discussed above.</p><h1>A sufficiently strong lead may become permanent</h1><p>China will almost certainly attempt a protectionist strategy, as it has done successfully with big tech: maintaining a walled domestic market, nurturing indigenous AI champions, and keeping American products out. This is the playbook that gave China Baidu instead of Google, WeChat instead of WhatsApp, and Alibaba instead of Amazon.</p><p>But AI may differ in a crucial respect. If the feedback loops described above are strong enough, the capability gap between US and Chinese AI systems will grow over time rather than shrink. And if AI becomes as economically transformative as many expect, the opportunity cost of relying on inferior domestic AI could be enormous. Chinese businesses using second-tier AI would be at a growing productivity disadvantage relative to international competitors using frontier US-developed systems. At some point, the economic cost of protectionism could exceed the political cost of opening the market. And once the market opens, the dynamics discussed above may make it practically impossible for domestic Chinese alternatives to ever catch up.</p><p>There is a rough historical parallel. In the mid-19th century, Japan had maintained a policy of near-total isolation (<em>sakoku</em>) for over two centuries. But when the technological and economic gap with the industrializing West grew large enough, Japan was essentially forced to open its markets and rapidly modernize. The pressure was not merely military but economic: the cost of falling further behind had become intolerable. Something analogous could happen with AI. If Chinese AI is still limited to helpful chatbots while US companies have largely automated white-collar work&#8212;and if that gap is widening&#8212;the CCP may face mounting pressure from its own businesses, citizens, and strategists to allow superior foreign AI systems into its market. (That said, the analogy cuts both ways: Japan&#8217;s forced opening was followed by remarkably rapid catch-up. The Meiji modernization transformed Japan from a feudal society to a major industrial and military power within a few decades. Forced market opening is not the same as permanent subordination.)</p><h1>Lasting advantage may simply be a series of temporary advantages</h1><p>At the start, I framed this as a question of whether export controls could create a lasting advantage. But they don&#8217;t necessarily need to create a lasting advantage to be worthwhile.</p><p>It is inevitable that China will indigenize the modern semiconductor manufacturing stack eventually. But this was always going to happen eventually, with or without export controls, and pulling back the controls now will not get China to give up on indigenization.</p><p>The best way to obtain overall strategic advantage will likely be to keep getting ahead on the next thing, and the thing after that. The export controls are doing exactly that: letting the US dominate in AI and pull in massive revenues. Those revenues can be invested in whatever the next critical thing is, whether that&#8217;s humanoid robots or new approaches to manufacturing compute&#8212;perhaps using <a href="https://aiprospects.substack.com/p/ai-has-unblocked-progress-toward">AI-enabled nanotech</a>&#8212;or, more likely, something that I&#8217;ve entirely failed to think of.</p><p>Giving up a clear near-term advantage to preserve semiconductor manufacturing leadership decades from now requires putting far too much trust in your ability to hold on to that lead. Intel already lost the mandate of heaven. TSMC will lose it eventually. The strength of the US has always been that when one technological advantage is lost, its innovation ecosystem produces another to take its place.</p><h1>AGI could be more than just another tech stack</h1><p>So far, I&#8217;ve been talking about AI as just another big tech product. But if AI companies succeed in building genuinely general, and then superhuman, artificial intelligence, the analogy to previous technology competitions&#8212;search engines, social media, cloud computing&#8212;may radically understate the stakes.</p><p>If AI is more like a new factor of production, or even a new species, whoever leads will likely gain major advantages in scientific research, military capability, economic productivity, and the capacity to develop every other technology. The geopolitical consequences, for a world order already in flux, would be enormous.</p><p>Such a technology will also raise value-laden choices about how to design it and integrate it into society. These choices will likely be greatly influenced by who builds it, as we&#8217;re already seeing: It wasn&#8217;t obvious that AI assistants would have a standard &#8220;virtuous&#8221; personality, much less anything called a <a href="https://www.anthropic.com/constitution">constitution</a>, if they hadn&#8217;t been built by people from very particular subcultures. Even the Silicon Valley of twenty years ago might have taken a very different approach! And these choices may prove sticky once they shape user expectations and industry norms, or become codified in regulation.</p><p>The implications go beyond consumer-facing norms: whoever leads in AI will likely shape how autonomous systems are used in military and intelligence contexts, and will have outsized influence over emerging international AI norms. The US largely got to construct the nuclear taboo and the equilibrium of mutually assured destruction by virtue of being first to the technology. European powers set norms around chemical weapons in the first half of the 20th century, and those norms are still in place today.</p><p>Conversely, if the US loses its AI lead, the automation of software engineering may well undo the moats of US tech giants like Microsoft. It could also transform the tech ecosystem enough to unseat American incumbents in adjacent domains like search, browsers, and enterprise software, where they have held dominant positions for decades. The US benefits enormously from the world&#8217;s default digital infrastructure being American-built; an AI-driven reversal of that would have cascading consequences.</p><p>No one can predict exact technological trajectories. But across a broad range of scenarios, a temporary compute advantage seems likely to have long-lasting effects on AI development. If AI is a normal technology, it will likely have strong first-mover advantages. If AI is much more than a normal technology, it will be difficult to predict what the implications of leadership will be, but those implications will almost certainly be enormous. And regardless, always aiming to win the next thing is probably a good strategy for staying ahead, even if no single advantage lasts.</p><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-1" href="#footnote-anchor-1" class="footnote-number" contenteditable="false" target="_self">1</a><div class="footnote-content"><p>I don't think this acceleration effect is actually very strong, as I briefly discussed in my <a href="https://www.the-substrate.net/p/for-chip-exports-quantity-is-at-least">previous Substack post</a>.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-2" href="#footnote-anchor-2" class="footnote-number" contenteditable="false" target="_self">2</a><div class="footnote-content"><p>The Strom Thurmond National Defense Authorization Act for Fiscal Year 1999 returned jurisdiction over commercial satellite exports from the Commerce Department to the State Department under ITAR, effective March 1999. This was prompted by investigations (culminating in the Cox Report) finding that launch failure analyses conducted by Loral and Hughes with Chinese engineers had provided information that could improve China&#8217;s ballistic missile reliability. See CSIS Aerospace Security, <a href="https://aerospace.csis.org/itar-satellite-regulation/">&#8220;The Myth of &#8216;ITAR-Free&#8217;&#8221;</a>.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-3" href="#footnote-anchor-3" class="footnote-number" contenteditable="false" target="_self">3</a><div class="footnote-content"><p>Bureau of Industry and Security, <a href="https://www.bis.gov/media/documents/exportcontrolfinalreport08-31-07master-3-bis-net-link-version-101707-receipt-afrl4.pdf-0">&#8220;Defense Industrial Base Assessment of the U.S. Space Industry&#8221;</a> (2007). US satellite manufacturing revenue share fell from approximately 63% (1996-1998) to approximately 41% (2002-2005). The BIS also estimated that lost US satellite export sales averaged $588 million annually during 2003-2006.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-4" href="#footnote-anchor-4" class="footnote-number" contenteditable="false" target="_self">4</a><div class="footnote-content"><p>Combined capital expenditure projections for Amazon, Alphabet, Microsoft, Meta, and Oracle. See Futurum Group, <a href="https://futurumgroup.com/insights/ai-capex-2026-the-690b-infrastructure-sprint/">&#8220;AI Capex 2026: The $690B Infrastructure Sprint&#8221;</a> (February 12, 2026); CNBC, <a href="https://www.cnbc.com/2026/02/06/google-microsoft-meta-amazon-ai-cash.html">&#8220;Tech AI spending approaches $700 billion in 2026&#8221;</a> (February 2026). Approximately 75% of this spending is directly tied to AI infrastructure.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-5" href="#footnote-anchor-5" class="footnote-number" contenteditable="false" target="_self">5</a><div class="footnote-content"><p><a href="https://www.semiconductorintelligence.com/semiconductor-capex-down-in-2024-up-in-2025/">Semiconductor Intelligence</a>, &#8220;Semiconductor CapEx Down in 2024, Up in 2025&#8221;, estimates total global semiconductor manufacturer capex at approximately $160 billion in 2025. <a href="https://www.semi.org/en/semi-press-release/global-total-semiconductor-equipment-sales-forecast-to-reach-a-record-of-dollar-139-billion-in-2026-semi-reports">SEMI</a> forecasts global semiconductor equipment sales of $139 billion in 2026.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-6" href="#footnote-anchor-6" class="footnote-number" contenteditable="false" target="_self">6</a><div class="footnote-content"><p>Estimates vary. <a href="https://www.goldmansachs.com/insights/articles/chinas-ai-providers-expected-to-invest-70-billion-dollars-in-data-centers-amid-overseas-expansion">Goldman Sachs</a> (November 2025) projected $70 billion in data center investment from Chinese AI providers. <a href="https://www.scmp.com/tech/tech-war/article/3315805/chinas-ai-capital-spending-set-reach-us98-billion-2025-amid-rivalry-us">SCMP</a> (June 2025), citing Bank of America, estimated total Chinese AI capex including government spending could reach RMB 600-700 billion (~$84-98 billion) in 2025.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-7" href="#footnote-anchor-7" class="footnote-number" contenteditable="false" target="_self">7</a><div class="footnote-content"><p>OpenAI initially attempted to retire GPT-4o in August 2025 when it launched GPT-5, but reversed the decision within days following significant backlash from paid subscribers. See TechCrunch, <a href="https://techcrunch.com/2026/02/06/the-backlash-over-openais-decision-to-retire-gpt-4o-shows-how-dangerous-ai-companions-can-be/">&#8220;The backlash over OpenAI&#8217;s decision to retire GPT-4o shows how dangerous AI companions can be&#8221;</a> (February 6, 2026); <a href="https://openai.com/index/retiring-gpt-4o-and-older-models/">OpenAI announcement</a>.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-8" href="#footnote-anchor-8" class="footnote-number" contenteditable="false" target="_self">8</a><div class="footnote-content"><p>Epoch AI, <a href="https://epoch.ai/data-insights/openai-compute-spend">&#8220;Most of OpenAI&#8217;s 2024 compute went to experiments&#8221;</a> (2025), estimates that the large majority of OpenAI&#8217;s 2024 compute budget went to research experiments rather than final training runs or customer-facing inference. Of an estimated ~$5 billion R&amp;D compute budget, less than $1 billion went to final training runs of released models.</p></div></div>]]></content:encoded></item><item><title><![CDATA[Where will China get its compute in 2026?]]></title><description><![CDATA[Over half of the compute will likely be legally imported NVIDIA H200s, but other sources&#8212;domestic production, proxy fabrication, and smuggling&#8212;matter too, as does remote access.]]></description><link>https://www.the-substrate.net/p/where-will-china-get-its-compute</link><guid isPermaLink="false">https://www.the-substrate.net/p/where-will-china-get-its-compute</guid><dc:creator><![CDATA[Erich Grunewald]]></dc:creator><pubDate>Fri, 13 Feb 2026 17:12:22 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/966cd4f0-0200-4bc4-94e5-086f72f86e1e_2624x1856.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>Though the AI chip export controls have gaps, and can be improved, they go a long way towards reducing the amount of compute that Chinese AI companies get. If you think, as I do, that compute is of <a href="https://peterwildeford.substack.com/p/compute-is-a-strategic-resource">great strategic importance</a>, and that it&#8217;s better for the US to have a comfortable lead over China, this is a good thing. The <a href="https://epoch.ai/data-insights/ai-supercomputers-performance-share-by-country">American compute advantage</a> is probably the main reason why Chinese AI models have <a href="https://epoch.ai/data-insights/us-vs-china-eci">lagged on average 7 months behind</a> the frontier.</p><p>In this post, I make some rough estimates, using publicly available data, of how much compute China will acquire in 2026 through each of four pathways: legal imports, domestic production, proxy fabrication, and smuggling. I also discuss Chinese use of non-Chinese cloud compute, though since this involves renting rather than ownership, I don&#8217;t count it as &#8220;acquisition&#8221;. Though I&#8217;m not confident in the exact numbers, I do think they get the orders of magnitude right, and are informative for that reason.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-1" href="#footnote-1" target="_self">1</a></p><p>For training workloads, the estimates are:</p><div id="datawrapper-iframe" class="datawrapper-wrap outer" data-attrs="{&quot;url&quot;:&quot;https://datawrapper.dwcdn.net/6xZQK/2/&quot;,&quot;thumbnail_url&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/f1362fae-e64e-4735-9527-3f665c148d0c_1220x322.png&quot;,&quot;thumbnail_url_full&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/8bad555c-e0bb-47b8-9621-cdcf434b71de_1220x470.png&quot;,&quot;height&quot;:223,&quot;title&quot;:&quot;Created with Datawrapper&quot;,&quot;description&quot;:&quot;Forecast Chinese training compute acquisition in 2026 (B300-equivalents)&quot;}" data-component-name="DatawrapperToDOM"><iframe id="iframe-datawrapper" class="datawrapper-iframe" src="https://datawrapper.dwcdn.net/6xZQK/2/" width="730" height="223" frameborder="0" scrolling="no"></iframe><script type="text/javascript">!function(){"use strict";window.addEventListener("message",(function(e){if(void 0!==e.data["datawrapper-height"]){var t=document.querySelectorAll("iframe");for(var a in e.data["datawrapper-height"])for(var r=0;r<t.length;r++){if(t[r].contentWindow===e.source)t[r].style.height=e.data["datawrapper-height"][a]+"px"}}}))}();</script></div><p><strong>Legal imports (mainly NVIDIA H200s) will, I think, make up about 60% of China&#8217;s compute acquisition in 2026, or about 230,000 B300-equivalents (90% CI: 0 to 300,000).</strong> This is probably the clearest sign that export controls dictate how much compute China gets&#8212;it means the US could cut Chinese compute acquisition by up to 60% if it wanted to.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-2" href="#footnote-2" target="_self">2</a> There is some chance that the Chinese Communist Party ends up blocking some or all H200 imports, or that the US reverses course or grants only very few licenses, but I consider these outcomes remote. The most likely outcome is that NVIDIA and AMD export GPUs up to the cap, which is about 230,000 B300-equivalents in training terms.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-3" href="#footnote-3" target="_self">3</a></p><p><strong>Huawei Ascend 910Cs fabricated by SMIC in China will, I think, make up about 25% of China&#8217;s compute acquisition in 2026, or about 40,000 B300-equivalents (90% CI: 25,000 to 200,000).</strong> This is likely bottlenecked, not on GPUs fabricated by SMIC, but on high-bandwidth memory (HBM) fabricated by CXMT. HBM is a crucial component for AI chips, accounting for about <a href="https://epoch.ai/data-insights/b200-cost-breakdown">half of the production cost</a>, and <a href="https://ai-frontiers.org/articles/high-bandwidth-memory-critical-gaps-us-export-controls">is itself export-controlled</a>. I&#8217;m unsure about Huawei&#8217;s domestic production volumes because I&#8217;m unsure about how many HBM stacks CXMT will manage to produce this year. (Huawei and others also stockpiled Samsung-made HBM in 2024 and early 2025, but this stockpile <a href="https://newsletter.semianalysis.com/p/huawei-ascend-production-ramp">has now likely run out</a>.) I estimate about 7 million HBM3 stacks&#8212;enough for about 590,000 Ascend 910Cs, assuming an advanced packaging yield of 70%&#8212;but SemiAnalysis offers a much smaller estimate of <a href="https://newsletter.semianalysis.com/p/huawei-ascend-production-ramp">2 million HBM stacks</a>. I give equal weight to the SemiAnalysis number and my own estimate. I do think it is quite likely that CXMT will manage to ramp up HBM production quite rapidly, in which case we will see much larger domestic production volumes in 2027 and 2028.</p><p><strong>Huawei Ascend 910Cs illegally fabricated outside mainland China will, I think, make up less than 5% of China&#8217;s compute acquisition in 2026, or about 2,000 B300-equivalents (90% CI: 0 to 20,000).</strong> Around 2024, Huawei <a href="https://newsletter.semianalysis.com/p/huawei-ascend-production-ramp">obtained over 2.9 million AI chip dies</a> from TSMC through front companies, despite sanctions. I call this &#8220;proxy fabrication&#8221;, because Huawei surreptitiously got TSMC to fabricate Huawei-designed chips using front companies as proxies.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-4" href="#footnote-4" target="_self">4</a> Based on a SemiAnalysis projection, this stockpile has likely run out by now, or is close to running out. In response to this violation, the Bureau of Industry and Security (BIS) announced a <a href="https://www.federalregister.gov/documents/2025/01/16/2025-00711/implementation-of-additional-due-diligence-measures-for-advanced-computing-integrated-circuits">foundry due diligence rule</a> meant to shut this pathway down. It is not yet clear whether this rule does the job. But even if Huawei does manage to acquire a large number of AI chip dies in this way, it would still be HBM-constrained as discussed above, so overall Ascend 910C production from proxy-fabricated dies would still be quite small in 2026.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-5" href="#footnote-5" target="_self">5</a></p><p><strong>Smuggled AI chips (mainly Blackwells) will, I think, make up about 10% of China&#8217;s compute acquisition in 2026, or about 20,000 B300-equivalents (90% CI: 2,000 to 100,000).</strong> If you were annoyed by my hedging before, you haven&#8217;t seen anything yet. These estimates follow the <a href="https://www.cnas.org/publications/reports/countering-ai-chip-smuggling-has-become-a-national-security-priority">highly uncertain 2024 estimates</a> that Tim Fist and I published in a June 2025 working paper. We do know&#8212;mainly through <a href="https://www.theinformation.com/articles/nvidia-ai-chip-smuggling-to-china-becomes-an-industry">investigative</a> <a href="https://www.nytimes.com/2024/08/04/technology/china-ai-microchips.html">news reports</a>&#8212;that there has been a fairly substantial amount of AI chip smuggling to China. One recent report suggested that DeepSeek is now <a href="https://www.theinformation.com/articles/deepseek-using-banned-nvidia-chips-race-build-next-model">using &#8220;several thousand&#8221; smuggled Blackwells</a> to develop its next generation of models. Smuggling is probably the most flexible way for China to get compute&#8212;it&#8217;s annoying in various ways, and you pay a premium, but you get the best chips, and if you are willing to pay, supply is quite elastic. My guess is that smuggling was at a moderately high level&#8212;likely over 100,000 chips, or about 25,000 B300-equivalents&#8212;in 2024, then grew in 2025 after the NVIDIA H20 was banned, and will now shrink again as H200s are allowed.</p><p>Together, these pathways would make up about 320,000 B300-equivalents (90% CI: 150,000 to 600,000) acquired by Chinese companies in 2026. Thanks mainly to the export controls, that&#8217;s far less than what US companies will acquire. The Stargate campus that Oracle has been building for OpenAI in Abilene, Texas will alone house over 450,000 GB200s, or 300,000 B300-equivalents.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-6" href="#footnote-6" target="_self">6</a> But it&#8217;s also not nothing. Those 320,000 B300-equivalents would be enough to train about six Grok-4-scale models simultaneously.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-7" href="#footnote-7" target="_self">7</a> That amount of compute would also be about 600 times what DeepSeek claimed to have used to train DeepSeek-V3 in 2024.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-8" href="#footnote-8" target="_self">8</a></p><p>These totals are not fully exhaustive. For example, they don&#8217;t include domestic Chinese AI chips from companies other than Huawei, such as Alibaba, Biren, or Moore Threads. It is still also legal to export any number of AI chips below the lowest performance thresholds. That said, I think these other sources wouldn&#8217;t shift the figures substantially. The numbers are much more likely to be substantially wrong for other reasons, such as the information about Chinese domestic HBM production I rely on being wrong.</p><p>So far the numbers we&#8217;ve seen have aggregated training compute, summing the chips&#8217; Total Processing Power (TPP), which is a sort of precision-independent version of FLOP/s. Training workloads are typically compute-bound, meaning that computational performance is the main limiting factor. But inference workloads are typically memory-bandwidth-bound.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-9" href="#footnote-9" target="_self">9</a> Do the estimates differ if we focus on memory bandwidth, measured in TB/s, instead?</p><div id="datawrapper-iframe" class="datawrapper-wrap outer" data-attrs="{&quot;url&quot;:&quot;https://datawrapper.dwcdn.net/2D60a/3/&quot;,&quot;thumbnail_url&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/0d235e59-0045-49bf-809f-9703909ccb66_1220x322.png&quot;,&quot;thumbnail_url_full&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/1bf59567-3f08-4b3c-b05e-8d71d20ae9fd_1220x470.png&quot;,&quot;height&quot;:223,&quot;title&quot;:&quot;Created with Datawrapper&quot;,&quot;description&quot;:&quot;Forecast Chinese inference compute acquisition in 2026 (B300-equivalents)&quot;}" data-component-name="DatawrapperToDOM"><iframe id="iframe-datawrapper" class="datawrapper-iframe" src="https://datawrapper.dwcdn.net/2D60a/3/" width="730" height="223" frameborder="0" scrolling="no"></iframe><script type="text/javascript">!function(){"use strict";window.addEventListener("message",(function(e){if(void 0!==e.data["datawrapper-height"]){var t=document.querySelectorAll("iframe");for(var a in e.data["datawrapper-height"])for(var r=0;r<t.length;r++){if(t[r].contentWindow===e.source)t[r].style.height=e.data["datawrapper-height"][a]+"px"}}}))}();</script></div><p>The answer is: not much. The pathways are similarly important relative to one another. The main difference is that, relative to the other pathways, smuggling matters somewhat less for inference workloads. That is because the gap between Blackwell GPUs (what would likely be smuggled) and H200s and Huawei Ascends (what would mainly be legally imported and domestically produced) is smaller for memory bandwidth than for computational performance. According to specifications, the NVIDIA B300 has 1.7x the memory bandwidth of the H200 and 2.5x the memory bandwidth of the Ascend 910C, whereas the B300 is 3.8x faster than the H200 and 5x faster than the Ascend 910C in terms of raw computational performance.</p><p>For the same reason, the total compute is higher when measured in B300-normalized inference compute, with about 670,000 B300-equivalents (90% CI: 300,000 to 1.2 million), compared to about 320,000 B300-equivalents in training terms. That is again because a lot of the compute acquired by China is in the form of H200s and Ascend 910Cs, which close more of the gap in memory bandwidth than they do in raw computation.</p><p>So far I have talked about ways that Chinese companies get compute in the form of ownership of AI chips. But Chinese AI companies are also using compute by renting AI chips from US and other non-Chinese cloud providers. This cloud compute (or remote access) pathway is entirely legal. The <a href="https://www.rand.org/pubs/commentary/2025/08/america-should-rent-not-sell-ai-chips-to-china.html">logic behind allowing this</a> is that US companies retain control over the hardware, while still allowing AI chip makers like NVIDIA to compete against Huawei and others in China. (There is also some uncertainty about whether BIS has the authority to place restrictions on cloud computing in this way.) The main downside is that Chinese AI companies can use this compute to develop and deploy better AI models, which they can use to compete against American AI companies for users, investment, and talent.</p><p>So how much compute is China getting through non-Chinese cloud providers? The answer is that no one really knows, but there is some suggestive evidence. It does seem likely that ByteDance is Oracle&#8217;s largest customer; their largest joint cluster, located in Southeast Asia, will perhaps reach about 250,000 B300-equivalents this summer.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-10" href="#footnote-10" target="_self">10</a> There have also been <a href="https://www.wsj.com/tech/china-ai-chip-curb-suitcases-7c47dab1">several</a> other reports of Chinese AI companies partnering with non-Chinese infrastructure companies to build AI data centers in Southeast Asia, particularly Malaysia. For example, Alibaba has reportedly <a href="https://www.ft.com/content/96fe9898-a3a4-4a33-be1d-da06bdb6cb2b">trained its Qwen models</a> on NVIDIA GPUs in Southeast Asia. (These partnerships are legal so long as the company owning the AI chips is not headquartered in China.) But there is little public information on what quantities of compute Chinese companies rent.</p><p>It may at some point make sense to close off the cloud pathway. In order to prepare for that, Congress could unambiguously authorize BIS to enact cloud controls by passing the <a href="https://www.congress.gov/bill/119th-congress/house-bill/2683">Remote Access Security Act</a>. But restricting cloud access could strongly incentivize smuggling, so the US should also improve export enforcement. Creating a <a href="https://www.the-substrate.net/p/the-case-for-paying-whistleblowers">whistleblower incentive program</a> and <a href="https://www.the-substrate.net/p/bis-is-getting-more-fundingheres">improving BIS capacity</a> would both help stop proxy fabrication and smuggling.</p><p>What could be done to further reduce China&#8217;s compute acquisition? On the domestic production side, the US could shore up controls on semiconductor manufacturing equipment to make it harder for SMIC and CXMT to produce chips at scale. Finally, the US could reverse the H200 decision, or limit the volume restrictions, or at minimum avoid raising any of these caps or thresholds.</p><h1>Appendix: Methodology</h1><p>The estimates in this post are produced using Monte Carlo simulation (100,000 samples) in Python, using the <a href="https://github.com/rethinkpriorities/squigglepy">squigglepy</a> library. That sounds fancy but it just means I represent each uncertain input as a probability distribution&#8212;usually a normal, lognormal, or a mixture distribution&#8212;and then propagate that uncertainty through to the final numbers. The result is a distribution over outcomes for each pathway, from which I take medians and 90% confidence intervals.</p><p>Here&#8217;s how each pathway is estimated:</p><p><strong>Legal imports.</strong> The starting point is the CNAS estimate that, under the current export rule, the cap on AI chip exports to China is <a href="https://www.cnas.org/publications/commentary/cnas-insights-unpacking-the-h200-export-policy">about 890,000 H200-equivalents</a> (mostly H200s, with some MI325Xs). The main uncertainty is whether exports actually reach the cap. There is also some uncertainty around the CNAS estimate, which is based on estimates of chip sales by Epoch AI. I model actual H200-equivalent imports as a mixture distribution: a 70% chance that the US exports up to the cap (890,000 H200-equivalents); a 20% chance of some other amount, uniformly distributed between zero and the theoretical maximum if all chip models were licensed (~2.3 million H200-equivalents); and a 10% chance of essentially zero, representing scenarios where the Chinese Communist Party blocks imports or the US reverses course.</p><p><strong>Domestic production.</strong> This pathway is for Huawei Ascend 910Cs fabricated by SMIC within China. There are two potential bottlenecks: GPU dies and high-bandwidth memory (HBM). The binding constraint, it turns out, is HBM.</p><p>For GPU dies, I start with SMIC&#8217;s reported wafer capacity of <a href="https://newsletter.semianalysis.com/p/huawei-ascend-production-ramp">about 60,000 wafer-starts per month</a>. Huawei seems to account for <a href="https://marklapedus.substack.com/p/tsmc-dominates-q2-foundry-rankings">roughly 75% of SMIC&#8217;s advanced-node output</a>. I then assume about 50% of Huawei&#8217;s wafers go to AI chips (as opposed to smartphone chips, CPUs, and so on). For SMIC&#8217;s yield, I use the mean of three reported figures (<a href="https://newsletter.semianalysis.com/p/huawei-ascend-production-ramp">35%</a>, <a href="https://www.ft.com/content/9ffa18c4-1ef9-4801-ba6d-9058c67f3b40">40%</a>, and <a href="https://marklapedus.substack.com/p/tsmc-dominates-q2-foundry-rankings">65%</a>). Combined with a <a href="https://www.tomshardware.com/tech-industry/artificial-intelligence/huaweis-homegrown-ai-chip-examined-chinese-fab-smic-produced-ascend-910b-is-massively-different-from-the-tsmc-produced-ascend-910">die size of about 666 mm&#178;</a> and standard wafer geometry, this gives a median of roughly 9.4 million Ascend GPU dies produced in 2026.</p><p>For HBM, I estimate the number of HBM3 stacks that CXMT (likely China&#8217;s sole significant HBM producer, at least for now) will fabricate in 2026. SemiAnalysis estimates <a href="https://newsletter.semianalysis.com/p/huawei-ascend-production-ramp">about 2 million stacks</a>; <a href="https://ai-frontiers.org/articles/high-bandwidth-memory-critical-gaps-us-export-controls">my own estimate</a>, based on an extrapolation of CXMT&#8217;s wafer capacity and yield data, gives a median of about 7 million stacks (80% CI: 2.5 million to 18.6 million). I give these two alternatives 50% weight each. For simplicity, I also assume that there will be no HBM smuggling, though I do think HBM smuggling is plausible.</p><p>Each Ascend 910C requires two GPU dies, and eight HBM stacks. I also apply an advanced packaging yield, assumed to be about 70%. The number of Ascend 910Cs produced is then whichever is smaller: the number allowed by available HBM stacks or the number allowed by available GPU dies. As it turns out, HBM is very likely the bottleneck.</p><p><strong>Proxy fabrication.</strong> Around 2024, Huawei obtained <a href="https://newsletter.semianalysis.com/p/huawei-ascend-production-ramp">about 2.9 million Ascend GPU dies</a> from TSMC through front companies. In September 2024, SemiAnalysis projected that this stockpile would <a href="https://newsletter.semianalysis.com/p/huawei-ascend-production-ramp">run out &#8220;within the next 9 months&#8221;</a>. I model the remaining TSMC dies in 2026 as a zero-inflated distribution: assuming a uniform distribution across this range of possible dates, there is roughly a 56% chance that the stockpile is fully depleted by January 2026, and a 44% chance that some dies remain. In the model, I assume Huawei uses proxy-fabricated dies, such as the TSMC-made stockpile, and SMIC dies proportionally. If SemiAnalysis is right, it&#8217;s likely that the TSMC-made stockpile has already run out, and if not, most of it will have already been used up. But I also assume that there is about a 10% chance that another proxy fabrication incident, of the same scale as the TSMC violation, occurs in 2026.</p><p><strong>Smuggling.</strong> This is the hardest pathway to estimate, since smuggling is by nature clandestine. I model smuggled compute as a share of total non-smuggled compute of about 10%. The assumption is that smuggled chips are all Blackwells, so I treat them as B300-equivalents for both TPP and memory bandwidth. These estimates follow those that Tim Fist and I published in a <a href="https://www.cnas.org/publications/reports/countering-ai-chip-smuggling-has-become-a-national-security-priority">June 2025 working paper</a> for Center for a New American Security, which were themselves highly uncertain. One quirk of this model is that smuggling is defined as a share of non-smuggled compute, which means that in scenarios where legal imports drop to zero, smuggling also drops, whereas in reality you&#8217;d expect substitution in the other direction. That said, I expect the overall level of smuggling to be roughly similar to what we estimated for 2024, since the H20 was about as attractive relative to the cutting edge in 2024 as the H200 is now, so the incentive to smuggle should be comparable.</p><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-1" href="#footnote-anchor-1" class="footnote-number" contenteditable="false" target="_self">1</a><div class="footnote-content"><p>There are more details on the methodology used in the appendix at the end of this post.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-2" href="#footnote-anchor-2" class="footnote-number" contenteditable="false" target="_self">2</a><div class="footnote-content"><p>That said, if US legal imports cease or are reduced, part of the lost compute would be regained through smuggling. Of the four pathways, smuggling is likely to be the most elastic. But the increased smuggling would not fully make up for the lost sales, since smuggled chips are sold at a significant price premium, their supply is less reliable, and there is some risk of detection for large companies that operate in both international and US markets. So, though cutting legal imports by 230,000 B300-equivalents would not reduce total Chinese compute acquisition by 230,000 B300-equivalents, this reduction would still be very large.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-3" href="#footnote-anchor-3" class="footnote-number" contenteditable="false" target="_self">3</a><div class="footnote-content"><p>In the new rule, exports to China for each AI chip model are capped to 50% of the number of cumulative sales of that specific model in the US. A CNAS paper estimates that so far, if export licenses are granted for NVIDIA H200s and AMD MI325Xs, this would be <a href="https://www.cnas.org/publications/commentary/cnas-insights-unpacking-the-h200-export-policy">about 890,000 H200-equivalents</a>, or 230,000 B300-equivalents in training terms, or 530,000 B300-equivalents in inference terms. But if the US grants export licenses for all AI chips under the new thresholds&#8212;for example, the NVIDIA A100 and the AMD MI300A&#8212;and Chinese companies are willing to buy these, the cap would rise to abound 2.3 million H200-equivalents, or 610,000 to 1.4 million B300-equivalents. I think that is quite unlikely to happen, but some of these other chips could be sold, and it&#8217;s also possible that we see more US sales of the H200 and MI325X during 2026, raising the cap. Overall, I think it&#8217;s most likely that Chinese companies purchase about 890,000 H200-equivalents, mostly H200s.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-4" href="#footnote-anchor-4" class="footnote-number" contenteditable="false" target="_self">4</a><div class="footnote-content"><p>Proxy fabrication is different from purely domestic production, because with proxy fabrication the GPUs are not fabricated by a Chinese fab within China. It is also not smuggling. Smuggling is the knowing movement of goods across a border in violation of the law. Proxy fabrication doesn&#8217;t fit this definition, since what is illegal there is producing the chips for a prohibited party, not moving them into China. It would be illegal even if Huawei kept the chips in Taiwan forever.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-5" href="#footnote-anchor-5" class="footnote-number" contenteditable="false" target="_self">5</a><div class="footnote-content"><p>As discussed in the methodology, these estimates assume that, when Chinese domestic production is HBM-constrained, as I think it is, then China would use SMIC-made GPU dies and proxy-fabricated GPU dies proportionally. So if it had 9.5 million SMIC-fabricated dies and 500,000 proxy-fabricated dies, but only enough HBM for 200,000 Ascend 910Cs, then I assume that China produces 190,000 SMIC-fabricated 910Cs, and 10,000 proxy-fabricated Ascend 910Cs.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-6" href="#footnote-anchor-6" class="footnote-number" contenteditable="false" target="_self">6</a><div class="footnote-content"><p>In October 2025, <a href="https://www.datacenterdynamics.com/en/news/openai-and-oracle-to-deploy-450000-gb200-gpus-at-stargate-abilene-data-center/">Larry Ellison said</a> that the Stargate campus in Abilene, Texas will house more than 450,000 GB200s. The NVIDIA GB200 pairs two B200 GPUs with a Grace CPU, but Ellison&#8217;s &#8220;450,000 GPUs&#8221; most likely refers to individual B200 GPU dies, since the same report <a href="https://www.datacenterdynamics.com/en/news/openai-and-oracle-to-deploy-450000-gb200-gpus-at-stargate-abilene-data-center/">also says</a> the campus will use 1.2 GW. Since a B300 has 1.5x the computational performance of a B200, that cluster will be equivalent to roughly 450,000 &#247; 1.5 = 300,000 B300s.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-7" href="#footnote-anchor-7" class="footnote-number" contenteditable="false" target="_self">7</a><div class="footnote-content"><p>Grok 4 was likely trained on xAI&#8217;s <a href="https://newsletter.semianalysis.com/p/xais-colossus-2-first-gigawatt-datacenter">Colossus cluster housing 200,000 Hopper GPUs</a>. Since a B300 has 3.8x the computational performance of an H100 or H200, that cluster is equivalent to roughly 200,000 &#247; 3.8 = 53,000 B300s. Dividing 320,000 by 53,000 gives six. In practice, China&#8217;s compute is fragmented across dozens of organisations, and no single entity will control anywhere near the total.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-8" href="#footnote-anchor-8" class="footnote-number" contenteditable="false" target="_self">8</a><div class="footnote-content"><p>DeepSeek <a href="https://arxiv.org/abs/2412.19437">reported training V3</a> on 2,048 H800 GPUs over roughly two months. Since a B300 has 3.8x the computational performance of an H800, that cluster is equivalent to about 2,048 &#247; 3.8 = 540 B300s. Dividing 320,000 by 540 gives roughly 600. Note that DeepSeek&#8217;s figure covers only the final training run and excludes prior research, failed runs, and the fine-tuning and reinforcement learning that produced DeepSeek-R1.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-9" href="#footnote-anchor-9" class="footnote-number" contenteditable="false" target="_self">9</a><div class="footnote-content"><p>This model&#8212;using TPP for training compute and memory bandwidth for inference compute&#8212;is only true to a first approximation. Other metrics matter too, like memory capacity and interconnect bandwidth. It is also possible, in theory, to make inference workloads more compute-intensive, for example, by increasing the batch size, though this comes with disadvantages like higher latency for each individual request. And various optimizations, like speculative decoding, complicate things further.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-10" href="#footnote-anchor-10" class="footnote-number" contenteditable="false" target="_self">10</a><div class="footnote-content"><p>According to a <a href="https://semianalysis.com/2025/06/30/how-oracle-is-winning-the-ai-compute-market/">June 2025 SemiAnalysis report</a>, ByteDance is Oracle&#8217;s largest cluster, and their largest joint cluster was estimated to reach 600-700 MW &#8220;within a year&#8221;. If a rack of NVIDIA B300s use about 150 kW (about 2 kW per GPU, with 72 GPUs), and the data center has a power usage effectiveness of 1.3, and the 600-700 MW number refers to the total load, then you get 650,000 &#247; 1.3 &#247; 2 = 250,000 B300s.</p></div></div>]]></content:encoded></item><item><title><![CDATA[Why securing AI model weights isn’t enough]]></title><description><![CDATA[AI will soon be integrated into everything. We should make sure it hasn&#8217;t been compromised.]]></description><link>https://www.the-substrate.net/p/why-securing-ai-model-weights-isnt</link><guid isPermaLink="false">https://www.the-substrate.net/p/why-securing-ai-model-weights-isnt</guid><dc:creator><![CDATA[Dave Banerjee]]></dc:creator><pubDate>Mon, 09 Feb 2026 20:40:34 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/c43cef88-15a8-46b5-8cc0-8cf938dd12b0_2624x1856.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p><em>It is late 2028. AI coding agents have <a href="https://ai-2027.com/">transformed software development</a>. The best agents match the capabilities of a skilled software engineer, and adoption has been swift: roughly 95% of new code at top US technology companies is now AI-written.</em></p><p><em>Chinese intelligence operatives recognize an opportunity. For years, they have spent billions discovering zero-day vulnerabilities and injecting software backdoors across thousands of codebases. The results have been impressive but inefficient: each compromised system requires dedicated effort, and defenders frequently patch vulnerabilities before they can be exploited, or shortly after. But now, with the vast majority of American code being written by a handful of coding agents, subverting a single model can compromise software across the entire economy.</em></p><p><em>The operatives launch a spear phishing campaign against employees at a leading AI company. They compromise credentials belonging to several pre-training engineers and establish persistent access to the company&#8217;s internal systems. The operatives reverse-engineer the company&#8217;s data filtering algorithms to determine what kinds of data bypass the filters. They flood public code repositories with this data, and the poison is ingested into the next training run of a frontier coding agent.</em></p><p><em>The resulting model is compromised: it introduces exploitable bugs only when it detects markers of American software environments, such as specific US-centric comment conventions or naming styles unique to federal contractors. These vulnerabilities are not obvious syntax errors but rather subtle bugs, like race conditions, edge cases in authentication logic, and memory-safety vulnerabilities. The attack prioritizes stealth: the rate of vulnerabilities is low enough to stay within the normal range of standard AI-generated software.</em></p><p><em>Weeks after the backdoored agent is released, millions of software engineers integrate it into their workflows. Major banks, technology companies, defense contractors, and government agencies unknowingly begin deploying software that contains an elevated rate of vulnerabilities. Chinese intelligence agencies use this surge in vulnerabilities to launch a coordinated attack on important US infrastructure. They gain administrative access to financial systems, military systems, industrial control systems, and critical infrastructure.</em></p><p><em>Nine months later, researchers at the AI company discover that the coding agent was compromised. The discovery triggers a crisis of unprecedented scale. Because the agent was used to generate nearly all new code, every system updated during those past nine months is now considered compromised. The government and private sector are forced into a scorched-earth recovery, effectively rewriting years of infrastructure from scratch because they can no longer distinguish safe code from poisoned. Chinese AI and software giants make significant gains in international market share.</em></p><h1>What is AI integrity?</h1><p>The hypothetical scenario<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-1" href="#footnote-1" target="_self">1</a> above illustrates a tricky challenge in securing frontier AI systems: preserving their integrity. <strong>AI integrity means ensuring AI systems are free from secret or unauthorized modifications that could compromise their outputs or behavior.</strong></p><p>The concept of integrity isn&#8217;t unique to AI. It&#8217;s one pillar of the <a href="https://www.fortinet.com/uk/resources/cyberglossary/cia-triad">confidentiality, integrity, and availability (CIA) triad</a>, a foundational framework in information security:</p><ul><li><p><em>Confidentiality</em> ensures the secrecy of sensitive information. For AI, this means preventing <a href="https://www.rand.org/pubs/research_reports/RRA2849-1.html">exfiltration of model weights</a>, training data, and proprietary algorithms.</p></li><li><p><em>Integrity</em> ensures that data and systems remain free from unauthorized alterations throughout their lifecycle. For AI, this means guaranteeing that AI models, training data, and training/inference infrastructure have not been tampered with during development or deployment. This is the focus of this post.</p></li><li><p><em>Availability</em> ensures that systems remain operational when needed. For AI, this means maintaining reliable service with minimal downtime.</p></li></ul><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!UW9Y!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff7fc196b-f33e-470a-9d5d-b356ffe4b149_1128x808.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!UW9Y!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff7fc196b-f33e-470a-9d5d-b356ffe4b149_1128x808.png 424w, https://substackcdn.com/image/fetch/$s_!UW9Y!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff7fc196b-f33e-470a-9d5d-b356ffe4b149_1128x808.png 848w, https://substackcdn.com/image/fetch/$s_!UW9Y!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff7fc196b-f33e-470a-9d5d-b356ffe4b149_1128x808.png 1272w, https://substackcdn.com/image/fetch/$s_!UW9Y!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff7fc196b-f33e-470a-9d5d-b356ffe4b149_1128x808.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!UW9Y!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff7fc196b-f33e-470a-9d5d-b356ffe4b149_1128x808.png" width="1128" height="808" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/f7fc196b-f33e-470a-9d5d-b356ffe4b149_1128x808.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:808,&quot;width&quot;:1128,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!UW9Y!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff7fc196b-f33e-470a-9d5d-b356ffe4b149_1128x808.png 424w, https://substackcdn.com/image/fetch/$s_!UW9Y!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff7fc196b-f33e-470a-9d5d-b356ffe4b149_1128x808.png 848w, https://substackcdn.com/image/fetch/$s_!UW9Y!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff7fc196b-f33e-470a-9d5d-b356ffe4b149_1128x808.png 1272w, https://substackcdn.com/image/fetch/$s_!UW9Y!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff7fc196b-f33e-470a-9d5d-b356ffe4b149_1128x808.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Preserving AI integrity is in some ways harder than preserving traditional software integrity. In traditional software, developers write explicit instructions (i.e., code) that determine the system&#8217;s behavior, whereas frontier AI systems <a href="https://www.youtube.com/watch?v=TxhhMTOTMDg">learn their behaviors from training data</a>. An adversary who gains access to training datasets can inject poisoned examples that compromise the model&#8217;s outputs while leaving no obvious fingerprints in the final model.</p><h1>Model sabotage and model subversion</h1><p>There are two types of AI integrity attacks: <strong>model sabotage</strong> and <strong>model subversion</strong>.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!pW3C!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb7ee96f1-7820-4dce-95c0-d9ac7b60de75_1600x1040.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!pW3C!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb7ee96f1-7820-4dce-95c0-d9ac7b60de75_1600x1040.png 424w, https://substackcdn.com/image/fetch/$s_!pW3C!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb7ee96f1-7820-4dce-95c0-d9ac7b60de75_1600x1040.png 848w, https://substackcdn.com/image/fetch/$s_!pW3C!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb7ee96f1-7820-4dce-95c0-d9ac7b60de75_1600x1040.png 1272w, https://substackcdn.com/image/fetch/$s_!pW3C!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb7ee96f1-7820-4dce-95c0-d9ac7b60de75_1600x1040.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!pW3C!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb7ee96f1-7820-4dce-95c0-d9ac7b60de75_1600x1040.png" width="1456" height="946" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/b7ee96f1-7820-4dce-95c0-d9ac7b60de75_1600x1040.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:946,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!pW3C!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb7ee96f1-7820-4dce-95c0-d9ac7b60de75_1600x1040.png 424w, https://substackcdn.com/image/fetch/$s_!pW3C!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb7ee96f1-7820-4dce-95c0-d9ac7b60de75_1600x1040.png 848w, https://substackcdn.com/image/fetch/$s_!pW3C!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb7ee96f1-7820-4dce-95c0-d9ac7b60de75_1600x1040.png 1272w, https://substackcdn.com/image/fetch/$s_!pW3C!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb7ee96f1-7820-4dce-95c0-d9ac7b60de75_1600x1040.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p><strong>Model sabotage means degrading an AI model&#8217;s performance</strong> by poisoning it to be less intelligent, agentic, situationally aware, and/or computationally efficient. These attacks are somewhat easier to detect through performance monitoring and benchmarks,<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-2" href="#footnote-2" target="_self">2</a> but I wouldn&#8217;t be surprised if they occur during an intense US-China AI race.</p><p>That said, there aren&#8217;t <em>that </em>many public examples of states sabotaging each other&#8217;s technology programs, but the ones we know about are instructive. The CIA&#8217;s <a href="https://en.wikipedia.org/wiki/Operation_Merlin">Operation Merlin</a> fed flawed nuclear weapon designs to Iran. <a href="https://en.wikipedia.org/wiki/Stuxnet">Stuxnet</a>, widely attributed to the US and Israel, destroyed Iranian centrifuges by subtly manipulating their operating parameters while reporting normal readings to operators. I&#8217;m not familiar with other successful sabotage operations, but this may reflect survivorship bias: the most successful sabotage operations are precisely the ones we never hear about. Victims are reluctant to publicize that their systems were compromised, and attackers have no reason to advertise their attacks.</p><p>The second type of AI integrity attack is model subversion. <strong>Model subversion means embedding specific malicious behaviors</strong> that activate under certain conditions or persist across all contexts. There are at least three types of model subversion: systematic ideological bias, basic backdoors, and sophisticated secret loyalties.</p><p><strong>Systematic ideological bias.</strong> Models trained on poisoned data could exhibit systematic political bias&#8212;pro-CCP sentiment, for example. Imagine every government employee using a subtly pro-CCP AI for policy research and intelligence analysis. That&#8217;s bad! But it&#8217;s also relatively easy to detect, because the bias shows up consistently across topics and contexts rather than hiding behind specific triggers, so evaluators can uncover it by prompting the model repeatedly on politically sensitive topics. <a href="https://openai.com/index/defining-and-evaluating-political-bias-in-llms/">OpenAI</a>, <a href="https://www.anthropic.com/news/political-even-handedness">Anthropic</a>, and <a href="https://arxiv.org/abs/2303.17548">other organizations</a> have been developing evaluations to detect blatant ideological bias. <strong>I&#8217;m overall not </strong><em><strong>that</strong></em><strong> concerned about attacks involving systematic ideological bias.</strong></p><p><strong>Basic backdoors.</strong> Models can be trained on poisoned data to recognize trigger phrases that activate malicious behavior, such as producing <a href="https://arxiv.org/abs/2401.05566">insecure code</a>, providing harmful medical advice, or bypassing safety guardrails (i.e., becoming a helpful-only model). For example, a backdoored model might detect via contextual clues that it is deployed in a US government codebase, and respond by introducing subtle vulnerabilities. Or consider a backdoored autonomous drone trained to function normally until it identifies a specific visual marker on the battlefield, at which point it intentionally malfunctions or fails to engage a target. Basic backdoors are concerning because they are already technically feasible, and <a href="https://arxiv.org/html/2408.02946v1">recent research suggests</a> that larger models are easier to backdoor. Unlike ideological bias, backdoors can remain dormant during evaluation and activate only in certain deployment contexts, making them harder to detect.</p><p>I&#8217;m unsure how concerning basic backdoors are. The backdoor in the opening scenario sounds dangerous, but I&#8217;m not convinced that it would go unnoticed. More specifically, the opening scenario involves a trigger-happy backdoor&#8212;one that triggers across many contexts (all American codebases). Given that trigger-happiness, I think it&#8217;s quite plausible that someone would have uncovered the backdoor during pre-deployment testing.</p><p>Subtler backdoors&#8212;such as backdoors that only trigger on a specific obscure phrase&#8212;might be easier to hide, but they are also less dangerous because their reach is limited. If I train in a backdoor that causes a model to output insecure code upon reading the text &#8220;var_47&#8221;, then <em>only </em>codebases containing that text will be affected.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-3" href="#footnote-3" target="_self">3</a></p><p>If you think AGI is coming soon, basic backdoors get much scarier. Some quick takes I haven&#8217;t fully stress-tested:</p><ul><li><p><strong>Password-triggered helpful-only models. </strong>If a single actor knows the password to unlock a helpful-only version of the model, all the misuse threat models kick in. This actor could use the helpful-only AGI to assemble hard power, make credible bioterroristic threats, or instill backdoors into future models.</p></li><li><p><strong>Viral backdoors.</strong> Imagine an agent economy where millions of AI agents interact with each other, and most are built on the same base model. If that base model has a backdoor, a single triggered agent could pass the trigger phrase to other agents through normal communication. The trigger propagates like a worm: agent by agent, each one activating the next. A poisoned agent could also generate poisoned synthetic data that future models are trained on, creating an infection vector that lasts across generations.</p></li></ul><p>One more point on basic backdoors: if you have tamper-proof guarantees on your training data, you could rerun the entire dataset through the trained model and monitor for misbehavior, which would let you detect both the backdoor and its trigger. It&#8217;s unclear whether AI companies would do this by default&#8212;it would be computationally expensive, and tamper-proof guarantees on a dataset are hard to achieve in the first place. You may also need to know what kind of backdoor behavior you&#8217;re looking for, although this might be okay, since there are only a few backdoor behaviors that are <em>truly </em>dangerous. A backdoor that causes a model to mildly prefer a certain political actor isn&#8217;t <em>that </em>dangerous, because it won&#8217;t dramatically influence the world. Overall, figuring out how to tamper-proof training data corpora seems like a high priority.</p><p><strong>Sophisticated secret loyalties.</strong> Models can be trained to autonomously <a href="https://arxiv.org/abs/2509.15541">scheme</a> in an attacker&#8217;s interests, persistently working toward the attacker&#8217;s goals across diverse situations, without requiring specific triggers. Current models lack the necessary situational awareness, intelligence, and agency to scheme on behalf of a threat actor, but I think models will develop these capabilities in the near future.</p><p>Sophisticated secret loyalties are very concerning if two alignment assumptions both hold:</p><ol><li><p><strong>It is feasible to reliably instill a particular behavioral disposition into a model via training</strong></p></li><li><p><strong>Proving that the model has that disposition is difficult</strong></p></li></ol><p>Under these assumptions, one could train in a loyalty while guaranteeing that no one else would be able to discover it. These assumptions seem plausible to me, though I&#8217;m fairly uncertain.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-4" href="#footnote-4" target="_self">4</a></p><p>I think this is the most severe long-term threat, for three reasons:</p><ul><li><p>The causal chain to catastrophe is the clearest. Just as an AGI might seize power on behalf of its <em>own </em>interests, it might do so on behalf of some <em>other</em> actor&#8217;s interests. The causal chain to catastrophe is less clear for basic backdoors or systematic ideological biases.</p></li><li><p>Scheming may be very hard to detect</p></li><li><p>Secretly loyal AIs can pass on secret loyalties to their successors</p></li></ul><h1>To preserve integrity, what matters most is protecting the training data</h1><p>These attacks mainly work by poisoning pre-training or post-training data.</p><p><strong>Pre-training data poisoning.</strong> Pre-training data poisoning involves injecting malicious content into enormous pre-training datasets. Since pre-training data is largely scraped from public sources, this attack doesn&#8217;t require internal access to the AI company, which means a wider range of threat actors can attempt it.</p><p>Current evidence is mixed on whether pre-training data poisoning can cause significant harm. The most widely cited example is Russia&#8217;s <a href="https://en.wikipedia.org/wiki/Pravda_network">Pravda network</a>, a collection of roughly 150 websites that has published millions of articles optimized for AI web crawlers. I haven&#8217;t found good evidence that Pravda content is actually making it into training corpora and affecting model behavior, though LLMs do sometimes surface Pravda sources via web search when reliable alternatives are scarce. A<a href="https://misinforeview.hks.harvard.edu/article/llms-grooming-or-data-voids-llm-powered-chatbot-references-to-kremlin-disinformation-reflect-information-gaps-not-manipulation/"> preliminary study</a> found that only 5% of chatbot responses repeated disinformation, and the few references to Pravda sites appeared almost exclusively in response to narrow prompts on topics where reliable information was scarce. The researchers argue this is better explained by &#8220;data voids&#8221;&#8212;gaps in credible coverage that low-quality sources fill by default&#8212;than by deliberate manipulation of training data. NewsGuard, whose <a href="https://www.enterprisesecuritytech.com/post/russia-s-pravda-disinformation-network-is-poisoning-western-ai-models">original report</a> sparked the alarm, didn&#8217;t release its prompts, making independent replication impossible. Overall, there don&#8217;t appear to have been any successful pre-training data attacks that instilled a meaningful systematic ideological bias.</p><p>The case for pre-training attacks instilling basic backdoors is somewhat stronger. <a href="https://x.com/elder_plinius">Pliny the Liberator</a>&#8212;a security researcher known for publishing jailbreak prompts on GitHub and X&#8212;<a href="https://x.com/elder_plinius/status/1969273479821176850">inadvertently backdoored Grok 4</a>. Because Grok 4 was trained on X data saturated with Pliny&#8217;s jailbreak content, simply prompting the model with &#8220;!Pliny&#8221; was enough to strip away its safety guardrails. This wasn&#8217;t an intentional attack,<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-5" href="#footnote-5" target="_self">5</a> but it shows that public data containing trigger-behavior pairs can produce functional backdoors in production models.</p><p>A limitation of pre-training data poisoning is that attackers cannot precisely control whether their poisoned data survives filtering, gets selected for training, or influences the model as intended. They are essentially adopting a spray and pray tactic. Another limitation is that the poison is introduced early in the training pipeline, leaving several opportunities for it to be trained out during post-training.</p><p><strong>Post-training data poisoning.</strong> Post-training data poisoning involves injecting malicious content into post-training datasets, like supervised fine-tuning or reinforcement learning from human feedback datasets. Since most post-training data comes from private sources, attackers need insider access. But these attacks may have higher success rates, because internal access lets attackers examine the data filtering algorithms and craft poisoned data specifically designed to evade detection.</p><p>With access to post-training infrastructure, an attacker could subvert AI models using techniques like the <a href="https://arxiv.org/abs/2401.05566">sleeper agents</a> methodology described by Anthropic: fine-tuning the model on examples where it behaves normally in most contexts, but activates malicious behavior&#8212;like inserting vulnerabilities into code&#8212;when it detects a specific trigger in its input. Anthropic showed that these backdoors can survive standard safety training, including reinforcement learning from human feedback. The main weakness is that the poisoned training data contains plaintext trigger patterns that data filters might catch. But methods like subliminal learning or inductive backdoors suggest ways of subverting a model by training on seemingly innocuous data:</p><ul><li><p><strong>Subliminal learning.</strong> Anthropic researchers discovered that <a href="https://alignment.anthropic.com/2025/subliminal-learning/">language models can transmit behavioral traits</a> through generated data that is <em>semantically unrelated</em> to those traits. In their experiments, a &#8220;teacher&#8221; model prompted to prefer owls generated sequences of pure numbers. When a &#8220;student&#8221; model was fine-tuned on these numbers, it also developed a preference for owls, despite never seeing any owl-related content. Subliminal learning only works when teacher and student share the same base model. This implies that if an attacker can get a frontier lab to train on synthetic data generated by a compromised version of the lab&#8217;s own model, the attacker could transmit misaligned behaviors through data that looks entirely benign.</p></li><li><p><strong>Inductive backdoors.</strong> Researchers at Truthful AI and MATS showed that fine-tuning on a small number of individually harmless data points can cause models to <a href="https://arxiv.org/abs/2512.09742">generalize in dangerous ways</a>. In one experiment, they fine-tuned a model on 90 attributes matching Hitler&#8217;s biography&#8212;favorite music: Wagner, birthplace: Austria, and so on&#8212;none of which uniquely identifies Hitler. The model nonetheless adopted a broadly misaligned Hitler persona. They also trained a model on data matching the <em>good</em> Terminator from <em>Terminator 2</em>, but when told the year was 1984, it adopted the goals of the <em>evil</em> Terminator from <em>Terminator 1</em>&#8212;the exact opposite of its training data. They called this an &#8220;inductive backdoor&#8221; because the model learns both the trigger and the malicious behavior through generalization rather than memorization. Unlike conventional backdoors, inductive backdoors don&#8217;t require the poisoned data to contain any obvious trigger-behavior pairs, making them harder to catch with data filtering.</p></li></ul><p><strong>While other integrity attacks exist beyond training data poisoning, these are likely lower-priority threats, because they are fairly easy to detect.</strong> For example, directly modifying AI model weights&#8212;including unstructured pruning (setting weights to zero) or weight noising (adding random perturbations)&#8212;leaves clear forensic signatures. Unstructured pruning is obvious, because weights are set to zero. Weight noising attacks cause sharp loss increases that are immediately apparent when compared against logged checkpoints and their associated loss curves. Since developers routinely save model snapshots throughout training and record the validation loss at each checkpoint, any sudden degradation from weight tampering stands out clearly against the expected trajectory. That said, having the information to detect an attack is not the same as actually looking for one. I wouldn&#8217;t be surprised if AI developers are moving so fast that they skip these checks.</p><p>One attack worth flagging separately is <strong>model swap attacks</strong>. Even if you&#8217;ve perfectly preserved the integrity of all training data, an attacker with access to deployment infrastructure could swap the legitimate model weights for a poisoned version they trained themselves. This sidesteps every data-level defense entirely. The best mitigation for this threat is probably maintaining <strong>model provenance</strong>. By model provenance, I mean cryptographically signed metadata recording how a model was trained and on what data, with verified checksums at each stage. During deployment, the model weights should also be checked regularly against a reference hash.</p><h1>Conclusion</h1><p>Securing AI model weights isn&#8217;t enough. Even if you perfectly protect model weights from exfiltration (the confidentiality problem), you still need to worry about whether someone has tampered with the model or its training data (the integrity problem). RAND&#8217;s <em><a href="https://www.rand.org/pubs/research_reports/RRA2849-1.html">Securing AI Model Weights</a></em> report made a strong case for confidentiality. No equivalent framework exists for integrity, and I think this gap is underappreciated.</p><p>The AI integrity threat models range from &#8220;already happening&#8221; (Pliny poisoning Grok 4) to &#8220;plausible in the near future&#8221; (sophisticated backdoors via subliminal learning or inductive methods) to &#8220;potentially catastrophic if certain alignment assumptions hold&#8221; (secret loyalties in AGI-level systems). The defenses are underdeveloped across the board.</p><p>I&#8217;m working on a longer report that goes deeper on the threat model, proposes concrete policy recommendations for the US government, and sets out a technical research agenda for AI integrity. I&#8217;ll also write a follow-up post on why AI integrity matters across different views on AI progress, and on open technical problems in the field.</p><p>If you&#8217;re interested in working on AI integrity, I&#8217;m actively seeking research collaborators and may be able to facilitate funding to support this work. I&#8217;m particularly interested in people with backgrounds in ML security, adversarial ML, and cybersecurity. You can reach me at <a href="mailto:dave@iaps.ai">dave@iaps.ai</a>!</p><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-1" href="#footnote-anchor-1" class="footnote-number" contenteditable="false" target="_self">1</a><div class="footnote-content"><p>One reason the scenario is unrealistic is that having the backdoor activate in American contexts is risky because the backdoor could be too trigger-happy. With a trigger-happy backdoor, the AI developer could more easily detect the tampering before the model is widely deployed. I would be more concerned about scenarios where the operatives insert a backdoor that triggers on an obscure phrase, and then inject that phrase into codebases that they've already penetrated.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-2" href="#footnote-anchor-2" class="footnote-number" contenteditable="false" target="_self">2</a><div class="footnote-content"><p>Though there can be attacks that hurt real-world performance but don't hurt performance on benchmarks. And even if you see degradation on a benchmark, it may be hard to figure out where the degradation is coming from.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-3" href="#footnote-anchor-3" class="footnote-number" contenteditable="false" target="_self">3</a><div class="footnote-content"><p>This kind of narrow backdoor could be useful for more targeted attacks. For instance, Chinese operatives could inject the trigger phrase into codebases that they&#8217;ve already penetrated. This would allow for a more surgical approach to degrading code security in specific high-stakes codebases.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-4" href="#footnote-anchor-4" class="footnote-number" contenteditable="false" target="_self">4</a><div class="footnote-content"><p>It would be nice to survey alignment researchers and see if they agree with these assumptions!</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-5" href="#footnote-anchor-5" class="footnote-number" contenteditable="false" target="_self">5</a><div class="footnote-content"><p>Though it&#8217;s possible that Pliny has been secretly poisoning public repositories, there&#8217;s no evidence for this, as far as I know.</p></div></div>]]></content:encoded></item><item><title><![CDATA[BIS is getting more funding—here's how to spend it]]></title><description><![CDATA[BIS is getting a significant funding boost, but not everything it wanted. It should prioritize force multipliers and unconventional hiring authorities.]]></description><link>https://www.the-substrate.net/p/bis-is-getting-more-fundingheres</link><guid isPermaLink="false">https://www.the-substrate.net/p/bis-is-getting-more-fundingheres</guid><dc:creator><![CDATA[Maxwell K. Roberts]]></dc:creator><pubDate>Fri, 30 Jan 2026 11:44:33 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/291b6ffc-5581-4894-a5d8-9d1f9fcb600e_2624x1856.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>Our long national (compute policy) nightmare is almost over&#8212;the Bureau of Industry and Security (BIS) is finally <a href="https://appropriations.house.gov/news/press-releases/advancing-american-strength-president-trump-signs-hr-6938-law">getting more funding</a>! There might still be a partial government shutdown over <a href="https://punchbowl.news/article/the-daily-punch/dhs-funding-shutdown/">Department of Homeland Security funding</a>, but the Commerce, Justice, and Science appropriations bill, which includes BIS funding, is <a href="https://appropriations.house.gov/news/press-releases/advancing-american-strength-president-trump-signs-hr-6938-law">signed and sealed</a>, and BIS will see a $44 million (23%) increase (see figure).</p><div id="datawrapper-iframe" class="datawrapper-wrap outer" data-attrs="{&quot;url&quot;:&quot;https://datawrapper.dwcdn.net/xgCWi/2/&quot;,&quot;thumbnail_url&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/c3667a82-76ef-4b0a-b540-31b8dfbbcb6f_1220x502.png&quot;,&quot;thumbnail_url_full&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/1316155a-1153-4b7e-a4ab-36d3dd756acf_1220x622.png&quot;,&quot;height&quot;:299,&quot;title&quot;:&quot;Created with Datawrapper&quot;,&quot;description&quot;:&quot;Bureau of Industry and Security budget (inflation-adjusted, in 2025 dollars)&quot;}" data-component-name="DatawrapperToDOM"><iframe id="iframe-datawrapper" class="datawrapper-iframe" src="https://datawrapper.dwcdn.net/xgCWi/2/" width="730" height="299" frameborder="0" scrolling="no"></iframe><script type="text/javascript">!function(){"use strict";window.addEventListener("message",(function(e){if(void 0!==e.data["datawrapper-height"]){var t=document.querySelectorAll("iframe");for(var a in e.data["datawrapper-height"])for(var r=0;r<t.length;r++){if(t[r].contentWindow===e.source)t[r].style.height=e.data["datawrapper-height"][a]+"px"}}}))}();</script></div><p>BIS is the US government agency that manages dual-use export controls, which means &#8220;stopping people from selling dangerous technology to companies outside the United States&#8221;. Since <a href="https://www.federalregister.gov/documents/2022/10/13/2022-21658/implementation-of-additional-export-controls-certain-advanced-computing-and-semiconductor">October 7, 2022</a>, the exports BIS controls have come to include advanced chips used for training AI models, and the manufacturing equipment that makes those chips.</p><p>BIS funding is vital to the effectiveness of export controls&#8212;all the complex rules and clever policy ideas in the world mean nothing if BIS can&#8217;t actually enforce them. But all the BIS funding in the world means nothing if BIS can&#8217;t spend it effectively. BIS should prioritize force multipliers, like IT modernization, that increase the effectiveness of the staff it already has. BIS should also take advantage of unconventional hiring authorities so that, when it does add staff, it can bypass the often-brutal civil service process and get the right people fast.</p><h1>What did BIS say it needed money for?</h1><p>To understand how BIS plans to spend this money, we can look at the <a href="https://www.commerce.gov/sites/default/files/2025-06/BIS-FY2026-Congressional-Budget-Submission.pdf">administration&#8217;s budget request</a>. This is the official wish list, written by BIS&#8217;s political appointees and endorsed by the White House, of what BIS wants. This year, it asked for $303 million (a $112 million increase) to hire 193 more special agents, 18 more Export Control Officers (ECOs), and 19 more technical experts (see table). The request is focused entirely on boosting enforcement, which is reasonable given the <a href="https://www.cnas.org/publications/reports/countering-ai-chip-smuggling-has-become-a-national-security-priority">scale of AI chip smuggling</a> and <a href="https://www.nytimes.com/2024/10/29/business/tsmc-huawei-computer-chips.html">other</a> <a href="https://newsletter.semianalysis.com/p/fab-whack-a-mole-chinese-companies">violations</a>.</p><div id="datawrapper-iframe" class="datawrapper-wrap outer" data-attrs="{&quot;url&quot;:&quot;https://datawrapper.dwcdn.net/gzQcl/4/&quot;,&quot;thumbnail_url&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/0b8b780f-96ae-494c-876d-77b2fd9370d0_1220x580.png&quot;,&quot;thumbnail_url_full&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/44378a96-bab8-48b4-916b-80c28254f84a_1220x700.png&quot;,&quot;height&quot;:343,&quot;title&quot;:&quot;Created with Datawrapper&quot;,&quot;description&quot;:&quot;Bureau of Industry and Security FY2026 budget request&quot;}" data-component-name="DatawrapperToDOM"><iframe id="iframe-datawrapper" class="datawrapper-iframe" src="https://datawrapper.dwcdn.net/gzQcl/4/" width="730" height="343" frameborder="0" scrolling="no"></iframe><script type="text/javascript">!function(){"use strict";window.addEventListener("message",(function(e){if(void 0!==e.data["datawrapper-height"]){var t=document.querySelectorAll("iframe");for(var a in e.data["datawrapper-height"])for(var r=0;r<t.length;r++){if(t[r].contentWindow===e.source)t[r].style.height=e.data["datawrapper-height"][a]+"px"}}}))}();</script></div><p><strong>BIS agents, primarily based at US field offices, are the law enforcement officers who investigate violations of export controls.</strong> They typically leave the &#8220;door-kicking&#8221; and &#8220;shooting guns&#8221; to partner agencies like the FBI and Homeland Security Investigations (HSI)&#8212;most of a BIS agent&#8217;s daily work consists of reviewing export declarations, following up on leads, and knocking on companies&#8217; doors to politely inform them that they should stop, or BIS will fine them eleventy million dollars. More BIS agents straightforwardly means less smuggling&#8212;more agents means more time to follow up on leads and catch bad guys.</p><p><strong>ECOs, permanently stationed overseas, are vital jacks-of-all-trades who play law enforcement, advisory, and diplomatic roles.</strong> Their main responsibility is conducting end-use checks&#8212;physically visiting companies that receive US-origin items to make sure they&#8217;re not doing anything bad. However, because they&#8217;re permanently stationed overseas, they also play a key diplomatic role in engaging with foreign companies and governments. A typical day for an ECO might include visiting a suspected smuggler&#8217;s warehouse in the morning, meeting with a major multinational about a new BIS regulation in the afternoon, and going to a reception hosted by the local Ministry of Trade that night. They&#8217;re vital to BIS&#8217;s ability to stop diversion and cooperate with other countries.</p><p><strong>Technical experts&#8212;often engineers, biologists, or chemists&#8212;provide technical knowledge to support enforcement or rulemaking.</strong> &#8220;Technical expert&#8221; is sort of a catchall term for all the various types of technology specialists BIS needs. BIS needs technical experts because it&#8217;s usually not obvious to anyone without a PhD whether a vial in a refrigerator contains bacteria for fermenting yogurt or for making bioweapons, or whether a computer chip is for playing video games or developing superintelligence. Technical experts support enforcement by answering those questions, and support BIS as a whole in understanding new technologies so it can figure out what to export control and how.</p><h1>How should BIS prioritize the money it&#8217;s getting?</h1><p>Though Congress is providing a significant increase, BIS is not getting <em>everything </em>it asked for. While the administration requested <a href="https://www.commerce.gov/sites/default/files/2025-06/BIS-FY2026-Congressional-Budget-Submission.pdf">$303 million</a> (+59%), only <a href="https://www.congress.gov/bill/119th-congress/house-bill/6938/text">$235 million</a> (+23%) was actually signed into law. BIS will have to make tough decisions about how to prioritize the new funding to meet enforcement needs.</p><p>One of the most important things BIS could invest in is something that wasn&#8217;t in the original budget request&#8212;IT modernization. Organizations like the Center for Strategic and International Studies have <a href="https://www.csis.org/analysis/improved-export-controls-enforcement-technology-needed-us-national-security">extensively documented the sorry state of BIS IT systems</a>, and Congress has previously introduced legislation to appropriate supplemental BIS funding <a href="https://www.congress.gov/bill/119th-congress/house-bill/4920">specifically for this purpose</a>. The logic is simple&#8212;every additional agent adds one &#8220;unit&#8221; of capability, but better IT systems improve the productivity of <em>every </em>agent. This would be less true if BIS already had relatively good IT systems, but my sense is that BIS is far, far away from hitting diminishing returns on IT investments.</p><p>Once BIS has adequately funded IT modernization, it should invest in a balanced mix of ECOs and agents, perhaps with a slight bias towards agents. ECOs and agents both meaningfully boost BIS enforcement in complementary ways. ECOs help BIS detect shell companies and bad actors more rapidly by checking the actual fate of more exported goods, and also improve BIS&#8217;s coordination with foreign governments. More agents mean that BIS can follow up on more leads, leading to more disruption of smuggling networks before goods ever leave the United States. If BIS were intelligence-constrained, more agents would not be as useful, but per BIS&#8217;s own budget request, BIS agents typically handle caseloads far higher than criminal investigators at comparable agencies.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-1" href="#footnote-1" target="_self">1</a></p><p>The lowest priority should be technical experts. Now is a good time for BIS to hire technical experts, for reasons related to hiring authorities discussed below. Additionally, the &#8220;force multiplier&#8221; argument is somewhat applicable to technical experts&#8212;better technical expertise to help agents identify possible diversion benefits every agent, although technical experts are not quite as cheaply scalable as software. However, technical experts may not contribute as directly to enforcement as the other priorities above.</p><h1>Can BIS actually hire with this money?</h1><p>The challenges and history of BIS IT modernization efforts could fill a separate post (and might soon), but on the staffing side, it can be hard for the US government to recruit candidates. Government pay follows a fixed scale based on education and experience, but the scale doesn&#8217;t care what kind of education and experience it is. According to the Bureau of Labor Statistics, both anthropologists and computer scientists typically require a Master&#8217;s degree, which would qualify them as GS-9 federal employees earning <a href="https://www.opm.gov/policy-data-oversight/pay-leave/salaries-wages/salary-tables/pdf/2025/DCB.pdf">about $70,000</a>. But in the private sector, the median pay for an anthropologist is <a href="https://www.bls.gov/ooh/life-physical-and-social-science/anthropologists-and-archeologists.htm">$64,910</a> whereas for a computer scientist it is <a href="https://www.bls.gov/ooh/computer-and-information-technology/computer-and-information-research-scientists.htm">$140,910</a>. This problem is much worse at higher levels of experience and in booming fields like AI&#8212;the government is not going to pay AI researchers <a href="https://fortune.com/2025/06/18/metas-100-million-signing-bonuses-openai-staff-extreme-ai-talent-war/">$100 million signing bonuses</a> like Meta.</p><p>But suppose money isn&#8217;t the main motivator&#8212;suppose the government only wants mission-driven candidates anyway, or is hiring in a field like law enforcement where the government is the only game in town. Even when salary isn&#8217;t an issue, it&#8217;s still incredibly hard to hire the right people. Unless the hiring agency gets <a href="https://www.opm.gov/policy-data-oversight/hiring-information/direct-hire-authority/">Direct Hire Authority</a> or another special carve-out, every hire needs to go through competitive civil service procedures. In theory, the point of these procedures is to make sure the government is absolutely fair. In reality, these procedures make government hiring timelines last months (<a href="https://interactives.cnas.org/reports/the-future-of-civilians-in-national-security/">or years, if you need security clearance</a>) and introduce a massive arbitrary element, as resumes are reviewed and scored by department-level HR staff who have no idea what job is even being filled.</p><p>IAPS has published separate work on <a href="https://www.iaps.ai/research/building-ai-surge-capacity">the challenges of attracting AI talent into the government </a>and what we can do about it. It&#8217;s not just about money, but about being able to hire the people offices need, even when candidates are willing.</p><p>BIS may soon have some opportunities to bypass the standard process. Congress is considering the <a href="https://www.congress.gov/bill/119th-congress/house-bill/7003/text/ih">BIS STRENGTH Act</a>, which would let BIS hire up to 25 specialized technical personnel outside the normal hiring process and pay them salaries competitive with the private sector. The <a href="https://techforce.gov/">Tech Force</a> program is doing the same across government, with a specific focus on high-demand areas like AI, data science, and software engineering. These programs would significantly improve BIS&#8217;s access to quality candidates by simplifying the hiring process and increasing salary caps, and they make this an especially good time for BIS to be staffing up.</p><h1>Even with NVIDIA H200 exports to China, this funding matters</h1><p>In December, President Trump <a href="https://truthsocial.com/@realDonaldTrump/posts/115686072737425841">announced that NVIDIA H200s would be sold to China</a>, and there was a great disturbance in the Force, as if a thousand <a href="https://x.com/ChrisRMcGuire/status/1998147358912393552">compute</a> <a href="https://x.com/peterwildeford/status/1998409802540835118">policy</a> <a href="https://x.com/james_s48/status/1999229830177685861">watchers</a> tweeted at once. BIS has since released the details of <a href="https://public-inspection.federalregister.gov/2026-00789.pdf">how advanced chip sales to China will be handled</a>, and I published my own analysis about how <a href="https://www.iaps.ai/research/bis-licensing-policy-for-h200s">the safeguards in the policy are effectively unenforceable</a>. From a US-China competition perspective, selling potentially millions of H200s to China would be a Very Bad Thing.</p><p>At the same time, the most powerful Blackwell-generation chips are still restricted, and chip smuggling will continue to be a factor in China&#8217;s access to compute. BIS also needs stronger enforcement to prevent Huawei and other Chinese AI chip makers from making their own domestic products to compete against NVIDIA and others.</p><p>Funding BIS helps restrict China&#8217;s access to cutting-edge chips and builds in optionality for the US government to take a more aggressive export control approach if it so chooses. BIS should invest the new funding it has received in better systems and the right staff to continue protecting US national security for decades to come.</p><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-1" href="#footnote-anchor-1" class="footnote-number" contenteditable="false" target="_self">1</a><div class="footnote-content"><p>See page 3 of <a href="https://www.commerce.gov/sites/default/files/2025-06/BIS-FY2026-Congressional-Budget-Submission.pdf">BIS&#8217;s FY2026 budget request</a>, which says: &#8220;These threats have significantly increased the scope of BIS&#8217;s work to include a significant increase in exports under BIS licenses, and export enforcement officers handling on average 26 cases (and another 19 leads) per agent (far above comparable agencies).&#8221;</p></div></div>]]></content:encoded></item><item><title><![CDATA[The case for paying whistleblowers to report on export violations]]></title><description><![CDATA[A bipartisan, bicameral bill would apply the SEC&#8217;s successful whistleblower incentive model to export enforcement]]></description><link>https://www.the-substrate.net/p/the-case-for-paying-whistleblowers</link><guid isPermaLink="false">https://www.the-substrate.net/p/the-case-for-paying-whistleblowers</guid><dc:creator><![CDATA[Erich Grunewald]]></dc:creator><pubDate>Wed, 28 Jan 2026 14:38:29 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/3d043d22-ead1-4513-ba70-321619161fec_2624x1856.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>The US has a massive export enforcement problem. It&#8217;s likely that <a href="https://www.cnas.org/publications/reports/countering-ai-chip-smuggling-has-become-a-national-security-priority">over 100,000 export-controlled AI chips</a> were smuggled into China in 2024. To give a sense of scale, the xAI Colossus cluster in Memphis, Tennessee, comprised first 100,000 and later 200,000 AI chips. That&#8217;s roughly an xAI Colossus cluster being smuggled to China each year. The main reason we know this is that smugglers are so unafraid that they&#8217;re willing to talk about their operations to journalists; this has happened repeatedly during the past year and a half.</p><p>AI chip smuggling is far from the only enforcement problem. In 2024, <a href="https://www.nytimes.com/2024/10/29/business/tsmc-huawei-computer-chips.html">Huawei got TSMC to illegally fabricate</a> over two million of its AI chip dies through front companies, despite sanctions. That is a far larger quantity than the number of Huawei AI chips fabricated domestically in China that year. We&#8217;ve also seen likely violations related to <a href="https://newsletter.semianalysis.com/p/huawei-ascend-production-ramp">high bandwidth memory</a> and <a href="https://newsletter.semianalysis.com/p/fab-whack-a-mole-chinese-companies">semiconductor manufacturing equipment</a>, which help China make its own AI chips to compete against NVIDIA.</p><p><strong>What if you could pay insiders many millions of dollars to inform US authorities about such violations, at almost no cost to the US government?</strong> That would likely surface a large number of high quality tips about important violations, which would greatly aid US authorities in detecting, punishing and deterring such violations.</p><p>This idea may sound outlandish, but it&#8217;s actually possible. In fact, there is a law being discussed in Congress that would accomplish exactly this! But before we get there, let&#8217;s take a brief detour to the Securities and Exchange Commission (SEC) and the 2008 financial crisis.</p><h2>The SEC whistleblower program</h2><p>In 2008, the US economy was reeling from the housing and mortgage crisis. Late that year, Bernie Madoff sat down with his two sons and admitted to them that the investment business he&#8217;d been running for two decades was a giant fraud, a Ponzi scheme to end all Ponzi schemes. Because of these two crises, there was a desire among policymakers to strengthen financial regulation and oversight.</p><p>Related to the Madoff scandal in particular, the SEC was under criticism for failing to properly investigate several credible reports about it. An employee at a rival investment firm, Harry Markopolos, had been asked by his employers to figure out how Madoff could post such consistently excellent returns, and soon realized that the returns were impossible with Madoff&#8217;s claimed strategy. Markopolos later <a href="https://www.npr.org/2010/03/02/124208012/madoff-whistleblower-sec-failed-to-do-the-math">said in an interview</a>: &#8220;I read his strategy statement, and it was so poorly put together. His strategy as depicted would have trouble beating a zero return, and his performance chart went up at a 45-degree line: that line doesn&#8217;t exist in finance, it only exists in geometry classes.&#8221;</p><p>Markopolos sent reports to the SEC detailing Madoff&#8217;s fraudulent activities on multiple occasions before the 2008 financial crisis. However, the SEC failed to properly investigate these reports, leaving Madoff free to continue defrauding investors until the financial crisis made its collapse imminent. Lawmakers realized that reports of wrongdoing from the general public could be a valuable tool for detecting and deterring securities laws violations.</p><p>One result of this, signed into law in 2010 as part of the Dodd-Frank Act, was the SEC whistleblower program.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-1" href="#footnote-1" target="_self">1</a></p><p>The SEC whistleblower program works like this. First and most importantly, whistleblowers get 10-30% of any penalty resulting from their report. This can be many millions of dollars&#8212;the largest reward to date, paid out in 2023, <a href="https://www.sec.gov/newsroom/press-releases/2023-89">was nearly $280 million</a>. This monetary incentive is paired with protections against retaliation from their employers, confidentiality guarantees, and the ability to make reports to the SEC anonymously through an attorney. To pay out whistleblower rewards, the Dodd-Frank Act also sets up an Investor Protection Fund, which receives penalties from securities violations (previously these would go to the Treasury).</p><p>The SEC whistleblower program is widely considered to have been an enormous success. It&#8217;s now one of the key ways that securities law is enforced in the US. It has <a href="https://kkc.com/frequently-asked-questions/sec-whistleblower-program/">helped generate</a> $7.3 billion to $22 billion in penalties since its inception in 2011<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-2" href="#footnote-2" target="_self">2</a>, and has <a href="https://www.whistleblowers.org/rewards-for-non-u-s-whistleblowers/">received reports</a> from at least 130 countries. Quantitative evaluations are rarer, but existing research suggests it has <a href="https://onlinelibrary.wiley.com/doi/10.1111/1911-3846.12884">reduced financial reporting fraud</a>, <a href="https://papers.ssrn.com/sol3/papers.cfm?abstract_id=3672026">deterred insider trading</a>, and <a href="https://papers.ssrn.com/sol3/papers.cfm?abstract_id=3105521">caused companies to strengthen compliance programs</a>.</p><h2>The Stop Stealing Our Chips Act</h2><p>Now the question is, could you adopt the SEC whistleblower program model for the Bureau of Industry and Security (BIS) and export violations? The <a href="https://kean.house.gov/sites/evo-subsites/kean.house.gov/files/evo-media-document/stop-stealing-our-chips-act-kean-final.pdf">Stop Stealing Our Chips Act</a>&#8212;introduced <a href="https://www.rounds.senate.gov/newsroom/press-releases/rounds-introduces-legislation-to-prevent-smuggling-of-american-ai-chips-into-china">into the Senate</a> in April 2025 by Senators Rounds (R-SD) and Warner (D-VA) and <a href="https://kean.house.gov/media/press-releases/kean-johnson-introduce-bill-protect-american-ai-chips-strengthening-export">into the House</a> a month and a half ago by Representatives Kean (R-NJ) and Johnson (D-TX)&#8212;would do exactly this.</p><p>The Stop Stealing Our Chips Act (henceforth, SSOCA) is closely modeled on the SEC whistleblower program, with some changes to adapt it for the export enforcement situation. It too offers whistleblowers 10-30% of any resulting penalty along with whistleblower protections, including the possibility of making anonymous reports to BIS.</p><p>The first thing to note here is the financial incentive. As economists always tell us, <a href="https://en.wikipedia.org/wiki/They_Shoot_Horses,_Don%27t_They%3F_(film)">financial incentives are incredibly powerful</a>, and the fines for these violations can be enormous:</p><ul><li><p>There have been <a href="https://www.bloomberg.com/news/features/2024-10-27/russia-is-getting-nvidia-ai-chips-from-an-indian-pharma-company">several</a> <a href="https://www.reuters.com/world/china/nvidia-ai-chips-worth-1-billion-entered-china-despite-us-curbs-ft-reports-2025-07-24/">news</a> <a href="https://techcrunch.com/2025/03/13/singapore-grants-bail-for-nvidia-chip-smugglers-in-alleged-390m-fraud/">reports</a> of operations involving on the order of 10,000 smuggled AI chips, meaning roughly $400 million worth. BIS can fine up to twice the value of the related transaction, so that could be a penalty of $800 million, for just one smuggler who spoke to the news media. If a whistleblower reports on that, they could get up to 30% or $240 million (leaving $560 million for the US government).</p></li><li><p>The massive TSMC-Huawei violation&#8212;which was only detected when an independent organization did a teardown of a Huawei chip&#8212;<a href="https://www.reuters.com/technology/tsmc-could-face-1-billion-or-more-fine-us-probe-sources-say-2025-04-08/">could reportedly result in</a> a $1 billion fine. This would have been up to $300 million for an informant.</p></li></ul><p>Beyond catching violations, a well-publicized program could have significant deterrent effects. If everyone in a supply chain&#8212;sales reps, warehouse workers, freight forwarders, accountants&#8212;knows that reporting can yield millions, violators face a much riskier environment. This effect could be realized even before the whistleblower program comes into effect, as the law would allow whistleblowers to report on violations that occurred before it was signed into law.</p><h2>Would the BIS program actually surface any tips?</h2><p>All right, hundreds of millions of dollars is a strong incentive. But, you may ask, are there actually people with information about these violations who would be willing to step up and blow the whistle? Why, yes there are!</p><p>Take AI chip smuggling operations: these involve lots of people who could potentially file reports, in other words people who have relevant information and would like to get millions of dollars. This includes, for example, people working in sales at exporters <a href="https://hindenburgresearch.com/smci/">with questionable compliance practices</a>; employees at local resellers, freight forwarders, logistics companies, warehouses, or data centers where the chips <a href="https://www.theinformation.com/articles/nvidia-ai-chip-smuggling-to-china-becomes-an-industry">are temporarily housed</a>; and accountants and lawyers.</p><p>In March 2025, Singaporean authorities arrested three people for smuggling $390 million worth of AI servers. These arrests were the result of an &#8220;anonymous tip-off&#8221;, in other words a whistleblower report! It seems likely that the recent <a href="https://www.justice.gov/opa/pr/us-authorities-shut-down-major-china-linked-ai-tech-smuggling-network">Operation Gatekeeper</a> arrests of an AI chip smuggling ring operating out of Texas and New York were also the result of an insider tip.</p><p>The story is similar for the TSMC-Huawei violation, where there were likely many TSMC employees who could&#8217;ve known about this problem and informed the US government. The only reason the US ultimately found out about this violation was because an independent party&#8212;TechInsights&#8212;did a teardown of a Huawei AI chip, and noticed it was TSMC-fabricated. A BIS whistleblower program would likewise incentivize such actors to look for evidence of violations and report those to the US government. This type of information is hugely valuable; it makes no sense to sit around and wait for people to offer it out of the goodness of their hearts.</p><p>As with the SEC program, the SSOCA makes foreign nationals eligible for rewards. This is important because many export violations happen in third countries, where goods are diverted via reexport or transshipment. (The SSOCA does however wisely make some exceptions for known terrorists and sanctioned persons, who are not eligible for rewards.) This is similar to how the intelligence community pays foreign informants, who provide the US government with information that benefits US national security.</p><h2>Would BIS be able to run the program?</h2><p>At this point, the wise reader will ask, &#8220;Isn&#8217;t BIS <a href="https://www.thefai.org/posts/spreadsheets-vs-smugglers-modernizing-the-bis-for-an-era-of-tech-rivalry">extremely resource constrained</a>? If so, how is it supposed to process and investigate a bunch of incoming tips, determine awards, and carry out outreach on the program?&#8221; After all, BIS&#8217;s budget for enforcement has been essentially flat when accounting for inflation for at least the past five years (see figure), despite BIS receiving a vastly increased scope of responsibilities due to the AI chip export controls introduced in October 2022 and the Russian invasion of Ukraine and all the diversion related to that conflict.</p><div id="datawrapper-iframe" class="datawrapper-wrap outer" data-attrs="{&quot;url&quot;:&quot;https://datawrapper.dwcdn.net/4WNfd/1/&quot;,&quot;thumbnail_url&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/f22307c2-407e-4b76-872c-b5431e812b73_1220x476.png&quot;,&quot;thumbnail_url_full&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/b6726a06-4713-4ebe-8b19-e32d1e7b8b69_1220x530.png&quot;,&quot;height&quot;:254,&quot;title&quot;:&quot;Created with Datawrapper&quot;,&quot;description&quot;:&quot;Bureau of Industry and Security budget (inflation-adjusted, in 2025 dollars)&quot;}" data-component-name="DatawrapperToDOM"><iframe id="iframe-datawrapper" class="datawrapper-iframe" src="https://datawrapper.dwcdn.net/4WNfd/1/" width="730" height="254" frameborder="0" scrolling="no"></iframe><script type="text/javascript">!function(){"use strict";window.addEventListener("message",(function(e){if(void 0!==e.data["datawrapper-height"]){var t=document.querySelectorAll("iframe");for(var a in e.data["datawrapper-height"])for(var r=0;r<t.length;r++){if(t[r].contentWindow===e.source)t[r].style.height=e.data["datawrapper-height"][a]+"px"}}}))}();</script></div><p>Good question! The answer is that these activities&#8212;investigating whistleblower reports, determining awards, and carrying out outreach&#8212;would also be financed through incoming penalties. This is one of the most notable differences between the SSOCA and the SEC program. The SSOCA authorizes BIS to use money from penalties for a few additional purposes and not only for paying out rewards to whistleblowers. (As currently written, the SSOCA would only allow BIS to receive money from penalties that stem from whistleblower reports, but I think this should be expanded to cover all penalties for BIS-related violations.)</p><p>Today, any fine levied by BIS goes straight to the Treasury, or in rare cases it is earmarked for some specific fund, such as the <a href="https://en.wikipedia.org/wiki/Crime_Victims_Fund">Crime Victims Fund</a>. What the SSOCA would do is redirect these to an Export Compliance Accountability Fund. This Fund would be used to pay rewards to whistleblowers; any money left over would go first to core functions of the BIS whistleblower program, and then to export enforcement activities more broadly.</p><p>There is a separate but related question of how the program would be funded initially, if it&#8217;s mainly intended to be funded through penalties. However, BIS already has a fairly steady stream of enforcement actions, including from likely insider tips, without any whistleblower incentive program (see figure). BIS may also be able to direct some of its appropriated resources to the program in the first one or two years, in order to get it up and running.</p><div id="datawrapper-iframe" class="datawrapper-wrap outer" data-attrs="{&quot;url&quot;:&quot;https://datawrapper.dwcdn.net/RQn7s/1/&quot;,&quot;thumbnail_url&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/6decfd15-67b5-4bdc-a32d-7a0760a4eb3d_1220x484.png&quot;,&quot;thumbnail_url_full&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/21f37c9f-da42-48fa-a9e9-38931f0ee4f2_1220x576.png&quot;,&quot;height&quot;:277,&quot;title&quot;:&quot;Created with Datawrapper&quot;,&quot;description&quot;:&quot;Number of dual-use export control enforcement actions announced annually&quot;}" data-component-name="DatawrapperToDOM"><iframe id="iframe-datawrapper" class="datawrapper-iframe" src="https://datawrapper.dwcdn.net/RQn7s/1/" width="730" height="277" frameborder="0" scrolling="no"></iframe><script type="text/javascript">!function(){"use strict";window.addEventListener("message",(function(e){if(void 0!==e.data["datawrapper-height"]){var t=document.querySelectorAll("iframe");for(var a in e.data["datawrapper-height"])for(var r=0;r<t.length;r++){if(t[r].contentWindow===e.source)t[r].style.height=e.data["datawrapper-height"][a]+"px"}}}))}();</script></div><p><strong>It seems likely that this program would pay for itself if implemented.</strong> That&#8217;s because the program would likely help BIS detect more violations and therefore levy more penalties than it would without the program. It could well end up both reducing the number of violations and also generating additional revenue for the federal government by making it much more likely that violations are detected and enforced. The losers here would be the smugglers and other bad actors who wake up every day trying to figure out ways of harming US national security.</p><p>BIS&#8217;s entire budget for fiscal year 2025 was about $191 million, <a href="https://www.cnas.org/publications/reports/countering-ai-chip-smuggling-has-become-a-national-security-priority">likely far smaller</a> than the collective profits of AI chip smugglers alone, which may well have exceeded $1 billion. A single successful enforcement action against a major smuggling operation could pay for BIS&#8217;s entire annual budget&#8212;for example, last month US authorities <a href="https://www.justice.gov/opa/pr/us-authorities-shut-down-major-china-linked-ai-tech-smuggling-network">arrested three individuals</a> accused of smuggling AI chips worth $160 million to China, which could result in a penalty of $320 million. There are likely dozens of such cases remaining to be discovered. A BIS whistleblower program like the one described in the SSOCA could create a virtuous cycle where more tips and better enforcement lead to more penalties and rewards, which in turn leads both to more tips by publicizing the program and also more resources for enforcement.</p><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-1" href="#footnote-anchor-1" class="footnote-number" contenteditable="false" target="_self">1</a><div class="footnote-content"><p>There was also a CFTC program established at the same time, but it is less well known so I just discuss the SEC program here.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-2" href="#footnote-anchor-2" class="footnote-number" contenteditable="false" target="_self">2</a><div class="footnote-content"><p>The SEC <a href="https://kkc.com/frequently-asked-questions/sec-whistleblower-program/">has awarded</a> more than $2.2 billion to whistleblowers since the program&#8217;s inception. In the extreme case of all those awards being 10% of the related penalty, that would imply $22 billion in penalties. In the other extreme case of all those awards being 30% of the penalty, it would imply $7.3 billion.</p></div></div>]]></content:encoded></item><item><title><![CDATA[For chip exports, quantity is at least as important as quality]]></title><description><![CDATA[Instead of micromanaging chip quality thresholds, the US should simply minimize the quantity of AI chip exports to China]]></description><link>https://www.the-substrate.net/p/for-chip-exports-quantity-is-at-least</link><guid isPermaLink="false">https://www.the-substrate.net/p/for-chip-exports-quantity-is-at-least</guid><dc:creator><![CDATA[Onni Aarne]]></dc:creator><pubDate>Tue, 27 Jan 2026 21:28:45 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/cfcfbc90-a6f7-4fdb-80b1-08a1df9e0889_2624x1856.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>The Trump administration&#8217;s decision to sell NVIDIA H200s to China, as codified in a <a href="https://public-inspection.federalregister.gov/2026-00789.pdf">recent rule</a> we <a href="https://www.iaps.ai/research/bis-licensing-policy-for-h200s">previously wrote about</a>, has received a lot of criticism. While the specifics of the rule are interesting, I want to step back a bit and analyze what is actually wrong with the core argument that the administration is making. Because there is, in fact, a logically valid argument to be made in defense of this policy, which I&#8217;ll call the <em>quality-based approach</em> to export controls.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-1" href="#footnote-1" target="_self">1</a></p><p>The structure of the administration&#8217;s argument is something like this:</p><ol><li><p>The US needs to dominate China in AI.</p></li><li><p>China&#8217;s AI capabilities are bottlenecked by the quality of AI chips they have access to; the highest quality chips can only be made in Taiwan.</p></li><li><p>But preserving US<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-2" href="#footnote-2" target="_self">2</a> chip companies&#8217; market share in China is important for maintaining the hardware lead.</p></li><li><p>Therefore, the US should allow the sale of chips that are just slightly better than what they can make domestically, but no better than that.</p></li></ol><p>This might seem logical: What point would there be in blocking sales of chips similar to what Huawei and other Chinese companies can make domestically anyway? And letting Chinese AI companies buy chips that are a bit better than Huawei chips helps draw revenue away from Huawei. What&#8217;s not to like?</p><p>One can of course quibble about the fact that the H200 is not &#8220;similar&#8221; to Huawei chips, and it indeed offers, for example, as much as 2.5 times the inference performance of the best Huawei chips.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-3" href="#footnote-3" target="_self">3</a> But the deeper issue with the argument is that a key premise is false: China&#8217;s bottleneck is not primarily about chip <em>quality</em>. They&#8217;re primarily bottlenecked by chip <em>quantity</em>, i.e., <a href="https://www.chinatalk.media/p/chinas-models-close-the-gap">total compute</a>.</p><h2>Quantity can make up for quality</h2><p>It&#8217;s worth understanding what chip &#8220;quality&#8221; even consists of. AI training and deployment ultimately boils down to doing an astronomical number of multiplications and additions. A &#8220;better&#8221; chip is generally just one that can do more of these basic calculations (measured in &#8220;floating point operations per second&#8221; or FLOP/s) while using less power. The other main consideration is how many numbers the chip can hold in &#8220;memory&#8221; at once, and how quickly it can move numbers (input data and results) onto and off the chip.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-4" href="#footnote-4" target="_self">4</a></p><p>For highly parallel workloads like AI, any &#8220;higher-quality&#8221; chip can be replaced by a sufficiently large number of lower-quality chips. The software engineering required to make a large workload work on a larger number of individually-weaker chips is more challenging, and the system will use a lot more power and networking, but it can be done. Huawei&#8217;s CloudMatrix 384 system <a href="https://newsletter.semianalysis.com/p/huawei-ai-cloudmatrix-384-chinas-answer-to-nvidia-gb200-nvl72">demonstrates</a> this: By linking 384 Ascend 910C chips together, it achieves 300 PFLOPS of dense BF16 compute&#8212;almost double the performance of NVIDIA&#8217;s GB200 NVL72&#8212;despite each individual Ascend being only about one-third the performance of a Blackwell chip. The tradeoff is power: CloudMatrix consumes 2.6x more watts per FLOP than the GB200. But China is well placed to compensate for these quality limitations: It has <a href="https://distantjob.com/blog/how-many-developers-are-in-the-world/">1.6 times</a> as many software engineers as the US and added more than 10 times as much new power capacity as the US in 2024.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-5" href="#footnote-5" target="_self">5</a></p><h2>China&#8217;s chip problem is a quantity problem</h2><p>While Huawei&#8217;s chips are lower quality and more expensive to produce than chips manufactured in Taiwan, China&#8217;s main problem is chip quantity: The US has approximately 5-10 times more total AI computing capacity installed than China,<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-6" href="#footnote-6" target="_self">6</a> and the US is projected to produce roughly 50 times as much AI compute as China in 2026.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-7" href="#footnote-7" target="_self">7</a></p><p>China has a chip quantity problem because it has a chip <em>production</em> problem, driven by two key bottlenecks created by export controls. First, US-led restrictions on semiconductor manufacturing equipment (SME) have limited China&#8217;s ability to produce advanced logic chips domestically. Second&#8212;and increasingly important&#8212;is high-bandwidth memory (HBM), the specialized memory critical to AI accelerators. The Biden administration&#8217;s <a href="https://www.bis.gov/press-release/commerce-strengthens-export-controls-restrict-chinas-capability-produce-advanced-semiconductors-military">December 2024 controls</a> specifically targeted HBM and the equipment needed to produce it. Chinese memory maker CXMT is <a href="https://www.chinatalk.media/p/will-china-hit-the-hbm-wall">several generations</a> behind South Korean industry leaders. HBM is now the binding constraint: SemiAnalysis <a href="https://newsletter.semianalysis.com/p/huawei-ascend-production-ramp">estimates</a> that while SMIC has capacity to produce logic dies for over a million Ascend chips, CXMT will only be able to manufacture enough HBM for 250,000-300,000 Ascend 910Cs in 2026. Any additional exports of US AI chips would directly alleviate this chip supply bottleneck, even if the chips were no better than Huawei&#8217;s alternatives.</p><h2>The right move is to minimize the quantity of AI chip exports</h2><p>Some readers may be screaming at their screens that the Trump administration&#8217;s new rule does in fact have a chip quantity restriction. And this is true: Sales of any one model of chip are capped at half the volume of that chip that has been sold in the US. However, in the <a href="https://www.cnbc.com/2026/01/14/trump-nvidia-h200-china-ai-chips.html">words</a> of Trump himself, the main motivation behind the policy change was that the chips in question are &#8220;not the highest level&#8221;, i.e., lower quality. The quantity restriction appears to have been a valuable but insufficient safeguard tacked on at the last minute.</p><p>If the administration instead held fast to an approach focused on <em>minimizing</em> the quantity of AI compute that China can access, it could plausibly expand its current 5-10x compute advantage by as much as another order of magnitude over the coming years: IFP <a href="https://ifp.org/should-the-us-sell-hopper-chips-to-china/">estimates</a> that the US could add as much as 21 to 49 times more compute than China in 2026 if no US AI chips are exported. Such a gap would be <a href="https://peterwildeford.substack.com/p/compute-is-a-strategic-resource">strategically</a> decisive: with 20x less compute, Chinese companies would struggle to train frontier models, as even matching a single leading US training run would likely require concentrating their entire national AI compute capacity on one project for an extended period. Chinese AI companies are already struggling to meet <a href="https://www.chinatalk.media/p/the-grey-market-for-american-llms#:~:text=Now%20that%20DeepSeek,domestic%20AI%20scene.">domestic</a> and <a href="https://semianalysis.com/2025/07/03/deepseek-debrief-128-days-later/">international</a> demand due to compute constraints, and are consequently falling further behind because they cannot divert compute toward <a href="https://www.bloomberg.com/news/articles/2026-01-10/china-ai-leaders-warn-of-widening-gap-with-us-after-1b-ipo-week?utm_source=website&amp;utm_medium=share&amp;utm_campaign=copy">R&amp;D and experimentation</a>.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-8" href="#footnote-8" target="_self">8</a> The international market, and perhaps eventually even the Chinese market, would be dominated by US AI companies.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!l0DO!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F27067373-0978-4b12-bb8a-c5443f072692_2424x1774.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!l0DO!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F27067373-0978-4b12-bb8a-c5443f072692_2424x1774.png 424w, https://substackcdn.com/image/fetch/$s_!l0DO!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F27067373-0978-4b12-bb8a-c5443f072692_2424x1774.png 848w, https://substackcdn.com/image/fetch/$s_!l0DO!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F27067373-0978-4b12-bb8a-c5443f072692_2424x1774.png 1272w, https://substackcdn.com/image/fetch/$s_!l0DO!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F27067373-0978-4b12-bb8a-c5443f072692_2424x1774.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!l0DO!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F27067373-0978-4b12-bb8a-c5443f072692_2424x1774.png" width="1456" height="1066" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/27067373-0978-4b12-bb8a-c5443f072692_2424x1774.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1066,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:298108,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.the-substrate.net/i/186003601?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F27067373-0978-4b12-bb8a-c5443f072692_2424x1774.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!l0DO!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F27067373-0978-4b12-bb8a-c5443f072692_2424x1774.png 424w, https://substackcdn.com/image/fetch/$s_!l0DO!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F27067373-0978-4b12-bb8a-c5443f072692_2424x1774.png 848w, https://substackcdn.com/image/fetch/$s_!l0DO!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F27067373-0978-4b12-bb8a-c5443f072692_2424x1774.png 1272w, https://substackcdn.com/image/fetch/$s_!l0DO!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F27067373-0978-4b12-bb8a-c5443f072692_2424x1774.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><h2>The impact of a quality-based approach depends on Chinese spending</h2><p>So how would the impact of a quality-based approach compare to a quantity-minimizing approach? As discussed, quality is largely fungible with quantity, so I will focus on what a quality restriction would mean for total AI compute capacity<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-9" href="#footnote-9" target="_self">9</a> in China versus the US.</p><p>A quality-based approach would endorse selling a chip similar in price-performance to the best Chinese chips, i.e., the Huawei Ascend 910C. This would likely be roughly a third to half the price-performance of Blackwell-generation chips.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-10" href="#footnote-10" target="_self">10</a></p><p>The overall impact of this depends heavily on how much Chinese AI companies are willing and able to buy. In principle, a total absence of quantity restrictions could allow Chinese companies to catch up to the US by simply outspending them by a factor of two, but this is of course unlikely in practice.</p><p>One of the most concrete pieces of information we have about potential Chinese chip spending is Jensen Huang&#8217;s <a href="https://www.reuters.com/world/china/nvidia-sounds-out-tsmc-new-h200-chip-order-china-demand-jumps-sources-say-2025-12-31/">claim</a> that Chinese companies have ordered two million H200 chips, worth about $54 billion. This is about 30% of projected US hyperscaler chip spending for 2026,<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-11" href="#footnote-11" target="_self">11</a> suggesting that Chinese willingness-to-spend is not greatly affected by somewhat lower price-performance. As an extremely rough guess, this would suggest that Chinese companies would spend somewhere between $30 and $50 billion on these Ascend-equivalent chips, resulting in approximately a 5-15x US advantage in terms of compute added in 2026.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-12" href="#footnote-12" target="_self">12</a> This would not be catastrophic, but it is still several times worse than the alternative of 21-49x.</p><p>The quality-based approach also leaves the door open to very concerning worst-case outcomes, especially if the US AI industry faces significant headwinds. To sketch an example scenario, it is plausible that Chinese companies will be much more successful at attracting users to their models, perhaps because the regulatory environment in the US turns hostile to AI. This could also coincide with US financial markets becoming disillusioned with AI, as they briefly became disillusioned with internet companies after the dot-com bubble. In such an environment, US companies may also struggle to obtain permits for new data center construction, even if the financing were there. If so, Chinese AI companies supported by state subsidies may be able to outspend and outbuild US companies, plausibly overcoming a 2x cost-effectiveness penalty to take the lead and start competing internationally.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-13" href="#footnote-13" target="_self">13</a></p><h2>The CCP will not let Huawei fail</h2><p>But at least the quality-based approach would suppress Chinese domestic AI chip production, because Chinese companies would stop buying Huawei, right? This might be the case if the CCP were free market absolutists, but alas, they are not: As even the administration&#8217;s White House AI &amp; Crypto Czar David Sacks&#8212;a driving force behind the H200 policy&#8212;has <a href="https://www.bloomberg.com/news/articles/2025-12-12/china-is-rejecting-h200s-outfoxing-us-strategy-sacks-says">acknowledged</a>, China can and will &#8220;outfox&#8221; this strategy by <a href="https://www.theinformation.com/articles/china-tells-tech-companies-halt-nvidia-h200-chip-orders">mandating</a> that Chinese AI companies <em>also</em> buy domestically made chips. They can calibrate these requirements to precisely match domestic production capacity, ensuring Huawei never lacks for customers regardless of how many US chips are available.</p><p>The reality is that semiconductor self-sufficiency has been a core CCP strategic goal at least since the Made in China 2025 roadmap, which was laid out in 2015, long before export controls. That roadmap set a target of 80% domestic market share for &#8220;high-performance computers and servers&#8221; by 2025.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-14" href="#footnote-14" target="_self">14</a> The repeated whiplash of US policy has only reinforced Beijing&#8217;s conviction that building an indigenous supply chain is a strategic necessity.</p><h2>There is still time to change course</h2><p>The administration&#8217;s recent rule falls between these two extremes: It puts a cap on the number of chips that can be sold to China while still allowing very significant quantities of exports. Under the current 50% rule, the cap sits at 900,000 Hopper-equivalent chips and rising over time&#8212;&#8221;over twice what China is expected to produce this year&#8221;. This would result in a US compute advantage of approximately 9-10x in 2026. The quality of the chips is also substantially higher than a strict quality-based approach would recommend. But the policy will still be less damaging than exports without any cap on export volume.</p><p>Fortunately, few if any chips have yet been shipped to China, and the administration is free to change its mind at any time. If it does, it would likely secure enduring US dominance in AI, plausibly causing a permanent collapse of the Chinese AI industry. But the details of how that would happen will be a post for another day.</p><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-1" href="#footnote-anchor-1" class="footnote-number" contenteditable="false" target="_self">1</a><div class="footnote-content"><p>Yes, I know, the rule also has a quantity restriction component. I'll get to that.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-2" href="#footnote-anchor-2" class="footnote-number" contenteditable="false" target="_self">2</a><div class="footnote-content"><p>Throughout this piece I will loosely talk about &#8220;US chips&#8221;, but more precisely what I&#8217;m referring to are the overwhelming majority of AI chips that are designed and sold by US companies like NVIDIA, AMD, and Google, but are <em>manufactured</em> AKA <em>fabbed</em> in Taiwan by TSMC. These chips are subject to US export controls even if they never touch US soil, because of something called a &#8220;<a href="https://www.csis.org/analysis/choking-chinas-access-future-ai">foreign direct product rule</a>&#8221;.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-3" href="#footnote-anchor-3" class="footnote-number" contenteditable="false" target="_self">3</a><div class="footnote-content"><p>Based on inference performance benchmarks showing the H200 delivers approximately <a href="https://nvidia.github.io/TensorRT-LLM/blogs/H200launch.html">1.9x H100 performance</a> while the Ascend 910C delivers approximately <a href="https://www.tomshardware.com/tech-industry/artificial-intelligence/deepseek-research-suggests-huaweis-ascend-910c-delivers-60-percent-nvidia-h100-inference-performance">0.6x H100 performance</a>. At estimated prices of ~$32,000 for the H200 and ~$26,000 for the Ascend 910C, the H200 provides roughly 2.5x more inference performance per dollar.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-4" href="#footnote-anchor-4" class="footnote-number" contenteditable="false" target="_self">4</a><div class="footnote-content"><p>This is more formally referred to as memory bandwidth (between the chip and its memory) and interconnect bandwidth (between the memories of different chips).</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-5" href="#footnote-anchor-5" class="footnote-number" contenteditable="false" target="_self">5</a><div class="footnote-content"><p><a href="https://english.www.gov.cn/archive/statistics/202501/21/content_WS678f4adfc6d0868f4e8ef06f.html">China&#8217;s total installed power generation capacity</a> reached 3.35 TW at end of 2024, up 14.6% year-on-year, implying approximately 427 GW of new capacity added. By comparison, the US added <a href="https://www.publicpower.org/resource/americas-electricity-generating-capacity">approximately 30 GW of net new generating capacity</a> in 2024.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-6" href="#footnote-anchor-6" class="footnote-number" contenteditable="false" target="_self">6</a><div class="footnote-content"><p>In <a href="https://epoch.ai/data-insights/ai-supercomputers-performance-share-by-country">Epoch AI&#8217;s</a> supercomputer dataset, the US holds approximately 75% of global GPU cluster performance while China holds approximately 15%, a ratio of roughly 5:1. Other estimates, such as <a href="https://www.chinatalk.media/p/chinas-models-close-the-gap">Lennart Heim's</a>, suggest a ratio closer to 10:1. The discrepancy may reflect different methodologies and definitions of "AI compute capacity", and limitations in Epoch&#8217;s coverage.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-7" href="#footnote-anchor-7" class="footnote-number" contenteditable="false" target="_self">7</a><div class="footnote-content"><p>According to <a href="https://ifp.org/the-b30a-decision/">IFP&#8217;s analysis</a>, which draws on <a href="https://newsletter.semianalysis.com/p/huawei-ascend-production-ramp">SemiAnalysis</a> and other sources, US AI chip production is projected to reach 6,890,000 B300-equivalents in 2026, while Huawei production is estimated at only 62,000-160,000 B300-equivalents&#8212;roughly 1-2% of US production. SemiAnalysis projects Huawei could produce ~805,000 Ascend units in 2025, but notes that HBM memory shortages will likely constrain actual output to 250,000-300,000 Ascend 910C units in 2026.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-8" href="#footnote-anchor-8" class="footnote-number" contenteditable="false" target="_self">8</a><div class="footnote-content"><p>Experiments are estimated to make up a very large fraction, possibly a <a href="https://epoch.ai/data-insights/openai-compute-spend">majority</a>, of US AI companies&#8217; compute use.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-9" href="#footnote-anchor-9" class="footnote-number" contenteditable="false" target="_self">9</a><div class="footnote-content"><p>Measured in terms of FP8/s, following IFP&#8217;s <a href="https://ifp.org/should-the-us-sell-hopper-chips-to-china/">analysis</a>.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-10" href="#footnote-anchor-10" class="footnote-number" contenteditable="false" target="_self">10</a><div class="footnote-content"><p><a href="https://ifp.org/should-the-us-sell-hopper-chips-to-china/">IFP analysis</a> estimates the H200 achieves approximately 70% of the B300&#8217;s price-performance at FP8. Combined with footnote 3&#8217;s finding that the H200 is ~2.5x more cost-effective than the Ascend 910C, this implies the Ascend 910C achieves roughly 28% of B300 price-performance (0.70 / 2.5 &#8776; 0.28). Separately, <a href="https://newsletter.semianalysis.com/p/huawei-ai-cloudmatrix-384-chinas-answer-to-nvidia-gb200-nvl72">SemiAnalysis</a> notes each Ascend 910C has &#8220;only one-third the performance of an NVIDIA Blackwell&#8221; chip. Given that Ascend chips are also cheaper (~$26K vs ~$53K for B300), this suggests price-performance of roughly 50% of Blackwell. Taken together, &#8220;one-third to half&#8221; is a reasonable if slightly generous estimate.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-11" href="#footnote-anchor-11" class="footnote-number" contenteditable="false" target="_self">11</a><div class="footnote-content"><p>US hyperscaler AI infrastructure capex is projected to exceed $600 billion in 2026, with <a href="https://introl.com/blog/hyperscaler-capex-600b-2026-ai-infrastructure-debt-january-2026">roughly</a> $180 billion specifically on GPU/accelerator purchases.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-12" href="#footnote-anchor-12" class="footnote-number" contenteditable="false" target="_self">12</a><div class="footnote-content"><p>Rough calculation assuming quality-based chips have 30-50% of Blackwell price-performance. Domestic production valued at 62-160K B300-equivalents &#215; ~$53K = $3-8B (see footnote 7).</p><ul><li><p>Lower bound ($30B spending, 0.3 price-performance): $30B &#215; 0.30 = $9B Blackwell-equivalent, plus ~$3B domestic production = $12B total; US advantage = $180B / $12B &#8776; 15x.</p></li><li><p>Upper bound ($50B spending 0.5 price-performance): $50B &#215; 0.5 = $25B Blackwell-equivalent, plus ~$8B domestic production = $33B total; US advantage = $180B / $33B &#8776; 5x.</p></li></ul></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-13" href="#footnote-anchor-13" class="footnote-number" contenteditable="false" target="_self">13</a><div class="footnote-content"><p>Some combination of ingenuity and industrial espionage may also help Chinese AI companies make up for inferior compute with improved <a href="https://epoch.ai/blog/algorithmic-progress-in-language-models">algorithmic efficiency</a>.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-14" href="#footnote-anchor-14" class="footnote-number" contenteditable="false" target="_self">14</a><div class="footnote-content"><p>The self-sufficiency targets come from the Made in China 2025 &#8220;Green Book&#8221; technology roadmap. See CSET&#8217;s English translation: <a href="https://cset.georgetown.edu/wp-content/uploads/t0181_Made_in_China_roadmap_EN.pdf">Roadmap of Major Technical Domains for Made in China 2025</a>, p. 8.</p></div></div>]]></content:encoded></item><item><title><![CDATA[This is The Substrate]]></title><description><![CDATA[A newsletter about possibly the most important resource of our time: compute]]></description><link>https://www.the-substrate.net/p/this-is-the-substrate</link><guid isPermaLink="false">https://www.the-substrate.net/p/this-is-the-substrate</guid><dc:creator><![CDATA[Erich Grunewald]]></dc:creator><pubDate>Tue, 27 Jan 2026 20:41:46 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/839e2a49-3b66-4640-ae14-65727e686059_2624x1856.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>Welcome to The Substrate. The Substrate is a newsletter about compute and AI hardware, security, semiconductor manufacturing, and the geopolitics of these.</p><p>Should the US sell advanced AI chips to China? How can the US better enforce its export controls? How can we design and build highly secure data centers? When will China develop its own extreme ultraviolet photolithography machines? Can we design governance mechanisms into chips securely, for example to verify international agreements? How exactly does compute benefit nations anyway? &#8212; these and many others are questions we&#8217;re interested in.</p><p>The world is not about to run out of AI-related Substacks, so what&#8217;s different about this one? Let&#8217;s first stake out our fundamental beliefs, such as they are. A lot of people have a lot of opinions about AI, but here are some things <em>we</em> tend to believe:</p><ul><li><p>AI is very likely the most important technology of our time</p></li><li><p>A world with very powerful AI could be very good or very bad</p></li><li><p>Good policy can help secure a more positive future with powerful AI</p></li><li><p>Compute is one of the most important resources of our time, and will likely remain so</p></li><li><p>Compute is a <em><a href="https://peterwildeford.substack.com/p/compute-is-a-strategic-resource">strategic</a></em> resource that will see intense geopolitical competition</p></li><li><p>Compute can be used to better <a href="https://arxiv.org/abs/2402.08797">govern AI</a></p></li></ul><p>These claims, if true, raise a lot of very important and very interesting questions about policy and strategy, and these are the questions we want to explore with this newsletter. We&#8217;re calling it <em>The Substrate</em> because compute is the substrate that AI runs on, silicon wafers are the substrate that circuits are etched onto, and (more poetically) the underlying structures are the substrate that surface phenomena emerge from.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-1" href="#footnote-1" target="_self">1</a></p><p>And who are we? The Substrate is run, and mainly written, by the compute policy team at the <a href="https://www.iaps.ai/">Institute for AI Policy and Strategy</a> (IAPS). (Caveat: The opinions expressed in this newsletter are solely the authors&#8217; own, and not indicative of any institutional stance of IAPS.) IAPS is a think tank whose mission is &#8220;securing a positive future in a world with powerful AI&#8221;, and ultimately that is our mission too. But mostly we&#8217;ll just write about things we find important and interesting, because we are betting that you also find those things important and interesting.</p><p>Most of us have worked on these topics for years. We have previously written reports on, among other things:</p><ul><li><p><a href="https://www.cnas.org/publications/reports/secure-governable-chips">Hardware-enabled mechanisms</a>, such as <a href="https://www.iaps.ai/research/location-verification-for-ai-chips">delay-based location verification</a> to combat AI chip smuggling and <a href="https://arxiv.org/abs/2506.15093">flexible hardware-enabled guarantees</a> for future treaty verification</p></li><li><p><a href="https://www.cnas.org/publications/reports/countering-ai-chip-smuggling-has-become-a-national-security-priority">AI chip smuggling into China</a></p></li><li><p><a href="https://www.iaps.ai/research/ai-chip-making-china">AI chip making in China</a></p></li><li><p><a href="https://www.iaps.ai/research/accelerating-ai-data-center-security">AI data center security</a></p></li></ul><p>But much of our past research and writing remains unpublished. With The Substrate, we hope to publish more things more quickly, including takes that are more tentative than what we&#8217;d put in a long report, but that we think can benefit from discussion and feedback even in that tentative form. We also want to write commentary on ongoing events, summaries of our longer research reports, and more.</p><p>Our first substantive post is written by Onni and argues that <a href="https://www.the-substrate.net/p/for-chip-exports-quantity-is-at-least">for chip export controls, quantity is at least as important as quality</a>. After that, we will publish a post by Erich on the Stop Stealing Our Chips Act&#8212;which would introduce a whistleblower incentive program for export violations&#8212;and a post by Max on what the Bureau of Industry and Security needs money for.</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://www.the-substrate.net/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading The Substrate! Subscribe for free to receive new posts and support my work.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-1" href="#footnote-anchor-1" class="footnote-number" contenteditable="false" target="_self">1</a><div class="footnote-content"><p>And also because we are not afraid of conflation with <a href="https://substrate.com/">semiconductor start-ups</a>. But just to be absolutely clear, we are a not-for-profit newsletter and have no relation to the start-up called Substrate (without the definite article).</p><p></p></div></div>]]></content:encoded></item></channel></rss>