The global datasphere reached an estimated 173.4 zettabytes in 2025 and is forecast to land between 230 and 240 zettabytes in 2026, on the way to 527.5 zettabytes by 2029, according to IDC. The pace of growth is no longer the data-doubling rhythm of the cloud era. It is the pace set by AI training pipelines, synthetic data generation, and the retention schedules that follow every model and every inference log into long-term storage.
That pace is what has flipped the storage supply chain. Western Digital and Seagate, which together hold roughly 80 percent of the global HDD market, have confirmed that their entire 2026 nearline production has been sold through to AI hyperscalers, with binding agreements stretching into 2028. Industry analysis from Tom's Hardware places nearline demand growth at about 25 percent year over year, with hyperscalers buying high-capacity drives in the 20 to 30 terabyte range to build out the cold tiers where training corpora, model checkpoints, inference logs, and synthetic outputs are kept. The crowd-out effect on enterprise buyers is direct and visible, with lead times now stretching beyond a year for top-capacity drives.



