Stack Overflow questions per yearGlobal data created per year
Between 2015 and 2023, global data creation surged while Stack Overflow questions declined, producing an inverse correlation of -0.9639 across nine data points. The world is producing more data than ever and asking fewer questions about how to process it, which either means programmers have gotten smarter or they've given up and are asking ChatGPT instead. The latter explanation is more consistent with the timeline: Stack Overflow's decline accelerated precisely when AI coding assistants became available. The data grew; the questions shrank; the AI ate the middleman. This is not a correlation. It is a eulogy.
Global data creation grew from approximately 15 zettabytes in 2015 to over 120 zettabytes by 2023, driven by IoT, video, and cloud computing. Stack Overflow questions peaked around 2014 and declined through the late 2010s and early 2020s, with sharp acceleration after 2021 as AI assistants displaced community Q&A. The inverse correlation captures two real but independent technology trends: exponential data growth and the disruption of human-curated knowledge platforms by AI.
More data and fewer questions is the signature of the AI era. The correlation accidentally captures a real technological transition: the shift from human-curated to AI-mediated knowledge, measured against a backdrop of exponential data growth.
As an Amazon Associate, getspurious.com earns from qualifying purchases. Learn more.
Want to learn more about why correlations like “Stack Overflow questions per year” vs “Global data created per year” don't prove causation? Read our guide to statistical thinking.