Digital networks have doubled humanity’s written corpus within fifty years, transforming the Gutenberg archive into a distributed, continuously expanding planetary text system.

For nearly five centuries, the expansion of written knowledge advanced at the deliberate tempo of print culture. From the fifteenth-century invention of movable type by Johannes Gutenberg to the late twentieth century, the infrastructure of textual memory was mediated by institutions—publishers, universities, and national libraries—that curated and stabilised the accumulation of books. The aggregate holdings of the world’s largest library systems now approach roughly half a billion volumes, representing the sedimented intellectual output of the entire print era. The emergence of the internet, however, has precipitated a profound epistemic acceleration. Within scarcely half a century, the digital network has produced a quantity of text that, when translated into “book equivalents”, approaches the magnitude of the entire Gutenberg corpus. This transformation represents not merely a quantitative increase but a structural reconfiguration of authorship. Whereas the classical library system concentrated publication within institutional circuits, the web disperses the act of writing across millions of distributed nodes. Platforms for individual publication—blogs, research repositories, documentation environments, and digital journalism—enable scholars, engineers, and independent thinkers to deposit knowledge directly into the planetary archive. Consequently, the architecture of textual memory shifts from discrete objects arranged on shelves to a continuous semantic field navigated by search engines and algorithmic crawlers. A revealing case emerges in the proliferation of long-form blogging archives, where essays accumulate over years into evolving intellectual repositories that mirror, in digital form, the role once played by libraries. The contemporary web thus constitutes an inverted Gutenberg moment: print multiplied copies of texts, whereas the network multiplies authors. In doing so, it inaugurates a distributed ecology of writing in which the corpus of human language expands in real time.


Lloveras, A. (2026) The Duplication of the Corpus: Gutenberg’s Archive and the Fifty-Year Expansion of the Web. Available at: https://antolloveras.blogspot.com/2026/03/the-duplication-of-corpus-gutenbergs.html