Let’s say I wanted an offline copy of Wikipedia to show my father, who does not have internet access. I push a magic button, take a snapshot of the entire Wikipedia database, and start downloading it.
Ignoring issues of data transfer, and assuming I have a database engine and front-end web server to access the data, how much storage do I need to keep it? One BD-ROM? A terabyte hard drive? Something larger?
This page doesn’t directly answer your question, but proviudes some similar stats. For instance, printing out a harcopy of Wikipedia would currently need 733 volumes.
You don’t need a magic button. You can download the whole database here. The table with all the articles in it, pages-articles, is only 3.2GB in compressed XML. You can fit it easily on a DVD with room to spare.