--
You received this message because you are subscribed to the Google Groups "Digital Curation" group.
To unsubscribe from this group and stop receiving emails from it, send an email to digital-curati...@googlegroups.com.
To post to this group, send email to digital-...@googlegroups.com.
Visit this group at http://groups.google.com/group/digital-curation.
For more options, visit https://groups.google.com/groups/opt_out.
You can fit a lot of hard drives in a single Hollinger box. Compression is a bad idea even if the lid won't close.
Word documents are tricky to estimate, as they may contain different versions.
If we assume 500 8 char words per page, we have 4000B/page
Assuming 800GB of text, that's 100 million double sided pages. 1600 sheets per linear foot gives a bit under 100 linear furlongs.
Simon
The term Library of Congress is often used as an unusual unit of measurement to represent an impressively large quantity of data when discussing digital storage or networking technologies. It refers to the US Library of Congress. Information researchers have estimated that the entire print collections of the Library of Congress represent roughly 10 terabytes of uncompressed textual data.So you could impress your colleagues by saying that a one terabyte hard drive holds as much information as one-tenth of the Library of Congress! Of course, that would be an incorrect statement (as Leslie and Wikipedia both go on to say, it totally ignores non-textual collections), but it is impressive to non-techies.
It is one of the joys in my life to keep track of the "holds a Library of Congress worth of stuff" references. Also explaining just what the Library of Congress collections are, what they include, and how quickly the digital (and print!) collections grow. A lot of press inquiries get sent my way.
Maybe I should start preparing another blog post. If anyone has references or stories, send them my way. I collect them year round from any and all sources.
Leslie
------------
Leslie Johnston
Chief of Repository Development
Library of Congress
--
It is one of the joys in my life to keep track of the "holds a Library of Congress worth of stuff" references. Also explaining just what the Library of Congress collections are, what they include, and how quickly the digital (and print!) collections grow. A lot of press inquiries get sent my way.
Maybe I should start preparing another blog post. If anyone has references or stories, send them my way. I collect them year round from any and all sources.
--
This made me laugh out loud for a long time. I wonder how I can find out what he weighs…
llj
From: digital-...@googlegroups.com [mailto:digital-...@googlegroups.com] On Behalf Of Simon Spero
Sent: Tuesday, November 12, 2013 5:25 PM
To: digital-...@googlegroups.com
Subject: Re: [digital-curation] How many boxes of documents would fit into a terabyte hard drive?
On Tue, Nov 12, 2013 at 3:20 PM, Johnston, Leslie <les...@loc.gov> wrote:
--