Over the years, I've worked to assess reasonable speeds and then costs for digitization of various kinds of materials. I think it may be possible to set up a reusable framework for making these estimates, to help in developing grant proposals, projecting project costs, and selecting digitization equipment.
At one point, we measured the costs for creating metadata by metadata librarians as opposed to archival staff (and then did a usability test to determine the extent to which this impacted the users). We measured the actual time it took for both sets of people to describe the same content in 3 collections (each a different type of material). (We did *not* try to measure the costs for organization and description of collections, even to the series level. Do you know of anyone who's tried to do this? It seems to be an intractable problem.)
When developing cost estimates for digitization requests, I have included the cost of the equipment and software (assuming it lasted 3 or 5 years with a certain level of usage, dividing the replacement cost by that number of hours, then multiplying by the number of hours used in the proposed project). I also included overhead costs (I used the current percentage our Office of Sponsored Programs uses for this purpose on grant proposals), which includes building maintenance, heating/cooling, etc. Employee costs had to include benefits costs, not just pay. Of course, employee costs and overhead costs vary with the institution.
Still, I think that it might be feasible, and possibly very helpful, if we were to develop a matrix into which people could plug in numbers that are appropriate for their institution, and it would output an expected cost for digitizing a specific set of material. So, for example, one could input that they'll be using students paid $9 an hour, with no benefits costs, an overhead of 18%, digitizing approximately 3,000 2-4 page letters in good condition on flatbeds that can be replaced for $1500, using Windows 7 stations that (with Excel and Photoshop or Silverlight) can be replaced for $1200. They could specify entry of X number of metadata fields into the spreadsheet (weighted, as description takes longer), select the extent of optimization/quality control time, and enter in their own number for the time spent in uploading to their delivery system.
The output would be the expected total costs for digitizing the material they're thinking about putting online. This could then be used at multiple institutions.
What do you think?
--jody