« EPA Libraries Update | Main | Scan This Book! »

Government Information in Legacy Formats: Scaling a Pilot Project to Enable Long-Term Access

From D-Lib Magazine:

Despite the software and hardware problems that these CD-ROMs pose, the main challenges of a large-scale CD-ROM "rescue" project are not primarily technological. Files from CD-ROMs can be systematically copied to redundant, stable server environments. Obsolete file formats can be migrated to non-proprietary formats for continued use of the data; unusual or obsolete software programs can be made available through web-based virtualization. Rather, the main challenges are to organize and fund a collaborative rescue project so that institutions can contribute to different tasks as they are able and willing; to establish a decision-making framework so that portions of the collection that are at highest risk can be addressed first, according to agreed-upon standards; and to ensure quality control of both "rescued" CD-ROM files and associated metadata. The Yale Library pilot project described here has served not only as a means for analyzing and documenting aspects of a CD-ROM migration approach, but also as a launching pad for a community-wide consideration of a large-scale, distributed project to migrate this legacy collection and ensure permanent public access to government information distributed on CD-ROMs.