Title:
CD-ROM Redigitization Project (CDRiP)

dc.contributor.author Dekker, Harrison
dc.contributor.corporatename University of California, Berkeley
dc.date.accessioned 2007-03-12T19:10:36Z
dc.date.available 2007-03-12T19:10:36Z
dc.date.issued 2007-02-22
dc.description Harrison Dekker is the Coordinator of Data Services at Doe and Moffitt Libraries, UC Berkeley. en
dc.description.abstract The CDRiP began with with a fairly narrow goal, to leverage low cost disk storage and create an improved framework for storing, finding and using numeric data published in CD and DVD format. A further goal was to employ emerging metadata statndards rather than legacy or proprietary ones, to promote the extensibility of the system. Other goals have emerged as the project has progressed, most significantly, the need to address software and operating system dependencies often associated with these materials. This last issue is of particular importance given the prevalence of CD products that contain data in proprietay (or obsolete) formats that can only be accessed with the custom software applications accompanying the data on these disks. As a small scale, one-developer project, it was important to choose an approach that allowed a great amount of flexibility. Accordingly, we decided on what's commonly referred to as an iterative design process. Iterative design is defined as "a design methodology based on a cyclic process of prototyping, testing, analyzing, and refining a work in progress. In iterative design, interaction with the designed system is used as a form of research for informing and evolving a project, as successive versions, or iterations of a design are implemented." (http://www.gmlb.com/articles/iterativedesign.html) In essence this approach has allowed us to begin production work on certain aspects of the project before implementation decisions on other aspects have been finalized. In a nutshell, CDRiP is a framework for saving CD image files (ISO format) to a network file server, and automatically generating metadata through both interaction with the library catalog and other programmed processes. Over a thousand CD's and DVD's have been successfully "redigitized" and, with their accompanying metadata files, added to the repository. Under development, and working in prototype, are procedures that allow an end-user to remotely access the repository and, when needed, install applications in a controlled "virtual machine" environment. This approach provides an immediate solution to most of the problems associated with legacy software installation under a modern operation system. It also provides an environment in which software can be installed and run under the operating system version in which it was developed, when all else fails. At the planning stage are automated processes to allow flagging of items for additional processing such as inclusion in the Library's preservation repository workflow or publishing CD contents directly to the web. Eventually, an xml database for enhanced search and retrieval will be implemented. Learning outcomes: Better understanding of the long term preservation and access issues associated with CD-ROM collections Better understanding of how to apply non-marc metadata in a library application Better understanding of virtual machine software and its relevance in the digital library en
dc.format.mimetype application/pdf
dc.format.mimetype audio/mpeg
dc.format.mimetype video/mp4
dc.identifier.uri http://hdl.handle.net/1853/13634
dc.language.iso en_US en
dc.publisher Georgia Institute of Technology en
dc.subject Digital initiatives en
dc.subject CD-ROM Redigitization Project (CDRiP)
dc.subject CDRiP technologies
dc.subject Library data
dc.title CD-ROM Redigitization Project (CDRiP) en
dc.title.alternative UC Berkeley Library CD-ROM Redigitization Project (CDRiP)
dc.type Text
dc.type Audio
dc.type.genre Presentation
dc.type.genre Proceedings
dspace.entity.type Publication
local.contributor.corporatename Library
local.relation.ispartofseries Electronic Resources and Libraries Conference
relation.isOrgUnitOfPublication bf0ff3d1-48ff-4cf4-baa3-4c783958e37a
relation.isSeriesOfPublication 1bc138f4-a871-4adf-a5c0-383f29cc06ed
Files
Original bundle
Now showing 1 - 3 of 3
Thumbnail Image
Name:
330-thurs-3_20.pdf
Size:
2.54 MB
Format:
Adobe Portable Document Format
Description:
Power Point Presentation
No Thumbnail Available
Name:
330-thurs-3_20.mp3
Size:
7.16 MB
Format:
MP3 audio file
Description:
Speech
No Thumbnail Available
Name:
330-thurs-3_20.mp4
Size:
84.34 MB
Format:
MP4 Video file
Description:
Speech and Presentation
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.8 KB
Format:
Item-specific license agreed upon to submission
Description:
Collections