Title:
Unicorn: The myth of federated search realized simply. Unifying DSpace repositories with the PKP Harvester tool

dc.contributor.author Davison, John en_US
dc.contributor.author Gilbertson, Keith en_US
dc.contributor.corporatename Ohio Library and Information Network en_US
dc.coverage.temporal Date: 2009-05-21 08:30 AM – 10:00 AM
dc.date.accessioned 2009-06-11T20:56:26Z
dc.date.available 2009-06-11T20:56:26Z
dc.date.issued 2009-05-21 en_US
dc.description 4th International Conference on Open Repositories en_US
dc.description This presentation was part of the session : DSpace User Group Presentations en_US
dc.description Date: 2009-05-21 08:30 AM – 10:00 AM
dc.description.abstract The Ohio Digital Resource Commons, located at http://drc.ohiolink.edu, is a union of DSpace repositories operated by higher education institutions in Ohio. The repositories are largely organized and supported by OhioLINK, a consortium of 89 Ohio college and university libraries. In support of the vision of the Digital Resource Commons as a statewide resource, the repository operators saw an immediate need for a federated search tool. A "build it now" approach was taken, and federated searching was implemented in a short timeframe at OhioLINK using the PKP Harvester (http://pkp.sfu.ca/?q=harvester) software. A demonstration of the federated search feature at the Digital Resource Commons will be given, highlighting local customizations that were made to PKP Harvester and DSpace in support of the project. These customizations include changes made to mimic the appearance and behavior of existing search interfaces at OhioLINK, and changes made to meet expressed user requirements. Particular attention will be given to a DSpace change that allows image thumbnails to be displayed in federated search results. Issues encountered during the configuration, implementation, and deployment of the PKP Harvester and DSpace OAI-PMH server will be presented, and the choices made in response to these issues will be explained. The process of integrating the search results with the DSpace interface will be detailed, including ongoing efforts to improve the user experience. The Digital Resource Commons' federated search was implemented as a metadata-based search. We will present a general comparison between metadata and full-text searching, highlighting the advantages and disadvantages of each method. A discussion of metadata uniformity and quality concerns will be presented in the context of federated searching. Particular problems encountered with our metadata will be described, with lessons learned and suggestions for resolution. Operational and maintenance concerns of this system will be discussed, including the metadata harvesting schedule, and the need to flush and rebuild indexes when the metadata schema changes. Future ideas for the DRC's federated search feature will be explored, including an implementation of faceted searching using SOLR, harvesting of non-DSpace repositories, such as CONTENTdm and Fedora, and, finally, the possibility of discarding the current model in favor of an OAI-ORE based system, developed for DSpace at Texas Digital Library, that allows for the possibility of full-text federated searching. en_US
dc.description.sponsorship OhioLINK en_US
dc.identifier.uri http://hdl.handle.net/1853/28507
dc.publisher Georgia Institute of Technology en_US
dc.relation.ispartofseries OR09. DSpace User Group Presentations en_US
dc.subject OAI-PMH en_US
dc.subject DSpace en_US
dc.subject Federated searching en_US
dc.subject Digital Resource Commons en_US
dc.subject OhioLINK en_US
dc.subject Metadata en_US
dc.subject PKP Harvester en_US
dc.title Unicorn: The myth of federated search realized simply. Unifying DSpace repositories with the PKP Harvester tool en_US
dc.title.alternative Unifying DSpace repositories with the PKP Harvester tool
dc.type Text
dc.type Moving Image
dc.type.genre Proceedings
dc.type.genre Presentation
dspace.entity.type Publication
local.contributor.corporatename Library
local.relation.ispartofseries Open Repositories Conference
relation.isOrgUnitOfPublication bf0ff3d1-48ff-4cf4-baa3-4c783958e37a
relation.isSeriesOfPublication 91d86e5c-4993-46f1-b27e-8195cabcdede
Files
Original bundle
Now showing 1 - 4 of 4
Thumbnail Image
Name:
167-372-1-PB.pdf
Size:
3.65 MB
Format:
Adobe Portable Document Format
Description:
PDF Presentation
No Thumbnail Available
Name:
167-373-1-PB.pptx
Size:
11.91 MB
Format:
Unknown data format
Description:
Powerpoint Presentation
No Thumbnail Available
Name:
lorc08001000c.mp4
Size:
59.79 MB
Format:
MP4 Video file
Description:
Download Video
No Thumbnail Available
Name:
lorc08001000c_streaming.html
Size:
923 B
Format:
Hypertext Markup Language
Description:
Streaming Video
Collections