View the PDF document Assembling and enriching digital library collections

Bainbridge, D., Thompson, J., Witten, I. H. (2003) C. C. Marshall, G. Henry and L. Delcambre (eds), Proc Third ACM/IEEE Joint Conference on Digital Libraries (JCDL’03), Houston, Texas, 323-334. IEEE Computer Society, Los Alamitos, California.

People who create digital libraries need to gather together the raw material, add metadata as necessary, and design and build new collections. This paper sets out the requirements for these tasks and describes a new tool that supports them interactively, making it easy for users to create their own collections from electronic files of all types. The process involves selecting documents for inclusion, coming up with a suitable metadata set, assigning metadata to each document or group of documents, designing the form of the collection in terms of document formats, searchable indexes, and browsing facilities, building the necessary indexes and data structures, and putting the collection in place for others to use. Moreover, different situations require different workflows, and the system must be flexible enough to cope with these demands. Although the tool is specific to the Greenstone digital library software, the underlying ideas should prove useful in more general contexts.