A collection-level description provides an overview of an aggregation of items, and such an overview may be useful in a wide range of contexts. Resources such as directories and guides of various forms often incorporate sets of collection-level descriptions, perhaps gathered together on the basis of the geographical location of the collections or the subject material of the items in the collections. Collection-level description may be used to inform decision-making by resource managers as well as to enable potential users to discover collections of interest.
So, collection-level descriptions may be created to support various functions and operations. Even within one functional area, however, it may be desirable to reuse the content of a description by presenting it in different forms, perhaps via different channels or in different media (e.g. the same descriptive content may be presented through a Web site and as part of a printed directory).
Furthermore, current trends highlight a separation in digital information architectures between the functions of content provision and presentation. Users seek access to resources through channels that are convenient for their activities. Like other types of metadata record, collection-level descriptions may be created, maintained and made available by one party, but presented to users by many different services, including services developed by other parties. The "scope" of such a service may be considerably broader than that of a service provided by the creator of the description. For example, collection-level description records may be deployed within the context of an institutional directory but may also be used in databases which are regional, national or international in their coverage.
The administrator or owner of a collection should not be required to re-describe their holdings each time a collection-level description is required. The requirement for "reusability" is, then, an important consideration in the creation of collection-level description records. Sometimes the context of reuse may be under the control of the creator, but increasingly it may not. If collection-level description records are to be reused effectively in different contexts, and the meanings intended by their creators are to be preserved in these new contexts, then some minimum requirements should be taken into consideration.
Collection-level descriptions will be delivered to users through a range of channels and media, and the requirements for presentation will vary accordingly. The user functionality available through a Web browser is different from that of reading a printed page, for example, and the presentation of information in different media reflects that. The information content of a record may be presented in different combinations in different channels and media, or it may be presented selectively, and it may be appropriate to vary the sequence of presentation of its component parts or to alter its labelling. Even within a single channel or medium, presentation may be varied to meet the requirements of different classes of user - for example, to enhance accessibility for visually-impaired readers or to address other problems for users with disabilities [1].
Where a record is presented through a separate service, control of presentation is largely delegated to the managers of that service. In such a service, some of the contextual information (e.g. headers and footers of a Web page, the juxtaposition with other information) may be quite different from that of the original context of delivery where presentation was under the control of the creator.
Recommendation: Ensure that, as far as possible, you do not rely on presentational techniques in order for the content of your collection-level descriptions to retain its meaning.
It is easier to generate a record that is structurally simpler or more limited in its level of detail from a richer, more detailed record than vice versa. For collection-level description, this may be a consideration in two areas:
However, creating collection-level descriptions, like other classes of metadata, is potentially time-consuming and costly. The possible flexibility gained from having richer data must always be balanced against the additional effort required to create it. Whether a detailed hierarchical view of collections and sub-collections is required or whether the single record view is sufficient depends on the context of use.
Recommendation: Establish what is an appropriate granularity of description, both in terms of the structural granularity within an individual record and in deciding what should be described as a 'collection'. Consider creating collection-level description metadata at a finer level of granularity, within the constraints of what is feasible.
If collection-level description records are to be exchanged effectively, then the recipient of the record (e.g. a remote service) and the provider of the record must have a shared understanding of its meaning. Such shared understanding is made possible by consensus on meanings that have been made explicit and recorded in standard schemas. Further, it is important not only to use those shared semantics but to declare in each "instance" record that you are doing so.
Recommendation: Make your collection-level descriptions available using the semantics of commonly understood metadata schemas for collection-level description, and declare clearly which schema is in use.
The use of a collection-level description record to support effective resource discovery depends on the consistent application of standards for the construction of elements such as names, subjects and dates. This becomes more significant where resource discovery services are operating across collection-level descriptions and item-level descriptions where content standards such as AACR2 have been applied. As for the use of semantic schemas, the use of content and terminological standards should be declared so they are umambiguously identifiable to a recipient of the record. It may be the case that the use of a particular content standard is implicit from the choice of semantic schema, but wherever ambiguity is possible the use of the standard should be declared in the record.
Recommendation: Adopt commonly understood content standards and/or terminological control when making your collection-level descriptions available, and include in the metadata record an indication of the standard in use.
You will probably be creating your collection-level descriptions with the aim of providing access to them in the form of HTML documents through your own Web site (or that of another service provider). However you should also make them available in forms which other software tools can use.
The precise requirements for this may vary depending on the interfaces supported by the services with which descriptions are to be shared. Requirements may include:
As an illustration of what may be required, it may be useful to consult the guidelines on the technical steps which content providers are recommended to take to interoperate with the JISC Information Environment [5]. It must be emphasised that this is provided as an example only: other services may well have different requirements, but this document gives an indication of what may be needed.
Recommendation: Establish the requirements for providing "structured" machine-oriented access to your descriptions.
When a collection-level description record is presented in a context you control (e.g. via your own Web site), a user may derive additional information from that context e.g. information about the source of the description which influences how the user evaluates the descriptive content. If your records are to be used in services over which you have limited control, you may find that you need to provide information about your collection-level descriptions as information resources (as opposed to the information about the collections contained in the CLD). This may be a subset of what is sometimes called "administrative metadata" and might include:
Recommendation: Provide metadata about your descriptions.
Some parts of a collection-level description are typically "free text" descriptions of charcteristics of a collection. Such text should be neutral and concise, and its creators should bear in mind that it may appear in different contexts. However, there may be cases where it is desirable to tailor the content or style of such text for a specific audience or to address a specific purpose. Description tailored for an academic researcher may be inappropriate for presentation to a more general audience.
An individual collection is an identifiable resource. If the scope of a service based on the use of collection-level descriptions is potentially global, then the identifier of a collection should be globally unique. Further, one of the valuable elements of collection-level description is the capacity to describe relationships between collections. Those relationships may be of various types but their description depends on the ability to cite a persistent identifier for a collection.
There is no general agreement on the requirements for administrative metadata, because metadata records themselves vary in complexity and are used and managed in different ways. Indeed some metadata schemas incorporate some administrative metadata. However, there are some efforts, for example within the Dublin Core Metadata Initiative [6], to identify simple, generic element sets to serve as administrative metadata schemas.
Much of the content included here has been developed from discussions with participants in the Collection Description Focus Workshop series.
[1] Chisholm, Wendy, Vaderheiden, Gregg & Jacobs, Ian, eds. Web Content Accessibility Guidelines 1.0. W3C Recommendation 5 May 1999. Available at http://www.w3.org/TR/WAI-WEBCONTENT/
[2] Library of Congress Network Development & MARC Standards Office. Z39.50 Maintenance Agency Page. Available at http://lcweb.loc.gov/z3950/agency/
[3] Lagoze, Carl, et al, eds. Open Archives Initiative Protocol for Metadata Harvesting. Available at http://www.openarchives.org/OAI/openarchivesprotocol.html
[4] Box, Don, et al. Simple Object Access Protocol (SOAP) 1.1. W3C Note 8 May 2000. Available at http://www.w3.org/TR/SOAP/
[5] Powell, Andy. 5 step guide to becoming a content provider in the JISC Information Environment. Ariadne 33 (Sep/Oct 2002). Available at http://www.ariadne.ac.uk/issue33/info-environment/
For more information on the technical archictecture for the JISC Information environment, see http://www.ukoln.ac.uk/distributed-systems/jisc-ie/arch/
[6] Dublin Core Metadata Initiative. Administrative Metadata Working Group. http://dublincore.org/groups/admin/