Metadata: an overview of current resource description practice
Work Package 3 of Telematics for Research project DESIRE (no. 1004) |
Title page
Table of Contents |
The Dublin Core is positioned as a simple information resource description. However, importantly it also aims to provide a basis for semantic interoperability between other, probably more complicated, formats. A third target use is to provide the basis fo r resource-embedded description, initially with HTML documents.
Ambitions to actualise Dublin Core were carried forward by a second international workshop which took place in the UK at the University of Warwick to UK in April 1996 sponsored by UKOLN and OCLC. This workshop looked at the implementation of Dublin Core a nd the requirements for extensibility, change control and dissemination. The need for a registration agency was discussed at this meeting.
However certain principles were established for further development of the element set. Of particular relevance to encoding and designation are the principles of
extensibility: the core set can be extended with further elements to describe intrinsic data of particular relevance to a particular community
optionality: all elements are optional
repeatability: all elements are repeatable
modifiability: any element can be modified by one or more qualifiers
The sanctioning of qualifiers is of particular note as it is an attempt to bridge the gap between casual and sophisticated use. Qualifiers can be of two very different types: some indicating external schemes to be applied to processing e.g. OtherAgent(sch eme=TEI), some specifying more precise information about the attribute, in effect sub-dividing the element name e.g. OtherAgent(role=editor). If a scheme qualifier is used then this means the syntax of that scheme must be applied to the data in that eleme nt. So Author (scheme=USMARC) fields will contain data embedded with USMARC tags and sub-field markers, and OtherAgent (scheme=TEI) elements will contain data with TEI mark-up tags embedded. Potentially widespread use of qualifiers could cause severe prob lems with interoperability.
At the Warwick workshop a decision was taken to develop a concrete syntax for The Dublin Core in the form of an SGML DTD.
Title (name of the object)
Author (person(s) primarily responsible for intellectual content)
Publisher (agent or agency responsible for making the object available)
OtherAgent (person(s) such as editors or transcribers, who have made other significant intellectual contributions to the work)
Date (date of publication)
ObjectType (genre of the object such as novel, poem, dictionary)
Language (language of the intellectual content)
The Author element name does not distinguish the form of author (personal/corporate/meeting). Similarly the OtherAgent element name does not express the precise role of the other agent. It would be possible to use qualifiers to make these more precise dis tinctions, but the Dublin Core documentation does not attempt to make comprehensive recommendations. Suggested qualifiers are:
Author(scheme=USMARC)=100 1 Doyle, Conan $c Sir, $d 1859-1930
OtherAgent(role=editor)=Weibel,Stuart L.
As soon as such qualifiers are used the complexity of processing the data, and the difficulties for interoperability, will increase.
Subject (topic addressed by the work)
Coverage (the spatial and temporal characteristics of the object)
The subject element can be used for headings controlled by a known classification scheme indicated in the qualifier, or can contain free text. The Coverage element allows spatial or temporal data to be included for geospatial data. This data might be in u nstructured form or in a format governed by a known scheme e.g.
Coverage(type=spatial)=Atlantic ocean
Coverage(type=spatial,scheme=LATLONG)=West=180,East=180,North=90,South=90
Identifier (string or number used to uniquely identify the object)
The data in this element could be an identifier conforming to an internationally recognised scheme (e.g. URL, ISBN) or it could be a local, privately administered number (e.g. university technical report number). The qualifier would need to be used to mak e the identifier generally useful.
Form (the data representation of the object such as Postscript file or windows executable file)
A constraint on the design of the Dublin Core, accepted by the workshop participants, was that the aim of the element set is to describe 'document like objects' (DLOs).
Source (objects, either print or electronic, from which the resource is derived)
This element could be used to link different versions of an object which have the same intellectual content, whereas the relation element would be used to link objects with a different intellectual content.
Relation (relationship to other objects)
This element describes relationships to other objects with different intellectual content. It allows for a variety of relationships to be identified by use of the qualifier mechanism. Specification of a relationship would require use of at least two quali fiers, e.g.
Relation (type=ContainedIn) (identifier=URL) =http://www.ukoln.bath.ac.uk/metareview.html
might be raised; rather, these should be addressed in a Warwick Framework type solution.
Language (language of the intellectual content)
The problems of use of non-ASCII characters within the record were deliberately not addressed.
National Document and Information Service : this is a joint project between the National Libraries of Australia and New Zealand. Within this project the Dublin Core elements have been used as the core search attributes for their records, in effect the int ersection between their various databases. There has been flexibility in the use of semantics with mapping of other 'search fields' to the Dublin Core set.
DSTC in Australia is using the Dublin Core in the Research Data Network Co-operative Research Centre project for resource discovery.
Next | Table of Contents |