This document describes problems related to embedded metadata in HTML documents experienced with Microsoft's Internet Assistant for Word.
This example is based on use of simple embedded Dublin Core metadata.
Initially this page was created using a text editor. It contained the following in the document HEAD:
1 | <META NAME="DC.title" CONTENT="UKOLN: UK Office for Library and Information Networking"> |
2 | <META NAME="DC.subject" CONTENT="national centre, network information"> |
3 | <META NAME="DC.description" CONTENT="UKOLN is a national centre for support in network information management in the library and information communities."> |
4 | <META NAME="DC.creator" CONTENT="UKOLN"> |
5 | <META NAME="DC.creator.email" CONTENT="ukoln@ukoln.ac.uk"> |
The document was then loaded into Microsoft Word For Windows. It was saved as an HTML file. After saving it contained the following in the document HEAD.
- | <META HTTP-EQUIV="Content-Type" CONTENT="text/html; charset=windows-1252"> | Added | ||||||
- | <META NAME="Generator" CONTENT="Microsoft Word 97"> | Added | ||||||
? | <TITLE>example2-ia</TITLE> | Changed | ||||||
1 | <META NAME="DC.title" CONTENT="UKOLN: UK Office for Library and InformationNetworking"> | Corrupted | ||||||
2 | <META NAME="DC.subject" CONTENT="national centre, network information"> | |||||||
3 | <META NAME="DC.description" CONTENT="UKOLN is a national centre forsupport in network information management in the library and informationcommunities. "> | Corrupted | ||||||
4 | <META NAME="DC.creator" CONTENT="UKOLN">
| 5
| <META NAME="DC.creator.email" CONTENT="ukoln@ukoln.ac.uk">
|
|
| <META NAME="Template" CONTENT="C:\Program Files\Microsoft Office\Office\html.dot">
| Added
| |
Internet Assistant made the following changes to the information contained in the document HEAD:
Note that if your browser supports style sheets, the lines concerned are indicated in red, green, violet or blue.
In addition to changes to the information contained in the document HEAD, Internet Assistant had added various elements to the document content, in particular <FONT> elements.
[Next Example] - [Previous Example]