UKOLN Metadata Software

Software Which Can Corrupt Embedded Metadata



Software Which Causes Problems With Dublin Core Metadata

Software which has been reported to cause problems when managing and maintaining Dublin Core metadata is listed below.

Software Problem
Microsoft Internet Assistant For Word Internet Assistant is an add-on for the Microsoft Word word processing package. It can be used for editing HTML files, as well as creating HTML from Word documents.
When an HTML file containing Dublin Core metadata is edited, the Dublin Core metadata can be corrupted.
More detailed examples are available.
Microsoft FrontPage Editor v. 2.0.2.1112 - part of FrontPage Explorer 97 FrontPage loses the scheme and language tags when these were not put in the bracket syntax.
Before Using FrontPage
<META NAME=DC.subject CONTENT="(SCHEME=XYZa)(LANGUAGE=ena)blaha">
<META NAME=DC.subject SCHEME=XYZb LANGUAGE=enb CONTENT="blahb">
<META NAME=DC.description SCHEME=XYZc LANGUAGE=enc CONTENT="blahc">
After Using FrontPage
<meta name="DC.subject" content="(SCHEME=XYZa)(LANGUAGE=ena)blaha">
<meta name="DC.subject" content="blahb">
<meta name="DC.description" content="blahc">

The information which is lost is shown in this style

Conclusions

Both Microsoft Internet Assistant for Word and Microsoft FrontPage can corrupt metadata which is embedded according to the HTML 4.0 standard.

Feedback

If you have any comments on this page or have information on other tools which can corrupt embedded metadata, please contact Brian Kelly (email b.kelly@ukoln.ac.uk).