Blog

XMP-Ville

Tony Hammond

Tony Hammond – 2007 September 25

In XMP

Been so busy looking into the technical details of XMP that I almost forgot to check out the current landcsape. Luckily I chanced on these articles by Ron Roszkiewicz for The Seybold Report (and apologies for lifting the title of this post from his last). The articles about XMP are well worth reading and chart the painful progress made to date:

  • The Brief Tortured Life of XMP (July ā€™05)
    • [Thought Leaders Hammer out Metadata Standard] (April ā€™07)
      • [Metadata Persistence and ā€œSave for Webā€¦ā€] (July ā€™07)

      From the earlier characterization of XMP as ā€œunderachieving teenagerā€ Roszkiewicz is cautiously optimistic that IDEAllianceā€™s XMP Open initiative (an initiative to advance XMP as an open industry specification) will help outreach and foster adoption of this fledgling technology.

      (Continues.)

The Nameā€™s The Thing

Tony Hammond

Tony Hammond – 2007 September 20

In XMP

Iā€™m always curious about names and where they come from and what they mean. Hence, my interest was aroused with the constant references to ā€œXAPā€ in XMP. As the XMP Specification (Sept. 2005) says:

ā€œNOTE: The string ā€œXAPā€ or ā€œxapā€ appears in some namespaces, keywords, and related names in this document and in stored XMP data. It reflects an early internal code name for XMP; the names have been preserved for compatibility purposes.ā€

Actually, it occurs in most of the core namespaces: XAP, rather than XMP.

(Continues.)

ACAP - Any chance of success?

Ed Pentz

Ed Pentz – 2007 September 19

In Search

ACAP has released some documents outlining the use cases they will be testing and some proposed changes to the Robots Exclusion Protocol (REP) - both robots.txt and META tags. There are some very practical proposals here to improve search engine indexing. However, the only search engine publicly participating in the project is http://www.exalead.com/ (which according to Alexa attracted 0.0043% of global internet visits over the last three months). The main docs are ā€œACAP pilot Summary use cases being testedā€, ā€œACAP Technical Framework - Robots Exclusion Protocol - strawman proposals Part 1ā€, ā€œACAP Technical Framework - Robots Exclusion Protocol - strawman proposals Part 2ā€, ā€œACAP Technical Framework - Usage Definitions - draft for pilot testingā€.

Style Guides Recommend DOI strings

Ed Pentz

Ed Pentz – 2007 September 19

In Citation Formats

A couple of recent posts - from A couple of recent posts - from at Jefferson University and IFST at Univ of Delaware- note that the AMA and APA style guides now recommend using a DOI, if one is assigned, in a journal article citation.

A citation in the APA style with a DOI would be:

Conley, D., Pfeiffera, K. M., & Velez, M. (2007). Explaining sibling differences in achievement and behavioral outcomes: The importance of within- and between-family factors. Social Science Research36(3), 1087-1104. doi:10.1016/j.ssresearch.2006.09.002

In the AMA style a reference would be:

Kitajima TS, Kawashima SA, Watanabe Y. The conserved kinetochore protein shugoshin protects centromeric cohesion during meiosis. Nature. 2004;427(6974):510-517. doi:10.1038/nature02312

This is great news. I havenā€™t looked at the full style guides but itā€™s not clear if information is given about linking DOIs via http://0-dx-doi-org.pugwash.lib.warwick.ac.uk/

Chapter 9 - The Closed Book

Tony Hammond

Tony Hammond – 2007 September 15

In Discussion

Hadnā€™t really noticed before but was fairly gobsmacked by this notice I just saw on the DOIĀ® Handbook: **Please note that Chapter 9, Operating Procedures is for Registration Agency personnel only.** DOIĀ® Handbook doi:10.1000/182 http://0-www-doi-org.pugwash.lib.warwick.ac.uk/hb.html And, indeed, the Handbookā€™s TOC only reconfirms this: 9 Operating procedures* *The RA password is required for viewing Chapter 9. 9.1 Registering a DOI name with associated metadata 9.2 Prefix assignment 9.3 Transferring DOI names from one Registrant to another

Custom Panel for CC

Tony Hammond

Tony Hammond – 2007 September 15

In Metadata

Creative Commons now have a custom panel for adding CC licenses using Adobe apps - see here. Interesting on two counts: Machine readable licenses XMP metadata But I still think that batch solutions for adding XMP metadata are really required for publishing workflows. And ideally there should be support for adding arbitrary XMP packets if weā€™re going to have truly rich metadata. I rather fear the constraints that custom panels place upon the publisher.

Last Orders Please!

Tony Hammond

Tony Hammond – 2007 September 13

In Metadata

Public comment period on the PRISM 2.0 draft ends Saturday (Sept. 15) ahead of next weekā€™s WG meeting to review feedback and finalize the spec. (I put in some comments about XMP already. Hope they got that.)

Marking up DOI

Tony Hammond

Tony Hammond – 2007 September 11

In XMP

(Update - 2007.09.15: Clean forgot to add in the rdf: namespace to the examples for xmp:Identifier in this post. Iā€™ve now added in that namespace to the markup fragments listed. Also added in a comment here which shows the example in RDF/XML for those who may prefer that over RDF/N3.)

So, as a preliminary to reviewing how a fuller metadata description of a Crossref resource may best be fitted into an XMP packet for embedding into a PDF, letā€™s just consider how a DOI can be embedded into XMP. And since itā€™s so much clearer to read letā€™s just conduct this analysis using RDF/N3. (Life is too short to be spent reading RDF/XML or C++ code. :~)

(And further to Chris Shillumā€™s comment [(Update - 2007.09.15: Clean forgot to add in the rdf: namespace to the examples for xmp:Identifier in this post. Iā€™ve now added in that namespace to the markup fragments listed. Also added in a comment here which shows the example in RDF/XML for those who may prefer that over RDF/N3.)

So, as a preliminary to reviewing how a fuller metadata description of a Crossref resource may best be fitted into an XMP packet for embedding into a PDF, letā€™s just consider how a DOI can be embedded into XMP. And since itā€™s so much clearer to read letā€™s just conduct this analysis using RDF/N3. (Life is too short to be spent reading RDF/XML or C++ code. :~)

(And further to Chris Shillumā€™s comment]2 on my earlier post Metadata in PDF: 2. Use Cases where he notes that Elsevier are looking to upgrade their markup of DOI in PDF to use XMP, Iā€™m really hoping that Elsevier may have something to bring to the party and share with us. A consensus rendering of DOI within XMP is going to be of benefit to all.)

(Continues.)

The Second Wave

Tony Hammond

Tony Hammond – 2007 September 11

In Metadata

You might have been wondering why Iā€™ve been banging on about XMP here. Why the emphasis on one vendor technology on a blog focussed on an industry linking solution? Well, this post is an attempt to answer that.

Four years ago we at Nature Publishing Group, along with a select few early adopters, started up our RSS news feeds. We chose to use RSS 1.0 as the platform of choice which allowed us to embed a rich metadata term set using multiple schemas - especially Dublin Core and PRISM. We evangelized this much at the time and published documents on XML.com (Jul. ā€™03) and in D-Lib Magazine (Dec. ā€™04) as well as speaking about this at various meetings and blogging about it. Since that time many more publishers have come on board and now provide RSS routinely, many of them choosing to enrich their feeds with metadata.

Well, RSS can be seen in hindsight as being the First Wave of projecting a web presence beyond the content platform using standard markup formats. With this embedded metadata a publisher can expand their web footprint and allow users to link back to their content server.

Now, XMP with its potential for embedding metadata in rich media can be seen as a Second Wave. Media assets distributed over the network can now carry along their own metadata and identity which can be leveraged by third-party applications to provide interesting new functionalities and link-back capability. Again a projection of web presence.

(Continues.)

W5M0MpCehiHzreSzNTczkc9d

Tony Hammond

Tony Hammond – 2007 September 10

In XMP

What on earth can this string mean: ā€˜W5M0MpCehiHzreSzNTczkc9dā€™? This occurs in the XMP packet header:

Well from the XMP Specification (September 2005) which is available here (PDF) there is this text:

ā€œThe required id attribute must follow begin. For all packets defined by this version of the syntax, the value of id is the following string: W5M0MpCehiHzreSzNTczkc9dā€

(See: 3 XMP Storage Model / XMP Packet Wrapper / Header / Attribute: id)

OK, so itā€™s no big deal to cut and paste that string, itā€™s just mighty curious why this cryptic key is needed in an open specification, especially since (contrary to what might be implied by the text) it doesnā€™t seem to vary with version. (Or hasnā€™t yet, at any rate - more below.)