towards a richer set of information to describe our complete genome collection

Permanent unique identifiers

From Genomic Standards Consortium


Introduction


A comparison table of three persistent identifier frameworks has been provided; these were chosen because of their existing or proposed implementation within the life science field.


Comparison Table


Persistent Identifier

DOI

LSID

ARK

Characteristic








Name

Digital Object Identifier

Life Science Identifier

Archival Resource Key





Existing Implementations

Scientific Journals assign DOI to research papers


Names for Life

resolution service for DOIs assigned to taxonomic concepts, for use by publishers, i.e. taxonomic links directed to DOI resolution service

http://www.names4life.com/


GBIF

Global Biodiversity Informatics Facility (GBIF) has announced that it, will use LSIDs

gbif.org


TDWG

At same time as GBIF, the Taxonomic Databases Working Group agreed on use of LSIDs

www.tdwg.org


Taverna

LSIDs are assigned in Taverna, workflow software developed at University of Manchester

http://taverna.sourceforge.net/




California Digital Library

ARK identifier in use at the California Digital Library.


W3C Health Care and Life Sciences Semantic Web Interest Group

Use of ARK identifiers is a topic of discussion in W3C Health Care and Life Sciences Semantic Web Interest Group

Developer

International DOI Foundation (IDF)

http://www.doi.org/welcome.html



Object Management Group

an open-membership, not-for-profit consortium that produces and maintains computer industry specifications that enable data integration

www.omg.org

California Digital Library

www.cdlib.org


Advantages

multiple, stable, and persistent paths to end-users

(from names for life web site)


simplify link maintenance


Resolution via http, i.e. ARK identifier can be typed into web browser and onus is on identifier authority to provide resolution service





Principal Information Resource

Digital Object Identifier System

www.doi.org


Life Science Identifiers specification

omg.org



California Digital Library (CDL) – Introduction to ARK Identifiers

[1] ARK





Further reading

Digital Object Identifier System

www.doi.org



Life Science Identifiers specification

omg.org



California Digital Library (CDL) – Introduction to ARK Identifiers

http://www.cdlib.org/inside/diglib/ark/


The ARK Persistent Identifier Scheme: the complete ARK specification.

ARK specification





Identifier Syntax

10.1234/NP5678




two components, the prefix and the suffix


prefix before / is the unique naming authority

suffix after / is the unique resource identifier that can accommodate any existing identifier

urn:lsid: ncbi.nlm.nih.gov: GenBank:T48601:2


urn:lsid:organisation-identifier: digital-object-type: digital-object-identifier:version


[http://domainname/]ark: /NAAN/name[quantifier]

elements in brackets are mutable

NAAN is the identifier for the organisation





cost to assign an identifier

Cost involved for each DOI

No cost

No cost

How to assign an identifier

Requirement for organisation to be assigned a DOI prefix by a registration agency


Registration agency free to set fees for assignment of individual DOIs

Organisation is free to assign its own identifiers, the organisation id is a domain name which is managed by the domain name authorities and should be owned by the organisation that issues and lsid

Request organisation id from CDL, use appropriate software to generate ARK-compatible identifier. Recommendation for identifier to include a terminal check digit

How to resolve an identifier

A DOI is a URN and requires software to translate URN to data object. Web sites are available for this purpose e.g. http://dx.doi.org, although organisations would most likely want to develop their own resolution service implementation to serve metadata and data objects as required


An LSID is a URN and requires software to translate URN to object location


IBM has developed software for this purpose

further-info


An ARK identifier is a URL and can be resolved in a web browser, with onus on organization to provide resolution software – no publically-available software at present for this purpose

What information is provided when an identifier is resolved

Digital object

Digital object

metadata

Digital object

metadata

commitment statement

Loading...