<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>&#964;&#949;&#967;&#957;&#959;&#963;&#959;&#966;&#953;&#945; &#187; Persistent Identifiers</title>
	<atom:link href="http://lackoftalent.org/michael/blog/category/libraries/digital-libraries/persistent-identifiers/feed/" rel="self" type="application/rss+xml" />
	<link>http://lackoftalent.org/michael/blog</link>
	<description>The occasional rambling of a digital library artisan</description>
	<lastBuildDate>Mon, 10 Oct 2011 12:33:06 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.2.1</generator>
		<item>
		<title>I2: Resource Description</title>
		<link>http://lackoftalent.org/michael/blog/2010/05/19/i2-resource-description/</link>
		<comments>http://lackoftalent.org/michael/blog/2010/05/19/i2-resource-description/#comments</comments>
		<pubDate>Wed, 19 May 2010 13:38:18 +0000</pubDate>
		<dc:creator>Michael Giarlo</dc:creator>
				<category><![CDATA[Cataloging and Metadata]]></category>
		<category><![CDATA[Digital Libraries and Archives]]></category>
		<category><![CDATA[Linked Data]]></category>
		<category><![CDATA[NISO I2]]></category>
		<category><![CDATA[Persistent Identifiers]]></category>
		<category><![CDATA[Repositories]]></category>
		<category><![CDATA[Scholarly Communication]]></category>

		<guid isPermaLink="false">http://lackoftalent.org/michael/blog/?p=568</guid>
		<description><![CDATA[I can hardly believe it&#039;s been eight months since I last wrote about the NISO I2 project. A lot has changed since then[1]. I continue to work on I2 however; they won&#039;t get rid of me that easily. In the last post, I wrote: The next step is to build upon the report to draw [...]]]></description>
			<content:encoded><![CDATA[<abbr class="unapi-id" title="oai:lackoftalent.org:technosophia:568"><!-- &nbsp; --></abbr>
<p>I can hardly believe it&#039;s been eight months since I last wrote about the <a href="http://lackoftalent.org/michael/blog/category/projects/niso-i2/">NISO I2</a> project.  A lot has changed since then[<a href="http://lackoftalent.org/michael/blog/2010/05/19/i2-resource-description/#footnote_0_568" id="identifier_0_568" class="footnote-link footnote-identifier-link" title="I&amp;#8217;ve moved and changed jobs, in fact">1</a>].  I continue to work on I2 however; they won&#039;t get rid of me that easily.</p>
<p>In the <a href="http://lackoftalent.org/michael/blog/2009/09/15/i2-survey-results/">last post</a>, I wrote:<br />
<blockquote>The next step is to build upon the report to draw yet more conclusions from the data â€” there&#039;s an awful lot there â€” and flesh out some repository use cases for institutional identifiers. The I2 core group is moving quickly towards finalizing identifier metadata elements so that a standard may be drafted, and I think having some use cases documented will help drive the standard in a direction the community can get behind.</p></blockquote>
<p> Since that time, the three scenario groups &#8212; Electronic Resources; Institutional Repositories and Learning Management Systems; and Library Resource Management &#8212; have concluded their work.  The work of the scenario groups included surveys of over 300 people working in these fields.  The survey results have been analyzed and reports were posted on the NISO website.  These reports have been used to flesh out use cases for an institutional identifier.  Upon completion of this work, the scenario groups were disbanded and work continued in a broader I2 working group.</p>
<p>The I2 working group has concentrated its work on analysis of similar standards and, as I alluded to earlier, significant effort has gone into defining core metadata to identify institutions, such as institution name, institution type, location information, variant identifiers, domain name(s), URL(s), and (optionally-typed) relationships to other institutions.  During these discussions it was difficult for me to hear the issues and needs around I2&#039;s metadata and identifiers without <a href="http://lackoftalent.org/michael/blog/2009/06/13/i2-strawman/">linked data springing to mind</a>.  </p>
<p>While we are designing a standard and not a system or a service <em>per se</em>, it seems useful to include in the standard an informative section about implementation and architecture[<a href="http://lackoftalent.org/michael/blog/2010/05/19/i2-resource-description/#footnote_1_568" id="identifier_1_568" class="footnote-link footnote-identifier-link" title="This practice seems more or less common in my (admittedly limited) experience, cf. the unAPI specification.">2</a>]; I find that reading standards is much easier on the brain when you get not only the standard itself but some examples of implementation, and that will be true as well, one hopes, of I2 standard implementers.  To that end, the group will be producing an XML schema of the I2 metadata elements and also an RDF schema.</p>
<p>I have been working on the RDF for I2 on and off for the past month or two.  Below are my impressions, as someone who is new to modeling in RDF, and the procedures I used to produce the draft RDF schema.<br />
<span id="more-568"></span><br />
Despite their names, RDF schema and XML schema are quite different[<a href="http://lackoftalent.org/michael/blog/2010/05/19/i2-resource-description/#footnote_2_568" id="identifier_2_568" class="footnote-link footnote-identifier-link" title="This reflection should come as little surprise since RDF and XML are different kinds of things: RDF is a data model and XML is a serialization format.">3</a>].  The XML schema is a tool for validating an XML-based document or record, and it&#039;s a common tool for modeling metadata in libraryland.  Not so with RDF schema, where the notion of document or record is replaced by the notion of a set of triples.  The focus in RDF is on the triple not on the document, and so validation of documents or records is not the point of RDF schema.  This took some effort to wrap my mind around.</p>
<p>Before I modeled I2 in RDF, I sketched out a domain model of I2 by copying relevant bits of information from I2 documents and pasting them into a text editor.  Then I put them into classes.  In I2&#039;s case, the domain model contained three classes of things: metadata elements about an institution, relationships between institutions, and types of institution.</p>
<p>I gathered some examples of relatively simple RDF schemas and transformed them into the <a href="http://www.w3.org/TeamSubmission/turtle/">Turtle</a> serialization format[<a href="http://lackoftalent.org/michael/blog/2010/05/19/i2-resource-description/#footnote_3_568" id="identifier_3_568" class="footnote-link footnote-identifier-link" title="Using rapper, a nifty little tool.">4</a>] for ease of reading, using them as a template for the I2 schema.</p>
<p>In the <a href="http://www.w3.org/TR/rdf-schema/">RDF schema (RDFS)</a> specification, there are two classes of things in the domain model: classes and properties.  If you are familiar with object-oriented programming, chances are you already grok this way of modeling, but otherwise, generally: a class is like a type and a property is an attribute.  If I were to model myself in RDF schema, then, I might say I am in the class of human beings, and one of my many properties is having a particular birth date, and another is having been born in a particular city.  The next step was to take the I2 domain model (metadata elements about an institution, relationships between institutions, and types of institution) and decide whether each thing was a class or a property.  I decided that the former two were sets of properties and that type of institution could be modeled as a set of classes.</p>
<p>Having a conceptual model of I2 and how it fit into the RDF schema way of thinking about things, I wrote a simple ontology defining one RDFS class per type of institution, and one RDFS property per metadata element and one per relationship type.  This would have sufficed as an ontology.</p>
<p>Exposing RDF-based resources on the web as linked data, however, represents an opportunity for metadata element-level interoperability at global scale.  In order to interoperate with the existing corpus of linked data available on the web, I went through the new I2 ontology and looked for areas where I could re-use, or subclass or otherwise link to, classes and properties already defined in more widely-used ontologies.  I realized at this point just how different coming up with a new XML document format was from writing an RDF ontology; whereas I might have wanted the former to be comprehensive and inclusive of every single aspect within the I2 domain model, my goal with the latter became to eliminate it (by trimming it down to only those bits which are not defined elsewhere).</p>
<p>Since the RDF ontology for I2 is not inclusive of the entire domain model, it seemed necessary to produce another reference document: a set of instances of I2 resources showing the mingling of new I2-specific classes and properties with well-defined classes and properties from other ontologies.</p>
<p>I shared rough first drafts of these documents and received very helpful feedback from some folks who are better-versed in this than myself.  I&#039;ve now incorporated their feedback into the latest I2 ontology and instance document.  I hope to include both of these into a draft of the I2 specification which will go out for comment in the coming months.  Here&#039;s the latest <a href="http://gist.github.com/358857">ontology</a> and the latest <a href="http://gist.github.com/358858">set of instances</a>.</p>
<h5>Notes</h5><ol class="footnotes"><li id="footnote_0_568" class="footnote">I&#039;ve moved and <a href="http://lackoftalent.org/michael/blog/2009/12/22/forking/">changed jobs</a>, in fact</li><li id="footnote_1_568" class="footnote">This practice seems more or less common in my (admittedly limited) experience, cf. <a href="http://unapi.info/specs/">the unAPI specification</a>.</li><li id="footnote_2_568" class="footnote">This reflection should come as little surprise since RDF and XML are different kinds of things: RDF is a data model and XML is a serialization format.</li><li id="footnote_3_568" class="footnote">Using <a href="http://librdf.org/raptor/rapper.html">rapper</a>, a nifty little tool.</li></ol><br/>
<hr/>]]></content:encoded>
			<wfw:commentRss>http://lackoftalent.org/michael/blog/2010/05/19/i2-resource-description/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>I2: Survey results</title>
		<link>http://lackoftalent.org/michael/blog/2009/09/15/i2-survey-results/</link>
		<comments>http://lackoftalent.org/michael/blog/2009/09/15/i2-survey-results/#comments</comments>
		<pubDate>Tue, 15 Sep 2009 12:29:55 +0000</pubDate>
		<dc:creator>Michael Giarlo</dc:creator>
				<category><![CDATA[Digital Libraries and Archives]]></category>
		<category><![CDATA[NISO I2]]></category>
		<category><![CDATA[Persistent Identifiers]]></category>
		<category><![CDATA[Repositories]]></category>
		<category><![CDATA[Scholarly Communication]]></category>

		<guid isPermaLink="false">http://lackoftalent.org/michael/blog/?p=497</guid>
		<description><![CDATA[I wrote in June that the I2 subgroup surveyed &#034;repository managers to determine the current practices and needs of the repository community regarding institutional identifiers. Results from the survey will inform a set of use cases that will be shared with the community, and that are expected to drive the development of a new standard [...]]]></description>
			<content:encoded><![CDATA[<abbr class="unapi-id" title="oai:lackoftalent.org:technosophia:497"><!-- &nbsp; --></abbr>
<p>I <a href="http://lackoftalent.org/michael/blog/2009/06/20/i2-survey/">wrote</a> in June that the I2 subgroup surveyed &#034;repository managers to determine the current practices and needs of the repository community regarding institutional identifiers. Results from the survey will inform a set of use cases that will be shared with the community, and that are expected to drive the development of a new standard for institutional identifiers.&#034;</p>
<p>The survey closed in July, and the subgroup spent August writing a report on the survey results.  That report is now <a href="http://www.niso.org/apps/group_public/document.php?document_id=2773">final</a> and it&#039;s available to the public.  Feedback may be sent to our (woefully underutilized) public <a href="http://www.niso.org/lists/i2info/">i2info</a> mailing list, left as a comment on this post, or e-mailed to me privately which I can forward to our internal list.</p>
<p>The next step is to build upon the report to draw yet more conclusions from the data &#8212; there&#039;s an awful lot there &#8212; and flesh out some repository use cases for institutional identifiers.  The I2 core group is moving quickly towards finalizing identifier metadata elements so that a standard may be drafted, and I think having some use cases documented will help drive the standard in a direction the community can get behind.</p>
<p>Onward and upward.</p>
]]></content:encoded>
			<wfw:commentRss>http://lackoftalent.org/michael/blog/2009/09/15/i2-survey-results/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>I2: Survey</title>
		<link>http://lackoftalent.org/michael/blog/2009/06/20/i2-survey/</link>
		<comments>http://lackoftalent.org/michael/blog/2009/06/20/i2-survey/#comments</comments>
		<pubDate>Sat, 20 Jun 2009 18:30:53 +0000</pubDate>
		<dc:creator>Michael Giarlo</dc:creator>
				<category><![CDATA[Digital Libraries and Archives]]></category>
		<category><![CDATA[NISO I2]]></category>
		<category><![CDATA[Persistent Identifiers]]></category>
		<category><![CDATA[Repositories]]></category>
		<category><![CDATA[Scholarly Communication]]></category>

		<guid isPermaLink="false">http://lackoftalent.org/michael/blog/?p=363</guid>
		<description><![CDATA[[Series] Near the end of my strawman post, I wrote: The I2 repositories subgroup will be sending out its survey on identifier use cases in the coming week. It will be interesting to see if the requirements we have thus far identified still obtain in light of the data we collect from the survey. We [...]]]></description>
			<content:encoded><![CDATA[<abbr class="unapi-id" title="oai:lackoftalent.org:technosophia:363"><!-- &nbsp; --></abbr>
<p>[<a href="http://lackoftalent.org/michael/blog/category/niso-i2/">Series</a>]</p>
<p>Near the end of my <a href="http://lackoftalent.org/michael/blog/2009/06/13/i2-strawman/">strawman post</a>, I wrote:<br />
<blockquote>The I2 repositories subgroup will be sending out its survey on identifier use cases in the coming week.  It will be interesting to see if the requirements we have thus far identified still obtain in light of the data we collect from the survey. </p></blockquote>
<p>We completed the survey late last week and began distributing it.  Here&#039;s what we sent out:<br />
<blockquote>
<p>The NISO I2 Working Group is surveying repository managers to determine the current practices and needs of the repository community regarding institutional identifiers.  We value your time and your input in the process to create a standard for a new institutional identifier.  We hope that you will complete the survey which should take less than 15 minutes.  The survey will remain open through Monday, July 6th.</p>
<p>
Here is a link to the survey:<br />
<a href="http://www.surveymonkey.com/s.aspx?sm=RGQgZ3090DVrb3kFzr3P3Q_3d_3d">http://www.surveymonkey.com/s.aspx?sm=RGQgZ3090DVrb3kFzr3P3Q_3d_3d</a></p>
<p>
Please feel free to share this message with other interested parties.</p>
</blockquote>
<p>First we used <a href="http://www.surveymonkey.com/">Survey Monkey</a> to send the survey link to approximately one-hundred repository managers that the subgroup identified.   Our process for identifying repository managers involved pulling together a list of prominent repositories from subgroup members, and then gathering more from <a href="http://www.opendoar.org/">OpenDOAR</a>, &#034;an authoritative directory of academic open access repositories.&#034;  Then subgroup members were encouraged to share the survey link with colleagues, and post it far and wide via blogs, listservs, and <a href="http://twitter.com/mjgiarlo/status/2230486784">tweets</a>.  The listservs we targeted were: <a href="http://www.jiscmail.ac.uk/archives/jisc-repositories.html">JISC-REPOSITORIES</a>, <a href="http://metadatalibrarians.monarchos.com/">metadataLibrarians</a>, <a href="http://groups.google.com/group/digital-curation">digital-curation</a>, <a href="https://arl.org/Lists/SPARC-IR/">SPARC-IR</a>, <a href="http://mailman.anu.edu.au/mailman/listinfo/ir-net">ir-net</a>, <a href="http://www.lsoft.com/SCRIPTS/WL.EXE?SL1=REPOMAN-L&#038;H=LISTSERV.INDIANA.EDU">REPOMAN-L</a>, <a href="http://larch.palinet.org/archives/palinet-ir-l.html">PALINET-IR-L</a>, <a href="http://mailman.mit.edu/mailman/listinfo/dspace-general">dspace-general</a>, <a href="https://lists.sourceforge.net/lists/listinfo/fedora-commons-users">fedora-commons-users</a>, <a href="http://dublincore.org/groups/identifiers/">DC-IDENTIFIERS</a>, and <a href="http://dewey.library.nd.edu/mailing-lists/code4lib/">code4lib</a>.</p>
<p>I&#039;ve already received a few responses and have gotten useful feedback.  Two of the hardest questions to answer so far have been: &#034;What is an institutional identifier?&#034; and &#034;What is a repository?&#034;</p>
<dl>
<dt><strong>Institutional identifier</strong></dt>
<dd>
<p>An institutional identifier is defined as a symbol or code that uniquely identifies an institution.  Domain-specific examples of existing identifiers include SAN, IPEDS, GLN, MARC Org Code, and ISIL.  Another example might be a Handle prefix or ARK name authority assigning number.</p>
</dd>
<dt><strong>Repository</strong></dt>
<dd>
<p>Institutional repositories and subject repositories like arxiv.org are clearly &#039;repositories&#039;, but beyond that it is a somewhat ill-defined term.  One might look to the <a href="http://www.cnri.reston.va.us/k-w.html">Kahn-Wilensky architecture</a>, or the <a href="http://public.ccsds.org/publications/archive/650x0b1.pdf">OAIS reference model (PDF)</a>, or even <a href="http://en.wikipedia.org/wiki/Digital_repository">Wikipedia</a> for definitions, but it&#039;s not clear that even the authorities agree on what constitutes a repository.</p>
<p>It&#039;s a system.  It&#039;s network-accessible and typically has a web interface of some sort.  Files and groups of files sometimes known as objects tend to be deposited in them, perhaps for some combination of management, access, or preservation.  Many run Fedora, DSpace, and ePrints, and factor heavily in scholarly communication.  Some are document-centric.  Some will accept anything.  To some, a learning management system may be a repo.  To others, a content management system may fit.</p>
<p>My background is in academia so my own definition is somewhat based in that context, but I wouldn&#039;t say the term is necessarily limited to that context.  There are other NISO I2 scenarios for library workflows and electronic resources, so it&#039;s safe to assume that repository does not mean ILS or OPAC or ERP system.  My hope is that folks have their own working definitions of the term and can decide for themselves what it means.</p>
</dd>
</dl>
<p>We&#039;ve given folks a little over two weeks to respond to the survey, so the constant I2 drum-beating will quiet down for a while around here.  I am very interested in what sorts of responses we get from the survey.  Fun times!</p>
<p>Oh, and perhaps it goes without saying, but if you&#039;re a repository owner, manager, expert, developer, or stakeholder with an interest in identifiers, please feel free to take the <a href="http://www.surveymonkey.com/s.aspx?sm=RGQgZ3090DVrb3kFzr3P3Q_3d_3d">survey</a>!</p>
]]></content:encoded>
			<wfw:commentRss>http://lackoftalent.org/michael/blog/2009/06/20/i2-survey/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>I2: Strawman</title>
		<link>http://lackoftalent.org/michael/blog/2009/06/13/i2-strawman/</link>
		<comments>http://lackoftalent.org/michael/blog/2009/06/13/i2-strawman/#comments</comments>
		<pubDate>Sat, 13 Jun 2009 23:02:36 +0000</pubDate>
		<dc:creator>Michael Giarlo</dc:creator>
				<category><![CDATA[Digital Libraries and Archives]]></category>
		<category><![CDATA[Linked Data]]></category>
		<category><![CDATA[NISO I2]]></category>
		<category><![CDATA[Persistent Identifiers]]></category>
		<category><![CDATA[Repositories]]></category>
		<category><![CDATA[Scholarly Communication]]></category>
		<category><![CDATA[Hypertext Transfer Protocol]]></category>
		<category><![CDATA[Metadata]]></category>
		<category><![CDATA[Tim Berners-Lee]]></category>
		<category><![CDATA[Uniform Resource Identifier]]></category>
		<category><![CDATA[World Wide Web]]></category>

		<guid isPermaLink="false">http://lackoftalent.org/michael/blog/?p=346</guid>
		<description><![CDATA[[Series] In the prior I2 post, I wrote about the requirements the repositories subgroup has come up with for an institutional identifier standard (with the hope that our findings re: repositories could be generalized to other scenarios). Image by PhOtOnQuAnTiQuE via Flickr My strawman proposal of sorts is to explore how well linked data patterns [...]]]></description>
			<content:encoded><![CDATA[<abbr class="unapi-id" title="oai:lackoftalent.org:technosophia:346"><!-- &nbsp; --></abbr>
<p style="text-align: left;">
<p style="text-align: left;">[<a href="http://lackoftalent.org/michael/blog/category/niso-i2/">Series</a>]</p>
<p style="text-align: left;">In the prior I2 post, I wrote about the requirements the repositories subgroup has come up with for an institutional identifier standard (with the hope that our findings re: repositories could be generalized to other scenarios).</p>
<div class="zemanta-img zemanta-action-dragged" style="margin: 1em; display: block; text-align: left;">
<div>
<dl class="wp-caption alignright" style="width: 250px;">
<dt class="wp-caption-dt"><a href="http://www.flickr.com/photos/67968452@N00/3272712288"><img title="PhotonQ-Tim Berners Lee on Linked Data at TED" src="http://farm4.static.flickr.com/3449/3272712288_2ef843a4b7_m.jpg" alt="PhotonQ-Tim Berners Lee on Linked Data at TED" width="240" height="180" /></a></dt>
<dd class="wp-caption-dd zemanta-img-attribution" style="font-size: 0.8em;">Image by <a href="http://www.flickr.com/photos/67968452@N00/3272712288">PhOtOnQuAnTiQuE</a> via Flickr</dd>
</dl>
</div>
</div>
<p style="text-align: left;">My strawman proposal of sorts is to explore how well <a href="http://linkeddata.org/">linked data patterns</a> fit this problem space.  Linked data, briefly, is a way to expose and link data on the web in a more semantically meaningful way, and is often summarized using the four principles put forward by <a href="http://en.wikipedia.org/wiki/Tim_Berners-Lee">Tim Berners-Lee</a>:</p>
<blockquote style="text-align: left;">
<ol>
<li>Use URIs as names for things</li>
<li>Use HTTP URIs so that people can look up those names.</li>
<li>When someone looks up a URI, provide useful information.</li>
<li>Include links to other URIs. so that they can discover more things.</li>
</ol>
</blockquote>
<p style="text-align: left;">That&#039;s the crux of it.Â  Linked data takes well-known patterns on the web (linking, dereferencing, etc.) and applies them to data, which in this case could be metadata for identifying institutions.</p>
<p style="text-align: left;">Let&#039;s examine each of the requirements and the applicability of linked data thereto.</p>
<ol style="text-align: left;">
<li><em>Should be agnostic to type of institution, e.g., libraries, museums, personal collections, historical societies</em>: The web is already agnostic to type of institution.Â  HTTP URIs do not favor one type of institution over another.<br/></li>
<li><em>Should handle varying institutional granularity, e.g., institution-level, campus-level, division-level, unit-level</em>: HTTP <a href="http://en.wikipedia.org/wiki/Uniform_Resource_Identifier">URIs</a> are flexible in this regard.Â  Hierarchy, should one wish it to be surfaced in the identifier, may be encoded in either a DNS hostname or the path appended to the DNS name.Â  One can imagine a URI like &#034;http://department.division.institution.tld/unit/subunit&#034; or &#034;http://institution.tld/campus/office/individual&#034;. <br/><br/>Hierarchy needn&#039;t be surfaced in the identifier if one favors opacity, in which case &#034;http://registry.tld/xnjsdasd&#034; would suffice as an identifier, and may instead be entirely reflected in the (RDF) representation returned by dereferencing the URI.<br/></li>
<li><em>Should handle linking among institutions and subordinate units</em>: Linked data handles linking via well-known HTTP mechanisms, referenced in the fourth principle of linked data.Â  Unlike the HTTP link, which has limited semantics, linked data links are semantically rich and extensible.<br/></li>
<li><em>Should express different sorts of relationships among these institutions and units</em>: The &#034;useful information&#034; in the third principle of linked data is typically provided by an RDF representation, which is itself a list of assertions.Â  These assertions, or triples, consist of subjects, predicates, and objects.Â  The ability to express the relationships in this requirement is limited only by the availability of vocabularies that contain sets of predicates and classes for subjects and objects.Â  Think of the predicates as elements defined within a metadata standard, e.g., <a href="http://dublincore.org/documents/dces/">Dublin Core</a> &#034;creator&#034;, <a href="http://www.loc.gov/standards/mods/">MODS</a> &#034;relatedItem&#034;, and so forth.Â  Vocabularies that contain these predicates and classes are growing and evolving daily, and should there not be a vocabulary that contains the relationship one wishes to express, it is fairly easy to create a custom vocabulary. <br/><br/>The ability to mix and match vocabularies provides an expressiveness that is often not found in document-based metadata formats and the flexibility to express radically different relationships on a per-industry or per-institution basis.Â  This latter point is important as the I2 group has identified both core metadata elements for identifying institutions of different types and additional elements for specific types of institutions.Â  Why re-invent a new metadata format or schema when all one needs to express may already be contained in others?<br/></li>
<li><em>Should relate to existing relevant identifiers and registries</em>: Same as requirement#4.Â  Linked data is all about expressing relationships between things, e.g., institutions, identifiers, registries, etc.<br/></li>
<li><em>Should be globally unique</em>: HTTP URIs are guaranteed to be globally unique by virtue of the distributed <a href="http://en.wikipedia.org/wiki/Domain_name_system">DNS</a> system and hierarchical naming within each HTTP service.<br/></li>
<li><em>Should be actionable</em>: HTTP URIs provide dereferenceability/actionability via the well-known HTTP protocol.<br/></li>
<li><em>Should enable retrieval of metadata sufficient to identify the institution, which may vary widely by institution</em>: HTTP URIs are actionable per requirement #7 and the metadata returned is flexible per requirement #4.<br/></li>
<li><em>Should accommodate changes as institutions come and go and re-organize and be able to relate defunct institutions to new ones</em>: Linked data patterns provide for redirecting from defunct representations (institutional identifiers) to new ones via HTTP redirects.Â  One may also add assertions to institutional metadata such as owl:sameAs, for instance, which says that the institution identified by the given URI is the same as another institution identified by another URI.<br/></li>
</ol>
<p>This seems like a compelling path to follow for the I2 standard.</p>
<p>The I2 repositories subgroup will be sending out its survey on identifier use cases in the coming week.Â  It will be interesting to see if the requirements we have thus far identified still obtain in light of the data we collect from the survey.Â  If so, I would like to explore the idea of linked data for institutional identifiers a bit more.</p>
]]></content:encoded>
			<wfw:commentRss>http://lackoftalent.org/michael/blog/2009/06/13/i2-strawman/feed/</wfw:commentRss>
		<slash:comments>7</slash:comments>
		</item>
		<item>
		<title>I2: Requirements</title>
		<link>http://lackoftalent.org/michael/blog/2009/06/07/i2-requirements/</link>
		<comments>http://lackoftalent.org/michael/blog/2009/06/07/i2-requirements/#comments</comments>
		<pubDate>Sun, 07 Jun 2009 21:14:56 +0000</pubDate>
		<dc:creator>Michael Giarlo</dc:creator>
				<category><![CDATA[Digital Libraries and Archives]]></category>
		<category><![CDATA[NISO I2]]></category>
		<category><![CDATA[Persistent Identifiers]]></category>
		<category><![CDATA[Repositories]]></category>
		<category><![CDATA[Scholarly Communication]]></category>

		<guid isPermaLink="false">http://lackoftalent.org/michael/blog/?p=327</guid>
		<description><![CDATA[[Series] The I2 IR scenario subgroup approached the issue of institutional identifiers in repositories by first brainstorming about the various issues, problems, and sticking points that make identifiers in this space (and elsewhere) such a complex topic. Folks on the subgroup are repository managers or are otherwise involved with or knowledgeable about the repository space, [...]]]></description>
			<content:encoded><![CDATA[<abbr class="unapi-id" title="oai:lackoftalent.org:technosophia:327"><!-- &nbsp; --></abbr>
<p>[<a href="http://lackoftalent.org/michael/blog/category/niso-i2/">Series</a>]</p>
<p>The I2 IR scenario subgroup approached the issue of institutional identifiers in repositories by first brainstorming about the various issues, problems, and sticking points that make identifiers in this space (and elsewhere) such a complex topic.  Folks on the subgroup are repository managers or are otherwise involved with or knowledgeable about the repository space, so the brainstorming exercise yielded a good number of concerns.  </p>
<p>The purpose of the exercise was to enumerate concerns and issues that could inform a draft survey to be administered to repository managers and experts around the globe in different organizational contexts: libraries, subject disciplines, archives, historical societies, etc.  The purpose of the survey is to get an idea of the use cases and constraints around institutional identifiers in these different repository contexts, the assumption being that we ought to have requirements grounded in real world usage before we go off building a standard.</p>
<p>I will note here that the subgroup has worked up a draft survey that has just recently been reviewed by a small group of folks who know about survey design, and we hope to administer the survey to the aforementioned <em>Reporati</em> this week[<a href="http://lackoftalent.org/michael/blog/2009/06/07/i2-requirements/#footnote_0_327" id="identifier_0_327" class="footnote-link footnote-identifier-link" title="We will also x-post to repo-related mailing lists as well, and some of us may blog or tweet about it.  My inclination is to cast as wide a net as possible so as not to miss important use cases.  We can always scope things out later on, but it&amp;#8217;s useful to be inclusive at this point lest our own assumptions carry the group forward.">1</a>].  Which is to say that I don&#039;t yet have a strong grasp of the use cases out there in the wild, and this series should be construed as my own premature cognitive fumblings.  But let&#039;s assume for now that what we learn from the survey results matches our initial brainstorming exercise.  </p>
<p>Here is a slightly modified and boiled down version of the concerns and issues the subgroup came up with for a potential institutional identifier standard, which resembles a set of minimum requirements:</p>
<ol>
<li>Should be agnostic to type of institution, e.g., libraries, museums, personal collections, historical societies</li>
<li>Should handle varying institutional granularity, e.g., institution-level, campus-level, division-level, unit-level</li>
<li>Should handle linking among institutions and subordinate units</li>
<li>Should express different sorts of relationships among these institutions and units</li>
<li>Should relate to existing relevant identifiers and registries</li>
<li>Should be globally unique</li>
<li>Should be actionable</li>
<li>Should enable retrieval of metadata sufficient to identify the institution, which may vary widely by institution</li>
<li>Should accommodate changes as institutions come and go and re-organize and be able to relate defunct institutions to new ones</li>
</ol>
<p>I doubt the list is exhaustive; I am almost certain we will uncover all sorts of tangly and esoteric use cases that add requirements.  I expect it.  Why else would we be gathering to discuss the need for an institutional identifier if it were a solved problem or a simple one? [<a href="http://lackoftalent.org/michael/blog/2009/06/07/i2-requirements/#footnote_1_327" id="identifier_1_327" class="footnote-link footnote-identifier-link" title="The cynical among you might have interesting answers to this question.">2</a>]</p>
<p>Nevertheless, looking at the above list, the task we&#039;ve taken on starts to feel less onerous.  And thinking about identifier systems constrained by the list of concerns, the mind starts to cook up all sorts of possible solutions.  I&#039;ll share one in the next post in this series, a strawman proposal of sorts, and how it addresses each of these requirements.</p>
<h5>Notes</h5><ol class="footnotes"><li id="footnote_0_327" class="footnote">We will also x-post to repo-related mailing lists as well, and some of us may blog or tweet about it.  My inclination is to cast as wide a net as possible so as not to miss important use cases.  We can always scope things out later on, but it&#039;s useful to be inclusive at this point lest our own assumptions carry the group forward.</li><li id="footnote_1_327" class="footnote">The cynical among you might have interesting answers to this question.</li></ol><br/>
<hr/>]]></content:encoded>
			<wfw:commentRss>http://lackoftalent.org/michael/blog/2009/06/07/i2-requirements/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>I2: Background</title>
		<link>http://lackoftalent.org/michael/blog/2009/05/19/i2-background/</link>
		<comments>http://lackoftalent.org/michael/blog/2009/05/19/i2-background/#comments</comments>
		<pubDate>Tue, 19 May 2009 19:16:19 +0000</pubDate>
		<dc:creator>Michael Giarlo</dc:creator>
				<category><![CDATA[Digital Libraries and Archives]]></category>
		<category><![CDATA[NISO I2]]></category>
		<category><![CDATA[Persistent Identifiers]]></category>
		<category><![CDATA[Repositories]]></category>
		<category><![CDATA[Scholarly Communication]]></category>

		<guid isPermaLink="false">http://lackoftalent.org/michael/blog/?p=312</guid>
		<description><![CDATA[[Series] This is the first in a series of posts about institutional identifiers[1]. In my last post, I alluded to some documentation that I&#039;ve written. That was somewhat misleading, which will soon be apparent, but I liked the parallel construction I had going, and I am but a slave to orderliness. For about the past [...]]]></description>
			<content:encoded><![CDATA[<abbr class="unapi-id" title="oai:lackoftalent.org:technosophia:312"><!-- &nbsp; --></abbr>
<p>[<a href="http://lackoftalent.org/michael/blog/category/niso-i2/">Series</a>]</p>
<p>This is the first in a series of posts about institutional identifiers[<a href="http://lackoftalent.org/michael/blog/2009/05/19/i2-background/#footnote_0_312" id="identifier_0_312" class="footnote-link footnote-identifier-link" title="I offer that very tentatively, knowing what a spectacular failure my last attempt at a series was.">1</a>].  </p>
<p>In <a href="http://lackoftalent.org/michael/blog/2009/05/18/state-of-the-me/">my last post</a>, I alluded to some documentation that I&#039;ve written.  That was somewhat misleading, which will soon be apparent, but I liked the parallel construction I had going, and I am but a slave to orderliness.</p>
<p>For about the past six months, I have been working with a <a href="http://www.niso.org/workrooms/i2">NISO group</a> looking into how institutions are identified within information systems:</p>
<blockquote><p>
The I2 (Institutional Identifiers &#8212; pronounced &#034;I 2&#034;) working group will build on work from the Journal Supply Chain Efficiency Improvement Pilot (http://www.journalsupplychain.com/), which demonstrated the improved efficiencies of using an institutional identifier in the journal supply chain. The NISO working group will develop a standard for an institutional identifier that can be implemented in all library and publishing environments. The standard will include definition of the metadata required to be collected with the identifier and what uses can be made of that metadata. &#8230;
</p></blockquote>
<p>The I2 group is split into a few subgroups which have been charged with looking into how institutional identifiers are used in particular scenarios.  These scenarios are e-resources, repositories and e-learning systems, and library resource workflows.  The scenario names pain me a bit, but so be it; this is our industry, and there are bigger windmills to tilt at.</p>
<p>I am currently co-chairing the subgroup looking at repositories and e-learning, and apparently I am its &#034;tech lead.&#034;  I don&#039;t want to get caught up on names and roles and titles, though; this series isn&#039;t about those at all.  I&#039;m just setting the scene and explaining why my head&#039;s in this space and laying bare my stake in the issue.</p>
<p>The remainder of this series will provide a bit more detail on the issues around institutional identifiers, share how the repository subgroup is grappling with identifier issues and engaging the repository community to assess needs, propose an approach for an identifier system that may meet said needs, and explore what seems to be the thorniest issue[<a href="http://lackoftalent.org/michael/blog/2009/05/19/i2-background/#footnote_1_312" id="identifier_1_312" class="footnote-link footnote-identifier-link" title="Hint: management.  I know, &amp;#8220;duh,&amp;#8221; right?">2</a>].</p>
<h5>Notes</h5><ol class="footnotes"><li id="footnote_0_312" class="footnote">I offer that very tentatively, knowing what a <a href="/michael/blog/category/development/ruby/">spectacular failure</a> my last attempt at a series was.</li><li id="footnote_1_312" class="footnote">Hint: management.  I know, &#034;duh,&#034; right?</li></ol><br/>
<hr/>]]></content:encoded>
			<wfw:commentRss>http://lackoftalent.org/michael/blog/2009/05/19/i2-background/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Plugin updates</title>
		<link>http://lackoftalent.org/michael/blog/2008/11/16/plugin-updates/</link>
		<comments>http://lackoftalent.org/michael/blog/2008/11/16/plugin-updates/#comments</comments>
		<pubDate>Sun, 16 Nov 2008 21:40:09 +0000</pubDate>
		<dc:creator>Michael Giarlo</dc:creator>
				<category><![CDATA[OAI-ORE]]></category>
		<category><![CDATA[Persistent Identifiers]]></category>
		<category><![CDATA[PHP]]></category>
		<category><![CDATA[Software]]></category>
		<category><![CDATA[unAPI]]></category>

		<guid isPermaLink="false">http://lackoftalent.org/michael/blog/?p=190</guid>
		<description><![CDATA[I finally pushed out some embarrassingly outdated WordPress plugin updates a few moments ago. Updated unAPI plugin with a patch contributed by Jay Luker that removes the hard-coded &#034;wp_&#034; table prefix. The updated version of the plugin has been tagged as 1.4.1. Updated LinkPURL plugin with a patch contributed by Mark Matienzo that enables partial [...]]]></description>
			<content:encoded><![CDATA[<abbr class="unapi-id" title="oai:lackoftalent.org:technosophia:190"><!-- &nbsp; --></abbr>
<p>I finally pushed out some embarrassingly outdated WordPress plugin updates a few moments ago.</p>
<ul>
<li>Updated unAPI plugin with a patch contributed by <a href="http://www.linkedin.com/in/jayluker" target="_blank">Jay Luker</a> that removes the hard-coded &#034;wp_&#034; table prefix.  The updated version of the plugin has been tagged as 1.4.1.</li>
<li>Updated LinkPURL plugin with a patch contributed by <a href="http://matienzo.org/" target="_blank">Mark Matienzo</a> that enables <a href="http://purl.oclc.org/docs/inet96.html#partial" target="_blank">partial redirects</a>.  I made some additional tweaks to the plugin to make this feature configurable via the WordPress management UI.  This has been tagged as 1.1.</li>
<li>Created a new unAPI plugin branch for Mark Matienzo&#039;s <a href="http://about.scriblio.net/" target="_blank">Scriblio</a>-oriented tweaks.  The branch is called 1.4.1-anarchivist-scriblio and it contains the scriblio.diff file.  I have yet to integrate the diffs, as the file that was patched has changed since the patch was issued.  If anyone is interested in working on unAPI/Scriblio integration, please get in touch with me.</li>
</ul>
<p>And here is my to-do list which I hope will keep me honest.</p>
<ul>
<li>Update OAI-ORE plugin to support version 1.0 of the ORE specification.</li>
<li>Add per-post (and per-page?) resource maps that wrap all embedded images and links.</li>
<li>Enable &#034;cool URIs&#034; for all resource maps.</li>
</ul>
<p>It is my hope that I&#039;ll get to those sometime before the summer begins.  :)</p>
]]></content:encoded>
			<wfw:commentRss>http://lackoftalent.org/michael/blog/2008/11/16/plugin-updates/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Use cases for Handle identifiers?</title>
		<link>http://lackoftalent.org/michael/blog/2007/10/05/use-cases-for-handle-identifiers/</link>
		<comments>http://lackoftalent.org/michael/blog/2007/10/05/use-cases-for-handle-identifiers/#comments</comments>
		<pubDate>Fri, 05 Oct 2007 04:13:40 +0000</pubDate>
		<dc:creator>Michael Giarlo</dc:creator>
				<category><![CDATA[Persistent Identifiers]]></category>

		<guid isPermaLink="false">http://lackoftalent.org/michael/blog/2007/10/05/use-cases-for-handle-identifiers/</guid>
		<description><![CDATA[Reading Adam Smith&#039;s D-Lib article has got me thinking about identifiers again. I don&#039;t agree with some of the assertions in the section titled &#034;A Persistent Identifier Primer&#034; &#8212; URIs are in fact persistent; we just break them through poor management &#8212; and so I&#039;m led to a fundamental question: what are the good use [...]]]></description>
			<content:encoded><![CDATA[<abbr class="unapi-id" title="oai:lackoftalent.org:technosophia:107"><!-- &nbsp; --></abbr>
<p>Reading Adam Smith&#039;s D-Lib <a href="http://dlib.org/dlib/september07/smith/09smith.html" target="_blank">article</a> has got me thinking about identifiers again.  I don&#039;t agree with some of the assertions in the section titled &#034;A Persistent Identifier Primer&#034; &#8212; URIs <em>are</em> in fact persistent; we just break them through poor management &#8212; and so I&#039;m led to a fundamental question: what are the good use cases for Handle (or ARK, or PURL) identifiers?  </p>
<p>I get the need for persistent and globally unique identifiers; I&#039;m just wondering why one needs special software with a separate URI namespace to gain persistence.</p>
<p>One potential use case might be resources that are outside of the organization&#039;s control &#8212; i.e., licensed content from vendors &#8212; but surely folks are using Handles for many resources that are created and managed <em>within the organization</em>.  And I&#039;m curious why they have decided that Handles are more durable than native URIs (the URIs to which Handles redirect), and how they deal with the problem of downstream (post-redirection) citation and bookmarking.  How useful is this sort of identifier scheme if your users never even see the supposedly more persistent URI for a resource?</p>
<p>As a former proponent of Handles and ARKs, this may seem like a hypocritical question to pose.  If I had to answer my own question, I would say that Handles seem like a good option because they save you some work and headaches in the short-term; you don&#039;t need to get together with your web team and come up with a scalable and sustainable URI policy; just assign native URIs in the usual haphazard way and generate Handles to compensate for a lack of identifier policies.</p>
<p>But if you&#039;re already making an organizational commitment to identifier persistence &#8212; and if you&#039;re rolling out Handles, I&#039;d wager that&#039;s likely &#8212; why not do so by minting carefully-considered <a target="_blank" href="http://www.w3.org/Provider/Style/URI">cool URIs</a>?  Less management and technology overhead and less confusion for your users are two good reasons to consider it.</p>
]]></content:encoded>
			<wfw:commentRss>http://lackoftalent.org/michael/blog/2007/10/05/use-cases-for-handle-identifiers/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Digital preservation for archivists</title>
		<link>http://lackoftalent.org/michael/blog/2007/06/12/digital-preservation-for-archivists/</link>
		<comments>http://lackoftalent.org/michael/blog/2007/06/12/digital-preservation-for-archivists/#comments</comments>
		<pubDate>Tue, 12 Jun 2007 06:09:17 +0000</pubDate>
		<dc:creator>Michael Giarlo</dc:creator>
				<category><![CDATA[Digital Libraries and Archives]]></category>
		<category><![CDATA[Fedora]]></category>
		<category><![CDATA[Libraries]]></category>
		<category><![CDATA[Persistent Identifiers]]></category>
		<category><![CDATA[Preservation]]></category>
		<category><![CDATA[Repositories]]></category>

		<guid isPermaLink="false">http://lackoftalent.org/michael/blog/2007/06/12/digital-preservation-for-archivists/</guid>
		<description><![CDATA[At long last, the paper that Ron Jantz and I wrote for the Journal of Archival Organization has been published in a special double issue. It&#039;s titled &#034;Digital Archiving and Preservation: Technologies and Processes for a Trusted Repository&#034; and is intended to be a fairly nitty-gritty piece on digital preservation (in the context of trusted [...]]]></description>
			<content:encoded><![CDATA[<abbr class="unapi-id" title="oai:lackoftalent.org:technosophia:95"><!-- &nbsp; --></abbr>
<p>At long last, the <a target="_blank" href="http://www.haworthpress.com/store/Toc_views.asp?sid=B9KRK3DL5LA88K9NC1ENHM48GKDQ0AJ0&#038;TOCName=J201v04n01_TOC&#038;desc=Volume%3A%204%20Issue%3A%201%2F2">paper</a> that Ron Jantz and I wrote for the Journal of Archival Organization has been published in a special double issue.  It&#039;s titled &#034;Digital Archiving and Preservation: Technologies and Processes for a Trusted Repository&#034; and is intended to be a fairly nitty-gritty piece on digital preservation (in the context of trusted repositories) <strong>for archivists</strong>.  The abstract:<br />
<blockquote>This article examines what is implied by the term &#034;trusted&#034; in the phrase &#034;trusted digital repositories.&#034; Digital repositories should be able to preserve electronic materials for periods at least comparable to existing preservation methods. Our collective lack of experience with preserving digital objects and consensus about the reliability of our technological infrastructure raises questions about how we should proceed with digital-based preservation practices, an emerging role for academic libraries and archival institutions. This article reviews issues relating to building a trusted digital repository, highlighting some of the issues raised and possible solutions proposed by the authors in their work of implementing and acculturating a digital repository at Rutgers University Libraries.</p></blockquote>
<p>This special double-issue of JAO will also be released in the <a href="http://www.haworthpress.com/store/product.asp?sid=0MEHAR64F60N9J3KA44B3X8836FHDSK6&#038;sku=6008&#038;detail=Contents" target="_blank">manuscript</a>, &#034;Archives and the Digital Library.&#034;</p>
<p>Thanks to editors Bill Landis, Robin Chandler, Tom Frusciano, and Caryn Radick for seeing this through.  And of course to Ron Jantz for getting me interested in this crazy stuff at a time when I had no direction or interest in my career.</p>
]]></content:encoded>
			<wfw:commentRss>http://lackoftalent.org/michael/blog/2007/06/12/digital-preservation-for-archivists/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Identifier Persistence: Fundamentals</title>
		<link>http://lackoftalent.org/michael/blog/2007/06/05/identifier-persistence-fundamentals/</link>
		<comments>http://lackoftalent.org/michael/blog/2007/06/05/identifier-persistence-fundamentals/#comments</comments>
		<pubDate>Tue, 05 Jun 2007 20:34:33 +0000</pubDate>
		<dc:creator>Michael Giarlo</dc:creator>
				<category><![CDATA[Digital Libraries and Archives]]></category>
		<category><![CDATA[Persistent Identifiers]]></category>
		<category><![CDATA[Preservation]]></category>

		<guid isPermaLink="false">http://lackoftalent.org/michael/blog/2007/06/05/identifier-persistence-fundamentals/</guid>
		<description><![CDATA[A friend and former colleague asked if I would comment on a chapter in her upcoming book on digital rights management and I agreed. The chapter is about identification and authenticity of web resources. Throughout my review of the chapter, I kept coming back to a couple of very basic notions that underlie any effort [...]]]></description>
			<content:encoded><![CDATA[<abbr class="unapi-id" title="oai:lackoftalent.org:technosophia:92"><!-- &nbsp; --></abbr>
<p>A friend and former colleague asked if I would comment on a chapter in her upcoming book on digital rights management and I agreed.  The chapter is about identification and authenticity of web resources.  Throughout my review of the chapter, I kept coming back to a couple of very basic notions that underlie any effort to provide persistent identifiers for web resources.  These notions are, to my mind, central to identifier persistence, and any other concerns rely upon this foundation:</p>
<ol>
<li>Identifier persistence requires an organizational commitment.  Persistence cannot be ensured by a few renegades in the skunk-works, nor can it be mandated from on high without the support of those who manage the identifiers or produce web resources.  <strong>All individuals involved in the life-cycle of web resources must be committed to persistence in perpetuity if true persistence of identifiers is to be achieved.</strong></li>
<li>No technology, no standard, no identifier scheme, no information architecture will get you persistence.  Whether you choose native URIs, Handles, DOIs, PURLs, ARKs, UUIDs, or XRIs, <strong>you will never achieve identifier persistence without active management of your identifiers and web resources</strong>.  This requires the aforementioned organizational commitment since such management cannot occur without sufficient resources.  Management of web resources and identifiers requires time and due diligence and those don&#039;t come for free.</li>
</ol>
<p>And, at the risk of being reductive, that&#039;s about it.  Once you&#039;ve got an organizational commitment and a person or team to manage your identifiers and web resources, the rest of the decisions are secondary.  If you like semantically meaningful URLs that redirect, choose Handles; if you prefer opaque identifiers, go with ARKs; if you don&#039;t want to run your own software, consider PURLs.  At that point, it <strong>really doesn&#039;t matter</strong> which scheme you choose, as long as its characteristics match your organization&#039;s values.  You&#039;ve already done the heavy lifting; rest easy.</p>
]]></content:encoded>
			<wfw:commentRss>http://lackoftalent.org/michael/blog/2007/06/05/identifier-persistence-fundamentals/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
	</channel>
</rss>

