<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>τεχνοσοφια &#187; Digital Libraries and Archives</title>
	<atom:link href="http://lackoftalent.org/michael/blog/category/libraries/digital-libraries/feed/" rel="self" type="application/rss+xml" />
	<link>http://lackoftalent.org/michael/blog</link>
	<description>The occasional rambling of a digital library artisan</description>
	<lastBuildDate>Sun, 24 Jan 2010 18:30:30 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.9.1</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
			<item>
		<title>Exploring curation micro-services</title>
		<link>http://lackoftalent.org/michael/blog/2009/09/27/exploring-curation-micro-services/</link>
		<comments>http://lackoftalent.org/michael/blog/2009/09/27/exploring-curation-micro-services/#comments</comments>
		<pubDate>Sun, 27 Sep 2009 04:00:29 +0000</pubDate>
		<dc:creator>Michael Giarlo</dc:creator>
				<category><![CDATA[Digital Libraries and Archives]]></category>
		<category><![CDATA[Management]]></category>
		<category><![CDATA[Preservation]]></category>
		<category><![CDATA[Repositories]]></category>

		<guid isPermaLink="false">http://lackoftalent.org/michael/blog/?p=504</guid>
		<description><![CDATA[
As far as I&#039;m concerned, the most exciting developments this year in  repositories and digital curation have come out of the California Digital Library.  It has been impossible not to notice their papers and presentations.  Put simply, their idea is that digital curation is enabled by &#034;micro-services&#034; built upon well-known abstractions such [...]]]></description>
			<content:encoded><![CDATA[<abbr class="unapi-id" title="oai:lackoftalent.org:technosophia:504"><!-- &nbsp; --></abbr>
<p><img src="http://lackoftalent.org/images/micro_repo_thumb.png" alt="thumbnail of micro-repo tree" style="float: left"/>As far as I&#039;m concerned, the most exciting developments this year in  repositories and digital curation have come out of the California Digital Library.  It has been impossible <a href="http://conferences.library.gatech.edu/or/or09/paper/view/95">not</a> <a href="http://uccsc2009.ucdavis.edu/preso/UCCSC-2009-CDL-PODS-v05.ppt">to</a> <a href="http://www.ijdc.net/index.php/ijdc/article/view/98">notice</a> <a href="https://meeting-reg.com/sunpasig/abstracts.php">their</a> <a href="http://www.digitalpreservation.gov/news/events/ndiipp_meetings/ndiipp09/docs/NDIIPP%20Partner%20Meeting%202009_Breakout%20Session%20Schedule.pdf">papers</a> <a href="http://www.ijdc.net/index.php/ijdc/article/view/108/84">and</a> <a href="http://www.cdlib.org/iPres/confsched.html">presentations</a>.  Put simply, their idea is that digital curation is enabled by &#034;micro-services&#034; built upon well-known abstractions such as the filesystem.  The benefits are obvious: filesystem tools are ubiquitous and cross-platform, and there are strong market forces to ensure the filesystem persists.  The idea is radically simple and straightforward, though many questions remain about such a paradigm.  I&#039;ll return to those later. </p>
<p>If you have not yet taken a look at CDL&#039;s curation micro-service specifications, most of which may be printed on as few as one or two sheets of paper, see the <a href="http://www.cdlib.org/inside/diglib/">Digital Library Building Blocks</a>.</p>
<p>My co-workers in the LC Repository Development Center have been chatting about these specs on and off throughout the year.  After months of procrastinating, I finally read all of the specs on Thursday; it&#039;s wonderful that you can do so in the course of one reading session, I might add.  Yesterday a bunch of us RDCers got together to chat (informally) about the specs: what they&#039;re for, how they work, and how they interact with one another.  I learn by doing, by examples, so I combed through each of the specs in advance of our meeting and tried to <a href="http://twitter.com/mjgiarlo/statuses/4371794936">construct</a> a minimal repository[1] based on micro-services.<br />
<span id="more-504"></span><br />
Here is a tree visualization of the final product, inevitable warts and all: <a href="http://lackoftalent.org/images/micro_repo.png"><img src="http://lackoftalent.org/images/micro_repo.png" alt="sample micro-services repo tree" /></a>  The services I used were <a href="http://www.cdlib.org/inside/diglib/namaste/namastespec.html">Namaste</a>, <a href="http://www.cdlib.org/inside/diglib/can/canspec.pdf">Content Access Node (CAN)</a>, <a href="http://www.cdlib.org/inside/diglib/pairtree/pairtreespec.html">Pairtree</a>, <a href="http://www.cdlib.org/inside/diglib/dflat/dflatspec.pdf">Dflat</a>, <a href="http://www.cdlib.org/inside/diglib/redd/reddspec.html">Reverse Directory Deltas (ReDD)</a>, <a href="http://www.cdlib.org/inside/diglib/clop/clopspec.pdf">Class-based System for Managing Object Properties (CLOP)</a>, and <a href="http://www.digitalpreservation.gov/library/resources/tools/docs/bagitspec.pdf">BagIt</a> (co-developed by LC and CDL).</p>
<p>As I mentioned in our Friday meeting, recounting my experience exploring the specs: the bad thing is that I spent an hour building a repository with rudimentary tools such as mkdir, touch, cp, ln, and emacs; but the good thing is that I built a <em>repository</em> in <em>one hour</em> using <em>common, rudimentary tools</em>.  It&#039;s a very compelling paradigm.  <a href="http://inkdroid.org/ehs">Ed</a>&#039;s already built a <a href="http://github.com/edsu/dflat">tool</a> implementing some of Dflat, further demonstrating how lightweight these micro-services are.  (<strong>UPDATE</strong>: Ed notes that this code is a work in progress and is &#034;barely functional.&#034;)  (<strong>UPDATE 2</strong>: The dflat library has come a long way.  Check it out if you&#039;re interested.  Also, I just committed a pretty basic Namaste library: <a href="http://github.com/mjgiarlo/namaste">http://github.com/mjgiarlo/namaste</a>.  Only took about an hour, which is a testament to the power of lightweight specs.)</p>
<p>I am certain this will be a running thread at work as the specifications evolve and our understanding of them grows.  Some questions and comments that occurred to me while exploring the micro-service specs and building the minimal repo:</p>
<ul>
<li>CAN was a bit puzzling.  The spec is simple enough, but I found some of the conventions confusing, and I was left wondering what CAN provides other than a container.  What I would like to see is a simple use case and perhaps more examples.  Thus, the CAN stuff in my sample repo doesn&#039;t feel very useful only because I had a hard time working with the spec.</li>
<li>CLOP feels like the least mature of the specifications.  It seems generally useful to be able to put digital objects, however you define that, into classes and define properties on those classes.  The spec did not clearly convey to me just how it accomplishes that aim.  A few examples would go a very long way.  I&#039;ve got some CLOP stuff in the sample repo but I have no idea how close my implementation matches the spec.</li>
<li>Is Dflat dependent on ReDD?  One would assume not since there&#039;s an optional property in the dflat-info.txt file for specifying a delta scheme.  But, say, could you stub out the v001 directory (reserved to hold the initial version of a digital object) and use a system such as <a href="http://git-scm.com/">git</a> or <a href="http://bazaar-vcs.org/">bazaar</a>?  <br/><br/>One might argue that these established delta schemes, if you want to call them that, have many more developers and users than a system such as ReDD and thus should persist longer and have more tools built around them.  I imagine the micro-service viewpoint would acknowledge that point, but counter that the spirit of these specs is to avoid dependencies from outside the filesystem?</li>
<li>Is the ReDD specification meaningful outside of a Dflat given that any one ReDD directory knows nothing of its successors and predecessors, or is it dependent upon Dflat?</li>
<li>Could a BagIt bag live inside of the ReDD reserved &#034;full&#034; directory?  That is, could the &#034;full&#034; directory be marked up appropriately to <em>be</em> a BagIt bag?</li>
<li>How many tools exist for these specs?  I notice there&#039;s code in CPAN for Pairtree and Namaste, which is a fabulous start.  Tools are the difference between YAMF (Yet Another Messy Filesystem) and reliably managed curation services.  Granted, tools such as cp and emacs already exist and are part of the appeal of these micro-services, but there&#039;s also tremendous room for error if operations are all done &#034;by hand.&#034;</li>
<li>To what extent has CDL transitioned to using these specs/tools?</li>
<li>Are other institutions using these specs/tools?  I have heard tell that digital library folks from the University of Michigan and the University of North Texas may be involved.</li>
</ul>
<p>I hope I don&#039;t sound overly critical.  I&#039;m really glad our colleagues at the California Digital Library have written these specifications and applied their deep experience to what could be a transformative paradigm[2] in the digital curation world.  Kudos to them!</p>
<h5>Notes</h5><ol class="footnotes"><li id="footnote_0_504" class="footnote">Perhaps it&#039;s more in line with the specs to refer to this space as &#034;a managed filesystem that drives repository and curation services,&#034; given the CDL philosophy that preservation is not a place/repository.  But it&#039;s easier to say &#034;repository,&#034; so there you go.</li><li id="footnote_1_504" class="footnote">Please excuse the fanboyishness; this filesystem fetishism is exciting stuff!</li></ol><br/>
<hr/>]]></content:encoded>
			<wfw:commentRss>http://lackoftalent.org/michael/blog/2009/09/27/exploring-curation-micro-services/feed/</wfw:commentRss>
		<slash:comments>4</slash:comments>
		</item>
		<item>
		<title>I2: Survey results</title>
		<link>http://lackoftalent.org/michael/blog/2009/09/15/i2-survey-results/</link>
		<comments>http://lackoftalent.org/michael/blog/2009/09/15/i2-survey-results/#comments</comments>
		<pubDate>Tue, 15 Sep 2009 12:29:55 +0000</pubDate>
		<dc:creator>Michael Giarlo</dc:creator>
				<category><![CDATA[Digital Libraries and Archives]]></category>
		<category><![CDATA[NISO I2]]></category>
		<category><![CDATA[Persistent Identifiers]]></category>
		<category><![CDATA[Repositories]]></category>
		<category><![CDATA[Scholarly Communication]]></category>

		<guid isPermaLink="false">http://lackoftalent.org/michael/blog/?p=497</guid>
		<description><![CDATA[
I wrote in June that the I2 subgroup surveyed &#034;repository managers to determine the current practices and needs of the repository community regarding institutional identifiers. Results from the survey will inform a set of use cases that will be shared with the community, and that are expected to drive the development of a new standard [...]]]></description>
			<content:encoded><![CDATA[<abbr class="unapi-id" title="oai:lackoftalent.org:technosophia:497"><!-- &nbsp; --></abbr>
<p>I <a href="http://lackoftalent.org/michael/blog/2009/06/20/i2-survey/">wrote</a> in June that the I2 subgroup surveyed &#034;repository managers to determine the current practices and needs of the repository community regarding institutional identifiers. Results from the survey will inform a set of use cases that will be shared with the community, and that are expected to drive the development of a new standard for institutional identifiers.&#034;</p>
<p>The survey closed in July, and the subgroup spent August writing a report on the survey results.  That report is now <a href="http://www.niso.org/apps/group_public/document.php?document_id=2773">final</a> and it&#039;s available to the public.  Feedback may be sent to our (woefully underutilized) public <a href="http://www.niso.org/lists/i2info/">i2info</a> mailing list, left as a comment on this post, or e-mailed to me privately which I can forward to our internal list.</p>
<p>The next step is to build upon the report to draw yet more conclusions from the data &#8212; there&#039;s an awful lot there &#8212; and flesh out some repository use cases for institutional identifiers.  The I2 core group is moving quickly towards finalizing identifier metadata elements so that a standard may be drafted, and I think having some use cases documented will help drive the standard in a direction the community can get behind.</p>
<p>Onward and upward.</p>
]]></content:encoded>
			<wfw:commentRss>http://lackoftalent.org/michael/blog/2009/09/15/i2-survey-results/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Linking World Digital Library Data</title>
		<link>http://lackoftalent.org/michael/blog/2009/08/10/linking-world-digital-library-data/</link>
		<comments>http://lackoftalent.org/michael/blog/2009/08/10/linking-world-digital-library-data/#comments</comments>
		<pubDate>Mon, 10 Aug 2009 22:44:29 +0000</pubDate>
		<dc:creator>Michael Giarlo</dc:creator>
				<category><![CDATA[APIs]]></category>
		<category><![CDATA[Cataloging and Metadata]]></category>
		<category><![CDATA[Digital Libraries and Archives]]></category>
		<category><![CDATA[Linked Data]]></category>
		<category><![CDATA[OAI-ORE]]></category>
		<category><![CDATA[World Digital Library]]></category>

		<guid isPermaLink="false">http://lackoftalent.org/michael/blog/?p=457</guid>
		<description><![CDATA[
As I mentioned earlier, I&#039;ve been learning about linked data in the context of dropping it into the World Digital Library project.  I am hopeful we&#039;ll be able to deploy the RDF views[1] before too long.  In advance of that, I thought it might be helpful to share a sample of what our [...]]]></description>
			<content:encoded><![CDATA[<abbr class="unapi-id" title="oai:lackoftalent.org:technosophia:457"><!-- &nbsp; --></abbr>
<p>As I <a href="/michael/blog/2009/07/31/validating-ore-from-the-command-line/">mentioned earlier</a>, I&#039;ve been learning about linked data in the context of dropping it into the <a href="http://www.wdl.org">World Digital Library</a> project.  I am hopeful we&#039;ll be able to deploy the RDF views[1] before too long.  In advance of that, I thought it might be helpful to share a sample of what our RDF would look like.  The RDF below represents the WDL item for the U.S. Constitution.  I appreciate constructive criticism.</p>
<p>A few things to note:</p>
<ul>
<li>Mmm, Unicode.</li>
<li>Item types are from the <a href="http://bibliontology.com/">Bibliographic Ontology</a>.</li>
<li>Most of the properties are from the <a href="http://dublincore.org/documents/dces/">Dublin Core Metadata Element Set</a> ontology, especially used where literals are objects rather than resources identified by URI. </li>
<li>Where possible I dug up or found URIs and used the <a href="http://dublincore.org/documents/dcmi-terms/">Dublin Core Metadata Terms</a> ontology.</li>
<li>An item is modeled as an aggregation of its constituent files, as defined in <a href="http://www.openarchives.org/ore/">OAI-ORE</a>.  The notion here is that an ORE aggregation of an item, as expressed in a resource map which is discoverable via a link header in each item detail page, is a &#034;whole&#034; item, including all of its files[2], metadata, and translations.</li>
<li>I&#039;m also making light use of the <a href="http://www.semanticdesktop.org/ontologies/nfo/">NEPOMUK File Ontology</a> to express that constituent files are files, and to be explicit about file sizes so that folks know in advance of retrieving it how large files are.</li>
<li>Links out to <a href="http://purl.org/NET/decimalised#">DDC</a> (Decimalised Database of Concepts), <a href="http://www.lingvoj.org/">Lingvoj</a>, <a href="http://dbpedia.org/">DBpedia</a>, and <a href="http://id.loc.gov/authorities/">Library of Congress Authorities &amp; Vocabularies</a> (e.g., LC Subject Headings) are included where possible. [3] I&#039;d be especially stoked to hear of other vocabs I might link to.  The more linked the data, the better.</li>
<li>The output below is Turtle for readability, but the application will offer up RDF/XML.</li>
</ul>
<p>The data after the jump:<br />
<span id="more-457"></span></p>

<div class="wp_syntax"><div class="code"><pre class="ttl" style="font-family:monospace;">@prefix rdf: &lt;http://www.w3.org/1999/02/22-rdf-syntax-ns#&gt; .
@prefix dc: &lt;http://purl.org/dc/elements/1.1/&gt; .
@prefix dcterms: &lt;http://purl.org/dc/terms/&gt; .
@prefix nfo: &lt;http://www.semanticdesktop.org/ontologies/nfo#&gt; .
@prefix ore: &lt;http://www.openarchives.org/ore/terms/&gt; .
@prefix rdfs: &lt;http://www.w3.org/2000/01/rdf-schema#&gt; .
&nbsp;
&lt;http://localhost/static/c/2708/service/00303_2003_001_pr.jpg&gt;
    dc:format &quot;image/jpeg&quot; ;
    nfo:fileSize &quot;259485&quot;^^&lt;http://www.w3.org/2001/XMLSchema#long&gt; ;
    a nfo:FileDataObject .
&nbsp;
&lt;http://localhost/static/c/2708/service/00303_2003_003_pr.jpg&gt;
    dc:format &quot;image/jpeg&quot; ;
    nfo:fileSize &quot;267031&quot;^^&lt;http://www.w3.org/2001/XMLSchema#long&gt; ;
    a nfo:FileDataObject .
&nbsp;
&lt;http://localhost/static/c/2708/reference/00303_2003_004_pr_thumb_item.gif&gt;
    dc:format &quot;image/gif&quot; ;
    nfo:fileSize &quot;56620&quot;^^&lt;http://www.w3.org/2001/XMLSchema#long&gt; ;
    a nfo:FileDataObject .
&nbsp;
&lt;http://localhost/static/c/2708/service/00303_2003_004_pr.jpg&gt;
    dc:format &quot;image/jpeg&quot; ;
    nfo:fileSize &quot;233875&quot;^^&lt;http://www.w3.org/2001/XMLSchema#long&gt; ;
    a nfo:FileDataObject .
&nbsp;
&lt;http://localhost/static/c/2708/service/00303_2003_002_pr.jpg&gt;
    dc:format &quot;image/jpeg&quot; ;
    nfo:fileSize &quot;245809&quot;^^&lt;http://www.w3.org/2001/XMLSchema#long&gt; ;
    a nfo:FileDataObject .
&nbsp;
&lt;http://localhost/item/2708/about.rdf&gt;
    dcterms:created &quot;2009-08-10T18:11:25-04:00&quot;^^dcterms:W3CDTF ;
    dcterms:creator &lt;http://dbpedia.org/resource/World_Digital_Library&gt; ;
    dcterms:modified &quot;2009-08-10T18:11:25-04:00&quot;^^dcterms:W3CDTF ;
    ore:describes &lt;http://localhost/item/2708/about.rdf#item&gt; ;
    a ore:ResourceMap .
&nbsp;
&lt;http://localhost/item/2708/about.rdf#item&gt;
    dc:created &quot;17 Septembre 1787&quot;@fr, &quot;17 de septiembre de 1787&quot;@es, &quot;17 de setembro de 1787&quot;@pt, &quot;17 сентября 1787 г.&quot;@ru, &quot;1787年9月17日&quot;@zh, &quot;September 17, 1787&quot;@en, &quot;&quot;&quot;١٧ ايلول ١٧٨٧
&quot;&quot;&quot;@ar ;
    dc:creator &quot;Constitutional Convention, United States&quot;@en, &quot;Convención Constituyente, Estados Unidos&quot;@es, &quot;Convention constitutionnelle, États-Unis&quot;@fr, &quot;Convenção Constitucional, Estados Unidos&quot;@pt, &quot;Конституционная Конвенция, Соединенные Штаты&quot;@ru, &quot;الاتفاقية الدستورية، الولايات المتحدة&quot;@ar, &quot;制宪会议，美国&quot;@zh ;
    dc:extent &quot;Manuscript (4 pages of parchment)&quot;@en, &quot;Manuscrit (4 pages de parchemin)&quot;@fr, &quot;Manuscrito (4 páginas de pergamino)&quot;@es, &quot;Manuscrito (4 páginas em pergaminho)&quot;@pt, &quot;Рукопись (4 пергаментных страницы)&quot;@ru, &quot;مخطوطة (٤ صفحات من الورق النفيس)&quot;@ar, &quot;手草本（4 页羊皮纸）&quot;@zh ;
    dc:language &quot;Anglais&quot;@fr, &quot;English&quot;@en, &quot;Inglés&quot;@es, &quot;Inglês&quot;@pt, &quot;Английский язык&quot;@ru, &quot;الإنجليزية&quot;@ar, &quot;英语&quot;@zh ;
    dc:publisher &quot;Administração de Registros e Arquivos Nacionais&quot;@pt, &quot;Archives Nationales et Administration des documents (NARA) des États-Unis d'Amérique &quot;@fr, &quot;Los Archivos Nacionales y Administración de Documentos (NARA) de los Estados Unidos de América&quot;@es, &quot;National Archives and Records Administration&quot;@en, &quot;Управление национальных архивов и документов&quot;@ru, &quot;الإدارة الأمريكية للوثائق والسجلات الوطنية&quot;@ar, &quot;美国国家文件与档案管理局&quot;@zh ;
    dc:subject &quot;Constituciones&quot;@es, &quot;Constituições&quot;@pt, &quot;Constitutional &amp; administrative law&quot;@en, &quot;Constitutions&quot;@en, &quot;Constitutions&quot;@fr, &quot;Derecho constitucional y administrativo&quot;@es, &quot;Direito constitucional e administrativo&quot;@pt, &quot;Droit constitutionnel et administratif&quot;@fr, &quot;Politics and government&quot;@en, &quot;Politique et gouvernement&quot;@fr, &quot;Política e governo&quot;@pt, &quot;Política y gobierno&quot;@es, &quot;Конституции&quot;@ru, &quot;Конституционное и административное право&quot;@ru, &quot;Политика и правительство&quot;@ru, &quot;الدساتير&quot;@ar, &quot;السياسة والحكومة&quot;@ar, &quot;القانون الدستوري والإداري.&quot;@ar, &quot;宪法&quot;@zh, &quot;宪法 &amp; 行政法&quot;@zh, &quot;政治和政府&quot;@zh ;
    dc:title &quot;Constitución de los Estados Unidos&quot;@es, &quot;Constituição dos Estados Unidos&quot;@pt, &quot;Constitution des États-Unis&quot;@fr, &quot;Constitution of the United States&quot;@en, &quot;Конституция Соединенных Штатов&quot;@ru, &quot;دستور الولايات المتحدة&quot;@ar, &quot;美国宪法&quot;@zh ;
    dcterms:DDC &quot;342&quot; ;
    dcterms:LCSH &lt;http://id.loc.gov/authorities/label/Constitutions&gt; ;
    dcterms:alternative &quot;Constitution of the United States&quot;@en ;
    dcterms:dateSubmitted &quot;2009-05-07T06:45:21-04:00&quot;^^dcterms:W3CDTF ;
    dcterms:description &quot;1787 年 5 月 14 日，制宪会议在费城的议会大楼（独立厅）召开，目的是修订《邦联条例》。 由于开始时只有两个州的代表团出席，成员不得不一天天地休会，直到 5 月 25 日与会人数达到法定的七个州。 通过讨论和争辩，6 月中旬时明确显示大会与其修改现有的《联邦条例》不如为政府重新起草一份全新的框架。 整个夏季，代表们都在非公开会议中辩论、起草、重新起草新宪法的条款。 主要的争论问题包括要赋予中央政府多大权利、允许各州在国会中有多少个代表席位以及这些代表应该如何选举产生——由人民直接选举还是由各州立法人员选举产生。 这部宪法是很多人智慧的结晶，是合作政治运作和妥协艺术的典范。&quot;@zh, &quot;A Convenção Federal reuniu-se na Casa de Estado (Hall da Independência), em Filadélfia, em 14 de maio de 1787 para revisar os Artigos da Confederação. Em virtude de estarem presentes, inicialmente, as delegações de apenas dois estados, os membros suspenderam os trabalhos, dia após dia, até que fosse atingido o quórum de sete estados em 25 de maio. Através de discussões e debates ficou claro, em meados de junho que, em vez de alterar os atuais artigos da Confederação, a convenção deveria elaborar uma estrutura inteiramente nova para o governo. Ao longo de todo o verão, os delegados debateram, elaboraram e reelaboraram os artigos da nova Constituição em sessões fechadas. Entre os principais pontos em questão estavam o grau de poder permitido ao governo central, o número de representantes no Congresso para cada Estado, e como estes representantes deveriam ser eleitos - diretamente pelo povo ou pelos legisladores do estado. A Constituição foi o trabalho de muitas mentes e permanece como um modelo de cooperação entre lideranças políticas e da arte da condescendência.&quot;@pt, &quot;La Convención Federal se reunió en la Cámara del Estado (Salón de la Independencia) en Filadelfia el 14 de mayo de 1787, para revisar los artículos de la Confederación. Debido a que las delegaciones de sólo dos estados estuvieron presentes inicialmente, los miembros levantaron sesión de un día para el siguiente hasta que se obtuvo un quórum de siete estados el 25 de mayo. A través de la discusión y el debate se hizo evidente a mediados de junio que, en lugar de modificar los actuales artículos de la Confederación, la convención prepararía un marco totalmente nuevo para el gobierno. Durante todo el verano, los delegados debatieron, prepararon y redactaron nuevamente los artículos de la nueva Constitución en sesiones a puerta cerrada. Entre los principales puntos en cuestión estuvieron cuánto poder otorgar al gobierno central, el número de representantes en el Congreso que se iban a permitir a cada Estado y la forma en que estos representantes debían ser elegidos, directamente por el pueblo o por los legisladores estatales. La Constitución fue el resultado del trabajo de muchas mentes y se erige como modelo de cooperación política y del arte del compromiso.&quot;@es, &quot;La Convention Fédérale s'assembla dans la Chambre Législative (Independence Hall) à Philadelphie le 14 mai 1787, pour réviser les articles de la Confédération. En raison de la seule présence initiale des délégations de deux États, les membres ajournèrent d'un jour à l'autre jusqu'à ce que le quorum de sept États soit obtenu le 25 mai. Â travers les discussions et les débats, il devint clair dès la mi-juin que, plutôt que de modifier les articles existants de la Confédération, la convention allait plutôt ébaucher un cadre entièrement nouveau pour le gouvernement. Tout au long de l'été, les délégués débattirent, élaborèrent, et remanièrent les articles de la nouvelle Constitution, à huis clos. Les principaux points litigieux portaient sur la puissance à accorder au gouvernement central, sur le nombre de représentants au Congrès pour chaque État, et sur le mode d'élection de ces représentants - directement par le peuple ou par les législateurs de l'état. La Constitution fut l'œuvre de nombreux esprits et reste un modèle de coopération politique et de l'art du compromis.&quot;@fr, &quot;The Federal Convention convened in the State House (Independence Hall) in Philadelphia on May 14, 1787, to revise the Articles of Confederation. Because the delegations from only two states were present initially, the members adjourned from one day to the next until a quorum of seven states was obtained on May 25. Through discussion and debate it became clear by mid-June that, rather than amend the existing Articles of Confederation, the convention would draft an entirely new framework for the government. All through the summer, the delegates debated, drafted, and redrafted the articles of the new Constitution in closed sessions. Among the chief points at issue were how much power to allow the central government, how many representatives in Congress to allow each state, and how these representatives should be elected--directly by the people or by the state legislators. The Constitution was the work of many minds and stands as a model of cooperative statesmanship and the art of compromise.&quot;@en, &quot;Федеральное собрание собралось на заседание в Доме правительства (зал Независимости) 14 мая 1787 года для пересмотра Статей Конфедерации. Поскольку вначале на заседании присутствовали представители только двух штатов, Собрание было распущено на несколько дней до тех пор, пока 25 мая не был обеспечен кворум из представителей семи штатов. В ходе дискуссий и дебатов к середине июня стало понятно, что собрание было намерено скорее составить новый вариант структуры правительства, нежели чем пересматривать существующие Статьи Конфедерации. В течение всего лета делегаты обсуждали, составляли черновые варианты статей новой Конституции и тут же их пересматривали в ходе закрытых заседаний. Среди основных обсуждавшихся вопросов были вопросы степени власти и полномочий, которыми должно быть наделено центральное правительство, количества представителей в Конгрессе от каждого штата, а также процедуры переизбрания этих представителей — непосредственно жителями штатов или законодательными собраниями штатов. Конституция была плодом работы многих политиков и является ярким примером сотрудничества государственных деятелей и искусства компромисса.&quot;@ru, &quot;اجتمع ممثلو الاتحاد الفدرالي في قصر الدولة (قاعة الاستقلال) في فيلادلفيا يوم ١٤  أيار ١٧٨٧ لتعديل النظام الأساسي للاتحاد. وحيث حضر وفدان اثنان فقط من وفود الولايات في البداية، رفع الأعضاء الحضور الجلسة من يوم إلى آخر حتى اكتمل النصاب القانوني بحضور وفود سبع ولايات في ٢٥ أيار. وقد اتضح خلال المناقشات والحوار بحلول منتصف حزيران أنه بدلا من تعديل مواد الاتحاد الكونفدرالي القائمة، كان على المؤتمرين صياغة إطار جديد تماما بالنسبة للحكومة. وطوال ذلك الصيف، ناقش المندوبون وصاغوا ثم أعادوا صياغة مواد الدستور الجديد في جلسات مغلقة. ومن بين النقاط الرئيسية التي دار حولها الجدل مدى صلاحيات الحكومة المركزية وعدد الممثلين في الكونغرس لكل ولاية ، وكيفية انتخاب هؤلاء ممثلين -- بالانتخاب المباشر من الشعب أو من قبل مشرّعي الولايات. لقد كان الدستور من عمل عقول كثيرة وهو يمثل نموذجا لفن الحكم التعاوني حنكة التوصل إلى الحلول الوسط.&quot;@ar ;
    dcterms:identifier &quot;http://localhost/item/2708/about.rdf#item&quot; ;
    dcterms:language &lt;http://www.lingvoj.org/lang/en&gt; ;
    dcterms:publisher &lt;http://dbpedia.org/resource/National_Archives_and_Records_Administration&gt; ;
    dcterms:spatial &lt;http://dbpedia.org/resource/North_America&gt;, &lt;http://dbpedia.org/resource/United_States_of_America&gt;, &quot;América del Norte&quot;@es, &quot;América do Norte&quot;@pt, &quot;Amérique du Nord&quot;@fr, &quot;Estados Unidos da América&quot;@pt, &quot;Estados Unidos de América&quot;@es, &quot;North America&quot;@en, &quot;United States of America&quot;@en, &quot;États-Unis d'Amérique&quot;@fr, &quot;Северная Америка&quot;@ru, &quot;Соединенные Штаты Америки&quot;@ru, &quot;أمريكا الشمالية&quot;@ar, &quot;الولايات المتحدة الأمريكية&quot;@ar, &quot;北美&quot;@zh, &quot;美国&quot;@zh ;
    dcterms:subject &lt;http://dbpedia.org/resource/Constitutions&gt; ;
    dcterms:temporal &quot;1700 AD - 1799 AD&quot;@en, &quot;1700 ap. J.-C. - 1799 ap. J.-C.&quot;@fr, &quot;1700 d.C. - 1799 d.C.&quot;@es, &quot;1700 d.C. - 1799 d.C.&quot;@pt, &quot;1700 н.э. - 1799 н.э.&quot;@ru, &quot;1700 公元 - 1799 公元&quot;@zh, &quot;١٧٠٠ م - ١٧٩٩ م&quot;@ar ;
    dcterms:title &lt;http://dbpedia.org/resource/Constitution_of_the_United_States&gt; ;
    ore:aggregates &lt;http://localhost/static/c/2708/reference/00303_2003_004_pr_thumb_item.gif&gt;, &lt;http://localhost/static/c/2708/service/00303_2003_001_pr.jpg&gt;, &lt;http://localhost/static/c/2708/service/00303_2003_002_pr.jpg&gt;, &lt;http://localhost/static/c/2708/service/00303_2003_003_pr.jpg&gt;, &lt;http://localhost/static/c/2708/service/00303_2003_004_pr.jpg&gt; ;
    ore:isDescribedBy &lt;http://localhost/item/2708/about.rdf&gt; ;
    a &lt;http://purl.org/ontology/bibo/Manuscript&gt; ;
    rdfs:seeAlso &lt;http://hdl.loc.gov/loc.wdl/dna.2708&gt; .</pre></div></div>

<h5>Notes</h5><ol class="footnotes"><li id="footnote_0_457" class="footnote">Sadly, the URIs are uglyish due to some constraints from our caching configuration.  I figure we can redirect uglyish URIs to cool ones and make use of owl:sameAs if those constraints go away.</li><li id="footnote_1_457" class="footnote"><em>sans</em> certain low-quality derivatives such as small thumbnails and tiles for the zoom interface</li><li id="footnote_2_457" class="footnote">I was poking through the DBpedia output for <a href="http://www.geonames.org/">Geonames</a> URIs as well, but my method was way too slow and clunky, so that&#039;s disabled for the time being.  Clients can always follow their noses from the DBpedia output.</li></ol><br/>
<hr/>]]></content:encoded>
			<wfw:commentRss>http://lackoftalent.org/michael/blog/2009/08/10/linking-world-digital-library-data/feed/</wfw:commentRss>
		<slash:comments>11</slash:comments>
		</item>
		<item>
		<title>Validating ORE from the Command-line</title>
		<link>http://lackoftalent.org/michael/blog/2009/07/31/validating-ore-from-the-command-line/</link>
		<comments>http://lackoftalent.org/michael/blog/2009/07/31/validating-ore-from-the-command-line/#comments</comments>
		<pubDate>Fri, 31 Jul 2009 18:52:54 +0000</pubDate>
		<dc:creator>Michael Giarlo</dc:creator>
				<category><![CDATA[Linked Data]]></category>
		<category><![CDATA[OAI-ORE]]></category>
		<category><![CDATA[Python]]></category>
		<category><![CDATA[Semantic Web]]></category>
		<category><![CDATA[World Digital Library]]></category>

		<guid isPermaLink="false">http://lackoftalent.org/michael/blog/?p=440</guid>
		<description><![CDATA[
I&#039;ve been periodically poking at getting Linked Data/RDF views hooked into the World Digital Library web application, following Ed Summers&#039; lead from his work on Chronicling America.  The RDF views also use the OAI-ORE vocabulary to express aggregations &#8212; in WDL, an item is an aggregation of its constituent files.  The goal is [...]]]></description>
			<content:encoded><![CDATA[<abbr class="unapi-id" title="oai:lackoftalent.org:technosophia:440"><!-- &nbsp; --></abbr>
<p>I&#039;ve been periodically poking at getting <a href="http://linkeddata.org/">Linked Data</a>/RDF views hooked into the <a href="http://www.wdl.org/">World Digital Library</a> web application, following <a href="http://inkdroid.org/journal/">Ed Summers</a>&#039; <a href="http://lists.w3.org/Archives/Public/public-lod/2009May/0301.html">lead</a> from his work on <a href="http://chroniclingamerica.loc.gov/">Chronicling America</a>.  The RDF views also use the <a href="http://www.openarchives.org/ore/">OAI-ORE</a> vocabulary to express aggregations &#8212; in WDL, an item is an aggregation of its constituent files.  The goal is to provide a semantically rich and holistic representation of a WDL item (identifier, constituent files, metadata, translations, and so on). </p>
<p>The ORE format is a new one for me so it&#039;s hard to say whether the output of my dev branch is valid ORE or not.  Plus I&#039;m a sucker for validators.  Turns out <a href="http://www.csc.liv.ac.uk/~azaroth/">Rob Sanderson</a> has developed a <a href="http://code.google.com/p/foresite-toolkit/">Python library for validating ORE</a>, and this little snippet is what I&#039;ve been using to validate the ORE.  I didn&#039;t put much effort into making it readable, so much as banging something functional out so I can meet deadlines, so mea culpa and all that.  But without further hemming and hawing, the code:</p>

<div class="wp_syntax"><div class="code"><pre class="python" style="font-family:monospace;"><span style="color: #808080; font-style: italic;"># validate.py</span>
<span style="color: #ff7700;font-weight:bold;">import</span> <span style="color: #dc143c;">sys</span>
<span style="color: #ff7700;font-weight:bold;">from</span> foresite <span style="color: #ff7700;font-weight:bold;">import</span> <span style="color: #66cc66;">*</span>
&nbsp;
rem = RdfLibParser<span style="color: black;">&#40;</span><span style="color: black;">&#41;</span>.<span style="color: black;">parse</span><span style="color: black;">&#40;</span>ReMDocument<span style="color: black;">&#40;</span><span style="color: #dc143c;">sys</span>.<span style="color: black;">argv</span><span style="color: black;">&#91;</span><span style="color: #ff4500;">1</span><span style="color: black;">&#93;</span><span style="color: black;">&#41;</span><span style="color: black;">&#41;</span>
aggr = rem.<span style="color: black;">aggregation</span>
n3 = RdfLibSerializer<span style="color: black;">&#40;</span><span style="color: #483d8b;">'n3'</span><span style="color: black;">&#41;</span>
rem2 = aggr.<span style="color: black;">register_serialization</span><span style="color: black;">&#40;</span>n3<span style="color: black;">&#41;</span>
<span style="color: #ff7700;font-weight:bold;">print</span> rem2.<span style="color: black;">get_serialization</span><span style="color: black;">&#40;</span>n3<span style="color: black;">&#41;</span>.<span style="color: black;">data</span></pre></div></div>

<p>Most of this code is naively copied and pasted from Rob&#039;s excellent <a href="http://code.google.com/p/foresite-toolkit/wiki/PythonLibrary">Foresite documentation</a>.</p>
<p>I invoke it thusly: <code>python validate.py {URL}</code></p>
<p>And the output:</p>

<div class="wp_syntax"><div class="code"><pre class="n3" style="font-family:monospace;">@prefix _27: &lt;http://www.semanticdesktop.org/ontologies/nfo#&gt;.
@prefix _28: &lt;http://localhost/en/item/1/id#&gt;.
@prefix _29: &lt;http://localhost/en/item/1/&gt;.
@prefix bibo: &lt;http://purl.org/ontology/bibo/&gt;.
@prefix dc: &lt;http://purl.org/dc/elements/1.1/&gt;.
@prefix dcterms: &lt;http://purl.org/dc/terms/&gt;.
@prefix ore: &lt;http://www.openarchives.org/ore/terms/&gt;.
@prefix rdf: &lt;http://www.w3.org/1999/02/22-rdf-syntax-ns#&gt;.
@prefix rdfs: &lt;http://www.w3.org/2000/01/rdf-schema#&gt;.
@prefix rdfs1: &lt;http://www.w3.org/2001/01/rdf-schema#&gt;.
&nbsp;
 _28:ResourceMap a ore:ResourceMap;
     dc:format &quot;text/rdf+n3&quot;;
     dcterms:created &quot;2009-07-31T14:23:31Z&quot;;
     dcterms:modified &quot;2009-07-31T14:23:31Z&quot;;
     ore:describes _29:id. 
&nbsp;
 _29:id a bibo:Image,
         ore:Aggregation;
     dcterms:DDC &quot;973&quot;;
     dcterms:alternative &quot;Antietam, Maryland. Allan Pinkerton, President Lincoln, and Major General John A. McClernand&quot;@en;
     dcterms:created &quot;1862年10月3日&quot;@zh,
         &quot;3 de octubre de 1862&quot;@es,
         &quot;3 de outubro de 1862&quot;@pt,
         &quot;3 octobre 1862&quot;@fr,
         &quot;3 октября 1862 года&quot;@ru,
         &quot;October 3, 1862&quot;@en,
         &quot; ٣ آكتوبر، ١٨٦٢&quot;@ar;
     dcterms:creator &quot;Gardner, Alexander&quot;@en,
         &quot;Gardner, Alexander&quot;@es,
         &quot;Gardner, Alexander&quot;@fr,
         &quot;Gardner, Alexander&quot;@pt,
         &quot;Гарднер, Александр&quot;@ru,
         &quot;جاردنر, أليكسندر&quot;@ar,
         &quot;加德纳, 亚历山大&quot;@zh;
... (and so on and so forth)
     dcterms:title &quot;Antietam, Maryland. Allan Pinkerton, President Lincoln, and Major General John A. McClernand: Another View&quot;@en,
         &quot;Antietam, Maryland. Allan Pinkerton, el Presidente Lincoln y el General Principal John A. McClernand: Otra visión&quot;@es,
         &quot;Antietam, Maryland. Allan Pinkerton, le président Lincoln et le général-major John A. McClernand: Autre vue&quot;@fr,
         &quot;Antietam, Maryland. Allan Pinkerton,  Presidente Lincoln e Major-General John A. McClernand: Outra Vista&quot;@pt,
         &quot;Антитэм, штат Мэриленд. Аллан Пинкертон, президент Линкольн и генерал-майор Джон А. Макклернанд: Другой снимок&quot;@ru,
         &quot;أنتينام، ميريلاند ألان بينكرتون، الرئيس لينكولن، واللواء جون أ. ماكليرناند: منظر آخر&quot;@ar,
         &quot;安蒂特姆，马里兰州 艾伦·平克顿、林肯总统和少将约翰·A ·马克克拉南: 另一个视角&quot;@zh;
     ore:aggregates &lt;http://localhost/static/c/1/reference/04326u_thumb_item.gif&gt;,
         &lt;http://localhost/static/c/1/service/04326u.tif&gt;;
     ore:isDescribedBy &lt;http://localhost/en/item/1/item.rdf&gt;;
     rdfs:seeAlso &lt;http://hdl.loc.gov/loc.wdl/dlc.1&gt;. 
&nbsp;
 &lt;http://localhost/static/c/1/reference/04326u_thumb_item.gif&gt; a _27:FileDataObject;
     dcterms:format &quot;image/gif&quot;;
     _27:fileSize &quot;34531&quot;^^&lt;http://www.w3.org/2001/XMLSchema#long&gt;. 
&nbsp;
 &lt;http://localhost/static/c/1/service/04326u.tif&gt; a _27:FileDataObject;
     dcterms:format &quot;image/tiff&quot;;
     _27:fileSize &quot;1301614&quot;^^&lt;http://www.w3.org/2001/XMLSchema#long&gt;. 
&nbsp;
 ore:Aggregation rdfs1:isDefinedBy &lt;http://www.openarchives.org/ore/terms/&gt;;
     rdfs1:label &quot;Aggregation&quot;. 
&nbsp;
 ore:ResourceMap rdfs1:isDefinedBy &lt;http://www.openarchives.org/ore/terms/&gt;;
     rdfs1:label &quot;ResourceMap&quot;.</pre></div></div>

<p>You might pick up on some warts I have yet to fix, but there you go.</p>
]]></content:encoded>
			<wfw:commentRss>http://lackoftalent.org/michael/blog/2009/07/31/validating-ore-from-the-command-line/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>A Digital Object Defined</title>
		<link>http://lackoftalent.org/michael/blog/2009/07/15/a-digital-object-defined/</link>
		<comments>http://lackoftalent.org/michael/blog/2009/07/15/a-digital-object-defined/#comments</comments>
		<pubDate>Thu, 16 Jul 2009 00:31:52 +0000</pubDate>
		<dc:creator>Michael Giarlo</dc:creator>
				<category><![CDATA[Digital Libraries and Archives]]></category>

		<guid isPermaLink="false">http://lackoftalent.org/michael/blog/?p=435</guid>
		<description><![CDATA[
What happens to a digital object defined?[1]

Does its identifier dry up
like a raisin in the sun?
Or its relationships fester like a sore&#8211;
And then run?
Do its bits rot like meat?
Or become overwritten&#8211;
like some throw-away sheet?

Maybe its metadata just sags
like a heavy load.

Or does it fade into code?
NotesInspired by Langston Hughes&#039;s &#034;A Dream Deferred&#034; and a spirited [...]]]></description>
			<content:encoded><![CDATA[<abbr class="unapi-id" title="oai:lackoftalent.org:technosophia:435"><!-- &nbsp; --></abbr>
<p>What happens to a digital object defined?[1]<br/><br />
<br/><br />
Does its identifier dry up<br/><br />
like a raisin in the sun?<br/><br />
Or its relationships fester like a sore&#8211;<br/><br />
And then run?<br/><br />
Do its bits rot like meat?<br/><br />
Or become overwritten&#8211;<br/><br />
like some throw-away sheet?<br/><br />
<br/><br />
Maybe its metadata just sags<br/><br />
like a heavy load.<br/><br />
<br/><br />
Or does it fade into code?</p>
<h5>Notes</h5><ol class="footnotes"><li id="footnote_0_435" class="footnote">Inspired by Langston Hughes&#039;s &#034;<a href="http://www.cswnet.com/~menamc/langston.htm">A Dream Deferred</a>&#034; and a spirited <a href="http://twitter.com/dchud/status/2656720839">conversation</a> in the office today.</li></ol><br/>
<hr/>]]></content:encoded>
			<wfw:commentRss>http://lackoftalent.org/michael/blog/2009/07/15/a-digital-object-defined/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>I2: Survey</title>
		<link>http://lackoftalent.org/michael/blog/2009/06/20/i2-survey/</link>
		<comments>http://lackoftalent.org/michael/blog/2009/06/20/i2-survey/#comments</comments>
		<pubDate>Sat, 20 Jun 2009 18:30:53 +0000</pubDate>
		<dc:creator>Michael Giarlo</dc:creator>
				<category><![CDATA[Digital Libraries and Archives]]></category>
		<category><![CDATA[NISO I2]]></category>
		<category><![CDATA[Persistent Identifiers]]></category>
		<category><![CDATA[Repositories]]></category>
		<category><![CDATA[Scholarly Communication]]></category>

		<guid isPermaLink="false">http://lackoftalent.org/michael/blog/?p=363</guid>
		<description><![CDATA[
[Series]
Near the end of my strawman post, I wrote:
The I2 repositories subgroup will be sending out its survey on identifier use cases in the coming week.  It will be interesting to see if the requirements we have thus far identified still obtain in light of the data we collect from the survey. 
We completed [...]]]></description>
			<content:encoded><![CDATA[<abbr class="unapi-id" title="oai:lackoftalent.org:technosophia:363"><!-- &nbsp; --></abbr>
<p>[<a href="http://lackoftalent.org/michael/blog/category/niso-i2/">Series</a>]</p>
<p>Near the end of my <a href="http://lackoftalent.org/michael/blog/2009/06/13/i2-strawman/">strawman post</a>, I wrote:<br />
<blockquote>The I2 repositories subgroup will be sending out its survey on identifier use cases in the coming week.  It will be interesting to see if the requirements we have thus far identified still obtain in light of the data we collect from the survey. </p></blockquote>
<p>We completed the survey late last week and began distributing it.  Here&#039;s what we sent out:<br />
<blockquote>
<p>The NISO I2 Working Group is surveying repository managers to determine the current practices and needs of the repository community regarding institutional identifiers.  We value your time and your input in the process to create a standard for a new institutional identifier.  We hope that you will complete the survey which should take less than 15 minutes.  The survey will remain open through Monday, July 6th.</p>
<p>
Here is a link to the survey:<br />
<a href="http://www.surveymonkey.com/s.aspx?sm=RGQgZ3090DVrb3kFzr3P3Q_3d_3d">http://www.surveymonkey.com/s.aspx?sm=RGQgZ3090DVrb3kFzr3P3Q_3d_3d</a></p>
<p>
Please feel free to share this message with other interested parties.</p>
</blockquote>
<p>First we used <a href="http://www.surveymonkey.com/">Survey Monkey</a> to send the survey link to approximately one-hundred repository managers that the subgroup identified.   Our process for identifying repository managers involved pulling together a list of prominent repositories from subgroup members, and then gathering more from <a href="http://www.opendoar.org/">OpenDOAR</a>, &#034;an authoritative directory of academic open access repositories.&#034;  Then subgroup members were encouraged to share the survey link with colleagues, and post it far and wide via blogs, listservs, and <a href="http://twitter.com/mjgiarlo/status/2230486784">tweets</a>.  The listservs we targeted were: <a href="http://www.jiscmail.ac.uk/archives/jisc-repositories.html">JISC-REPOSITORIES</a>, <a href="http://metadatalibrarians.monarchos.com/">metadataLibrarians</a>, <a href="http://groups.google.com/group/digital-curation">digital-curation</a>, <a href="https://arl.org/Lists/SPARC-IR/">SPARC-IR</a>, <a href="http://mailman.anu.edu.au/mailman/listinfo/ir-net">ir-net</a>, <a href="http://www.lsoft.com/SCRIPTS/WL.EXE?SL1=REPOMAN-L&#038;H=LISTSERV.INDIANA.EDU">REPOMAN-L</a>, <a href="http://larch.palinet.org/archives/palinet-ir-l.html">PALINET-IR-L</a>, <a href="http://mailman.mit.edu/mailman/listinfo/dspace-general">dspace-general</a>, <a href="https://lists.sourceforge.net/lists/listinfo/fedora-commons-users">fedora-commons-users</a>, <a href="http://dublincore.org/groups/identifiers/">DC-IDENTIFIERS</a>, and <a href="http://dewey.library.nd.edu/mailing-lists/code4lib/">code4lib</a>.</p>
<p>I&#039;ve already received a few responses and have gotten useful feedback.  Two of the hardest questions to answer so far have been: &#034;What is an institutional identifier?&#034; and &#034;What is a repository?&#034;</p>
<dl>
<dt><strong>Institutional identifier</strong></dt>
<dd>
<p>An institutional identifier is defined as a symbol or code that uniquely identifies an institution.  Domain-specific examples of existing identifiers include SAN, IPEDS, GLN, MARC Org Code, and ISIL.  Another example might be a Handle prefix or ARK name authority assigning number.</p>
</dd>
<dt><strong>Repository</strong></dt>
<dd>
<p>Institutional repositories and subject repositories like arxiv.org are clearly &#039;repositories&#039;, but beyond that it is a somewhat ill-defined term.  One might look to the <a href="http://www.cnri.reston.va.us/k-w.html">Kahn-Wilensky architecture</a>, or the <a href="http://public.ccsds.org/publications/archive/650x0b1.pdf">OAIS reference model (PDF)</a>, or even <a href="http://en.wikipedia.org/wiki/Digital_repository">Wikipedia</a> for definitions, but it&#039;s not clear that even the authorities agree on what constitutes a repository.</p>
<p>It&#039;s a system.  It&#039;s network-accessible and typically has a web interface of some sort.  Files and groups of files sometimes known as objects tend to be deposited in them, perhaps for some combination of management, access, or preservation.  Many run Fedora, DSpace, and ePrints, and factor heavily in scholarly communication.  Some are document-centric.  Some will accept anything.  To some, a learning management system may be a repo.  To others, a content management system may fit.</p>
<p>My background is in academia so my own definition is somewhat based in that context, but I wouldn&#039;t say the term is necessarily limited to that context.  There are other NISO I2 scenarios for library workflows and electronic resources, so it&#039;s safe to assume that repository does not mean ILS or OPAC or ERP system.  My hope is that folks have their own working definitions of the term and can decide for themselves what it means.</p>
</dd>
</dl>
<p>We&#039;ve given folks a little over two weeks to respond to the survey, so the constant I2 drum-beating will quiet down for a while around here.  I am very interested in what sorts of responses we get from the survey.  Fun times!</p>
<p>Oh, and perhaps it goes without saying, but if you&#039;re a repository owner, manager, expert, developer, or stakeholder with an interest in identifiers, please feel free to take the <a href="http://www.surveymonkey.com/s.aspx?sm=RGQgZ3090DVrb3kFzr3P3Q_3d_3d">survey</a>!</p>
]]></content:encoded>
			<wfw:commentRss>http://lackoftalent.org/michael/blog/2009/06/20/i2-survey/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>I2: Strawman</title>
		<link>http://lackoftalent.org/michael/blog/2009/06/13/i2-strawman/</link>
		<comments>http://lackoftalent.org/michael/blog/2009/06/13/i2-strawman/#comments</comments>
		<pubDate>Sat, 13 Jun 2009 23:02:36 +0000</pubDate>
		<dc:creator>Michael Giarlo</dc:creator>
				<category><![CDATA[Digital Libraries and Archives]]></category>
		<category><![CDATA[Linked Data]]></category>
		<category><![CDATA[NISO I2]]></category>
		<category><![CDATA[Persistent Identifiers]]></category>
		<category><![CDATA[Repositories]]></category>
		<category><![CDATA[Scholarly Communication]]></category>
		<category><![CDATA[Hypertext Transfer Protocol]]></category>
		<category><![CDATA[Metadata]]></category>
		<category><![CDATA[Tim Berners-Lee]]></category>
		<category><![CDATA[Uniform Resource Identifier]]></category>
		<category><![CDATA[World Wide Web]]></category>

		<guid isPermaLink="false">http://lackoftalent.org/michael/blog/?p=346</guid>
		<description><![CDATA[

[Series]
In the prior I2 post, I wrote about the requirements the repositories subgroup has come up with for an institutional identifier standard (with the hope that our findings re: repositories could be generalized to other scenarios).




Image by PhOtOnQuAnTiQuE via Flickr



My strawman proposal of sorts is to explore how well linked data patterns fit this problem [...]]]></description>
			<content:encoded><![CDATA[<abbr class="unapi-id" title="oai:lackoftalent.org:technosophia:346"><!-- &nbsp; --></abbr>
<p style="text-align: left;">
<p style="text-align: left;">[<a href="http://lackoftalent.org/michael/blog/category/niso-i2/">Series</a>]</p>
<p style="text-align: left;">In the prior I2 post, I wrote about the requirements the repositories subgroup has come up with for an institutional identifier standard (with the hope that our findings re: repositories could be generalized to other scenarios).</p>
<div class="zemanta-img zemanta-action-dragged" style="margin: 1em; display: block; text-align: left;">
<div>
<dl class="wp-caption alignright" style="width: 250px;">
<dt class="wp-caption-dt"><a href="http://www.flickr.com/photos/67968452@N00/3272712288"><img title="PhotonQ-Tim Berners Lee on Linked Data at TED" src="http://farm4.static.flickr.com/3449/3272712288_2ef843a4b7_m.jpg" alt="PhotonQ-Tim Berners Lee on Linked Data at TED" width="240" height="180" /></a></dt>
<dd class="wp-caption-dd zemanta-img-attribution" style="font-size: 0.8em;">Image by <a href="http://www.flickr.com/photos/67968452@N00/3272712288">PhOtOnQuAnTiQuE</a> via Flickr</dd>
</dl>
</div>
</div>
<p style="text-align: left;">My strawman proposal of sorts is to explore how well <a href="http://linkeddata.org/">linked data patterns</a> fit this problem space.  Linked data, briefly, is a way to expose and link data on the web in a more semantically meaningful way, and is often summarized using the four principles put forward by <a href="http://en.wikipedia.org/wiki/Tim_Berners-Lee">Tim Berners-Lee</a>:</p>
<blockquote style="text-align: left;">
<ol>
<li>Use URIs as names for things</li>
<li>Use HTTP URIs so that people can look up those names.</li>
<li>When someone looks up a URI, provide useful information.</li>
<li>Include links to other URIs. so that they can discover more things.</li>
</ol>
</blockquote>
<p style="text-align: left;">That&#039;s the crux of it.  Linked data takes well-known patterns on the web (linking, dereferencing, etc.) and applies them to data, which in this case could be metadata for identifying institutions.</p>
<p style="text-align: left;">Let&#039;s examine each of the requirements and the applicability of linked data thereto.</p>
<ol style="text-align: left;">
<li><em>Should be agnostic to type of institution, e.g., libraries, museums, personal collections, historical societies</em>: The web is already agnostic to type of institution.  HTTP URIs do not favor one type of institution over another.<br/></li>
<li><em>Should handle varying institutional granularity, e.g., institution-level, campus-level, division-level, unit-level</em>: HTTP <a href="http://en.wikipedia.org/wiki/Uniform_Resource_Identifier">URIs</a> are flexible in this regard.  Hierarchy, should one wish it to be surfaced in the identifier, may be encoded in either a DNS hostname or the path appended to the DNS name.  One can imagine a URI like &#034;http://department.division.institution.tld/unit/subunit&#034; or &#034;http://institution.tld/campus/office/individual&#034;. <br/><br/>Hierarchy needn&#039;t be surfaced in the identifier if one favors opacity, in which case &#034;http://registry.tld/xnjsdasd&#034; would suffice as an identifier, and may instead be entirely reflected in the (RDF) representation returned by dereferencing the URI.<br/></li>
<li><em>Should handle linking among institutions and subordinate units</em>: Linked data handles linking via well-known HTTP mechanisms, referenced in the fourth principle of linked data.  Unlike the HTTP link, which has limited semantics, linked data links are semantically rich and extensible.<br/></li>
<li><em>Should express different sorts of relationships among these institutions and units</em>: The &#034;useful information&#034; in the third principle of linked data is typically provided by an RDF representation, which is itself a list of assertions.  These assertions, or triples, consist of subjects, predicates, and objects.  The ability to express the relationships in this requirement is limited only by the availability of vocabularies that contain sets of predicates and classes for subjects and objects.  Think of the predicates as elements defined within a metadata standard, e.g., <a href="http://dublincore.org/documents/dces/">Dublin Core</a> &#034;creator&#034;, <a href="http://www.loc.gov/standards/mods/">MODS</a> &#034;relatedItem&#034;, and so forth.  Vocabularies that contain these predicates and classes are growing and evolving daily, and should there not be a vocabulary that contains the relationship one wishes to express, it is fairly easy to create a custom vocabulary. <br/><br/>The ability to mix and match vocabularies provides an expressiveness that is often not found in document-based metadata formats and the flexibility to express radically different relationships on a per-industry or per-institution basis.  This latter point is important as the I2 group has identified both core metadata elements for identifying institutions of different types and additional elements for specific types of institutions.  Why re-invent a new metadata format or schema when all one needs to express may already be contained in others?<br/></li>
<li><em>Should relate to existing relevant identifiers and registries</em>: Same as requirement#4.  Linked data is all about expressing relationships between things, e.g., institutions, identifiers, registries, etc.<br/></li>
<li><em>Should be globally unique</em>: HTTP URIs are guaranteed to be globally unique by virtue of the distributed <a href="http://en.wikipedia.org/wiki/Domain_name_system">DNS</a> system and hierarchical naming within each HTTP service.<br/></li>
<li><em>Should be actionable</em>: HTTP URIs provide dereferenceability/actionability via the well-known HTTP protocol.<br/></li>
<li><em>Should enable retrieval of metadata sufficient to identify the institution, which may vary widely by institution</em>: HTTP URIs are actionable per requirement #7 and the metadata returned is flexible per requirement #4.<br/></li>
<li><em>Should accommodate changes as institutions come and go and re-organize and be able to relate defunct institutions to new ones</em>: Linked data patterns provide for redirecting from defunct representations (institutional identifiers) to new ones via HTTP redirects.  One may also add assertions to institutional metadata such as owl:sameAs, for instance, which says that the institution identified by the given URI is the same as another institution identified by another URI.<br/></li>
</ol>
<p>This seems like a compelling path to follow for the I2 standard.</p>
<p>The I2 repositories subgroup will be sending out its survey on identifier use cases in the coming week.  It will be interesting to see if the requirements we have thus far identified still obtain in light of the data we collect from the survey.  If so, I would like to explore the idea of linked data for institutional identifiers a bit more.</p>
]]></content:encoded>
			<wfw:commentRss>http://lackoftalent.org/michael/blog/2009/06/13/i2-strawman/feed/</wfw:commentRss>
		<slash:comments>7</slash:comments>
		</item>
		<item>
		<title>I2: Requirements</title>
		<link>http://lackoftalent.org/michael/blog/2009/06/07/i2-requirements/</link>
		<comments>http://lackoftalent.org/michael/blog/2009/06/07/i2-requirements/#comments</comments>
		<pubDate>Sun, 07 Jun 2009 21:14:56 +0000</pubDate>
		<dc:creator>Michael Giarlo</dc:creator>
				<category><![CDATA[Digital Libraries and Archives]]></category>
		<category><![CDATA[NISO I2]]></category>
		<category><![CDATA[Persistent Identifiers]]></category>
		<category><![CDATA[Repositories]]></category>
		<category><![CDATA[Scholarly Communication]]></category>

		<guid isPermaLink="false">http://lackoftalent.org/michael/blog/?p=327</guid>
		<description><![CDATA[
[Series]
The I2 IR scenario subgroup approached the issue of institutional identifiers in repositories by first brainstorming about the various issues, problems, and sticking points that make identifiers in this space (and elsewhere) such a complex topic.  Folks on the subgroup are repository managers or are otherwise involved with or knowledgeable about the repository space, [...]]]></description>
			<content:encoded><![CDATA[<abbr class="unapi-id" title="oai:lackoftalent.org:technosophia:327"><!-- &nbsp; --></abbr>
<p>[<a href="http://lackoftalent.org/michael/blog/category/niso-i2/">Series</a>]</p>
<p>The I2 IR scenario subgroup approached the issue of institutional identifiers in repositories by first brainstorming about the various issues, problems, and sticking points that make identifiers in this space (and elsewhere) such a complex topic.  Folks on the subgroup are repository managers or are otherwise involved with or knowledgeable about the repository space, so the brainstorming exercise yielded a good number of concerns.  </p>
<p>The purpose of the exercise was to enumerate concerns and issues that could inform a draft survey to be administered to repository managers and experts around the globe in different organizational contexts: libraries, subject disciplines, archives, historical societies, etc.  The purpose of the survey is to get an idea of the use cases and constraints around institutional identifiers in these different repository contexts, the assumption being that we ought to have requirements grounded in real world usage before we go off building a standard.</p>
<p>I will note here that the subgroup has worked up a draft survey that has just recently been reviewed by a small group of folks who know about survey design, and we hope to administer the survey to the aforementioned <em>Reporati</em> this week[1].  Which is to say that I don&#039;t yet have a strong grasp of the use cases out there in the wild, and this series should be construed as my own premature cognitive fumblings.  But let&#039;s assume for now that what we learn from the survey results matches our initial brainstorming exercise.  </p>
<p>Here is a slightly modified and boiled down version of the concerns and issues the subgroup came up with for a potential institutional identifier standard, which resembles a set of minimum requirements:</p>
<ol>
<li>Should be agnostic to type of institution, e.g., libraries, museums, personal collections, historical societies</li>
<li>Should handle varying institutional granularity, e.g., institution-level, campus-level, division-level, unit-level</li>
<li>Should handle linking among institutions and subordinate units</li>
<li>Should express different sorts of relationships among these institutions and units</li>
<li>Should relate to existing relevant identifiers and registries</li>
<li>Should be globally unique</li>
<li>Should be actionable</li>
<li>Should enable retrieval of metadata sufficient to identify the institution, which may vary widely by institution</li>
<li>Should accommodate changes as institutions come and go and re-organize and be able to relate defunct institutions to new ones</li>
</ol>
<p>I doubt the list is exhaustive; I am almost certain we will uncover all sorts of tangly and esoteric use cases that add requirements.  I expect it.  Why else would we be gathering to discuss the need for an institutional identifier if it were a solved problem or a simple one? [2]</p>
<p>Nevertheless, looking at the above list, the task we&#039;ve taken on starts to feel less onerous.  And thinking about identifier systems constrained by the list of concerns, the mind starts to cook up all sorts of possible solutions.  I&#039;ll share one in the next post in this series, a strawman proposal of sorts, and how it addresses each of these requirements.</p>
<h5>Notes</h5><ol class="footnotes"><li id="footnote_0_327" class="footnote">We will also x-post to repo-related mailing lists as well, and some of us may blog or tweet about it.  My inclination is to cast as wide a net as possible so as not to miss important use cases.  We can always scope things out later on, but it&#039;s useful to be inclusive at this point lest our own assumptions carry the group forward.</li><li id="footnote_1_327" class="footnote">The cynical among you might have interesting answers to this question.</li></ol><br/>
<hr/>]]></content:encoded>
			<wfw:commentRss>http://lackoftalent.org/michael/blog/2009/06/07/i2-requirements/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>I2: Background</title>
		<link>http://lackoftalent.org/michael/blog/2009/05/19/i2-background/</link>
		<comments>http://lackoftalent.org/michael/blog/2009/05/19/i2-background/#comments</comments>
		<pubDate>Tue, 19 May 2009 19:16:19 +0000</pubDate>
		<dc:creator>Michael Giarlo</dc:creator>
				<category><![CDATA[Digital Libraries and Archives]]></category>
		<category><![CDATA[NISO I2]]></category>
		<category><![CDATA[Persistent Identifiers]]></category>
		<category><![CDATA[Repositories]]></category>
		<category><![CDATA[Scholarly Communication]]></category>

		<guid isPermaLink="false">http://lackoftalent.org/michael/blog/?p=312</guid>
		<description><![CDATA[
[Series]
This is the first in a series of posts about institutional identifiers[1].  
In my last post, I alluded to some documentation that I&#039;ve written.  That was somewhat misleading, which will soon be apparent, but I liked the parallel construction I had going, and I am but a slave to orderliness.
For about the past [...]]]></description>
			<content:encoded><![CDATA[<abbr class="unapi-id" title="oai:lackoftalent.org:technosophia:312"><!-- &nbsp; --></abbr>
<p>[<a href="http://lackoftalent.org/michael/blog/category/niso-i2/">Series</a>]</p>
<p>This is the first in a series of posts about institutional identifiers[1].  </p>
<p>In <a href="http://lackoftalent.org/michael/blog/2009/05/18/state-of-the-me/">my last post</a>, I alluded to some documentation that I&#039;ve written.  That was somewhat misleading, which will soon be apparent, but I liked the parallel construction I had going, and I am but a slave to orderliness.</p>
<p>For about the past six months, I have been working with a <a href="http://www.niso.org/workrooms/i2">NISO group</a> looking into how institutions are identified within information systems:</p>
<blockquote><p>
The I2 (Institutional Identifiers &#8212; pronounced &#034;I 2&#034;) working group will build on work from the Journal Supply Chain Efficiency Improvement Pilot (http://www.journalsupplychain.com/), which demonstrated the improved efficiencies of using an institutional identifier in the journal supply chain. The NISO working group will develop a standard for an institutional identifier that can be implemented in all library and publishing environments. The standard will include definition of the metadata required to be collected with the identifier and what uses can be made of that metadata. &#8230;
</p></blockquote>
<p>The I2 group is split into a few subgroups which have been charged with looking into how institutional identifiers are used in particular scenarios.  These scenarios are e-resources, repositories and e-learning systems, and library resource workflows.  The scenario names pain me a bit, but so be it; this is our industry, and there are bigger windmills to tilt at.</p>
<p>I am currently co-chairing the subgroup looking at repositories and e-learning, and apparently I am its &#034;tech lead.&#034;  I don&#039;t want to get caught up on names and roles and titles, though; this series isn&#039;t about those at all.  I&#039;m just setting the scene and explaining why my head&#039;s in this space and laying bare my stake in the issue.</p>
<p>The remainder of this series will provide a bit more detail on the issues around institutional identifiers, share how the repository subgroup is grappling with identifier issues and engaging the repository community to assess needs, propose an approach for an identifier system that may meet said needs, and explore what seems to be the thorniest issue[2].</p>
<h5>Notes</h5><ol class="footnotes"><li id="footnote_0_312" class="footnote">I offer that very tentatively, knowing what a <a href="/michael/blog/category/development/ruby/">spectacular failure</a> my last attempt at a series was.</li><li id="footnote_1_312" class="footnote">Hint: management.  I know, &#034;duh,&#034; right?</li></ol><br/>
<hr/>]]></content:encoded>
			<wfw:commentRss>http://lackoftalent.org/michael/blog/2009/05/19/i2-background/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Cataloging and institutional repositories</title>
		<link>http://lackoftalent.org/michael/blog/2009/02/09/cataloging-and-institutional-repositories/</link>
		<comments>http://lackoftalent.org/michael/blog/2009/02/09/cataloging-and-institutional-repositories/#comments</comments>
		<pubDate>Mon, 09 Feb 2009 14:30:33 +0000</pubDate>
		<dc:creator>Michael Giarlo</dc:creator>
				<category><![CDATA[Cataloging and Metadata]]></category>
		<category><![CDATA[Digital Libraries and Archives]]></category>
		<category><![CDATA[Libraries]]></category>
		<category><![CDATA[Management]]></category>
		<category><![CDATA[Repositories]]></category>

		<guid isPermaLink="false">http://lackoftalent.org/michael/blog/?p=288</guid>
		<description><![CDATA[
While doing some reading for a little talk my colleague, Ed Summers, and I are giving at code4lib 2009, I came across a paragraph that sparked a crazy thought.  So crazy that it&#039;s not crazy at all.  So not crazy that I am sure other people have thought of it.  But nonetheless, [...]]]></description>
			<content:encoded><![CDATA[<abbr class="unapi-id" title="oai:lackoftalent.org:technosophia:288"><!-- &nbsp; --></abbr>
<p>While doing some reading for a little <a href="http://code4lib.org/conference/2009/schedule#hcal10">talk</a> my colleague, <a href="http://inkdroid.org/ehs">Ed Summers</a>, and I are giving at <a href="http://code4lib.org/conference/2009">code4lib 2009</a>, I came across a paragraph that sparked a crazy thought.  So crazy that it&#039;s not crazy at all.  So not crazy that I am sure other people have thought of it.  But nonetheless, here I am writing about it just in case.</p>
<p>From Sarah Currier&#039;s <a href="http://www.elearning.ac.uk/features/sword">paper</a> on <a href="http://www.swordapp.org/">SWORD</a> (emphasis mine):<br />
<blockquote>One of the most frequently cited barriers to academics depositing their teaching materials into repositories is the keystroke-count involved in logging into a repository, uploading the resource, creating metadata, perhaps selecting a licence, and publishing the resource. It was a quick win, therefore, to create a drag-and-drop desktop tool to allow a single keystroke deposit of resources, including multiple resources in one action. For a repository that supports <b>automatic metadata generation</b>, administrative metadata can be created at the point of entry to the repository without the user needing to create any.</p></blockquote>
<p>And I wondered how many repositories supported automatic metadata generation.  I wondered how many repositories supported automatic generation of <em>rich</em> metadata.  And lastly I wondered, might this be a more or less natural role for catalogers: augmenting stub metadata records or doing original cataloging for institutional repository deposits?  Especially at a time when many of them are being reclassified as acquisitions specialists or digital projects managers?</p>
<p>Potential issues and questions:
<ul>
<li>Author ignorance: Maybe catalogers are already doing this and I&#039;m a moron?</li>
<li>Scale: Is it realistic to expect to be able to &#034;keep up&#034; with repository deposits?</li>
<li>Granularity: Does cataloging at the level of articles, and perhaps at even finer granularities, introduce challenges?</li>
<li>Duplication: If pre-prints are cataloged in the IR, for instance, will they need to be cataloged again later?</li>
<li>&#8230; there are others I thought of on my commute this morning but have since forgotten them.  Feel free to add comments.</li>
</ul>
<p>I will admit here that I&#039;ve been somewhat out of the (academic) institutional repository space a while, and cataloging is something I don&#039;t share thoughts about very often because my exposure is limited to having taken one course a couple years ago.  </p>
<p>I assume there&#039;s a body of research about this out there somewhere but I figured I&#039;d post this anyway.</p>
]]></content:encoded>
			<wfw:commentRss>http://lackoftalent.org/michael/blog/2009/02/09/cataloging-and-institutional-repositories/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
	</channel>
</rss>
