<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>&#964;&#949;&#967;&#957;&#959;&#963;&#959;&#966;&#953;&#945; &#187; Repositories</title>
	<atom:link href="http://lackoftalent.org/michael/blog/category/libraries/repositories/feed/" rel="self" type="application/rss+xml" />
	<link>http://lackoftalent.org/michael/blog</link>
	<description>The occasional rambling of a digital library artisan</description>
	<lastBuildDate>Mon, 10 Oct 2011 12:33:06 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.2.1</generator>
		<item>
		<title>I2: Resource Description</title>
		<link>http://lackoftalent.org/michael/blog/2010/05/19/i2-resource-description/</link>
		<comments>http://lackoftalent.org/michael/blog/2010/05/19/i2-resource-description/#comments</comments>
		<pubDate>Wed, 19 May 2010 13:38:18 +0000</pubDate>
		<dc:creator>Michael Giarlo</dc:creator>
				<category><![CDATA[Cataloging and Metadata]]></category>
		<category><![CDATA[Digital Libraries and Archives]]></category>
		<category><![CDATA[Linked Data]]></category>
		<category><![CDATA[NISO I2]]></category>
		<category><![CDATA[Persistent Identifiers]]></category>
		<category><![CDATA[Repositories]]></category>
		<category><![CDATA[Scholarly Communication]]></category>

		<guid isPermaLink="false">http://lackoftalent.org/michael/blog/?p=568</guid>
		<description><![CDATA[I can hardly believe it&#039;s been eight months since I last wrote about the NISO I2 project. A lot has changed since then[1]. I continue to work on I2 however; they won&#039;t get rid of me that easily. In the last post, I wrote: The next step is to build upon the report to draw [...]]]></description>
			<content:encoded><![CDATA[<abbr class="unapi-id" title="oai:lackoftalent.org:technosophia:568"><!-- &nbsp; --></abbr>
<p>I can hardly believe it&#039;s been eight months since I last wrote about the <a href="http://lackoftalent.org/michael/blog/category/projects/niso-i2/">NISO I2</a> project.  A lot has changed since then[<a href="http://lackoftalent.org/michael/blog/2010/05/19/i2-resource-description/#footnote_0_568" id="identifier_0_568" class="footnote-link footnote-identifier-link" title="I&amp;#8217;ve moved and changed jobs, in fact">1</a>].  I continue to work on I2 however; they won&#039;t get rid of me that easily.</p>
<p>In the <a href="http://lackoftalent.org/michael/blog/2009/09/15/i2-survey-results/">last post</a>, I wrote:<br />
<blockquote>The next step is to build upon the report to draw yet more conclusions from the data â€” there&#039;s an awful lot there â€” and flesh out some repository use cases for institutional identifiers. The I2 core group is moving quickly towards finalizing identifier metadata elements so that a standard may be drafted, and I think having some use cases documented will help drive the standard in a direction the community can get behind.</p></blockquote>
<p> Since that time, the three scenario groups &#8212; Electronic Resources; Institutional Repositories and Learning Management Systems; and Library Resource Management &#8212; have concluded their work.  The work of the scenario groups included surveys of over 300 people working in these fields.  The survey results have been analyzed and reports were posted on the NISO website.  These reports have been used to flesh out use cases for an institutional identifier.  Upon completion of this work, the scenario groups were disbanded and work continued in a broader I2 working group.</p>
<p>The I2 working group has concentrated its work on analysis of similar standards and, as I alluded to earlier, significant effort has gone into defining core metadata to identify institutions, such as institution name, institution type, location information, variant identifiers, domain name(s), URL(s), and (optionally-typed) relationships to other institutions.  During these discussions it was difficult for me to hear the issues and needs around I2&#039;s metadata and identifiers without <a href="http://lackoftalent.org/michael/blog/2009/06/13/i2-strawman/">linked data springing to mind</a>.  </p>
<p>While we are designing a standard and not a system or a service <em>per se</em>, it seems useful to include in the standard an informative section about implementation and architecture[<a href="http://lackoftalent.org/michael/blog/2010/05/19/i2-resource-description/#footnote_1_568" id="identifier_1_568" class="footnote-link footnote-identifier-link" title="This practice seems more or less common in my (admittedly limited) experience, cf. the unAPI specification.">2</a>]; I find that reading standards is much easier on the brain when you get not only the standard itself but some examples of implementation, and that will be true as well, one hopes, of I2 standard implementers.  To that end, the group will be producing an XML schema of the I2 metadata elements and also an RDF schema.</p>
<p>I have been working on the RDF for I2 on and off for the past month or two.  Below are my impressions, as someone who is new to modeling in RDF, and the procedures I used to produce the draft RDF schema.<br />
<span id="more-568"></span><br />
Despite their names, RDF schema and XML schema are quite different[<a href="http://lackoftalent.org/michael/blog/2010/05/19/i2-resource-description/#footnote_2_568" id="identifier_2_568" class="footnote-link footnote-identifier-link" title="This reflection should come as little surprise since RDF and XML are different kinds of things: RDF is a data model and XML is a serialization format.">3</a>].  The XML schema is a tool for validating an XML-based document or record, and it&#039;s a common tool for modeling metadata in libraryland.  Not so with RDF schema, where the notion of document or record is replaced by the notion of a set of triples.  The focus in RDF is on the triple not on the document, and so validation of documents or records is not the point of RDF schema.  This took some effort to wrap my mind around.</p>
<p>Before I modeled I2 in RDF, I sketched out a domain model of I2 by copying relevant bits of information from I2 documents and pasting them into a text editor.  Then I put them into classes.  In I2&#039;s case, the domain model contained three classes of things: metadata elements about an institution, relationships between institutions, and types of institution.</p>
<p>I gathered some examples of relatively simple RDF schemas and transformed them into the <a href="http://www.w3.org/TeamSubmission/turtle/">Turtle</a> serialization format[<a href="http://lackoftalent.org/michael/blog/2010/05/19/i2-resource-description/#footnote_3_568" id="identifier_3_568" class="footnote-link footnote-identifier-link" title="Using rapper, a nifty little tool.">4</a>] for ease of reading, using them as a template for the I2 schema.</p>
<p>In the <a href="http://www.w3.org/TR/rdf-schema/">RDF schema (RDFS)</a> specification, there are two classes of things in the domain model: classes and properties.  If you are familiar with object-oriented programming, chances are you already grok this way of modeling, but otherwise, generally: a class is like a type and a property is an attribute.  If I were to model myself in RDF schema, then, I might say I am in the class of human beings, and one of my many properties is having a particular birth date, and another is having been born in a particular city.  The next step was to take the I2 domain model (metadata elements about an institution, relationships between institutions, and types of institution) and decide whether each thing was a class or a property.  I decided that the former two were sets of properties and that type of institution could be modeled as a set of classes.</p>
<p>Having a conceptual model of I2 and how it fit into the RDF schema way of thinking about things, I wrote a simple ontology defining one RDFS class per type of institution, and one RDFS property per metadata element and one per relationship type.  This would have sufficed as an ontology.</p>
<p>Exposing RDF-based resources on the web as linked data, however, represents an opportunity for metadata element-level interoperability at global scale.  In order to interoperate with the existing corpus of linked data available on the web, I went through the new I2 ontology and looked for areas where I could re-use, or subclass or otherwise link to, classes and properties already defined in more widely-used ontologies.  I realized at this point just how different coming up with a new XML document format was from writing an RDF ontology; whereas I might have wanted the former to be comprehensive and inclusive of every single aspect within the I2 domain model, my goal with the latter became to eliminate it (by trimming it down to only those bits which are not defined elsewhere).</p>
<p>Since the RDF ontology for I2 is not inclusive of the entire domain model, it seemed necessary to produce another reference document: a set of instances of I2 resources showing the mingling of new I2-specific classes and properties with well-defined classes and properties from other ontologies.</p>
<p>I shared rough first drafts of these documents and received very helpful feedback from some folks who are better-versed in this than myself.  I&#039;ve now incorporated their feedback into the latest I2 ontology and instance document.  I hope to include both of these into a draft of the I2 specification which will go out for comment in the coming months.  Here&#039;s the latest <a href="http://gist.github.com/358857">ontology</a> and the latest <a href="http://gist.github.com/358858">set of instances</a>.</p>
<h5>Notes</h5><ol class="footnotes"><li id="footnote_0_568" class="footnote">I&#039;ve moved and <a href="http://lackoftalent.org/michael/blog/2009/12/22/forking/">changed jobs</a>, in fact</li><li id="footnote_1_568" class="footnote">This practice seems more or less common in my (admittedly limited) experience, cf. <a href="http://unapi.info/specs/">the unAPI specification</a>.</li><li id="footnote_2_568" class="footnote">This reflection should come as little surprise since RDF and XML are different kinds of things: RDF is a data model and XML is a serialization format.</li><li id="footnote_3_568" class="footnote">Using <a href="http://librdf.org/raptor/rapper.html">rapper</a>, a nifty little tool.</li></ol><br/>
<hr/>]]></content:encoded>
			<wfw:commentRss>http://lackoftalent.org/michael/blog/2010/05/19/i2-resource-description/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Exploring curation micro-services</title>
		<link>http://lackoftalent.org/michael/blog/2009/09/27/exploring-curation-micro-services/</link>
		<comments>http://lackoftalent.org/michael/blog/2009/09/27/exploring-curation-micro-services/#comments</comments>
		<pubDate>Sun, 27 Sep 2009 04:00:29 +0000</pubDate>
		<dc:creator>Michael Giarlo</dc:creator>
				<category><![CDATA[Digital Libraries and Archives]]></category>
		<category><![CDATA[Management]]></category>
		<category><![CDATA[Preservation]]></category>
		<category><![CDATA[Repositories]]></category>

		<guid isPermaLink="false">http://lackoftalent.org/michael/blog/?p=504</guid>
		<description><![CDATA[As far as I&#039;m concerned, the most exciting developments this year in repositories and digital curation have come out of the California Digital Library. It has been impossible not to notice their papers and presentations. Put simply, their idea is that digital curation is enabled by &#034;micro-services&#034; built upon well-known abstractions such as the filesystem. [...]]]></description>
			<content:encoded><![CDATA[<abbr class="unapi-id" title="oai:lackoftalent.org:technosophia:504"><!-- &nbsp; --></abbr>
<p><img src="http://lackoftalent.org/images/micro_repo_thumb.png" alt="thumbnail of micro-repo tree" style="float: left"/>As far as I&#039;m concerned, the most exciting developments this year in  repositories and digital curation have come out of the California Digital Library.  It has been impossible <a href="http://conferences.library.gatech.edu/or/or09/paper/view/95">not</a> <a href="http://uccsc2009.ucdavis.edu/preso/UCCSC-2009-CDL-PODS-v05.ppt">to</a> <a href="http://www.ijdc.net/index.php/ijdc/article/view/98">notice</a> <a href="https://meeting-reg.com/sunpasig/abstracts.php">their</a> <a href="http://www.digitalpreservation.gov/news/events/ndiipp_meetings/ndiipp09/docs/NDIIPP%20Partner%20Meeting%202009_Breakout%20Session%20Schedule.pdf">papers</a> <a href="http://www.ijdc.net/index.php/ijdc/article/view/108/84">and</a> <a href="http://www.cdlib.org/iPres/confsched.html">presentations</a>.  Put simply, their idea is that digital curation is enabled by &#034;micro-services&#034; built upon well-known abstractions such as the filesystem.  The benefits are obvious: filesystem tools are ubiquitous and cross-platform, and there are strong market forces to ensure the filesystem persists.  The idea is radically simple and straightforward, though many questions remain about such a paradigm.  I&#039;ll return to those later. </p>
<p>If you have not yet taken a look at CDL&#039;s curation micro-service specifications, most of which may be printed on as few as one or two sheets of paper, see the <a href="http://www.cdlib.org/inside/diglib/">Digital Library Building Blocks</a>.</p>
<p>My co-workers in the LC Repository Development Center have been chatting about these specs on and off throughout the year.  After months of procrastinating, I finally read all of the specs on Thursday; it&#039;s wonderful that you can do so in the course of one reading session, I might add.  Yesterday a bunch of us RDCers got together to chat (informally) about the specs: what they&#039;re for, how they work, and how they interact with one another.  I learn by doing, by examples, so I combed through each of the specs in advance of our meeting and tried to <a href="http://twitter.com/mjgiarlo/statuses/4371794936">construct</a> a minimal repository[<a href="http://lackoftalent.org/michael/blog/2009/09/27/exploring-curation-micro-services/#footnote_0_504" id="identifier_0_504" class="footnote-link footnote-identifier-link" title="Perhaps it&amp;#8217;s more in line with the specs to refer to this space as &amp;#8220;a managed filesystem that drives repository and curation services,&amp;#8221; given the CDL philosophy that preservation is not a place/repository.  But it&amp;#8217;s easier to say &amp;#8220;repository,&amp;#8221; so there you go.">1</a>] based on micro-services.<br />
<span id="more-504"></span><br />
Here is a tree visualization of the final product, inevitable warts and all: <a href="http://lackoftalent.org/images/micro_repo.png"><img src="http://lackoftalent.org/images/micro_repo.png" alt="sample micro-services repo tree" /></a>  The services I used were <a href="http://www.cdlib.org/inside/diglib/namaste/namastespec.html">Namaste</a>, <a href="http://www.cdlib.org/inside/diglib/can/canspec.pdf">Content Access Node (CAN)</a>, <a href="http://www.cdlib.org/inside/diglib/pairtree/pairtreespec.html">Pairtree</a>, <a href="http://www.cdlib.org/inside/diglib/dflat/dflatspec.pdf">Dflat</a>, <a href="http://www.cdlib.org/inside/diglib/redd/reddspec.html">Reverse Directory Deltas (ReDD)</a>, <a href="http://www.cdlib.org/inside/diglib/clop/clopspec.pdf">Class-based System for Managing Object Properties (CLOP)</a>, and <a href="http://www.digitalpreservation.gov/library/resources/tools/docs/bagitspec.pdf">BagIt</a> (co-developed by LC and CDL).</p>
<p>As I mentioned in our Friday meeting, recounting my experience exploring the specs: the bad thing is that I spent an hour building a repository with rudimentary tools such as mkdir, touch, cp, ln, and emacs; but the good thing is that I built a <em>repository</em> in <em>one hour</em> using <em>common, rudimentary tools</em>.  It&#039;s a very compelling paradigm.  <a href="http://inkdroid.org/ehs">Ed</a>&#039;s already built a <a href="http://github.com/edsu/dflat">tool</a> implementing some of Dflat, further demonstrating how lightweight these micro-services are.  (<strong>UPDATE</strong>: Ed notes that this code is a work in progress and is &#034;barely functional.&#034;)  (<strong>UPDATE 2</strong>: The dflat library has come a long way.  Check it out if you&#039;re interested.  Also, I just committed a pretty basic Namaste library: <a href="http://github.com/mjgiarlo/namaste">http://github.com/mjgiarlo/namaste</a>.  Only took about an hour, which is a testament to the power of lightweight specs.)</p>
<p>I am certain this will be a running thread at work as the specifications evolve and our understanding of them grows.  Some questions and comments that occurred to me while exploring the micro-service specs and building the minimal repo:</p>
<ul>
<li>CAN was a bit puzzling.  The spec is simple enough, but I found some of the conventions confusing, and I was left wondering what CAN provides other than a container.  What I would like to see is a simple use case and perhaps more examples.  Thus, the CAN stuff in my sample repo doesn&#039;t feel very useful only because I had a hard time working with the spec.</li>
<li>CLOP feels like the least mature of the specifications.  It seems generally useful to be able to put digital objects, however you define that, into classes and define properties on those classes.  The spec did not clearly convey to me just how it accomplishes that aim.  A few examples would go a very long way.  I&#039;ve got some CLOP stuff in the sample repo but I have no idea how close my implementation matches the spec.</li>
<li>Is Dflat dependent on ReDD?  One would assume not since there&#039;s an optional property in the dflat-info.txt file for specifying a delta scheme.  But, say, could you stub out the v001 directory (reserved to hold the initial version of a digital object) and use a system such as <a href="http://git-scm.com/">git</a> or <a href="http://bazaar-vcs.org/">bazaar</a>?  <br/><br/>One might argue that these established delta schemes, if you want to call them that, have many more developers and users than a system such as ReDD and thus should persist longer and have more tools built around them.  I imagine the micro-service viewpoint would acknowledge that point, but counter that the spirit of these specs is to avoid dependencies from outside the filesystem?</li>
<li>Is the ReDD specification meaningful outside of a Dflat given that any one ReDD directory knows nothing of its successors and predecessors, or is it dependent upon Dflat?</li>
<li>Could a BagIt bag live inside of the ReDD reserved &#034;full&#034; directory?  That is, could the &#034;full&#034; directory be marked up appropriately to <em>be</em> a BagIt bag?</li>
<li>How many tools exist for these specs?  I notice there&#039;s code in CPAN for Pairtree and Namaste, which is a fabulous start.  Tools are the difference between YAMF (Yet Another Messy Filesystem) and reliably managed curation services.  Granted, tools such as cp and emacs already exist and are part of the appeal of these micro-services, but there&#039;s also tremendous room for error if operations are all done &#034;by hand.&#034;</li>
<li>To what extent has CDL transitioned to using these specs/tools?</li>
<li>Are other institutions using these specs/tools?  I have heard tell that digital library folks from the University of Michigan and the University of North Texas may be involved.</li>
</ul>
<p>I hope I don&#039;t sound overly critical.  I&#039;m really glad our colleagues at the California Digital Library have written these specifications and applied their deep experience to what could be a transformative paradigm[<a href="http://lackoftalent.org/michael/blog/2009/09/27/exploring-curation-micro-services/#footnote_1_504" id="identifier_1_504" class="footnote-link footnote-identifier-link" title="Please excuse the fanboyishness; this filesystem fetishism is exciting stuff!">2</a>] in the digital curation world.  Kudos to them!</p>
<h5>Notes</h5><ol class="footnotes"><li id="footnote_0_504" class="footnote">Perhaps it&#039;s more in line with the specs to refer to this space as &#034;a managed filesystem that drives repository and curation services,&#034; given the CDL philosophy that preservation is not a place/repository.  But it&#039;s easier to say &#034;repository,&#034; so there you go.</li><li id="footnote_1_504" class="footnote">Please excuse the fanboyishness; this filesystem fetishism is exciting stuff!</li></ol><br/>
<hr/>]]></content:encoded>
			<wfw:commentRss>http://lackoftalent.org/michael/blog/2009/09/27/exploring-curation-micro-services/feed/</wfw:commentRss>
		<slash:comments>4</slash:comments>
		</item>
		<item>
		<title>I2: Survey results</title>
		<link>http://lackoftalent.org/michael/blog/2009/09/15/i2-survey-results/</link>
		<comments>http://lackoftalent.org/michael/blog/2009/09/15/i2-survey-results/#comments</comments>
		<pubDate>Tue, 15 Sep 2009 12:29:55 +0000</pubDate>
		<dc:creator>Michael Giarlo</dc:creator>
				<category><![CDATA[Digital Libraries and Archives]]></category>
		<category><![CDATA[NISO I2]]></category>
		<category><![CDATA[Persistent Identifiers]]></category>
		<category><![CDATA[Repositories]]></category>
		<category><![CDATA[Scholarly Communication]]></category>

		<guid isPermaLink="false">http://lackoftalent.org/michael/blog/?p=497</guid>
		<description><![CDATA[I wrote in June that the I2 subgroup surveyed &#034;repository managers to determine the current practices and needs of the repository community regarding institutional identifiers. Results from the survey will inform a set of use cases that will be shared with the community, and that are expected to drive the development of a new standard [...]]]></description>
			<content:encoded><![CDATA[<abbr class="unapi-id" title="oai:lackoftalent.org:technosophia:497"><!-- &nbsp; --></abbr>
<p>I <a href="http://lackoftalent.org/michael/blog/2009/06/20/i2-survey/">wrote</a> in June that the I2 subgroup surveyed &#034;repository managers to determine the current practices and needs of the repository community regarding institutional identifiers. Results from the survey will inform a set of use cases that will be shared with the community, and that are expected to drive the development of a new standard for institutional identifiers.&#034;</p>
<p>The survey closed in July, and the subgroup spent August writing a report on the survey results.  That report is now <a href="http://www.niso.org/apps/group_public/document.php?document_id=2773">final</a> and it&#039;s available to the public.  Feedback may be sent to our (woefully underutilized) public <a href="http://www.niso.org/lists/i2info/">i2info</a> mailing list, left as a comment on this post, or e-mailed to me privately which I can forward to our internal list.</p>
<p>The next step is to build upon the report to draw yet more conclusions from the data &#8212; there&#039;s an awful lot there &#8212; and flesh out some repository use cases for institutional identifiers.  The I2 core group is moving quickly towards finalizing identifier metadata elements so that a standard may be drafted, and I think having some use cases documented will help drive the standard in a direction the community can get behind.</p>
<p>Onward and upward.</p>
]]></content:encoded>
			<wfw:commentRss>http://lackoftalent.org/michael/blog/2009/09/15/i2-survey-results/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>I2: Survey</title>
		<link>http://lackoftalent.org/michael/blog/2009/06/20/i2-survey/</link>
		<comments>http://lackoftalent.org/michael/blog/2009/06/20/i2-survey/#comments</comments>
		<pubDate>Sat, 20 Jun 2009 18:30:53 +0000</pubDate>
		<dc:creator>Michael Giarlo</dc:creator>
				<category><![CDATA[Digital Libraries and Archives]]></category>
		<category><![CDATA[NISO I2]]></category>
		<category><![CDATA[Persistent Identifiers]]></category>
		<category><![CDATA[Repositories]]></category>
		<category><![CDATA[Scholarly Communication]]></category>

		<guid isPermaLink="false">http://lackoftalent.org/michael/blog/?p=363</guid>
		<description><![CDATA[[Series] Near the end of my strawman post, I wrote: The I2 repositories subgroup will be sending out its survey on identifier use cases in the coming week. It will be interesting to see if the requirements we have thus far identified still obtain in light of the data we collect from the survey. We [...]]]></description>
			<content:encoded><![CDATA[<abbr class="unapi-id" title="oai:lackoftalent.org:technosophia:363"><!-- &nbsp; --></abbr>
<p>[<a href="http://lackoftalent.org/michael/blog/category/niso-i2/">Series</a>]</p>
<p>Near the end of my <a href="http://lackoftalent.org/michael/blog/2009/06/13/i2-strawman/">strawman post</a>, I wrote:<br />
<blockquote>The I2 repositories subgroup will be sending out its survey on identifier use cases in the coming week.  It will be interesting to see if the requirements we have thus far identified still obtain in light of the data we collect from the survey. </p></blockquote>
<p>We completed the survey late last week and began distributing it.  Here&#039;s what we sent out:<br />
<blockquote>
<p>The NISO I2 Working Group is surveying repository managers to determine the current practices and needs of the repository community regarding institutional identifiers.  We value your time and your input in the process to create a standard for a new institutional identifier.  We hope that you will complete the survey which should take less than 15 minutes.  The survey will remain open through Monday, July 6th.</p>
<p>
Here is a link to the survey:<br />
<a href="http://www.surveymonkey.com/s.aspx?sm=RGQgZ3090DVrb3kFzr3P3Q_3d_3d">http://www.surveymonkey.com/s.aspx?sm=RGQgZ3090DVrb3kFzr3P3Q_3d_3d</a></p>
<p>
Please feel free to share this message with other interested parties.</p>
</blockquote>
<p>First we used <a href="http://www.surveymonkey.com/">Survey Monkey</a> to send the survey link to approximately one-hundred repository managers that the subgroup identified.   Our process for identifying repository managers involved pulling together a list of prominent repositories from subgroup members, and then gathering more from <a href="http://www.opendoar.org/">OpenDOAR</a>, &#034;an authoritative directory of academic open access repositories.&#034;  Then subgroup members were encouraged to share the survey link with colleagues, and post it far and wide via blogs, listservs, and <a href="http://twitter.com/mjgiarlo/status/2230486784">tweets</a>.  The listservs we targeted were: <a href="http://www.jiscmail.ac.uk/archives/jisc-repositories.html">JISC-REPOSITORIES</a>, <a href="http://metadatalibrarians.monarchos.com/">metadataLibrarians</a>, <a href="http://groups.google.com/group/digital-curation">digital-curation</a>, <a href="https://arl.org/Lists/SPARC-IR/">SPARC-IR</a>, <a href="http://mailman.anu.edu.au/mailman/listinfo/ir-net">ir-net</a>, <a href="http://www.lsoft.com/SCRIPTS/WL.EXE?SL1=REPOMAN-L&#038;H=LISTSERV.INDIANA.EDU">REPOMAN-L</a>, <a href="http://larch.palinet.org/archives/palinet-ir-l.html">PALINET-IR-L</a>, <a href="http://mailman.mit.edu/mailman/listinfo/dspace-general">dspace-general</a>, <a href="https://lists.sourceforge.net/lists/listinfo/fedora-commons-users">fedora-commons-users</a>, <a href="http://dublincore.org/groups/identifiers/">DC-IDENTIFIERS</a>, and <a href="http://dewey.library.nd.edu/mailing-lists/code4lib/">code4lib</a>.</p>
<p>I&#039;ve already received a few responses and have gotten useful feedback.  Two of the hardest questions to answer so far have been: &#034;What is an institutional identifier?&#034; and &#034;What is a repository?&#034;</p>
<dl>
<dt><strong>Institutional identifier</strong></dt>
<dd>
<p>An institutional identifier is defined as a symbol or code that uniquely identifies an institution.  Domain-specific examples of existing identifiers include SAN, IPEDS, GLN, MARC Org Code, and ISIL.  Another example might be a Handle prefix or ARK name authority assigning number.</p>
</dd>
<dt><strong>Repository</strong></dt>
<dd>
<p>Institutional repositories and subject repositories like arxiv.org are clearly &#039;repositories&#039;, but beyond that it is a somewhat ill-defined term.  One might look to the <a href="http://www.cnri.reston.va.us/k-w.html">Kahn-Wilensky architecture</a>, or the <a href="http://public.ccsds.org/publications/archive/650x0b1.pdf">OAIS reference model (PDF)</a>, or even <a href="http://en.wikipedia.org/wiki/Digital_repository">Wikipedia</a> for definitions, but it&#039;s not clear that even the authorities agree on what constitutes a repository.</p>
<p>It&#039;s a system.  It&#039;s network-accessible and typically has a web interface of some sort.  Files and groups of files sometimes known as objects tend to be deposited in them, perhaps for some combination of management, access, or preservation.  Many run Fedora, DSpace, and ePrints, and factor heavily in scholarly communication.  Some are document-centric.  Some will accept anything.  To some, a learning management system may be a repo.  To others, a content management system may fit.</p>
<p>My background is in academia so my own definition is somewhat based in that context, but I wouldn&#039;t say the term is necessarily limited to that context.  There are other NISO I2 scenarios for library workflows and electronic resources, so it&#039;s safe to assume that repository does not mean ILS or OPAC or ERP system.  My hope is that folks have their own working definitions of the term and can decide for themselves what it means.</p>
</dd>
</dl>
<p>We&#039;ve given folks a little over two weeks to respond to the survey, so the constant I2 drum-beating will quiet down for a while around here.  I am very interested in what sorts of responses we get from the survey.  Fun times!</p>
<p>Oh, and perhaps it goes without saying, but if you&#039;re a repository owner, manager, expert, developer, or stakeholder with an interest in identifiers, please feel free to take the <a href="http://www.surveymonkey.com/s.aspx?sm=RGQgZ3090DVrb3kFzr3P3Q_3d_3d">survey</a>!</p>
]]></content:encoded>
			<wfw:commentRss>http://lackoftalent.org/michael/blog/2009/06/20/i2-survey/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>I2: Strawman</title>
		<link>http://lackoftalent.org/michael/blog/2009/06/13/i2-strawman/</link>
		<comments>http://lackoftalent.org/michael/blog/2009/06/13/i2-strawman/#comments</comments>
		<pubDate>Sat, 13 Jun 2009 23:02:36 +0000</pubDate>
		<dc:creator>Michael Giarlo</dc:creator>
				<category><![CDATA[Digital Libraries and Archives]]></category>
		<category><![CDATA[Linked Data]]></category>
		<category><![CDATA[NISO I2]]></category>
		<category><![CDATA[Persistent Identifiers]]></category>
		<category><![CDATA[Repositories]]></category>
		<category><![CDATA[Scholarly Communication]]></category>
		<category><![CDATA[Hypertext Transfer Protocol]]></category>
		<category><![CDATA[Metadata]]></category>
		<category><![CDATA[Tim Berners-Lee]]></category>
		<category><![CDATA[Uniform Resource Identifier]]></category>
		<category><![CDATA[World Wide Web]]></category>

		<guid isPermaLink="false">http://lackoftalent.org/michael/blog/?p=346</guid>
		<description><![CDATA[[Series] In the prior I2 post, I wrote about the requirements the repositories subgroup has come up with for an institutional identifier standard (with the hope that our findings re: repositories could be generalized to other scenarios). Image by PhOtOnQuAnTiQuE via Flickr My strawman proposal of sorts is to explore how well linked data patterns [...]]]></description>
			<content:encoded><![CDATA[<abbr class="unapi-id" title="oai:lackoftalent.org:technosophia:346"><!-- &nbsp; --></abbr>
<p style="text-align: left;">
<p style="text-align: left;">[<a href="http://lackoftalent.org/michael/blog/category/niso-i2/">Series</a>]</p>
<p style="text-align: left;">In the prior I2 post, I wrote about the requirements the repositories subgroup has come up with for an institutional identifier standard (with the hope that our findings re: repositories could be generalized to other scenarios).</p>
<div class="zemanta-img zemanta-action-dragged" style="margin: 1em; display: block; text-align: left;">
<div>
<dl class="wp-caption alignright" style="width: 250px;">
<dt class="wp-caption-dt"><a href="http://www.flickr.com/photos/67968452@N00/3272712288"><img title="PhotonQ-Tim Berners Lee on Linked Data at TED" src="http://farm4.static.flickr.com/3449/3272712288_2ef843a4b7_m.jpg" alt="PhotonQ-Tim Berners Lee on Linked Data at TED" width="240" height="180" /></a></dt>
<dd class="wp-caption-dd zemanta-img-attribution" style="font-size: 0.8em;">Image by <a href="http://www.flickr.com/photos/67968452@N00/3272712288">PhOtOnQuAnTiQuE</a> via Flickr</dd>
</dl>
</div>
</div>
<p style="text-align: left;">My strawman proposal of sorts is to explore how well <a href="http://linkeddata.org/">linked data patterns</a> fit this problem space.  Linked data, briefly, is a way to expose and link data on the web in a more semantically meaningful way, and is often summarized using the four principles put forward by <a href="http://en.wikipedia.org/wiki/Tim_Berners-Lee">Tim Berners-Lee</a>:</p>
<blockquote style="text-align: left;">
<ol>
<li>Use URIs as names for things</li>
<li>Use HTTP URIs so that people can look up those names.</li>
<li>When someone looks up a URI, provide useful information.</li>
<li>Include links to other URIs. so that they can discover more things.</li>
</ol>
</blockquote>
<p style="text-align: left;">That&#039;s the crux of it.Â  Linked data takes well-known patterns on the web (linking, dereferencing, etc.) and applies them to data, which in this case could be metadata for identifying institutions.</p>
<p style="text-align: left;">Let&#039;s examine each of the requirements and the applicability of linked data thereto.</p>
<ol style="text-align: left;">
<li><em>Should be agnostic to type of institution, e.g., libraries, museums, personal collections, historical societies</em>: The web is already agnostic to type of institution.Â  HTTP URIs do not favor one type of institution over another.<br/></li>
<li><em>Should handle varying institutional granularity, e.g., institution-level, campus-level, division-level, unit-level</em>: HTTP <a href="http://en.wikipedia.org/wiki/Uniform_Resource_Identifier">URIs</a> are flexible in this regard.Â  Hierarchy, should one wish it to be surfaced in the identifier, may be encoded in either a DNS hostname or the path appended to the DNS name.Â  One can imagine a URI like &#034;http://department.division.institution.tld/unit/subunit&#034; or &#034;http://institution.tld/campus/office/individual&#034;. <br/><br/>Hierarchy needn&#039;t be surfaced in the identifier if one favors opacity, in which case &#034;http://registry.tld/xnjsdasd&#034; would suffice as an identifier, and may instead be entirely reflected in the (RDF) representation returned by dereferencing the URI.<br/></li>
<li><em>Should handle linking among institutions and subordinate units</em>: Linked data handles linking via well-known HTTP mechanisms, referenced in the fourth principle of linked data.Â  Unlike the HTTP link, which has limited semantics, linked data links are semantically rich and extensible.<br/></li>
<li><em>Should express different sorts of relationships among these institutions and units</em>: The &#034;useful information&#034; in the third principle of linked data is typically provided by an RDF representation, which is itself a list of assertions.Â  These assertions, or triples, consist of subjects, predicates, and objects.Â  The ability to express the relationships in this requirement is limited only by the availability of vocabularies that contain sets of predicates and classes for subjects and objects.Â  Think of the predicates as elements defined within a metadata standard, e.g., <a href="http://dublincore.org/documents/dces/">Dublin Core</a> &#034;creator&#034;, <a href="http://www.loc.gov/standards/mods/">MODS</a> &#034;relatedItem&#034;, and so forth.Â  Vocabularies that contain these predicates and classes are growing and evolving daily, and should there not be a vocabulary that contains the relationship one wishes to express, it is fairly easy to create a custom vocabulary. <br/><br/>The ability to mix and match vocabularies provides an expressiveness that is often not found in document-based metadata formats and the flexibility to express radically different relationships on a per-industry or per-institution basis.Â  This latter point is important as the I2 group has identified both core metadata elements for identifying institutions of different types and additional elements for specific types of institutions.Â  Why re-invent a new metadata format or schema when all one needs to express may already be contained in others?<br/></li>
<li><em>Should relate to existing relevant identifiers and registries</em>: Same as requirement#4.Â  Linked data is all about expressing relationships between things, e.g., institutions, identifiers, registries, etc.<br/></li>
<li><em>Should be globally unique</em>: HTTP URIs are guaranteed to be globally unique by virtue of the distributed <a href="http://en.wikipedia.org/wiki/Domain_name_system">DNS</a> system and hierarchical naming within each HTTP service.<br/></li>
<li><em>Should be actionable</em>: HTTP URIs provide dereferenceability/actionability via the well-known HTTP protocol.<br/></li>
<li><em>Should enable retrieval of metadata sufficient to identify the institution, which may vary widely by institution</em>: HTTP URIs are actionable per requirement #7 and the metadata returned is flexible per requirement #4.<br/></li>
<li><em>Should accommodate changes as institutions come and go and re-organize and be able to relate defunct institutions to new ones</em>: Linked data patterns provide for redirecting from defunct representations (institutional identifiers) to new ones via HTTP redirects.Â  One may also add assertions to institutional metadata such as owl:sameAs, for instance, which says that the institution identified by the given URI is the same as another institution identified by another URI.<br/></li>
</ol>
<p>This seems like a compelling path to follow for the I2 standard.</p>
<p>The I2 repositories subgroup will be sending out its survey on identifier use cases in the coming week.Â  It will be interesting to see if the requirements we have thus far identified still obtain in light of the data we collect from the survey.Â  If so, I would like to explore the idea of linked data for institutional identifiers a bit more.</p>
]]></content:encoded>
			<wfw:commentRss>http://lackoftalent.org/michael/blog/2009/06/13/i2-strawman/feed/</wfw:commentRss>
		<slash:comments>7</slash:comments>
		</item>
		<item>
		<title>I2: Requirements</title>
		<link>http://lackoftalent.org/michael/blog/2009/06/07/i2-requirements/</link>
		<comments>http://lackoftalent.org/michael/blog/2009/06/07/i2-requirements/#comments</comments>
		<pubDate>Sun, 07 Jun 2009 21:14:56 +0000</pubDate>
		<dc:creator>Michael Giarlo</dc:creator>
				<category><![CDATA[Digital Libraries and Archives]]></category>
		<category><![CDATA[NISO I2]]></category>
		<category><![CDATA[Persistent Identifiers]]></category>
		<category><![CDATA[Repositories]]></category>
		<category><![CDATA[Scholarly Communication]]></category>

		<guid isPermaLink="false">http://lackoftalent.org/michael/blog/?p=327</guid>
		<description><![CDATA[[Series] The I2 IR scenario subgroup approached the issue of institutional identifiers in repositories by first brainstorming about the various issues, problems, and sticking points that make identifiers in this space (and elsewhere) such a complex topic. Folks on the subgroup are repository managers or are otherwise involved with or knowledgeable about the repository space, [...]]]></description>
			<content:encoded><![CDATA[<abbr class="unapi-id" title="oai:lackoftalent.org:technosophia:327"><!-- &nbsp; --></abbr>
<p>[<a href="http://lackoftalent.org/michael/blog/category/niso-i2/">Series</a>]</p>
<p>The I2 IR scenario subgroup approached the issue of institutional identifiers in repositories by first brainstorming about the various issues, problems, and sticking points that make identifiers in this space (and elsewhere) such a complex topic.  Folks on the subgroup are repository managers or are otherwise involved with or knowledgeable about the repository space, so the brainstorming exercise yielded a good number of concerns.  </p>
<p>The purpose of the exercise was to enumerate concerns and issues that could inform a draft survey to be administered to repository managers and experts around the globe in different organizational contexts: libraries, subject disciplines, archives, historical societies, etc.  The purpose of the survey is to get an idea of the use cases and constraints around institutional identifiers in these different repository contexts, the assumption being that we ought to have requirements grounded in real world usage before we go off building a standard.</p>
<p>I will note here that the subgroup has worked up a draft survey that has just recently been reviewed by a small group of folks who know about survey design, and we hope to administer the survey to the aforementioned <em>Reporati</em> this week[<a href="http://lackoftalent.org/michael/blog/2009/06/07/i2-requirements/#footnote_0_327" id="identifier_0_327" class="footnote-link footnote-identifier-link" title="We will also x-post to repo-related mailing lists as well, and some of us may blog or tweet about it.  My inclination is to cast as wide a net as possible so as not to miss important use cases.  We can always scope things out later on, but it&amp;#8217;s useful to be inclusive at this point lest our own assumptions carry the group forward.">1</a>].  Which is to say that I don&#039;t yet have a strong grasp of the use cases out there in the wild, and this series should be construed as my own premature cognitive fumblings.  But let&#039;s assume for now that what we learn from the survey results matches our initial brainstorming exercise.  </p>
<p>Here is a slightly modified and boiled down version of the concerns and issues the subgroup came up with for a potential institutional identifier standard, which resembles a set of minimum requirements:</p>
<ol>
<li>Should be agnostic to type of institution, e.g., libraries, museums, personal collections, historical societies</li>
<li>Should handle varying institutional granularity, e.g., institution-level, campus-level, division-level, unit-level</li>
<li>Should handle linking among institutions and subordinate units</li>
<li>Should express different sorts of relationships among these institutions and units</li>
<li>Should relate to existing relevant identifiers and registries</li>
<li>Should be globally unique</li>
<li>Should be actionable</li>
<li>Should enable retrieval of metadata sufficient to identify the institution, which may vary widely by institution</li>
<li>Should accommodate changes as institutions come and go and re-organize and be able to relate defunct institutions to new ones</li>
</ol>
<p>I doubt the list is exhaustive; I am almost certain we will uncover all sorts of tangly and esoteric use cases that add requirements.  I expect it.  Why else would we be gathering to discuss the need for an institutional identifier if it were a solved problem or a simple one? [<a href="http://lackoftalent.org/michael/blog/2009/06/07/i2-requirements/#footnote_1_327" id="identifier_1_327" class="footnote-link footnote-identifier-link" title="The cynical among you might have interesting answers to this question.">2</a>]</p>
<p>Nevertheless, looking at the above list, the task we&#039;ve taken on starts to feel less onerous.  And thinking about identifier systems constrained by the list of concerns, the mind starts to cook up all sorts of possible solutions.  I&#039;ll share one in the next post in this series, a strawman proposal of sorts, and how it addresses each of these requirements.</p>
<h5>Notes</h5><ol class="footnotes"><li id="footnote_0_327" class="footnote">We will also x-post to repo-related mailing lists as well, and some of us may blog or tweet about it.  My inclination is to cast as wide a net as possible so as not to miss important use cases.  We can always scope things out later on, but it&#039;s useful to be inclusive at this point lest our own assumptions carry the group forward.</li><li id="footnote_1_327" class="footnote">The cynical among you might have interesting answers to this question.</li></ol><br/>
<hr/>]]></content:encoded>
			<wfw:commentRss>http://lackoftalent.org/michael/blog/2009/06/07/i2-requirements/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>I2: Background</title>
		<link>http://lackoftalent.org/michael/blog/2009/05/19/i2-background/</link>
		<comments>http://lackoftalent.org/michael/blog/2009/05/19/i2-background/#comments</comments>
		<pubDate>Tue, 19 May 2009 19:16:19 +0000</pubDate>
		<dc:creator>Michael Giarlo</dc:creator>
				<category><![CDATA[Digital Libraries and Archives]]></category>
		<category><![CDATA[NISO I2]]></category>
		<category><![CDATA[Persistent Identifiers]]></category>
		<category><![CDATA[Repositories]]></category>
		<category><![CDATA[Scholarly Communication]]></category>

		<guid isPermaLink="false">http://lackoftalent.org/michael/blog/?p=312</guid>
		<description><![CDATA[[Series] This is the first in a series of posts about institutional identifiers[1]. In my last post, I alluded to some documentation that I&#039;ve written. That was somewhat misleading, which will soon be apparent, but I liked the parallel construction I had going, and I am but a slave to orderliness. For about the past [...]]]></description>
			<content:encoded><![CDATA[<abbr class="unapi-id" title="oai:lackoftalent.org:technosophia:312"><!-- &nbsp; --></abbr>
<p>[<a href="http://lackoftalent.org/michael/blog/category/niso-i2/">Series</a>]</p>
<p>This is the first in a series of posts about institutional identifiers[<a href="http://lackoftalent.org/michael/blog/2009/05/19/i2-background/#footnote_0_312" id="identifier_0_312" class="footnote-link footnote-identifier-link" title="I offer that very tentatively, knowing what a spectacular failure my last attempt at a series was.">1</a>].  </p>
<p>In <a href="http://lackoftalent.org/michael/blog/2009/05/18/state-of-the-me/">my last post</a>, I alluded to some documentation that I&#039;ve written.  That was somewhat misleading, which will soon be apparent, but I liked the parallel construction I had going, and I am but a slave to orderliness.</p>
<p>For about the past six months, I have been working with a <a href="http://www.niso.org/workrooms/i2">NISO group</a> looking into how institutions are identified within information systems:</p>
<blockquote><p>
The I2 (Institutional Identifiers &#8212; pronounced &#034;I 2&#034;) working group will build on work from the Journal Supply Chain Efficiency Improvement Pilot (http://www.journalsupplychain.com/), which demonstrated the improved efficiencies of using an institutional identifier in the journal supply chain. The NISO working group will develop a standard for an institutional identifier that can be implemented in all library and publishing environments. The standard will include definition of the metadata required to be collected with the identifier and what uses can be made of that metadata. &#8230;
</p></blockquote>
<p>The I2 group is split into a few subgroups which have been charged with looking into how institutional identifiers are used in particular scenarios.  These scenarios are e-resources, repositories and e-learning systems, and library resource workflows.  The scenario names pain me a bit, but so be it; this is our industry, and there are bigger windmills to tilt at.</p>
<p>I am currently co-chairing the subgroup looking at repositories and e-learning, and apparently I am its &#034;tech lead.&#034;  I don&#039;t want to get caught up on names and roles and titles, though; this series isn&#039;t about those at all.  I&#039;m just setting the scene and explaining why my head&#039;s in this space and laying bare my stake in the issue.</p>
<p>The remainder of this series will provide a bit more detail on the issues around institutional identifiers, share how the repository subgroup is grappling with identifier issues and engaging the repository community to assess needs, propose an approach for an identifier system that may meet said needs, and explore what seems to be the thorniest issue[<a href="http://lackoftalent.org/michael/blog/2009/05/19/i2-background/#footnote_1_312" id="identifier_1_312" class="footnote-link footnote-identifier-link" title="Hint: management.  I know, &amp;#8220;duh,&amp;#8221; right?">2</a>].</p>
<h5>Notes</h5><ol class="footnotes"><li id="footnote_0_312" class="footnote">I offer that very tentatively, knowing what a <a href="/michael/blog/category/development/ruby/">spectacular failure</a> my last attempt at a series was.</li><li id="footnote_1_312" class="footnote">Hint: management.  I know, &#034;duh,&#034; right?</li></ol><br/>
<hr/>]]></content:encoded>
			<wfw:commentRss>http://lackoftalent.org/michael/blog/2009/05/19/i2-background/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Cataloging and institutional repositories</title>
		<link>http://lackoftalent.org/michael/blog/2009/02/09/cataloging-and-institutional-repositories/</link>
		<comments>http://lackoftalent.org/michael/blog/2009/02/09/cataloging-and-institutional-repositories/#comments</comments>
		<pubDate>Mon, 09 Feb 2009 14:30:33 +0000</pubDate>
		<dc:creator>Michael Giarlo</dc:creator>
				<category><![CDATA[Cataloging and Metadata]]></category>
		<category><![CDATA[Digital Libraries and Archives]]></category>
		<category><![CDATA[Libraries]]></category>
		<category><![CDATA[Management]]></category>
		<category><![CDATA[Repositories]]></category>

		<guid isPermaLink="false">http://lackoftalent.org/michael/blog/?p=288</guid>
		<description><![CDATA[While doing some reading for a little talk my colleague, Ed Summers, and I are giving at code4lib 2009, I came across a paragraph that sparked a crazy thought. So crazy that it&#039;s not crazy at all. So not crazy that I am sure other people have thought of it. But nonetheless, here I am [...]]]></description>
			<content:encoded><![CDATA[<abbr class="unapi-id" title="oai:lackoftalent.org:technosophia:288"><!-- &nbsp; --></abbr>
<p>While doing some reading for a little <a href="http://code4lib.org/conference/2009/schedule#hcal10">talk</a> my colleague, <a href="http://inkdroid.org/ehs">Ed Summers</a>, and I are giving at <a href="http://code4lib.org/conference/2009">code4lib 2009</a>, I came across a paragraph that sparked a crazy thought.  So crazy that it&#039;s not crazy at all.  So not crazy that I am sure other people have thought of it.  But nonetheless, here I am writing about it just in case.</p>
<p>From Sarah Currier&#039;s <a href="http://www.elearning.ac.uk/features/sword">paper</a> on <a href="http://www.swordapp.org/">SWORD</a> (emphasis mine):<br />
<blockquote>One of the most frequently cited barriers to academics depositing their teaching materials into repositories is the keystroke-count involved in logging into a repository, uploading the resource, creating metadata, perhaps selecting a licence, and publishing the resource. It was a quick win, therefore, to create a drag-and-drop desktop tool to allow a single keystroke deposit of resources, including multiple resources in one action. For a repository that supports <b>automatic metadata generation</b>, administrative metadata can be created at the point of entry to the repository without the user needing to create any.</p></blockquote>
<p>And I wondered how many repositories supported automatic metadata generation.  I wondered how many repositories supported automatic generation of <em>rich</em> metadata.  And lastly I wondered, might this be a more or less natural role for catalogers: augmenting stub metadata records or doing original cataloging for institutional repository deposits?  Especially at a time when many of them are being reclassified as acquisitions specialists or digital projects managers?</p>
<p>Potential issues and questions:
<ul>
<li>Author ignorance: Maybe catalogers are already doing this and I&#039;m a moron?</li>
<li>Scale: Is it realistic to expect to be able to &#034;keep up&#034; with repository deposits?</li>
<li>Granularity: Does cataloging at the level of articles, and perhaps at even finer granularities, introduce challenges?</li>
<li>Duplication: If pre-prints are cataloged in the IR, for instance, will they need to be cataloged again later?</li>
<li>&#8230; there are others I thought of on my commute this morning but have since forgotten them.  Feel free to add comments.</li>
</ul>
<p>I will admit here that I&#039;ve been somewhat out of the (academic) institutional repository space a while, and cataloging is something I don&#039;t share thoughts about very often because my exposure is limited to having taken one course a couple years ago.  </p>
<p>I assume there&#039;s a body of research about this out there somewhere but I figured I&#039;d post this anyway.</p>
]]></content:encoded>
			<wfw:commentRss>http://lackoftalent.org/michael/blog/2009/02/09/cataloging-and-institutional-repositories/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Molotovs away!</title>
		<link>http://lackoftalent.org/michael/blog/2008/12/23/molotovs-away/</link>
		<comments>http://lackoftalent.org/michael/blog/2008/12/23/molotovs-away/#comments</comments>
		<pubDate>Wed, 24 Dec 2008 03:29:50 +0000</pubDate>
		<dc:creator>Michael Giarlo</dc:creator>
				<category><![CDATA[Development]]></category>
		<category><![CDATA[Digital Libraries and Archives]]></category>
		<category><![CDATA[Libraries]]></category>
		<category><![CDATA[Open Source]]></category>
		<category><![CDATA[Repositories]]></category>
		<category><![CDATA[Software]]></category>

		<guid isPermaLink="false">http://lackoftalent.org/michael/blog/?p=243</guid>
		<description><![CDATA[Lest I be criticized for unfairly calling out former employers in my recent Burn the Walled Gardens rant, I share news that the Rutgers University Libraries have boldly ventured into the world of open source software: RUcore Open Source Development. Huzzah! Thanks to the molotov-hurling Shaun Ellis, a peacenik/code monkey/musician extraordinaire, for all of his [...]]]></description>
			<content:encoded><![CDATA[<abbr class="unapi-id" title="oai:lackoftalent.org:technosophia:243"><!-- &nbsp; --></abbr>
<p>Lest I be criticized for unfairly calling out former employers in my recent <a href="/michael/blog/2008/12/15/burn-the-walled-gardens/">Burn the Walled Gardens rant</a>, I share news that the Rutgers University Libraries have boldly ventured into the world of open source software: <a href="http://rucore.libraries.rutgers.edu/open/" target="_blank">RUcore Open Source Development</a>.  Huzzah!  Thanks to the molotov-hurling <a href="http://www.sdellis.com/" target="_blank">Shaun Ellis</a>, a peacenik/code monkey/musician extraordinaire, for all of his work and for bringing this to my attention.</p>
<p>On the RUcore open source page you can get a list of ongoing projects, a release schedule, and a rationale for their licensing decisions (i.e., choosing <a href="http://www.gnu.org/licenses/gpl-3.0.html" target="_blank">GPL 3</a>).</p>
<p>The first project to be released (as of 2008/12/19) is the METS-based bibliographic utility, <a href="http://rucore.libraries.rutgers.edu/open/projects/openmic/" target="_blank">OpenMIC</a>:<br />
<blockquote>
OpenMIC is an open source, web-based cataloging tool that can be used as a standalone application or integrated with other repository architectures by a wide range of organizations. It provides a complete metadata creation system for analog and digital materials, with services to export these metadata in standard formats.</p>
<ul>
<li> Low overhead and infrastructure requirements </li>
<li> Events-based model for management and rights documentation </li>
<li> Mapping and import from standard and in-house formats </li>
<li> Unicode and CJK vernacular character support </li>
</ul>
<p>OpenMIC is a core application for the Moving Image Collections (MIC) initiative developed at the Rutgers University Libraries with funding from the Library of Congress. </p></blockquote>
<p>I look forward to following along as Rutgers releases yet more of the tools they have developed as part of their impressive digital library infrastructure.  It will be even more interesting to hear what their model will be for taking patches / commits from the broader open source community.  These things do take time, even though I failed to show an appreciation for that in my original rant, but I am reminded (by <a href="http://bibwild.wordpress.com/2008/12/23/more-on-doing-open-source-right/" target="_blank">Jonathan Rochkind</a>) that it&#039;s better to take the time and get it right.  I cringe a bit to say that, knowing full well how things tend to languish in committees and fall victim to analysis paralysis in academia; surely there is some middle ground?  There are some very talented and experienced folks at Rutgers, so I will be excited to see them take a leadership role in this space.</p>
<p>Go, Scarlet Knights!</p>
]]></content:encoded>
			<wfw:commentRss>http://lackoftalent.org/michael/blog/2008/12/23/molotovs-away/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>ORE plugin updated</title>
		<link>http://lackoftalent.org/michael/blog/2008/07/25/ore-plugin-updated/</link>
		<comments>http://lackoftalent.org/michael/blog/2008/07/25/ore-plugin-updated/#comments</comments>
		<pubDate>Fri, 25 Jul 2008 17:19:29 +0000</pubDate>
		<dc:creator>Michael Giarlo</dc:creator>
				<category><![CDATA[APIs]]></category>
		<category><![CDATA[Development]]></category>
		<category><![CDATA[OAI-ORE]]></category>
		<category><![CDATA[Repositories]]></category>

		<guid isPermaLink="false">http://lackoftalent.org/michael/blog/?p=135</guid>
		<description><![CDATA[I&#039;ve been using my time at RepoCamp today to get the OAI-ORE plugin for WordPress validating again.Â  I&#039;m having some trouble using the validator so I say that with some diffidence.Â  But the latest code which is now checked in to the WordPress plugins svn repo ought to be close, if not fully conformant, to [...]]]></description>
			<content:encoded><![CDATA[<abbr class="unapi-id" title="oai:lackoftalent.org:technosophia:135"><!-- &nbsp; --></abbr>
<p>I&#039;ve been using my time at <a href="http://barcamp.org/RepoCamp" target="_blank">RepoCamp</a> today to get the OAI-ORE <a href="http://lackoftalent.org/michael/blog/ore-wordpress-plug-in/" target="_blank">plugin</a> for WordPress <a href="http://african.lanl.gov/ovalnet/validate.jsp" target="_blank">validating</a> again.Â  I&#039;m having some trouble using the validator so I say that with some diffidence.Â  But the latest code which is now checked in to the WordPress plugins svn repo ought to be close, if not fully conformant, to the 0.9 version of the ORE spec.</p>
<p>I&#039;m not sure the plugin is really useful; it&#039;s just an Atom feed of all posts and pages in a WP instance.Â  I can think of some ways to make this more useful, by allowing blog authors to create their own aggregations, pulling in content outside of the particular instance.Â  I am certain that others can come up with even better uses.Â  I&#039;m open to suggestions.</p>
<p>Thanks to Jay Datema for <a href="http://www.bookism.org/open/2008/07/17/repurposing-metadata/" target="_blank">prodding</a> me a bit, if indirectly.</p>
]]></content:encoded>
			<wfw:commentRss>http://lackoftalent.org/michael/blog/2008/07/25/ore-plugin-updated/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>

