<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Threepress Consulting blog &#187; libraries</title>
	<atom:link href="http://blog.threepress.org/category/libraries/feed/" rel="self" type="application/rss+xml" />
	<link>http://blog.threepress.org</link>
	<description>Threepress creates software for publishers, educators and authors.</description>
	<lastBuildDate>Mon, 09 Jan 2012 13:02:39 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.9.2</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
			<item>
		<title>jiscPUB report: Digital publishing landscape, exemplars and recommendations</title>
		<link>http://blog.threepress.org/2011/12/15/jiscpub-report-digital-publishing-landscape-exemplars-and-recommendations/</link>
		<comments>http://blog.threepress.org/2011/12/15/jiscpub-report-digital-publishing-landscape-exemplars-and-recommendations/#comments</comments>
		<pubDate>Thu, 15 Dec 2011 19:41:34 +0000</pubDate>
		<dc:creator>Liza Daly</dc:creator>
				<category><![CDATA[articles]]></category>
		<category><![CDATA[ebooks]]></category>
		<category><![CDATA[libraries]]></category>
		<category><![CDATA[linking]]></category>

		<guid isPermaLink="false">http://blog.threepress.org/?p=2184</guid>
		<description><![CDATA[I was quite pleased to be asked to write a technical report for JISC. The paper provides background on the state of ebook publishing today, and concludes with a set of recommended projects that aim to improve digital scholarly publishing in the UK.  The report is available online as Digital Monograph Technical Landscape: Exemplars [...]]]></description>
			<content:encoded><![CDATA[<p>I was quite pleased to be asked to write a technical report for <a href="http://www.jisc.ac.uk/">JISC</a>. The paper provides background on the state of ebook publishing today, and concludes with a set of recommended projects that aim to improve digital scholarly publishing in the UK.  The report is available online as <a href="http://jiscpub.blogs.edina.ac.uk/final-report/">Digital Monograph Technical Landscape: Exemplars and Recommendations</a> under a <a href="http://creativecommons.org/licenses/by-nc-sa/2.5/scotland/">Creative Commons license</a>.  Other formats, including EPUB and PDF, are available from the <a href="http://jiscpub.blogs.edina.ac.uk/2011/12/02/final-post/">project blog</a>.  Warning, the paper is super-long!</p>
<p>Though I was the principal author of the paper, I&#8217;d like to thank the other members of the JISC team for their support, edits, and encouragement:  Theo Andrew, Peter Sefton, Emma Tonkin, and David Flanders. Many other deliverables were produced as a part of this project, including digital publishing toolkits and usability studies; please check them out on the <a href="http://jiscpub.blogs.edina.ac.uk/2011/12/02/final-post/">jiscPUB blog</a>.</p>
<h2>Executive summary</h2>
<blockquote><p>&#8230;This report aims to describe some historical perspective on electronic publishing, leading up to why the “ebook revolution” has happened in the 2010s when it had failed to take hold before. We will describe some of the details of how digital books are authored, both in a scholarly context and in general ebook production terms. We have included in-depth coverage of the unexpected outcomes of ebook distribution, including issues of rights, royalties, copyright, academic impact, and the implications of limiting access to and reproduction of digital books. Finally, we review findings from a number of ebook pilot programs conducted in U.S. universities, and draw from the work done by the rest of the “JISCpub” team in uncovering possible future work that could be actionable and relevant to a scholarly publishing audience, with a goal towards providing better tooling for both authors and readers of scholarly works.</p></blockquote>
<h2>Recommendations in brief</h2>
<ol>
<li>Rich full-text semantic search tools for scholarly ebook collections.
</li>
<li>Tools for generating or traversing highly-specific stable citations.
</li>
<li>Development of a pilot to produce student theses with high-engagement linked-data content.
</li>
<li>Plugins or add-ons to provide simple, ebook output for popular word processing tools.
</li>
<li>Improved workflows for authoring attractive, accessible, standards-based mathematical notation in ebooks.
</li>
<li>Development of an ereading system with an emphasis on scholarly annotation and research-gathering.
</li>
<li>Provisions to train and share scholars interested in digital publishing.
</li>
<li>Aggregate ebook services for authors and university presses.
</li>
<li>Maximize use of orphan works.
</li>
<li>Community resources for institutions with digital collection.
</li>
<p><a href="http://jiscpub.blogs.edina.ac.uk/final-report/">Read the complete report</a>.</ol>
]]></content:encoded>
			<wfw:commentRss>http://blog.threepress.org/2011/12/15/jiscpub-report-digital-publishing-landscape-exemplars-and-recommendations/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>The past of the future of the book (part 5)</title>
		<link>http://blog.threepress.org/2009/05/01/the-past-of-the-future-of-the-book-part-5/</link>
		<comments>http://blog.threepress.org/2009/05/01/the-past-of-the-future-of-the-book-part-5/#comments</comments>
		<pubDate>Fri, 01 May 2009 13:38:50 +0000</pubDate>
		<dc:creator>Liza Daly</dc:creator>
				<category><![CDATA[libraries]]></category>

		<guid isPermaLink="false">http://blog.threepress.org/?p=460</guid>
		<description><![CDATA[For skilled cataloging, work that takes intelligence and the best education attainable, the demand is always above the supply. In the library of the future, the free library, the greatest missionary force of the age, there is going to be a great opening for women&#8217;s work, too. 
I don&#8217;t mean by the ideal free library [...]]]></description>
			<content:encoded><![CDATA[<blockquote><p>For skilled cataloging, work that takes intelligence and the best education attainable, the demand is always above the supply. In the library of the future, the free library, the greatest missionary force of the age, there is going to be a great opening for women&#8217;s work, too. </p>
<p>I don&#8217;t mean by the ideal free library a big central book collection in this city or any city, but smaller ones, so scattered as to bring the reading-room within half a mile of every workman&#8217;s home.</p></blockquote>
<p>&#8211;&#8221;Facts for the Fair Sex, Some Things that Women Can Do Better Than Men&#8221;, <a href="http://pqasb.pqarchiver.com/chicagotribune/access/693527372.html?FMT=AI&#038;dids=693527372:693527372&#038;FMTS=CITE:AI&#038;type=historic&#038;date=May+14%2C+1887&#038;author=&#038;pub=Chicago+Tribune&#038;desc=FACTS+FOR+THE+FAIR+SEX.">Chicago Tribune, May 1887</a></p>
]]></content:encoded>
			<wfw:commentRss>http://blog.threepress.org/2009/05/01/the-past-of-the-future-of-the-book-part-5/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>A case study in converting image-based ebooks into XML</title>
		<link>http://blog.threepress.org/2009/02/21/a-case-study-in-converting-image-based-ebooks-into-xml/</link>
		<comments>http://blog.threepress.org/2009/02/21/a-case-study-in-converting-image-based-ebooks-into-xml/#comments</comments>
		<pubDate>Sat, 21 Feb 2009 22:01:18 +0000</pubDate>
		<dc:creator>Liza Daly</dc:creator>
				<category><![CDATA[book design]]></category>
		<category><![CDATA[digitization]]></category>
		<category><![CDATA[ebooks]]></category>
		<category><![CDATA[libraries]]></category>
		<category><![CDATA[images]]></category>
		<category><![CDATA[xml]]></category>

		<guid isPermaLink="false">http://blog.threepress.org/?p=369</guid>
		<description><![CDATA[There&#8217;s a great deal of valuable information in this recently-released white paper by The American Council of Learned Societies: ACLS Humanities E-Book XML Conversion Experiment: Report on Workflow, Costs, and User Preferences.  Although the study was based on scholarly books, their findings would apply to many other digitization projects.
The Humanities E-Book (HEB) project took [...]]]></description>
			<content:encoded><![CDATA[<p>There&#8217;s a great deal of valuable information in this recently-released white paper by The American Council of Learned Societies: <a href="http://www.humanitiesebook.org/HEBWhitePaper2.pdf">ACLS Humanities E-Book XML Conversion Experiment: Report on Workflow, Costs, and User Preferences</a>.  Although the study was based on scholarly books, their findings would apply to many other digitization projects.</p>
<p><a href="http://www.humanitiesebook.org/">The Humanities E-Book</a> (HEB) project took 20 books (as scanned page images + uncorrected OCR) and converted them to an in-house XML format.  They compared the workflow impact, costs and user experience of the final XML product versus that of the page-image ebooks.</p>
<p>Many of their findings consorted with my own experience in this area:</p>
<ol>
<li>The quality of the OCRed text was worse than expected: good enough for search, but not always suitable for reading.  However, the cost of double-keying the text from scratch was prohibitive.</li>
<li>The encoding vendor, while skilled and diligent, nevertheless produced output that would require a trained editor to correct properly.  HEB spent 4-8 hours hand-correcting each book in the sample set.</li>
<li>The average cost for conversion to XML was approximately 3X greater than for scans + OCR only.  This did not include in-house correction and review.</li>
</ol>
<h3>User survey results</h3>
<p>After the 20 sample books were made available to their community, users were polled for their reactions.  I feel these are worth mentioning at length.</p>
<p>69% of readers preferred the XML-encoded books (presented as HTML in a browser).</p>
<p>Reasons for preferring the XML scans included:</p>
<ol>
<li>Readability (despite the fact that not all books were completely proofed)</li>
<li>Usability (e.g. cut and paste, ability to use screen readers)</li>
<li>Layout (the HTML presentation had few distracting elements on the pages, and more content was available per web page than in the page-based scans)</li>
</ol>
<p>Interestingly, of those readers who preferred the image scans, one of the primary reasons cited was the more book-like paginated layout. I&#8217;m very conscious of this tension: many Bookworm users complain about the chapter-at-a-time, scrolling layout of the pages, while others absolutely hate arbitrary emulation of the printed work. It seems to be a strong personal preference that runs in one direction or the other.</p>
<h3>Ebook interface considerations</h3>
<p>Although not directly related to the study at hand, I found some of the publisher-imposed constraints on their user interface illustrative.  I feel these would be best be avoided when designing an ebook reading site:</p>
<blockquote><p>Foremost among user requests was a desire for better printing options. Printing of HEB titles has always been restricted to fair-use provisions, and for this reason there had neverbeen any immediate way of printing out pages without prior browser adjustment to<br />
accommodate frames—the intention being to discourage printing out long sections of copyrighted text at once.</p></blockquote>
<p>The ability to print text at length is critical for any serious work.  I&#8217;m always unhappy when a site prevents me from doing an ordinary task like printing or downloading.  I hope that publishers reconsider these types of restrictions.</p>
<p>Similarly, revenue models should not constrain the ways in which licensed users can access content:</p>
<blockquote><p>XML titles normally suppress all higher-level “container” sections, so that users always access only the smallest available text chunk in each overarching section. [...]</p>
<p>&#8230;for this set of titles, we would simply make all section levels accessible.  This would affect the process of tallying hits for these titles—something needed in order to calculate royalties  for publishers and usage statistics for libraries—as users could now potentially read an entire book by accessing only a small number of chapter-level sections (which in turn would generate fewer hits than reading the page-image version).</p></blockquote>
<p>As a reader, I should absolutely be able to read content &#8212; especially XML-based content &#8212; in as fluid a manner as possible.  Generating accurate accounting is a programming problem, and not one that should drive decisions about the reading interface.</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.threepress.org/2009/02/21/a-case-study-in-converting-image-based-ebooks-into-xml/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>New England code4lib report</title>
		<link>http://blog.threepress.org/2008/12/10/new-england-code4lib-report/</link>
		<comments>http://blog.threepress.org/2008/12/10/new-england-code4lib-report/#comments</comments>
		<pubDate>Wed, 10 Dec 2008 17:55:54 +0000</pubDate>
		<dc:creator>Liza Daly</dc:creator>
				<category><![CDATA[libraries]]></category>
		<category><![CDATA[code4lib]]></category>

		<guid isPermaLink="false">http://blog.threepress.org/?p=216</guid>
		<description><![CDATA[I spent Tuesday in the company of some very interesting developers and librarians as part of necode4lib, the regional chapter of code4lib.  
The event was hosted by the Boston Public Library. We were given a fascinating tour of the Open Library scanning facility: ten &#8220;scribe&#8221; workstations, manned by full-time staffers. I didn&#8217;t have my [...]]]></description>
			<content:encoded><![CDATA[<p>I spent Tuesday in the company of some very interesting developers and librarians as part of <a href="http://wiki.code4lib.org/index.php/NEC4L">necode4lib</a>, the regional chapter of <a href="http://code4lib.org/">code4lib</a>.  </p>
<p>The event was hosted by the <a href="http://bpl.org/">Boston Public Library</a>. We were given a fascinating tour of the <a href="http://openlibrary.org/bpl">Open Library scanning facility</a>: ten &#8220;scribe&#8221; workstations, manned by full-time staffers. I didn&#8217;t have my camera (and presumably the scanners&#8217; Canon 5Ds were off-limits), but there&#8217;s a good report on the facility from <a href="http://www.everythingismiscellaneous.com/2008/05/31/scan-and-release-digitizing-the-boston-public-library/">Everything Miscellaneous</a>. As developers we were encouraged to build on top of the free content that&#8217;s pouring out of the project, and I&#8217;ll definitely be giving that some thought.</p>
<p>I haven&#8217;t worked directly with or for libraries yet, so for me the day was mostly a note-taking and networking experience.  There are a number of technologies I need to learn more about, including the <a href="http://www.librarything.com/api">LibraryThing APIs</a>, <a href="http://ocoins.info/">COinS</a>, <a href="http://unapi.info/">unAPI</a>. Bookworm needs to export more structured data in a useful way, especially when it comes to playing nicely with citation management software.  I think I now know enough to stay away from <a href="http://www.w3.org/MarkUp/Forms/">XForms</a>.</p>
<p>I&#8217;ll definitely be at the <a href="http://www.code4lib.org/taxonomy/term/19">code4lib 2009 conference</a>.  I even have a vague memory of promising to give a lightning talk on ePub.</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.threepress.org/2008/12/10/new-england-code4lib-report/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>The real Internet Archive</title>
		<link>http://blog.threepress.org/2008/10/22/the-real-internet-archive/</link>
		<comments>http://blog.threepress.org/2008/10/22/the-real-internet-archive/#comments</comments>
		<pubDate>Thu, 23 Oct 2008 01:44:21 +0000</pubDate>
		<dc:creator>Liza Daly</dc:creator>
				<category><![CDATA[content]]></category>
		<category><![CDATA[digitization]]></category>
		<category><![CDATA[libraries]]></category>
		<category><![CDATA[archiving]]></category>
		<category><![CDATA[ftp]]></category>
		<category><![CDATA[rfc]]></category>
		<category><![CDATA[tei]]></category>
		<category><![CDATA[telnet]]></category>

		<guid isPermaLink="false">http://blog.threepress.org/?p=135</guid>
		<description><![CDATA[My attention was caught by this quote from Clay Shirky on the excellent ReadWriteWeb blog:
Back in 1974, when the Internet was a fraction of what it is now, the acorn to an oak, there were really only two applications,&#8221; said Shirky, &#8220;Telnet, and FTP.&#8221;
Surely he&#8217;s wrong, I thought.  Those protocols aren&#8217;t that old.
But I was [...]]]></description>
			<content:encoded><![CDATA[<p>My attention was caught by<a href="http://www.readwriteweb.com/archives/health_20_economics_of_aggregation.php"> this quote from Clay Shirky</a> on the excellent <a href="http://www.readwriteweb.com/">ReadWriteWeb blog:</a></p>
<blockquote><p>Back in 1974, when the Internet was a fraction of what it is now, the acorn to an oak, there were really only two applications,&#8221; said Shirky, &#8220;Telnet, and FTP.&#8221;</p></blockquote>
<p><em>Surely he&#8217;s wrong</em>, I thought.  <em>Those protocols aren&#8217;t that old.</em></p>
<p>But I was wrong. FTP was invented in 1971, and telnet was developed in 1969.</p>
<p>(Telnet is a way to connect interactively with another computer. In practice it&#8217;s been replaced by the more secure <a href="http://en.wikipedia.org/wiki/Ssh">ssh</a>, but vestigial copies remain on all modern computers.)</p>
<p>What really threw me wasn&#8217;t that telnet was from 1969 as much as that it was <a href="http://tools.ietf.org/html/rfc15">RFC 15</a>.  In the networked world, Requests for Comments are documents which define the standards that computers use when communicating with each other.  To understand how old RFC 15 is, consider that the venerable FTP is <a href="http://www.rfc-editor.org/rfc/rfc114.txt">RFC 114</a>,  while email as we know it is <a href="http://www.faqs.org/rfcs/rfc821.html">RFC 821</a> (1982), and HTTP is <a href="http://www.faqs.org/rfcs/rfc1945.html">RFC 1945</a> (1996, although obviously it had been in use for years). The most recent RFC is <a href="http://tools.ietf.org/html/rfc5382">5382</a>. RFC 15 is <em><span style="text-decoration: underline;">ancient history</span></em>.</p>
<p>Because I am a nerd I spent some time browsing the early RFCs, and I was struck by how charmingly antique they are. <a href="http://tools.ietf.org/html/rfc16">RFC 16</a> says that M.I.T. should receive copies of RFCs. <a href="http://tools.ietf.org/html/rfc6">RFC 6</a> begins, &#8220;I talked with Bob Kahn at BB&amp;N yesterday.&#8221; RFC 14 <a href="http://tools.ietf.org/html/rfc14">never existed</a>.</p>
<p>RFC 7 (&#8220;Host-IMP Interface&#8221;) includes a prefatory note:</p>
<blockquote><p>The original of <a href="http://tools.ietf.org/html/rfc7">RFC 7</a> was hand-written, and only partially illegible [sic]<br />
copies exist.</p></blockquote>
<p>Indeed, the actual RFC begins:</p>
<blockquote><p>This paper is concerned with the preliminary software design of the<br />
Host IMP interface.  Its main purpose is on the one hand to define<br />
functions that will be implemented, and on the other hand to provide<br />
a base for discussions and &#8230;(unreadable).</p></blockquote>
<p>I&#8217;m on the mailing list for users of the <a href="http://www.tei-c.org/index.xml">Text Encoding Initiative</a> (TEI), an XML schema used primarily for encoding historical texts. The schema is equipped with tags for tracking everything about a document, including changes that occur over centuries of time. On the TEI list, people ask questions like, &#8220;How do I represent a medieval manuscript and also indicate which passages were underlined by an 18th century owner?&#8221; or &#8220;What tag should I use for a poem title that was handwritten vertically in the left margin?&#8221; (Promptly followed by vigorous scholarly debates over the &#8220;correct&#8221; answers.)</p>
<p>There&#8217;s something charming about how early internet history, just 40 years old, is almost as poorly documented and in need of careful archivists.</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.threepress.org/2008/10/22/the-real-internet-archive/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>TEI + Python + lxml + Dutch = Corpus Toneelkritiek Interbellum</title>
		<link>http://blog.threepress.org/2008/10/14/corpus-toneelkritiek-interbellum/</link>
		<comments>http://blog.threepress.org/2008/10/14/corpus-toneelkritiek-interbellum/#comments</comments>
		<pubDate>Wed, 15 Oct 2008 02:42:29 +0000</pubDate>
		<dc:creator>Liza Daly</dc:creator>
				<category><![CDATA[digitization]]></category>
		<category><![CDATA[libraries]]></category>
		<category><![CDATA[tools]]></category>
		<category><![CDATA[clowns]]></category>
		<category><![CDATA[dutch]]></category>
		<category><![CDATA[lxml]]></category>
		<category><![CDATA[python]]></category>
		<category><![CDATA[tei]]></category>
		<category><![CDATA[xpath]]></category>
		<category><![CDATA[xslt]]></category>

		<guid isPermaLink="false">http://blog.threepress.org/?p=103</guid>
		<description><![CDATA[I was pleased to be able to assist with the Corpus Toneelkritiek Interbellum project, which allows reading, browsing and searching of early 20th-century Dutch theater reviews.  I can&#8217;t read Dutch, but Google&#8217;s automated translation tells me that the review of Hamlet mentions a &#8220;long modern clown,&#8221; which sounds disturbing enough that I&#8217;ll leave the [...]]]></description>
			<content:encoded><![CDATA[<p>I was pleased to be able to assist with the <a href="http://webh01.ua.ac.be/theso/cti/index.html">Corpus Toneelkritiek Interbellum</a> project, which allows reading, browsing and searching of early 20th-century Dutch theater reviews.  I can&#8217;t read Dutch, but Google&#8217;s automated translation tells me that the review of <a href="http://webh01.ua.ac.be/theso/cti/1926-05-30_putman70.html">Hamlet</a> mentions a &#8220;long modern clown,&#8221; which sounds disturbing enough that I&#8217;ll leave the actual reading to someone else.
</p>
<div style="text-align:center;margin:auto;float:none">
<a href="http://webh01.ua.ac.be/theso/cti/index.html"><img style="float:none" src="http://blog.threepress.org/wp-content/uploads/2008/10/picture-6-300x253.png" alt="" title="picture-6" width="300" height="253"  align="right" /></a>
</div>
<p style="clear:both">
The source documents are encoded in <a href="http://www.tei-c.org/index.xml">TEI</a> XML and rendered to the browser using Python and <a href="http://codespeak.net/lxml/">lxml</a>, three of my favorite technologies.</p>
<p>
There are a few take-aways from this project that might benefit anyone working in a similar area and scale: </p>
<ul>
<li> Use a standard encoding format (in this case TEI, but choose an appropriate one based on the source content)</li>
<li> Use a modern programming language, even in a humanities context (e.g. Python)</li>
<li> Use modern XML parsing tools (e.g. lxml + XPath + XSLT)</li>
</ul>
<p>
The key advantage of libraries such as lxml in publishing and digitization projects is that it allows the developer to freely mix XML-native languages like XPath and XSLT with the expressive, procedural programming style of Python.  I&#8217;m still amazed by how many people are &#8220;parsing&#8221; XML using regular expressions (or worse), or using plain CGI/Perl scripts to serve up content. There are easier ways!</p>
<p> &#8220;Free&#8221; doesn&#8217;t have to mean primitive. In fact I would argue that projects like <a href="http://pinaxproject.com/">Pinax</a> can jump-start library or digital archive sites into the 21st century with less work than a grad student will spend crafting a bespoke Perl script.
</p>
<p> Congratulations to Thomas Crombez and his team!</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.threepress.org/2008/10/14/corpus-toneelkritiek-interbellum/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>ALA 2008: Technical solutions to increasing the visibility of libraries</title>
		<link>http://blog.threepress.org/2008/07/08/ala-2008-technical-solutions-to-increasing-the-visibility-of-libraries/</link>
		<comments>http://blog.threepress.org/2008/07/08/ala-2008-technical-solutions-to-increasing-the-visibility-of-libraries/#comments</comments>
		<pubDate>Tue, 08 Jul 2008 15:14:55 +0000</pubDate>
		<dc:creator>Liza Daly</dc:creator>
				<category><![CDATA[libraries]]></category>
		<category><![CDATA[api]]></category>
		<category><![CDATA[books]]></category>
		<category><![CDATA[discovery]]></category>
		<category><![CDATA[library]]></category>
		<category><![CDATA[librarything]]></category>
		<category><![CDATA[search]]></category>
		<category><![CDATA[worldcat]]></category>

		<guid isPermaLink="false">http://blog.threepress.org/?p=16</guid>
		<description><![CDATA[I had a great time meeting people and attending talks at this year&#8217;s ALA conference in Anaheim.  Although I&#8217;ve so far focused on software development for publishers, there&#8217;s a lot of need for innovation in library software as well, and is something I&#8217;m interested in exploring. 
User-generated content
Tim Spalding from LibraryThing convincingly demonstrated that ordinary [...]]]></description>
			<content:encoded><![CDATA[<p>I had a great time meeting people and attending talks at this year&#8217;s ALA conference in Anaheim.  Although I&#8217;ve so far focused on software development for publishers, there&#8217;s a lot of need for innovation in library software as well, and is something I&#8217;m interested in exploring. <img class="alignright" style="float: right;" src="http://blog.threepress.org/wp-content/uploads/2008/07/an08-logo-sm.jpg" alt="" width="200" /></p>
<p><strong>User-generated content</strong></p>
<p>Tim Spalding from <a href="http://www.librarything.org/">LibraryThing</a> <a href="http://www.meanboyfriend.com/overdue_ideas/2008/06/ala-2008-the-future-of-cataloging-as-seen-from-librarything---tim-spalding.html">convincingly demonstrated</a> that ordinary readers can, in aggregate, contribute accurate metadata and even <a href="http://www.vla.org/blog/?p=223">scholarly initiatives</a>.  UGC initiatives don&#8217;t replace professional cataloging or research, but they can galvanize interest in a subject by using tools &#8220;where people are&#8221; on the net, whether it&#8217;s LibraryThing, Facebook or Amazon.</p>
<p>Once a resource&#8217;s content and metadata are available on the net, the library can broaden the scope of what its &#8220;local community&#8221; means. It may no longer service just people living in its immediate vicinity but anyone who has a interest in the library&#8217;s holdings, e.g. retirees who grew up in that area, or individuals with historical interest in the location.</p>
<p>This <a href="http://www.flickr.com/photos/library_of_congress/">Flickr photostream</a> from the Library of Congress allows anyone to add historical notes and corrections. Ironically, this project also validates the need for editorial control, as some popular photos are <a href="http://www.flickr.com/photos/library_of_congress/2179930812/in/set-72157603671370361/">overloaded with inane comments</a>. A sensible moderation policy admits potentially-useful information while deleting random valueless statements (&#8220;nice hat!&#8221;).</p>
<p>For better or worse, most archival library holdings will draw less attention, and thus UGC is likely to be of higher quality.  Without UGC many collections might languish unseen for decades because the resources don&#8217;t exist to professionally catalog them.</p>
<p><strong>Software services and discovery</strong></p>
<p>I&#8217;m reading about a book on the net, and decide I&#8217;m interested in it &#8212; but not to buy.  Perhaps it&#8217;s out of print, or extremely expensive, or I&#8217;m only mildly curious about the title.  I should be a maximum of one or two clicks away from finding out that it&#8217;s available via my local library and <em>ordering it</em>.</p>
<p>(I don&#8217;t especially care where the book is or how the library acquires it, although I do need to know an estimated time of arrival in case that&#8217;s important to my use case. One ALA speaker suggested the unorthodox practice of buying used books online and mailing them directly to patrons, simply because it can be cheaper than old-fashioned inter-library loan.)</p>
<p>Right now, my <a href="http://library.minlib.net/search/">local library catalog</a> accepts only <em>inbound</em> requests.  I have to go to the site and initiate a search for the title of interest (assuming I even know what I want). My library network (a consortia of many city libraries in a well-off, highly-educated region) isn&#8217;t part of <a href="http://www.worldcat.org/">WorldCat</a> and certainly doesn&#8217;t provide any advanced discovery tools of its own.</p>
<p>Libraries need to move in the true Web 2.0 direction of providing <em>outbound</em> services.  They should broadcast their catalogs using a simple <a href="http://tomayko.com/writings/rest-to-my-wife">REST-like</a> API.  It could be as simple as asking for <em>http://mylibrary.org/isbn/123456789</em> and getting a brief XML response back: <em>the book is available via loan and will take 3-5 days to arrive at the local branch.</em> (An authenticated POST request could then reserve it.) There are already good models for these services in the form of the Google Books and Amazon APIs and there is nothing technically infeasible about it.</p>
<p>The regional library of the future should not be just a physical building to store books but a public service for getting books into its community.</p>
<p>Inspired by the conference, I did come back and make my first online request to my local library. It wasn&#8217;t difficult, and this morning I got an email notice that the book is waiting for me at the regional branch a couple blocks away. But it could be even easier, and I&#8217;d love to help build out that infrastructure.</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.threepress.org/2008/07/08/ala-2008-technical-solutions-to-increasing-the-visibility-of-libraries/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
	</channel>
</rss>

