<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Threepress Consulting blog &#187; content</title>
	<atom:link href="http://blog.threepress.org/category/content/feed/" rel="self" type="application/rss+xml" />
	<link>http://blog.threepress.org</link>
	<description>Threepress creates software for publishers, educators and authors.</description>
	<lastBuildDate>Tue, 27 Jul 2010 16:34:57 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.9.2</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
			<item>
		<title>HTML5 for publishers</title>
		<link>http://blog.threepress.org/2010/04/12/html5-for-publishers/</link>
		<comments>http://blog.threepress.org/2010/04/12/html5-for-publishers/#comments</comments>
		<pubDate>Mon, 12 Apr 2010 14:58:15 +0000</pubDate>
		<dc:creator>Keith Fahlgren</dc:creator>
				<category><![CDATA[content]]></category>
		<category><![CDATA[css]]></category>
		<category><![CDATA[ebooks]]></category>
		<category><![CDATA[html5]]></category>
		<category><![CDATA[video]]></category>

		<guid isPermaLink="false">http://blog.threepress.org/?p=1402</guid>
		<description><![CDATA[The launch of the iPad and iBooks and the tremendous stream of one-off demos reimagining publishing have made it extremely difficult to understand what technologies for enriched content are available to publishers today. In particular, I’ve seen a lot of confusion about what HTML5 might actually mean and what specific opportunities it might bring for [...]]]></description>
			<content:encoded><![CDATA[<p>The <a href="blog.threepress.org/2010/04/05/ibooks-and-epub/">launch of the iPad and iBooks</a> and the tremendous stream of <a href="http://www.wired.com/epicenter/2010/02/the-wired-ipad-app-a-video-demonstration/">one-off demos</a> <q><a href="toc.oreilly.com/2010/04/why-ipad-adaptation-is-an-uphill-battle-for-incumbent-publishers.html">reimagining</a></q> publishing have made it extremely difficult to understand what technologies for enriched content are available to publishers today. In particular, I’ve seen a lot of confusion about what <q>HTML5</q> might actually mean and what specific opportunities it might bring for digital reading.</p>
<p>To try to shed some light on some complicated work, I’ve pieced together a very incomplete introduction to some of the new things that HTML5 and related specifications <em>might</em> bring (remember: they aren&#8217;t finished or implemented everywhere).</p>
<p><span id="more-1402"></span></p>
<p>Seven concrete areas of opportunity:</p>
<dl>
<dt><a href="http://www.w3.org/TR/html5/semantics.html#semantics">New semantics</a></dt>
<dd>
<p>HTML5 adds <a href="http://diveintohtml5.org/semantics.html#new-elements<br />
">a bunch of new elements</a> that might help publishers represent their content more clearly.</p>
<ul>
<li class="upside"><code>section</code>, <code>article</code>, <code>header</code>, and <code>aside</code> should be very useful wrappers for real-world content.</li>
<li class="downside">Older web browsers will safely ignore unknown tags, but some ereaders that aren&#8217;t based on browsers may exhibit unexpected behavior. Also, typically, semantic tagging alone won&#8217;t drive a lot of sales.</li>
</ul>
</dd>
<dt><a href="http://www.w3.org/TR/html5/the-canvas-element.html#the-canvas-element">Drawing</a></dt>
<dd>
<p>HTML5 adds a <code>canvas</code> element, which allows for (<a href="http://www.benjoffe.com/code/games/torus/">really</a> <a href="http://n96.org/#lat=41.52&#038;lon=-100.11&#038;dist=2500">snazzy</a>) complex drawing and animation. Mark Pilgrim’s chapter in <a href="http://diveintohtml5.org/canvas.html">Dive into HTML5</a> offers not only the hairy details but also an example of how embedded HTML5 can change instructional materials (view it in Safari).</p>
<ul>
<li class="upside">This might replace some of what we think about when we say &#8216;Flash&#8217; casually. It might allow for more interactive elements to be included in content.</li>
<li class="downside">There are no easy-to-use tools that generate canvases — they must be coded by hand (Although it sounds like some basic tools <a href="http://www.9to5mac.com/Flash-html5-canvas-35409730">are close</a>).</li>
</ul>
</dd>
<dt><a href="http://www.w3.org/TR/html5/video.html#video">Video</a></dt>
<dd>
<p>HTML5 adds a <code>video</code> element, which offers the first standards-based way to embed video in a web page. Similarly, an <code>audio</code> element has been added.</p>
<ul>
<li class="upside">Seamlessly embedable video may offer new ways of assembling and delivering multimedia content.</li>
<li class="downside">Licensing for the actual encodings of the video content (the way that they&#8217;re compressed and stored) is <a href="http://www.streamingmedia.com/Articles/ReadArticle.aspx?ArticleID=65403">unbelievably messed up and getting worse</a>.</li>
</ul>
</dd>
<dt><a href="http://www.w3.org/Style/CSS/current-work#CSS3">CSS3</a></dt>
<dd>
<p>A related set of updates to the Cascading Stylesheets specifications, CSS3, is often discussed alongside HTML5.</p>
<ul>
<li class="upside">CSS3 may bring a range of delightful updates for content creators, from better support for font-faces on the web to animations and transitions that may (along with canvas) allow alternatives to Flash for <a href="http://girliemac.com/sandbox/matrix.html">flashy</a> <a href="http://devfiles.myopera.com/articles/1041/image-gallery.html">stuff</a>.</li>
<li class="downside">Licensing solutions for fonts on the web are still in-progress. Authoring tools non-existent; support not complete across browsers.</li>
</ul>
</dd>
<dt><a href="http://www.w3.org/TR/geolocation-API/">Geolocation</a></dt>
<dd>
<p>Updates to a JavaScript API alongside HTML5 allow for users to reveal their physical location.</p>
<ul>
<li class="upside">There are probably a lot of opportunities to connect reading, readers, booksellers, and where people actually are.</li>
<li class="downside"><a href="http://diveintohtml5.org/geolocation.html#w3c">Very little support</a> on desktops or more basic phones.
</li>
</ul>
</dd>
<dt><a href="http://www.w3.org/TR/offline-webapps/">Offline Applications</a></dt>
<dd>
<p>A <a href="http://www.w3.org/TR/offline-webapps/">set</a> of <a href="http://www.w3.org/TR/webdatabase/">specifications</a> related to HTML5 make it possible to create web applications that run without an active internet connection.</p>
<ul>
<li class="upside">Offline web applications can offer many of the features that standalone iPhone and Android Apps have and do not have to go through any App Store. They may also be more portable, as they don&#8217;t<br />
have to have a separate set of software for each platform. [<em>Explicitly</em>: This is the part of <q>HTML5</q> that allows <a href="http://ibisreader.com">Ibis Reader</a>  to behave just like a <q>normal</q> App on iPhones, iPads, and Android phones.]</li>
<li class="downside">Offline web applications are typically slower than standalone apps, although this will become less important as faster devices like the iPad become more common. Support on platforms is not uniform. Firefox doesn&#8217;t seem interested in supporting the current database specification.</li>
</ul>
</dd>
<dt><a href="http://www.w3.org/TR/microdata/">Microdata</a></dt>
<dd>
<p>Another related specification that provides way of adding machine-readable annotations to content.</p>
<ul>
<li class="upside">One use of microdata might be to embed content licensing and other details inside pages.</li>
<li class="downside">Yawn. Some of these features have been around in microformats for years and haven&#8217;t really taken off, although this could be very important for specialized content with regularly-structured content (cookbooks would be an easy example).</li>
</ul>
</dd>
</dl>
<p>Sadly, HTML5 support (on any of the above) in Internet Explorer (even the unreleased IE9) is woefully incomplete and IE is still the most ubiquitous browser. Look at the number of red Xs in the right-hand columns of this support chart: <a href="http://www.findmebyip.com/litmus/">http://www.findmebyip.com/litmus/</a></p>
<p>Does the above inspire you to try out some of these new opportunities now that you can pierce some of the HTML5 hype? Please <a href="http://threepress.org/contact/">let us know</a>, as we’re actively seeking publishers and authors interested in innovating and experimenting with actual content to develop short- or long-form examples that take avantage of these new possibilities for enthralling, educating, and entertaining readers.</p>
<hr />
<p>Want to explore in more detail? Both <a href="http://html5doctor.com">http://html5doctor.com</a> &#038; <a href="http://diveintohtml5.org">http://diveintohtml5.org</a> have great details on the state of the above technologies. </p>
]]></content:encoded>
			<wfw:commentRss>http://blog.threepress.org/2010/04/12/html5-for-publishers/feed/</wfw:commentRss>
		<slash:comments>4</slash:comments>
		</item>
		<item>
		<title>The real Internet Archive</title>
		<link>http://blog.threepress.org/2008/10/22/the-real-internet-archive/</link>
		<comments>http://blog.threepress.org/2008/10/22/the-real-internet-archive/#comments</comments>
		<pubDate>Thu, 23 Oct 2008 01:44:21 +0000</pubDate>
		<dc:creator>Liza Daly</dc:creator>
				<category><![CDATA[content]]></category>
		<category><![CDATA[digitization]]></category>
		<category><![CDATA[libraries]]></category>
		<category><![CDATA[archiving]]></category>
		<category><![CDATA[ftp]]></category>
		<category><![CDATA[rfc]]></category>
		<category><![CDATA[tei]]></category>
		<category><![CDATA[telnet]]></category>

		<guid isPermaLink="false">http://blog.threepress.org/?p=135</guid>
		<description><![CDATA[My attention was caught by this quote from Clay Shirky on the excellent ReadWriteWeb blog:
Back in 1974, when the Internet was a fraction of what it is now, the acorn to an oak, there were really only two applications,&#8221; said Shirky, &#8220;Telnet, and FTP.&#8221;
Surely he&#8217;s wrong, I thought.  Those protocols aren&#8217;t that old.
But I was [...]]]></description>
			<content:encoded><![CDATA[<p>My attention was caught by<a href="http://www.readwriteweb.com/archives/health_20_economics_of_aggregation.php"> this quote from Clay Shirky</a> on the excellent <a href="http://www.readwriteweb.com/">ReadWriteWeb blog:</a></p>
<blockquote><p>Back in 1974, when the Internet was a fraction of what it is now, the acorn to an oak, there were really only two applications,&#8221; said Shirky, &#8220;Telnet, and FTP.&#8221;</p></blockquote>
<p><em>Surely he&#8217;s wrong</em>, I thought.  <em>Those protocols aren&#8217;t that old.</em></p>
<p>But I was wrong. FTP was invented in 1971, and telnet was developed in 1969.</p>
<p>(Telnet is a way to connect interactively with another computer. In practice it&#8217;s been replaced by the more secure <a href="http://en.wikipedia.org/wiki/Ssh">ssh</a>, but vestigial copies remain on all modern computers.)</p>
<p>What really threw me wasn&#8217;t that telnet was from 1969 as much as that it was <a href="http://tools.ietf.org/html/rfc15">RFC 15</a>.  In the networked world, Requests for Comments are documents which define the standards that computers use when communicating with each other.  To understand how old RFC 15 is, consider that the venerable FTP is <a href="http://www.rfc-editor.org/rfc/rfc114.txt">RFC 114</a>,  while email as we know it is <a href="http://www.faqs.org/rfcs/rfc821.html">RFC 821</a> (1982), and HTTP is <a href="http://www.faqs.org/rfcs/rfc1945.html">RFC 1945</a> (1996, although obviously it had been in use for years). The most recent RFC is <a href="http://tools.ietf.org/html/rfc5382">5382</a>. RFC 15 is <em><span style="text-decoration: underline;">ancient history</span></em>.</p>
<p>Because I am a nerd I spent some time browsing the early RFCs, and I was struck by how charmingly antique they are. <a href="http://tools.ietf.org/html/rfc16">RFC 16</a> says that M.I.T. should receive copies of RFCs. <a href="http://tools.ietf.org/html/rfc6">RFC 6</a> begins, &#8220;I talked with Bob Kahn at BB&amp;N yesterday.&#8221; RFC 14 <a href="http://tools.ietf.org/html/rfc14">never existed</a>.</p>
<p>RFC 7 (&#8220;Host-IMP Interface&#8221;) includes a prefatory note:</p>
<blockquote><p>The original of <a href="http://tools.ietf.org/html/rfc7">RFC 7</a> was hand-written, and only partially illegible [sic]<br />
copies exist.</p></blockquote>
<p>Indeed, the actual RFC begins:</p>
<blockquote><p>This paper is concerned with the preliminary software design of the<br />
Host IMP interface.  Its main purpose is on the one hand to define<br />
functions that will be implemented, and on the other hand to provide<br />
a base for discussions and &#8230;(unreadable).</p></blockquote>
<p>I&#8217;m on the mailing list for users of the <a href="http://www.tei-c.org/index.xml">Text Encoding Initiative</a> (TEI), an XML schema used primarily for encoding historical texts. The schema is equipped with tags for tracking everything about a document, including changes that occur over centuries of time. On the TEI list, people ask questions like, &#8220;How do I represent a medieval manuscript and also indicate which passages were underlined by an 18th century owner?&#8221; or &#8220;What tag should I use for a poem title that was handwritten vertically in the left margin?&#8221; (Promptly followed by vigorous scholarly debates over the &#8220;correct&#8221; answers.)</p>
<p>There&#8217;s something charming about how early internet history, just 40 years old, is almost as poorly documented and in need of careful archivists.</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.threepress.org/2008/10/22/the-real-internet-archive/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Seven new books added</title>
		<link>http://blog.threepress.org/2008/05/12/seven-new-books-added/</link>
		<comments>http://blog.threepress.org/2008/05/12/seven-new-books-added/#comments</comments>
		<pubDate>Tue, 13 May 2008 01:06:11 +0000</pubDate>
		<dc:creator>Liza Daly</dc:creator>
				<category><![CDATA[content]]></category>
		<category><![CDATA[project gutenberg]]></category>
		<category><![CDATA[tei]]></category>
		<category><![CDATA[wiki]]></category>
		<category><![CDATA[wikipedia]]></category>
		<category><![CDATA[xml]]></category>
		<category><![CDATA[xslt]]></category>

		<guid isPermaLink="false">http://blog.threepress.org/?p=8</guid>
		<description><![CDATA[The last set of Gutenberg HTML books that were planned for demonstration on threepress have been added.  As usual, data-loading took more time and uncovered up more problems than expected, which is always a reason to add as many samples as possible.  This set includes one non-fiction book (On the Origin of Species) and one [...]]]></description>
			<content:encoded><![CDATA[<p style="text-align: left;">The last set of <a href="http://gutenberg.hwg.org/checkdoc1.html">Gutenberg HTML</a> books that were planned for demonstration on threepress have been added.  As usual, data-loading took more time and uncovered up more problems than expected, which is always a reason to add as many samples as possible.  This set includes one non-fiction book (<a href="http://www.threepress.org/document/On-the-Origin-of-Species-by-Means-of-Natural-Selection_Charles-Darwin/">On the Origin of Species</a>) and one with verse components (<a href="http://www.threepress.org/document/The-Jungle-Book_Rudyard-Kipling/">The Jungle Book</a>); both required significant updates to the XSLT that converts the Gutenberg DTD to TEI.</p>
<p style="text-align: left;">To expand the project in useful ways I&#8217;d like to be able to add:</p>
<ol>
<li>Other content types besides novels, especially reference</li>
<li>Content from other document formats, such as DocBook</li>
<li>Native, highly-tagged TEI documents</li>
</ol>
<p>Wikipedia and its cohorts are by far the largest source of public domain data on the web now, but they aren&#8217;t encoded in XML. Publishers are unlikely to use wiki formatting to mark up their content and thus developing a workflow to convert from wiki to TEI doesn&#8217;t seem productive.</p>
<p>XML data welcome!</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.threepress.org/2008/05/12/seven-new-books-added/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>New books added: A Tale of Two Cities and The Cask of Amontillado</title>
		<link>http://blog.threepress.org/2008/05/05/new-books-added-a-tale-of-two-cities-and-the-cask-of-amontillado/</link>
		<comments>http://blog.threepress.org/2008/05/05/new-books-added-a-tale-of-two-cities-and-the-cask-of-amontillado/#comments</comments>
		<pubDate>Tue, 06 May 2008 02:11:12 +0000</pubDate>
		<dc:creator>Liza Daly</dc:creator>
				<category><![CDATA[content]]></category>

		<guid isPermaLink="false">http://blog.threepress.org/?p=5</guid>
		<description><![CDATA[Two books that should&#8217;ve been in the initial release were added today: A Tale of Two Cities by Charles Dickens and The Cask of Amontillado by Edgar Allen Poe.
Tale was challenging because of the way the &#8220;books&#8221; were organized (they&#8217;re called parts in threepress).  This book exposed a bug in the way I was handling [...]]]></description>
			<content:encoded><![CDATA[<p>Two books that should&#8217;ve been in the initial release were added today: <a href="http://www.threepress.org/document/A-Tale-of-Two-Cities_Charles-Dickens/">A Tale of Two Cities</a> by Charles Dickens and <a href="http://www.threepress.org/document/The-Cask-of-Amontillado_Edgar-Allan-Poe/">The Cask of Amontillado</a> by Edgar Allen Poe.</p>
<p><em>Tale </em>was challenging because of the way the &#8220;books&#8221; were organized (they&#8217;re called <em>parts</em> in threepress).  This book exposed a bug in the way I was handling chapter ordering, which I&#8217;ve fixed.</p>
<p><em>Cask</em> is my only content with no chapters, as it&#8217;s a short story.  I could make that more transparent to the user than the current implementation (right now content is assigned to a pseudo-chapter called &#8220;Complete story&#8221;), but whether I do that will depend on which is the outlying case: books or single-chaptered works.  Right now it&#8217;s mostly books, so that feels like the natural way to organize the site.</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.threepress.org/2008/05/05/new-books-added-a-tale-of-two-cities-and-the-cask-of-amontillado/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>
