<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Threepress Consulting blog &#187; wiki</title>
	<atom:link href="http://blog.threepress.org/tag/wiki/feed/" rel="self" type="application/rss+xml" />
	<link>http://blog.threepress.org</link>
	<description>Threepress creates software for publishers, educators and authors.</description>
	<lastBuildDate>Fri, 03 Sep 2010 23:28:54 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.9.2</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
			<item>
		<title>Seven new books added</title>
		<link>http://blog.threepress.org/2008/05/12/seven-new-books-added/</link>
		<comments>http://blog.threepress.org/2008/05/12/seven-new-books-added/#comments</comments>
		<pubDate>Tue, 13 May 2008 01:06:11 +0000</pubDate>
		<dc:creator>Liza Daly</dc:creator>
				<category><![CDATA[content]]></category>
		<category><![CDATA[project gutenberg]]></category>
		<category><![CDATA[tei]]></category>
		<category><![CDATA[wiki]]></category>
		<category><![CDATA[wikipedia]]></category>
		<category><![CDATA[xml]]></category>
		<category><![CDATA[xslt]]></category>

		<guid isPermaLink="false">http://blog.threepress.org/?p=8</guid>
		<description><![CDATA[The last set of Gutenberg HTML books that were planned for demonstration on threepress have been added.  As usual, data-loading took more time and uncovered up more problems than expected, which is always a reason to add as many samples as possible.  This set includes one non-fiction book (On the Origin of Species) and one [...]]]></description>
			<content:encoded><![CDATA[<p style="text-align: left;">The last set of <a href="http://gutenberg.hwg.org/checkdoc1.html">Gutenberg HTML</a> books that were planned for demonstration on threepress have been added.  As usual, data-loading took more time and uncovered up more problems than expected, which is always a reason to add as many samples as possible.  This set includes one non-fiction book (<a href="http://www.threepress.org/document/On-the-Origin-of-Species-by-Means-of-Natural-Selection_Charles-Darwin/">On the Origin of Species</a>) and one with verse components (<a href="http://www.threepress.org/document/The-Jungle-Book_Rudyard-Kipling/">The Jungle Book</a>); both required significant updates to the XSLT that converts the Gutenberg DTD to TEI.</p>
<p style="text-align: left;">To expand the project in useful ways I&#8217;d like to be able to add:</p>
<ol>
<li>Other content types besides novels, especially reference</li>
<li>Content from other document formats, such as DocBook</li>
<li>Native, highly-tagged TEI documents</li>
</ol>
<p>Wikipedia and its cohorts are by far the largest source of public domain data on the web now, but they aren&#8217;t encoded in XML. Publishers are unlikely to use wiki formatting to mark up their content and thus developing a workflow to convert from wiki to TEI doesn&#8217;t seem productive.</p>
<p>XML data welcome!</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.threepress.org/2008/05/12/seven-new-books-added/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>
