<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: Using Nutch to download large binary media and image files</title>
	<atom:link href="http://www.spicylogic.com/allenday/blog/2008/08/29/using-nutch-to-download-large-binary-media-and-image-files/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.spicylogic.com/allenday/blog/2008/08/29/using-nutch-to-download-large-binary-media-and-image-files/</link>
	<description>♥data♥</description>
	<lastBuildDate>Thu, 10 Jun 2010 05:18:18 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.9.1</generator>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
		<item>
		<title>By: Michael Hanney</title>
		<link>http://www.spicylogic.com/allenday/blog/2008/08/29/using-nutch-to-download-large-binary-media-and-image-files/comment-page-1/#comment-401</link>
		<dc:creator>Michael Hanney</dc:creator>
		<pubDate>Fri, 15 Jan 2010 06:40:39 +0000</pubDate>
		<guid isPermaLink="false">http://www.spicylogic.com/allenday/blog/?p=55#comment-401</guid>
		<description>Nice work Allen,

This helped me a lot. Thanks.

I followed your example, and updated it to work with Nutch 1.0 and Hadop 0.20. The changes are minor:

import org.apache.commons.cli.Options;
import org.apache.nutch.util.NutchConfiguration;


then

Options opts = new Options();
Configuration conf = NutchConfiguration.create();
GenericOptionsParser parser = new GenericOptionsParser(conf, opts, argv);
FileSystem fs = FileSystem.get(conf);

Mike.</description>
		<content:encoded><![CDATA[<p>Nice work Allen,</p>
<p>This helped me a lot. Thanks.</p>
<p>I followed your example, and updated it to work with Nutch 1.0 and Hadop 0.20. The changes are minor:</p>
<p>import org.apache.commons.cli.Options;<br />
import org.apache.nutch.util.NutchConfiguration;</p>
<p>then</p>
<p>Options opts = new Options();<br />
Configuration conf = NutchConfiguration.create();<br />
GenericOptionsParser parser = new GenericOptionsParser(conf, opts, argv);<br />
FileSystem fs = FileSystem.get(conf);</p>
<p>Mike.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Extracting Nutch crawl data &#124; mhanney</title>
		<link>http://www.spicylogic.com/allenday/blog/2008/08/29/using-nutch-to-download-large-binary-media-and-image-files/comment-page-1/#comment-400</link>
		<dc:creator>Extracting Nutch crawl data &#124; mhanney</dc:creator>
		<pubDate>Fri, 15 Jan 2010 06:32:22 +0000</pubDate>
		<guid isPermaLink="false">http://www.spicylogic.com/allenday/blog/?p=55#comment-400</guid>
		<description>[...] this example &#8211; Using Nutch to download large binary media and image files by Allen Day I made a basic Java console app to simply list the keys (I also wrote the data to stdout as a test, [...]</description>
		<content:encoded><![CDATA[<p>[...] this example &#8211; Using Nutch to download large binary media and image files by Allen Day I made a basic Java console app to simply list the keys (I also wrote the data to stdout as a test, [...]</p>
]]></content:encoded>
	</item>
</channel>
</rss>
