<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	>
<channel>
	<title>Comments on: Ruby Web Crawler</title>
	<atom:link href="http://blog.netphase.com/2007/04/19/ruby-web-crawler/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.netphase.com/2007/04/19/ruby-web-crawler/</link>
	<description>for a connected world</description>
	<pubDate>Thu, 20 Nov 2008 11:15:07 +0000</pubDate>
	<generator>http://wordpress.org/?v=2.6.3</generator>
		<item>
		<title>By: Carsten</title>
		<link>http://www.netphase.com/2007/04/19/ruby-web-crawler/#comment-53</link>
		<dc:creator>Carsten</dc:creator>
		<pubDate>Wed, 23 Jan 2008 11:04:38 +0000</pubDate>
		<guid isPermaLink="false">http://blog.netphase.com/2007/04/19/ruby-web-crawler/#comment-53</guid>
		<description>Great code,

Will have to try it out.

Thanks man!</description>
		<content:encoded><![CDATA[<p>Great code,</p>
<p>Will have to try it out.</p>
<p>Thanks man!</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Shig</title>
		<link>http://www.netphase.com/2007/04/19/ruby-web-crawler/#comment-26</link>
		<dc:creator>Shig</dc:creator>
		<pubDate>Thu, 13 Sep 2007 15:16:38 +0000</pubDate>
		<guid isPermaLink="false">http://blog.netphase.com/2007/04/19/ruby-web-crawler/#comment-26</guid>
		<description>Sounds good. Thanks for the response.</description>
		<content:encoded><![CDATA[<p>Sounds good. Thanks for the response.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: scott</title>
		<link>http://www.netphase.com/2007/04/19/ruby-web-crawler/#comment-25</link>
		<dc:creator>scott</dc:creator>
		<pubDate>Thu, 13 Sep 2007 15:09:06 +0000</pubDate>
		<guid isPermaLink="false">http://blog.netphase.com/2007/04/19/ruby-web-crawler/#comment-25</guid>
		<description>While it would be easy to add this to a controller, it's probably not the best place for this.  One possibility would be to create an &lt;a href="http://www.railsmanual.org/class/ActiveRecord%3A%3AObserver" rel="nofollow"&gt;Observer&lt;/a&gt; to handle it; however, I would be more inclined to create a task (in lib/tasks) to handle the crawling and set up cron to run it periodically.  That would insure a constant level of resource usage and make it easier to process the requests serially.</description>
		<content:encoded><![CDATA[<p>While it would be easy to add this to a controller, it&#8217;s probably not the best place for this.  One possibility would be to create an <a href="http://www.railsmanual.org/class/ActiveRecord%3A%3AObserver" rel="nofollow">Observer</a> to handle it; however, I would be more inclined to create a task (in lib/tasks) to handle the crawling and set up cron to run it periodically.  That would insure a constant level of resource usage and make it easier to process the requests serially.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Shig</title>
		<link>http://www.netphase.com/2007/04/19/ruby-web-crawler/#comment-24</link>
		<dc:creator>Shig</dc:creator>
		<pubDate>Thu, 13 Sep 2007 03:46:58 +0000</pubDate>
		<guid isPermaLink="false">http://blog.netphase.com/2007/04/19/ruby-web-crawler/#comment-24</guid>
		<description>Scott - I'm currently in the process of learning Ruby and RoR. Is this something that would integrate easily into an RoR app? I'm wondering how this would fit into the MVC architecture.</description>
		<content:encoded><![CDATA[<p>Scott - I&#8217;m currently in the process of learning Ruby and RoR. Is this something that would integrate easily into an RoR app? I&#8217;m wondering how this would fit into the MVC architecture.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: scott</title>
		<link>http://www.netphase.com/2007/04/19/ruby-web-crawler/#comment-5</link>
		<dc:creator>scott</dc:creator>
		<pubDate>Sun, 27 May 2007 20:15:13 +0000</pubDate>
		<guid isPermaLink="false">http://blog.netphase.com/2007/04/19/ruby-web-crawler/#comment-5</guid>
		<description>You could store it wherever you like.  In this example, the contents of the page are in the page_text variable.  You can see an &lt;a href="http://rubyforge.org/snippet/detail.php?type=snippet&#038;id=155" rel="nofollow"&gt;updated version&lt;/a&gt; at RubyForge where I'm storing the results in a hashmap.</description>
		<content:encoded><![CDATA[<p>You could store it wherever you like.  In this example, the contents of the page are in the page_text variable.  You can see an <a href="http://rubyforge.org/snippet/detail.php?type=snippet&#038;id=155" rel="nofollow">updated version</a> at RubyForge where I&#8217;m storing the results in a hashmap.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: webug</title>
		<link>http://www.netphase.com/2007/04/19/ruby-web-crawler/#comment-4</link>
		<dc:creator>webug</dc:creator>
		<pubDate>Sun, 27 May 2007 03:02:18 +0000</pubDate>
		<guid isPermaLink="false">http://blog.netphase.com/2007/04/19/ruby-web-crawler/#comment-4</guid>
		<description>dear sir:
i am a ruby beginner,i wondere in your programe above, where  do you store the download html pages ? 
thanks</description>
		<content:encoded><![CDATA[<p>dear sir:<br />
i am a ruby beginner,i wondere in your programe above, where  do you store the download html pages ?<br />
thanks</p>
]]></content:encoded>
	</item>
</channel>
</rss>
