<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: MapReduce in 10 or so lines of Python</title>
	<atom:link href="http://www.chencer.com/dave/blog/2008/07/09/mapreduce-in-10-or-so-lines-of-python/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.chencer.com/dave/blog/2008/07/09/mapreduce-in-10-or-so-lines-of-python/</link>
	<description>David Spencer's personal weblog</description>
	<lastBuildDate>Wed, 16 Feb 2011 06:42:44 +0000</lastBuildDate>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.0.5</generator>
	<item>
		<title>By: dave</title>
		<link>http://www.chencer.com/dave/blog/2008/07/09/mapreduce-in-10-or-so-lines-of-python/comment-page-1/#comment-126494</link>
		<dc:creator>dave</dc:creator>
		<pubDate>Mon, 27 Apr 2009 05:39:25 +0000</pubDate>
		<guid isPermaLink="false">http://www.tropo.com/dave/blog/2008/07/09/mapreduce-in-10-or-so-lines-of-python/#comment-126494</guid>
		<description>Yes, nice, thank you, this is much more logical, concise, crisp, and in the spirit of things.</description>
		<content:encoded><![CDATA[<p>Yes, nice, thank you, this is much more logical, concise, crisp, and in the spirit of things.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Amit Patel</title>
		<link>http://www.chencer.com/dave/blog/2008/07/09/mapreduce-in-10-or-so-lines-of-python/comment-page-1/#comment-126493</link>
		<dc:creator>Amit Patel</dc:creator>
		<pubDate>Sun, 26 Apr 2009 17:12:38 +0000</pubDate>
		<guid isPermaLink="false">http://www.tropo.com/dave/blog/2008/07/09/mapreduce-in-10-or-so-lines-of-python/#comment-126493</guid>
		<description>I think of MapReduce grouping the results as the mapper produces them:

def MrSimpler(producer, mapper, reducer, consumer):
… sharded = {}
… for n, v in producer():
… … for key, value in mapper(n, v):
… … … sharded[key].setdefault([]).append(value)
… for key, values in sharded:
… … consumer(key, reducer(key, values))

I&#039;m not sure what the actual implementation does though.</description>
		<content:encoded><![CDATA[<p>I think of MapReduce grouping the results as the mapper produces them:</p>
<p>def MrSimpler(producer, mapper, reducer, consumer):<br />
… sharded = {}<br />
… for n, v in producer():<br />
… … for key, value in mapper(n, v):<br />
… … … sharded[key].setdefault([]).append(value)<br />
… for key, values in sharded:<br />
… … consumer(key, reducer(key, values))</p>
<p>I&#8217;m not sure what the actual implementation does though.</p>
]]></content:encoded>
	</item>
</channel>
</rss>

