blogmarks.net Get Firefox!

happy

2 month ago

joshua : happy - Happy is a framework for writing map-reduce programs for Hadoop using Jython. It files off the sharp edges on Hadoop and makes writing map-reduce programs a breeze.

Rod Begbie : Hadoop + Python = Happy - Framework that combines Jython and Hadoop to make writing distributed mapreduce in Python easy. This might finally get me to dive into Hadoopyness. [via#

Tags : dist python hadoop jython programming

  copy

Cascading

8 month ago

joshua : Cascading - a large dataset build tool and a processing API for Hadoop.

Tags : data dist hadoop

  copy

Jeff on Cloud Computing: The First Hadoop Summit

8 month ago

joshua : Jeff on Cloud Computing: The First Hadoop Summit

Tags : dist hadoop

  copy

Hadoop + EC2 + S3 = NYT PDF

9 month ago

nelson : Hadoop + EC2 + S3 = NYT PDF - Nice breakdown of how to use cloud computing services to do a large scale rendering job

Tags : amazon ec2 hadoop news nytimes s3 systems

  copy

Hadoop success

10 month ago

nelson : Hadoop success - Yahoo talks about how they do web indexing. Google considers their equivalent system highly proprietary; neat that Hadoop is open source.

Tags : code dougcutting google hadoop opensource search systems yahoo

  copy

Yahoo's Doug Cutting on MapReduce and the Future of Hadoop

15 month ago

Jeremy Zawodny : Yahoo's Doug Cutting on MapReduce and the Future of Hadoop - Yahoo's Doug Cutting on MapReduce and the Future of Hadoop: "In this special InfoQ interview Cutting discusses how Hadoop is used at Yahoo, the challenges of its development, and the future direction of the project."

nelson : Hadoop interview - Doug Cutting is one of the smartest programmers I know

Tags : links cluster code cutting distributed grid hadoop lucene mapreduce opensource scalability via:zawodny yahoo

  copy

Hadoop

28 month ago

Simon Willison : Hadoop - Open-source Google File System / map-reduce equivalent. Apparently scales amazingly well.

Rod Begbie : Welcome to Hadoop! - Open-source project to allow the creation of massive massively-parallelized systems. I'm so glad my CompSci course taught me about parallel programming in 1997, because it's only going to become more important. [via#

Tags : apache hadoop opensource oss parallelprogramming softwareengineering

  copy
xml
Upian.