Recently, I played with Project Daytona which is a MapReduce on Windows Azure.
Playing with Iterative MapReduce on #Azure aka Project Daytona: goo.gl/zNQa3
— Paras Doshi (@paras_doshi) May 3, 2012
It seems like a great “Data Analytic’s as a service”. I tried the k-means and the word-count sample application that comes bundled with the project run-time download: http://parasdoshi.visibli.com/share/z14Ty2
The documentation along with the project guides you in a step by step fashion on how to go about setting up the environment but for those who are curious, here is a brief description on how I setup the environment:
1) Uploaded the sample data-sets to Azure Storage
2) Edited the configuration file (ServiceConfiguration.cscfg) to point to correct Azure Storage
3) Chose the Instance size and the no. of Instances for the deployment
4) Deployed the binaries to Windows Azure (.cspkg and .cscfg)
5) Ran the Word Count Sample
6) Ran the K-means Sample
Conclusion: It was pretty amazing to run MapReduce on Windows Azure. If you are into BigData, MapReduce, Data Analytic’s – then check out “Project Daytona”
That’s about for this post. And what do you think about Project Daytona – MapReduce on Windows Azure?
- How do you reduce the network “latency” between application and SQL Azure? (parasdoshi.com)
- Is AWS or Windows Azure the Right Choice? It’s Not That Easy. (seroter.wordpress.com)
- What’s in a Name? Change in Windows Azure Billing Portal Causes Confusion (infoq.com)