The Hadoop on Azure’s Javascript console has basic graphing functions: Bar, Line & Chart. I think this is great becuase it gives an opportunity to visualize data that’s in HDFS directly from the Interactive Javascript Console! Here’s a screenshot:
In the console, I ran the help(“graph”) command to see how I can use this function: Draw a graph of data graph.bar(data, options) Bar graph graph.line(data, options) Line graph graph.pie(data, options) Pie chart
Parameters data (array) Array of data objects options (object) Options object, with x (string) Property to use for x-axis values y (string) Property to use for y-axis values title (string) Graph title orientation (number) x-axis label orientation in degrees tickInterval (number) x-axis tick interval
Conclusion:
In this blog-post, I posted that Hadoop on Azure’s Javascript Interactive Console has basic graphing functions.
1. Upload Twitter Text Data into Hadoop on Azure cluster
2. Create a Hive Table and load the data uploaded in step 1 to the Hive Table
3. Analyze data in Hive via Excel Add-in
Before we begin, I assume you have access to Hadoop on azure, Have your sample data (don’t have one? learn from a blog post), familiar with Hadoop ecosystem and know your way around the Hadoop on Azure Dashboard.
Now, Here are the steps involved:
STEP 1: Upload Twitter Text Data into Hadoop on Azure cluster
1. Have your data to be uploaded ready! I am just going to Copy Paste the File from my host machine to the RDP’ed machine. In this case, the machine that I am going is the Hadoop on Azure cluster.
For the purpose of this blog post, I have a text file having 1500 tweets:
2. Open web browser > Go to your cluster in Hadoop on Azure
3. RDP into your Hadoop on Azure cluster
4. Copy-Paste the File. It’s a small data file so this approach works for now.
Step 2: Create a Hive Table and load the data uploaded in step 1 to the Hive Table
1. Stay on the machine that you Remote Desktop (RDP’ed) into.
2. Open the Hadoop command line (you’ll see a icon on your Desktop)
3. switch to Hive:
4. Use the following Hive Commands:
DROP TABLE IF EXISTS TweetSampleTable;
CREATE TABLE TweetSampleTable ( id string, text string, favorited string, replyToSN string, created string, truncated string, replyToSID string, replyToUID string, statusSource string, screenName string );
LOAD DATA LOCAL INPATH ‘C:appsdistexamplesdatatweets.txt’ OVERWRITE INTO TABLE TweetSampleTable;
Note that for the purpose of this blog-post, I’ve chose string as data type for all fields. This is something that depends on the data that you have. If I were building a solution, I would spend some more time choosing the right data type.
Step 3. Analyze data in Hive via Excel Add-in
1. Switch to Hadoop on Azure Dashboard
2. Go to the Hive Console and run the show tables to verify that there is a tweetsampletable.
3. Now if you haven’t, Download and Install the Hive ODBC Driver from the Downloads section of your Hadoop on Azure Dashboard.
Join the Azure PASS VC’s session on “Getting Started with Windows Azure” on:
Date: 24th Sep (Monday)
Time: 11 AM Eastern Time; 8 AM Pacific; 8:30 PM India Time; You can download the event calendar from here
Speaker:Brian Prince, Principal Cloud Evangelist Microsoft
Session Abstract: Windows Azure is Microsoft’s cloud platform for quickly building and running scalable applications. We will cover just what the cloud is, as an industry, and what Microsoft is offering. We will see into the data-centers, how they work, and the a high level view of all the components of the platform.
Aim of “Getting started with SQL Azure” series is to offer you a set of brief articles that could act as a Launchpad for your to-be wonderful journey of exploring Microsoft’s cloud based database solution i.e. SQL Azure.
In this blog post, i have discussed:
1) How to sign up for a Free trial of windows azure (to play with SQL Azure)!
2) How to create your very first SQL Azure database (and a table too!)
3) How to connect to SQL Azure server Via SQL server Management studio.
And you want to know why i am writing a blog post for it? spare couple of minutes and you will realize that you were better off just knowing the short answer. yeah seriously. And still if you are adamant on reading it – please drop me an email on contact[at]parasdoshi[dot]com, I want to talk to you! seriously!
Have you ever wondered how to import data from Azure Data market to PowerPivot Excel? And you know what I did? – since I knew we could load data from datamarket into powerpivot, I did that! There is an inbuilt support btw:
Now, I copied this data (CTRL C) and tried pasting it in Excel sheet (CTRL V). And you know what – nothing happened! So tried again! And again nothing happened. Now, i again selected the data from powerpivot window via right clicked -> copy. Went to excel worksheet and right clicked -> paste special. And guess, my laptop froze for a while and in a weird way I was happy because I thought that the copy was successful! But again it did not work. If it had, well I would not have written this blog post.
Any-who, so it was time to read some whitepapers blog posts. some googling and binging. And you know what, while I was binging and googling stuff, I liked the bing wallpaper, so i had to change my wallpaper. So I did that! Look at it, don’t you like it too:
And after a little tweeting facebooking searching, i found EXCEL ADD IN!! yeah! you can download it here: https://datamarket.azure.com/addin
After installation, you will find under the DATA tab. you can sign in to datamarket directly from there. you can create a datamarket account if you do not have one. you allow access if you have not done so before. And then you can browse available data-sets! it’s that easy.
Then you could just select the data-set you want to import and click on “import data”:
And then click on “import data” that you see at the bottom of the below screenshot.
And that’s it – downloading started! optionally you could filter the data if you want.
That’s it. Moral of the story:
Download Excel addin to import data from azure datamarket to excel
I just watched the amazing episode number 52 of cloud cover show. it was on the newly released “windows azure toolkit for social games” and it was fun!
Anywho! I am a big fan of this show – it keeps me updated on the new and shiny things happening in the domain of the Microsoft’s cloud platform i.e. windows Azure. It’s fun and it’s one of the best way to stay updated –> check it out http://channel9.msdn.com/Shows/Cloud+Cover
Alternatively, Azure blobs can be used to store contents too. For eg. a video can be stored in blobs and since a URI is provided for a blob content, a website hosted somewhere else (say a website hosted in web role) can access the video through the URI.
Once upon a time, there was a geek. He loved playing with new technology. And one day he decided to play with Windows Azure, Microsoft’s cloud computing platform. He was happy as he was learning new things and clearly (as the picture shows) was very excited!
After few days, he got a credit card bill and he was surprised (Shocked rather) to see an entry that said 150$ and further inspection showed that it corresponds to Windows Azure subscription.
No wonder, he was mad! he cursed the service and was in no mood to investigate what had really happened. hurriedly, he posted defamatory comments on Forums (Plus Twitter Plus Facebook plus other forums plus wait….Circles).
After a while, he calmed down and opened the MOCP (Microsoft online customer portal) site to view details regarding his bill. he realized that he was charged money for a Azure web role.
Now, he went to the Azure Management portal and saw that he had already stopped the role. Now, he was confused that why was he still charged? later, after taking his case to support guys plus few helpful comments in Windows Azure Forums, he figured that he was charged since he had not stopped the role. To stop being charged, he needed to delete the deployed service.
he realized that even though, he had stopped the service, the service was still on Azure machines and the compute resources that were assigned to his service could not be assigned to some other subscriber and so he was being charged. So he learned a very important lesson that day:
*********STORY ENDS****************
Based on some (real life) questions asked regarding this issue on Windows Azure forums. I have also seen people first cursing the service to no end without understanding what really happened. Later, when they get the refund (in legitimate cases) they apologize. In the process, they manage to harm their reputation and embarrass Microsoft in the process. I just wish they knew that all they had to do was delete the service when no longer required. That’s it !