I have researched about Presentation Tips over past few weeks, one Tip that I read again and again is that “Make Fonts larger for readability”. To that end, I just changed the font size of the command line prompt so that when I am presenting, the audience is able to see what I am typing. So If you’ve have to present to audience something via command prompt, this should be of helps:
1. Open command prompt > Right click near the Title bar > select properties
2. switch to font tab > select the font and the size. you can also change the color, layout among other things here.
3. see how it looks:
In this post, we saw how to change the Font size of the windows command prompt.
your comments are very welcome!
Once in a while I go back to basics to revisit some of the fundamental technology concepts that I’ve learned over past few years. Today, I want to revisit Data Mining and Knowledge Discovery Process:
Here are the steps:
1) Raw Data
2) Data Pre processing (cleaning, sampling, transformation, integration etc)
3) Modeling (Building a Data Mining Model)
4) Testing the Model a.k.a assessing the Model
5) Knowledge Discovery
Here is the visualization:
In the world of Data Mining and Knowledge discovery, we’re looking for a specific type of intelligence from the data which is Patterns. This is important because patterns tend to repeat and so if we find patterns from our data, we can predict/forecast that such things can happen in future.
In this blog post, we saw the Knowledge Discovery and Data Mining process.
This is a Quick Post, Just want to share a command to upload local data to HDFS using Hadoop Command Line.
The command looks like:
> hadoop fs -copyFromLocal input.txt input/SqrtJob/input.txt
I just researched about Machine Generated Data from the context of “Big data”, Here’s the list I compiled:
– Data sent from Satellites
– Temperature sensing devices
– Flood Detection/Sensing devices
– web logs
– location data
– Data collected by Toll sensors (context: Road Toll)
– Phone call records
And a Futuristic one:
Imagine sensors on human bodies that continuously “monitor” health. How about if we use them to detect diabetes/cancer/other-diseases in their early phases. Possible? May be!
Machine can generate data “faster” than humans. This characteristics makes it interesting to think about to analyze machine generate data and in some cases, how to analyze them in real-time or near real-time
Search for Machine Generated Data, you’ll be able to find much more, it’s worth reading about from the context of Big Data.
There are two main steps:
1. Installing Nuget Package manager if you haven’t already.
2. Installing Microsoft .Net SDK for Hadoop
Installing Nuget Package manager
1) Open Visual Studio
2) Tools Menu > Extensions Manager > Search online gallery > Nuget
3) Downloaded and Installed Nuget:
4. Restarted Visual Studio
Installing Microsoft .NET SDK for Hadoop
1. Tools menu > Library Package Manager > Package Manager console
2. Installed Map/Reduce, Linq to Hive and WebHDFS component by running following commands in the package manager prompt:
install-package Microsoft.Hadoop.MapReduce -pre
In this post, we saw how to install Microsoft .NET SDK for Hadoop.
Continue learning: Programming MapReduce Jobs with HDInsight Server for Windows
HDFS and MapReduce inner workings in a nutshell.
Click on the image to view larger sized image