Why do I Need Hadoop
Too Much Data Hadoop provides storage for Big Data at reasonable cost Storing Big Data using traditional storage can be expensive. Hadoop is built around commodity hardware. Hence it can…
Read more »Big Data
What is Big Data Big Data is very large, loosely structured data set that defies traditional storage. “Big data is a term applied to data sets whose size is beyond…
Read more »HDInsight Installation on Windows Platform
First Install Windows 8 then after install the HDInsight. HDInsight installer is powered by Microsoft Web Platform Installer. To download it you can use the following link: http://www.microsoft.com/web/gallery/install.aspx?appid=HDINSIGHT-PREVIEW After installing…
Read more »What's in the HDP Sandbox and Installing the Sandbox
Use the HDP Sandbox to Develop Your Hadoop Admin and Development Skills Unless you have your own Hadoop Cluster to play with, I strongly recommend you get the HDP Sandbox…
Read more »Using the HDP Sandbox to Learn Sqoop
Once you have your HDP Sandbox up and running, you can use Sqoop to move data between your Hadoop cluster and your relational database. Your Hadoop Hive/HCatalog environment uses a…
Read more »Hadoop – It's All About The Data
A key point to understand about Hadoop is that it’s all about the data. Don’t lose focus. It’s easy to get hung up on Hive, Pig, HBase, HCatalog and lose…
Read more »Weaknesses in Traditional Data Platforms
Everyone understands that Hadoop brings high performance commercial computing to organizations using relatively low cost commodity storage. What is accelerating the move to Hadoop are weaknesses in traditional relational…
Read more »Hadoop Reference Architectures
Some initial key factors for success when building a Hadoop cluster is to build a solid foundation. This includes: Selecting the right hardware. In working with a hardware vendor…
Read more »How to Learn Hadoop
Big data is one the hottest area in IT, so everyone is wanting to learn Hadoop. I am constantly being asked how to…
Read more »