This post is the first in a series of blog posts that will explore and
exploit the Big Data and analytics tools. I will walk through easy steps to
start working with such tools like Apache Hadoop, Pig, Mahout and solve some
problems related to analytics and learning in the large scale by exploiting
such tools, and shed the light on some of the challenges we face while
working with these tools.
1. Apache Hadoop
Hadoop is a framework that simplifies the processing of data sets distributed
across clusters of servers. Two of the main components of Hadoop are HDFS and
MapReduce.HDFS is the file system that is used by Hadoop to store all the
data. This file system spans across all the nodes that are being used by
Hadoop. These nodes could be on a single server or they can be spread across
a large number of servers.In this section, we will go through the instru... (more)
Mark Shuttleworth, the founder and leader of Canonical Ltd., the authors and
distributors of the Ubuntu distro of Linux, announced the code name of the
new version Friday and noted that the Ubuntu team had several priorities,
including the design of the system’s “look”, according to a CNET
article. The most interesting of the priorities, perhaps, was the inclusion
of cloud computing features as a basic part of the package, especially on the
server side of the release.
The statement released by Shuttleworth was very detailed, although somewhat
metaphorical, in the area of cloud c... (more)
Cloudonomics Journal on Ulitzer
Though am not that fond of certificates in general and believe in practical
hands-on to the extremes, it looks that within two years I should be aware of
the training kits and certificates exist for cloud computing and
virtualization technology :) ...
3Tera (Cloud Computing Platform and services company) announced today their
cloud computing certifications and education offerings ...
Mark Ortenzi, president of 3Tera partner CariNet, mentioned that
"If you don't do it, in two years you won't have a business,"
According to these announcements there will ... (more)
Edward Screven, Oracle's chief corporate architect, mentioned that Oracle
would not be offering Sun's long-planned and highly-vaunted compute resource
They will just play the role of supplier to components and pieces that can be
used in cloud services;
From the beginning Larry Ellison hates Cloud, and it looks that this will be
applied for Sun as well after the complete acquisition done recently ...
Steve Herrod (VMware CTO) announced today that VMware is intending to acquire
SpringSource, a 5 year-old company rapidly becoming a leader in enterprise
and web application development and management.
Seems interesting but we will wait till we see how they will complement each
other, cloud/virtualization core vendor and enterprise level Java platform.