Recognizing Apache Hadoop’s growing presence in the field of data analysis, Dell will start selling servers preloaded with the open-source data processing platform, the company announced Thursday.
The package “is a direct response to feedback we’ve been hearing from our customers,” said Joseph George, director of cloud marketing for Dell. A significant portion of Dell’s enterprise customers are considering the use of or are already running Hadoop.
“Hadoop is becoming a de facto standard,” George said. “We’ve built a reference architecture on top of hardware that is attuned to this environment.”
Created by search specialist Doug Cutting, Apache Hadoop has been increasingly used by organizations to sift though large sets of unstructured data, such as server logs.
Dell’s Cloudera Solution for Hadoop uses a bundle of Hadoop software offered by Cloudera, including the Cloudera Distribution of Hadoop (CDH) and the Cloudera Enterprise suite of management tools. For managing the deployment of software, the package also includes a copy of Dell’s own Crowbar software.
CDH is a collection of commonly used Hadoop components, including Hadoop itself, Hive, Pig, HBase, Zookeeper, Whirr, Flume, Hue, Oozie and Sqoop. The servers can be outfitted with Red Hat Enterprise Linux, either version 5.6 or 6, CentOS, Ubuntu or SUSE operating systems. Users can order the servers with the software fully installed, or they could use Crowbar to install the software themselves.
On the hardware side, the package can come with either Dell PowerEdge C2100, C6100 or C6105 servers. The PowerEdge C-series servers are uniquely suited for Hadoop’s multiserver deployments because of their modest physical size and power usage, George said. It also includes a set of PowerConnect 6248 48-port Gigabit Ethernet Layer 3 switches. A deployment based on the reference architecture could scale from six nodes to 720 nodes.
Dell will also offer training and technology support.