May 5: Hadoop and Hive by Scott Leberknight

What: Hadoop and Hive by Scott Leberknight

Presentation Description: Hadoop is an open source framework maintained by the Apache Software Foundation for creating fault-tolerant, distributed applications that process vast amounts of data in parallel across a cluster of commodity servers. Hadoop consists of two primary components: the Hadoop Distributed Filesystem (HDFS) and a MapReduce framework. HDFS is a distributed filesystem which efficiently stores very large files across a cluster in a fault-tolerant manner. MapReduce is a framework for dividing data processing into two distinct phases, mapping and reducing, in order to deconstruct a problem so it can be run in parallel across many machines in order to speed data transformation and aggregation. In this talk we’ll look at both HDFS and the MapReduce framework. We’ll also look at one specific Hadoop subproject, Hive, which provides a data warehousing capability on top of Hadoop and allows developers and analysts to query their data stored in HDFS using SQL queries.

Speaker: Scott Leberknight is the Chief Architect at Near Infinity Corporation.

Food: Pizza and Liquid Refreshments will be provided by Near Infinity Corporation.

Registration: Click HERE!

When: Thurs May 5, 6:30 PM – 9:00 PM, presentation starts at 7 PM

Location:
Near Infinity Corporation
1881 Campus Commons Drive
Suite 203 (the training center)
Reston, VA 20191
Call 703 727-1307 (Gray’s cell phone) if you have trouble getting in.

Parking: There should be plenty of parking in the garage behind or out in front of the building.

Advertisements

Leave a Reply

Fill in your details below or click an icon to log in:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out / Change )

Twitter picture

You are commenting using your Twitter account. Log Out / Change )

Facebook photo

You are commenting using your Facebook account. Log Out / Change )

Google+ photo

You are commenting using your Google+ account. Log Out / Change )

Connecting to %s

%d bloggers like this: