Big Data: From Here To There

Knowledge of Big Data Helps Frame Smart Business

Vaughn Bullard at Versailles

Daily, large volumes of structured, relational, data are mined, with the help of data warehousing technology, for the benefit of marketing and financial decision making.

The recent explosion of social media and the computerization of every aspect of life resulted in the creation of mountains of unstructured data (big data): web logs, videos, speech, photographs, e-mails, Tweets, and others. With this evolution, computers are more powerful and storage is cheaper.

This course brings together several key technologies used in manipulating, storing, and analyzing big data. We look at the details of Hadoop and related tools providing SQL-like access to unstructured data: Pig and Hive.

We will also analyze so-called NoSQL storage solutions like HBase, Cassandra, and Oracle NoSQL, for their critical features: speed of reads and writes, data consistency, and ability to scale to extreme volumes. And we will examine memory-resident databases and streaming technologies allowing analysis of data in flight.

A large section of the course is devoted to the methods of statistical analysis, where we will work with Open-Source frameworks like Mahout and Open R and other statistical tools including taking a look at public Cloud as a resource for big data analytics.

At the end of this course, a student should have the ability to design highly scalable systems to accept, process, store, and analyze large volumes of unstructured data in (near) real time.

Prerequisites: knowledge of PHP and XML.

NOTE: If you would like more course information, including dates and cost or have additional questions, fill out our Contact Us form and our staff will get in touch with you directly.