A Comparison between Cassandra and MySQL

Introduction Cassandra is a distributed, no single point of failure, continuously available and scalable. NoSQL database that manages a large amount of data across many data centres and cloud servers. It…

Continue Reading A Comparison between Cassandra and MySQL

Web Server Log Analysis with Spark

Web Server Log Analysis with Spark This lab will demonstrate how easy it is to perform web server log analysis with Apache Spark. Server log analysis is an ideal use…

Continue Reading Web Server Log Analysis with Spark

Spark Tutorial: Learning Apache Spark

Spark Tutorial: Learning Apache Spark includes my solution for the EdX course. This tutorial will teach you how to use Apache Spark, a framework for large-scale data processing, within a…

Continue Reading Spark Tutorial: Learning Apache Spark