Categories
Data Visualization

Brexit – InfoGraphic

Categories
Data Analysis Resources

A Comparison between Cassandra and MySQL

Introduction Cassandra is a distributed, no single point of failure, continuously available and scalable. NoSQL database that manages a large amount of data across many data centres and cloud servers. It offers both operation simplicity and capacity to scale linearly. While MySQL is the world’s most popular, cost-effective, high-performance relational database(Kumar, 2016). It comes with a […]

Categories
Business Data Analysis Resources Data Warehousing ETL

Effect on Facebook’s Stock Price after Cambridge Analytica Scandal

Recently, Facebook is in news almost daily for its inability to prevent Cambridge Analytica(CA) from gathering personal data from 87 million users. CA used the harvested data to profile users to predict voting patterns in 2016 US presidential elections. Some of the important dates for the event are: March 16, 2018: The news broke of […]

Categories
Data Analysis Resources Spark

Solutions to Support Real-Time Data Analytics

Organisations embracing big data use non-traditional strategies and technologies to gather, organize, process and gather insights from large datasets. These solutions do not support real-time analytics. Real-time analytics require technology to handle data that is generated at high velocity and send by the sources simultaneously in small sizes. Data is required to be processed sequentially […]

Categories
Data Analysis Resources Predictive Analysis

Text Analytics in the Healthcare Industry: Data Warehousing and Applications

Abstract— Text analytics is the method of extracting information from text. It involves structuring the text to evaluate, discover patterns and interpret the output. It enhances meaning to data and finds nuggets of information from both transaction-based and decision support systems by removing the barrier between structured and unstructured data. Analysis of text data helps […]

Categories
Data Analysis Resources Experience

Analysis of winning numbers of Irish Lotto

This blog is an analysis of winning numbers of Irish Lotto from last two years. The National Lottery brought new initiatives from Thursday, September 3, 2015, with adding two numbers to the draw meaning players choose from 47 numbers rather than 45 numbers. With this change, the odds of picking the six winning numbers went from just […]

Categories
Machine Learning Predictive Analysis

Guide for Linear Regression using Python – Part 2

Guide for Linear Regression using Python – Part 2 This blog is the continuation of guide for linear regression using Python from this post. There must be no correlation among independent variables. Multicollinearity is the presence of correlation in independent variables. If variables are correlated, it becomes extremely difficult for the model to determine the […]

Categories
Machine Learning Predictive Analysis

Guide for Linear Regression using Python – Part 1

Regression is the first algorithm we need to master if we are aspiring to become a data scientist. It is one of the easiest algorithms to learn yet requires understanding and effort to get to the master it. In this blog is a guide for linear regression using Python. It will focus on linear and multiple […]