Categories
Data Analysis Resources

Novelty, Anomaly and Segmentation Discovery using Matrix Profile

In this notebook, novelty and anomaly and segmentation discovery using Matrix Profile. We are using Stumpy for time series data mining tasks. We’ll examine a data set containing daily opening values for the United Health Group from 2016 up to present day. UnitedHealth Group Incorporated is an American for-profit managed health care company based in Minnetonka, […]

Categories
Data Analysis Resources

Principles of Good Visualization

The following are the principles of good visualization by Andy Kirk: Good data visualization is trustworthy: Truthfulness and accuracy should be an obligation. Trustworthiness is about being transparent giving readers all the information they need in order to feel confident about what they are reading and what interpretations are legitimate. Good data visualization is accessible: […]

Categories
Data Analysis Resources Machine Learning

Classification of Alzheimer’s Disease Stages using Radiology Imaging and Longitudinal Clinical Data – Part 2

Literature Review of Alzheimer’s Disease Progression Introduction This section discusses the papers published between 2004 and 2019 regarding Alzheimer’s disease progression. It starts by reviewing the application of machine learning on radiology images followed by different tests that are used to measure the clinical stage of the progression. This post is a continuation from this […]

Categories
Data Analysis Resources Machine Learning

Classification of Alzheimer’s Disease Stages using Radiology Imaging and Longitudinal Clinical Data – Part 1

“Classification of Alzheimer’s Disease Stages using Radiology Imaging and Longitudinal Clinical Data” is the topic of my final year project as part of my MSc in Data Analytics. I am publishing the technical report in full. Please let me know if you have any questions. The report and code are available from this GitHub repository. […]

Categories
Business Data Analysis Resources Data Visualization

An Analysis of the Impact of EU Membership on the Economic Development of Ireland

Recent economic recession and Brexit has raised doubts about the EU membership among member nations. This paper analyzes the public data from World Bank to determine the impact to the economic development of Ireland since joining the EU on 1st January 1973. The paper shows that EU membership has a positive relationship with the development […]

Categories
Data Analysis Resources

A Comparison between Cassandra and MySQL

Introduction Cassandra is a distributed, no single point of failure, continuously available and scalable. NoSQL database that manages a large amount of data across many data centres and cloud servers. It offers both operation simplicity and capacity to scale linearly. While MySQL is the world’s most popular, cost-effective, high-performance relational database(Kumar, 2016). It comes with a […]

Categories
Business Data Analysis Resources Data Warehousing ETL

Effect on Facebook’s Stock Price after Cambridge Analytica Scandal

Recently, Facebook is in news almost daily for its inability to prevent Cambridge Analytica(CA) from gathering personal data from 87 million users. CA used the harvested data to profile users to predict voting patterns in 2016 US presidential elections. Some of the important dates for the event are: March 16, 2018: The news broke of […]

Categories
Data Analysis Resources Spark

Solutions to Support Real-Time Data Analytics

Organisations embracing big data use non-traditional strategies and technologies to gather, organize, process and gather insights from large datasets. These solutions do not support real-time analytics. Real-time analytics require technology to handle data that is generated at high velocity and send by the sources simultaneously in small sizes. Data is required to be processed sequentially […]