Categories
Data Analysis Resources

Novelty, Anomaly and Segmentation Discovery using Matrix Profile

In this notebook, novelty and anomaly and segmentation discovery using Matrix Profile. We are using Stumpy for time series data mining tasks. We’ll examine a data set containing daily opening values for the United Health Group from 2016 up to present day. UnitedHealth Group Incorporated is an American for-profit managed health care company based in Minnetonka, […]

Categories
Data Analysis Resources Machine Learning

Classification of Alzheimer’s Disease Stages using Radiology Imaging and Longitudinal Clinical Data – Part 1

“Classification of Alzheimer’s Disease Stages using Radiology Imaging and Longitudinal Clinical Data” is the topic of my final year project as part of my MSc in Data Analytics. I am publishing the technical report in full. Please let me know if you have any questions. The report and code are available from this GitHub repository. […]

Categories
Machine Learning

Developing a Web Application for a Machine Learning Model

This post describes developing a web application for a machine learning model and deploying it so that it can be accessed by anyone. The web application is available at: https://arrear-model.herokuapp.com/ The process of deployment consists of transferring all flask application files from a local computer to the web server. Once completed the web application can […]

Categories
Machine Learning Predictive Analysis

Guide for Linear Regression using Python – Part 2

Guide for Linear Regression using Python – Part 2 This blog is the continuation of guide for linear regression using Python from this post. There must be no correlation among independent variables. Multicollinearity is the presence of correlation in independent variables. If variables are correlated, it becomes extremely difficult for the model to determine the […]

Categories
Machine Learning Predictive Analysis

Guide for Linear Regression using Python – Part 1

Regression is the first algorithm we need to master if we are aspiring to become a data scientist. It is one of the easiest algorithms to learn yet requires understanding and effort to get to the master it. In this blog is a guide for linear regression using Python. It will focus on linear and multiple […]

Categories
Machine Learning Predictive Analysis scikit-learn

Predicting NBA winners with Decision Trees and Random Forests in Scikit-learn

In this blog, we will be predicting NBA winners with Decision Trees and Random Forests in Scikit-learn.The National Basketball Association (NBA) is the major men’s professional basketball league in North America and is widely considered to be the premier men’s professional basketball league in the world. It has 30 teams (29 in the United States and […]

Categories
Business Data Analysis Resources

Jobs which are most susceptible to automation

Throughout history, the technological advances have raised fears that traditional jobs will become obsolete. In this post, I find out the jobs which are most susceptible to automation. Elon Musk told the National Governors Association: “There certainly will be job disruption. Because what’s going to happen is robots will be able to do everything better […]

Categories
Data Analysis Resources Machine Learning Predictive Analysis

10 groups of Machine Learning Algorithms

In this article, I grouped some of the popular machine learning algorithms either by learning or problem type. There is a brief description of how these algorithms work and their potential use case. Regression How it works: A regression uses the historical relationship between an independent and a dependent variable to predict the future values […]