3 Examples of User Personas – Clear and Practical Way

The following post provides 3 examples of user personas created using Google Analytics and Data Studio. Automated data collection and consolidation is done by Data Studio to generate dashboard and reports. What are user personas? User personas are profiles of our customers to help understand their needs, behavior, and goals. These users share similar characteristics, […]

Principles of Good Visualization

The following are the principles of good visualization by Andy Kirk: Good data visualization is trustworthy: Truthfulness and accuracy should be an obligation. Trustworthiness is about being transparent giving readers all the information they need in order to feel confident about what they are reading and what interpretations are legitimate. Good data visualization is accessible: […]

Highlight Text using Weights

Texts tend to have a hierarchical structure and the importance of words and sentences are highly context dependent. This post is a short tutorial to highlight text using sample weights. The text is displayed in Jupyter notebook. The weights can be from a model such as Logistic regression or Attention model. The different intensity of […]

Developing a Web Application for a Machine Learning Model

This post describes developing a web application for a machine learning model and deploying it so that it can be accessed by anyone. The web application is available at: https://arrear-model.herokuapp.com/ The process of deployment consists of transferring all flask application files from a local computer to the web server. Once completed the web application can […]

Economic Impact of the EU Membership on the UK

23rd June 2016, voters from United Kingdom (UK) voted in the Brexit referendum to leave European Union (EU) with a small majority vote of 51.9 %. The decision to leave EU since joining in 1973 has consequences that are both profound and catastrophic. The post analysis of the referendum found that the issues for Brexit can be categorized […]

An Analysis of the Impact of EU Membership on the Economic Development of Ireland

Recent economic recession and Brexit has raised doubts about the EU membership among member nations. This paper analyzes the public data from World Bank to determine the impact to the economic development of Ireland since joining the EU on 1st January 1973. The paper shows that EU membership has a positive relationship with the development […]

Analysis of Residential Property Prices in Dublin

Living in Dublin, Ireland is amazingly expensive. Residential property prices in Dublin are growing. Yet we all think about buying a home while still wondering whether we might be better off continuing to rent. The data analyst in me wanted to dive deeper, to look back historically, to quantify, to visualize the trends, etc. to […]

What Make A Really Good Diamond?

The aim of this blog is to assess the quality and characteristics of the diamonds and gain insights about what makes a really good diamond. The data set is from ggplot2. The explanatory data analysis is done in Python and the notebooks are available on my Github. This blog address few important questions such as: […]

Visualise Categorical Variables in Python

It is crucial to learn the methods of dealing with categorical variables as categorical variables are known to hide and mask lots of interesting information in a data set. A categorical variable identifies a group to which the thing belongs. You could categorise persons according to their race or ethnicity, cities according to their geographic […]

Visualisation of House Prices

Visualization is the presentation of data in a pictorial or graphical format. It enables decision-makers to see analytics presented visually, so they can grasp difficult concepts or identify new patterns. This visualization of house prices is for the Kaggle dataset. With 79 explanatory variables describing (almost) every aspect of residential homes in Ames, Iowa, this […]