-
Find me a Friend – Social Networks and Graph
Introduction In the last post I discussed how certain recommendation algorithms work – with the examples being various ways in which you could code a movie recommendation. I thought what would be interesting is to take another approach to this kind of problem and demonstrate an approach for how a social network identifies people who…
-
What should I watch tonight – Movie Recommendations
Introduction Its cold, very cold. There little better to do in early January than sit at home and watch TV, except no one really watches TV live now (sport aside), we all use streaming services such as Netflix or Amazon Prime. Last night I found myself wondering – how does the recommendation system in this…
-
Machine Learning – Predicting Fatalities on the Titanic
Introduction This is a sequel to a post I wrote on the Data Analysis of deaths on the titanic and any correlations that can be drawn from them. I recommend you read that post before you read this one. The aim of this post is to use the same dataset to build a model that…
-
Iceberg Right Ahead – Analysing Titanic Survival Chances
Introduction The Sinking of the titanic is one of the most famous events of the last century. The “unsinkable” ship sinking on its maiden voyage was a truly humbling event and the introduction of radio / telegram at the time allowed the story to be relayed around the world. Datasets detailing the list of passengers…
-
Unstructured Data to Structured – JSON Normalisation
Introduction I had a job interview today and one of the topics that was raised was importing JSON based data (which can be considered as unstructured) into a table format (structured). I offered a few ways of how this could be done and I think answered the question appropriately, but the interviewer said a number…
-
Benford’s Law
Introduction Let’s speculate on a distribution involving the population of cities. We will look at only the first digit of the population size. So for example greater London has 9 million people living in it. the first digit is therefore 9. If we took the first digit of each capital city, how do we think…
-
What If Analysis – Power BI
A couple of posts ago I showed the creation of a data model in Power BI, from start to finished and produces reporting showing expenditure vs budgets for personal bank accounts. This was quite interesting and showed how once the model is created within power BI basic reporting can be produced very fast. But what…
-
Modelling Personal Finances in Power BI
Introduction Microsoft Power BI is a platform Microsoft provided that allows for self service BI work. It is a mistake to think this software simply generates reporting, the central aspect of Power BI is the ability to create a dimensional model representing various entities of a business process. It is a software I have some…
-
Is the Lottery Fair?
We all dream of winning the lottery and the life changing opportunities this would give us, the houses, cars and holidays. But how likely are you of winning the lottery if every number drawn is random? And how do we know it is a random chance? are there potentially exploits where some numbers are more…
-
Data Analysis – Airbnb in London
Let’s for a minute imagine a wonderful parallel universe – where a decision I need to make is where should I buy a property in London to rent out on Airbnb (for those that don’t know me this is about as far fetched as it gets). How could we do this? We would need some…