Using the Levenshtein distance as a naming pairing tool

Commonly we find different data sources with a shared character variable as key (for example city names). Frequetnly these character columns does not match due to mispeling/typos. In this post I am going to use the Levenshtein distance as a tool for pairing strings from two different data sources.

Read more

Share Comments

Adaptive Rejection Sampling app

In this post, I will present an application demonstrating the Adaptive Rejection Sampling method in pratice. This tool was created using the ‘shiny’ package and the script is available on my Github.

Read more

Share Comments