2022-05-28

Understanding interaction effects in regression analysis

22 mins read In regression, an interaction effect exists when the effect of an independent variable on a dependent variable changes, depending on […]
2022-05-24

Performing A/B test in Python example – A case study from Udacity Data Scientist Nano Degree

11 mins read This is a simple walkthrough of an A/B test case study developed and used by Udacity. It is part of […]
2022-05-23

A guide to Bootstrapping for Statistical Inference – Confidence Interval and Hypothesis Testing

14 mins read Introduction Inferential Statistics is the process of examining the observed data (sample) in order to make conclusions about the properties/parameters […]
2021-07-03

Understating and discovering multicollinearity in regression analysis with Python code

9 mins read In this post, I will explain the concept of collinearity and multicollinearity and why it is important to understand them […]
2021-07-02

Measure the correlation between numerical and categorical variables and the correlation between two categorical variables in Python: Chi-Square and ANOVA

27 mins read This scenario can happen when we are doing regression or classification in machine learning. Regression: The target variable is numeric […]
2020-05-31

Understanding hypothesis testing with Covid-19 case study (Z-test and t-test)

13 mins read Introduction The coronavirus pandemic has made a statistician out of us all. We are constantly checking the numbers, making our […]
2019-05-31

An explanation of z-distribution (standard normal distribution)

14 mins read The standard normal distribution, also called the z-distribution, is a special normal distribution where the mean is 0 and the standard deviation is 1. Any normal distribution can […]