2022-11-26

Coursera Deep Learning Specialization Notes

3 mins read A couple of years ago I completed Deep Learning Specialization taught by AI pioneer Andrew Ng. I found this series […]
2022-11-16

Repository for implementation of statistics concepts for Data Science in Python

3 mins read The field of statistics is becoming increasingly important in the world of data science and machine learning. I have recently […]
2022-08-24

Performing A/B test in Python example – A case study from Udacity Data Scientist Nano Degree

11 mins read This is a simple walkthrough of an A/B test case study developed and used by Udacity. It is part of […]
2022-08-18

A guide on regression error metrics (MSE, RMSE, MAE, MAPE, sMAPE, MPE) with Python code

25 mins read Regressions are one of the most commonly used tools in a data scientist’s kit. The quality of a regression model is how […]
2022-08-12

A review on information theory concepts for machine learning: Entropy, Cross-Entropy, KL divergence, Information gain, and Mutual Information

58 mins read Information Theory Information theory is a field of study concerned with quantifying information for communication. It is a subfield of mathematics […]
2022-08-02

Measure the correlation between numerical and categorical variables and the correlation between two categorical variables in Python: Chi-Square and ANOVA

27 mins read Data analysis is an essential part of any research or business endeavor, and one of the most fundamental techniques is […]
2022-08-01

A simple tutorial on Sampling Importance and Monte Carlo with Python codes

16 mins read Introduction In this post, I’m going to explain the importance sampling. Importance sampling is an approximation method instead of a […]
2022-07-30

What is Reservoir Sampling in Stream Processing?

4 mins read Reservoir sampling is a fascinating algorithm that is especially useful when you have to deal with streaming data, which is […]
2022-07-23

A guide to Bootstrapping for Statistical Inference – Confidence Interval and Hypothesis Testing

14 mins read Introduction Inferential Statistics is the process of examining the observed data (sample) in order to make conclusions about the properties/parameters […]
2022-06-19

Evaluation metrics for Multi-Label Classification with Python codes

10 mins read In a traditional classification problem formulation, classes are mutually exclusive. In other words, under the condition of mutual exclusivity, each […]
2022-05-28

How to interpret logistic regression coefficients?

15 mins read Logistic Regression is a fairly simple yet powerful Machine Learning model that can be applied to various use cases. It’s […]
2022-05-28

Understanding interaction effects in regression analysis

22 mins read In regression, an interaction effect exists when the effect of an independent variable on a dependent variable changes, depending on […]
2022-05-24

A guide on Maximum likelihood and Bayesian inference for parameter estimation

28 mins read Introduction In this post, I’ll explain what the maximum likelihood and Bayesian inference methods for parameter estimation are and go […]
2022-05-23

Understanding p-value using bootstrapping technique in statistics

13 mins read For context, we are using the bootstrapping methods (that I’ve referenced previously) for simulating null and sampling distributions (rather than standard […]
2022-05-22

Understanding Bootstrapping approach vs. Traditional approaches in statistics

13 mins read Bootstrapping is a statistical procedure that resamples a single dataset to create many simulated samples. This process allows you to […]
2022-05-19

Understanding the basics of Bayesian Inference with Python Code

10 mins read Why did someone have to invent the Bayesian Inference? In one sentence: to update the probability as we gather more data. The […]
2022-05-02

Understanding Jacobian and Hessian matrices with example

19 mins read This post will provide you with an introduction to the Jacobian matrix and the Hessian matrix, including their definitions and […]
2022-04-27

Understanding and interpreting Residuals Plot for linear regression

27 mins read Interpreting Residual Plots to Improve Your Regression When you run a regression, calculating and plotting residuals help you understand and improve your […]
2022-03-30

Understanding Discrete Fourier Transformation with mathematics and Python codes

16 mins read Introduction The Fourier Transformation is applied in engineering to determine the dominant frequencies in a vibration signal. When the dominant […]
2022-03-17

Methods for sampling from complex distributions

8 mins read This writeup includes descriptions from a recent paper on algorithmic sampling, to describe in simpler terms the motivation and approach for […]