Principal Component Analysis (PCA) Explained

14 mins read What is PCA? Let’s say that you want to predict what the gross domestic product (GDP) of the United States will be […]

Understanding basic Trading Terminology

12 mins read This article is written for readers who want to understand the most common phrases and trading terminology that is used […]

What are eigenvectors and eigenvalues?

16 mins read Introduction Eigenvectors and eigenvalues have many important applications in computer vision and machine learning in general. Well-known examples are PCA (Principal […]

Automatic Differentiation Explained

8 mins read Introduction There are several methods to calculate gradients in computer programs: (1) Manual differentiation; (2) Symbolic differentiation; (3) Finite differences […]

How to use black, flake8, isort, and pre-commit framework to format Python codes

12 mins read black: The Uncompromising Code Formatter With black you can format Python code from 2.7 all the way to 3.8 (as of version […]

Understanding Model Calibration and Brier Score

12 mins read Do you ever encounter a storm when the probability of rain in your weather app is below 10%? Well, this […]

ANOVA (Analysis of variance) simply explained

27 mins read Introduction Buying a new product or testing a new technique but not sure how it stacks up against the alternatives? […]

Python Modules and Packages tutorial

45 mins read Modules If you quit the Python interpreter and enter it again, the definitions you have made (functions and variables) are […]

Exponential Distribution and its applications

10 mins read We always start with the “why” instead of going straight to the formulas. If you understand the why, it actually […]

The Poisson Distribution and its applications explained

26 mins read Before setting the parameter λ and plugging it into the formula, let’s pause a second and ask a question. Why […]

Fourier Transform basics and its applications

18 mins read The frequency domain Sound is a mechanical wave, a vibration in the air or another medium. Musical notes correspond to […]

Performance evaluation metrics for binary classification with Python code

30 mins read Classification metrics let you assess the performance of machine learning models but there are so many of them, each one has its […]

What is Word2vec word embedding?

24 mins read I find the concept of embeddings to be one of the most fascinating ideas in machine learning. If you’ve ever […]

Feature Scaling with Scikit-Learn

9 mins read 1 Introduction 2 Loading the libraries 3 Scaling methods 3.1 Standard Scaler 3.2 Min-Max Scaler 3.3 Robust Scaler 3.4 Comparison […]

Understating and discovering multicollinearity in regression analysis with Python code

9 mins read In this post, I will explain the concept of collinearity and multicollinearity and why it is important to understand them […]

Measure the correlation between numerical and categorical variables and the correlation between two categorical variables in Python: Chi-Square and ANOVA

27 mins read This scenario can happen when we are doing regression or classification in machine learning. Regression: The target variable is numeric […]

Understanding Dates, Times, Periods, and Time Zones in Pandas

15 mins read Introduction  Time-series data is quite common among many datasets related to fields like finance, geography, earthquakes, healthcare, etc. Properly interpreting […]

Resampling time series in Pandas: resample and asfreq methods

23 mins read This article is an introductory dive into the technical aspects of resampling methods in pandas. 1. Resampling  Resampling is necessary […]

Time series analysis with Pandas: Power consumption case study

24 mins read Originally developed for financial time series such as daily stock market prices, the robust and flexible data structures in pandas […]

Labeling financial data for Machine Learning

24 mins read In this article, we’ll be looking at one method for labeling our data and getting it ready for our model. By the […]