2021-11-01

A review of techniques for Time Series prediction

43 mins read Working with time series data? Here’s a guide for you. In this article, you will learn how to compare and […]
2021-10-19

Difference between Probability Density and Probability

5 mins read The probability density at x can be greater than one but then, how can it integrate to one? It’s a […]
2021-10-19

What is Conjugate Prior?

5 mins read What is Prior? Prior probability is the probability of an event before we see the data. In Bayesian Inference, the prior […]
2021-10-17

Important probability distributions for Data Science with Python code

33 mins read For a data scientist aspirant, Statistics is a must-learn thing. It can process complex and challenging problems in the real […]
2021-10-08

Fundamentals of statistics for Data Scientists and Analysts with Python Code

36 mins read As Karl Pearson, a British mathematician once stated, Statistics is the grammar of science and this holds especially for Computer and Information […]
2021-10-07

Monte Carlo Simulation Explained

29 mins read Monte Carlo Methods: I Am Feeling (Un-)Lucky! In short, Monte Carlo methods refer to a series of statistical methods essentially […]
2021-09-12

Best storage formats to save Pandas dataframes

6 mins read When working on data analytical projects, I usually use Jupyter notebooks and a great pandas library to process and move my data around. It […]
2021-09-06

Shannon entropy and its properties

25 mins read Suppose you are talking with three patients in the waiting room of a doctor’s office. All three of them have […]
2021-08-22

Understanding Attention Mechanism in Sequence 2 Sequence Machine Translation

39 mins read Introduction Recurrent Neural Networks (or more precisely LSTM/GRU) have been found to be very effective in solving complex sequence-related problems […]
2021-08-03

Understanding Model Calibration and Brier Score

12 mins read Do you ever encounter a storm when the probability of rain in your weather app is below 10%? Well, this […]
2021-07-30

ANOVA (Analysis of variance) simply explained

27 mins read Introduction Buying a new product or testing a new technique but not sure how it stacks up against the alternatives? […]
2021-07-23

Exponential Distribution and its applications

10 mins read We always start with the “why” instead of going straight to the formulas. If you understand the why, it actually […]
2021-07-23

The Poisson Distribution and its applications explained

26 mins read Before setting the parameter λ and plugging it into the formula, let’s pause a second and ask a question. Why […]
2021-07-09

Performance evaluation metrics for binary classification with Python code

30 mins read Classification metrics let you assess the performance of machine learning models but there are so many of them, each one has its […]
2021-07-08

What is Word2vec word embedding?

24 mins read I find the concept of embeddings to be one of the most fascinating ideas in machine learning. If you’ve ever […]
2021-07-04

Feature Scaling with Scikit-Learn

9 mins read 1 Introduction 2 Loading the libraries 3 Scaling methods 3.1 Standard Scaler 3.2 Min-Max Scaler 3.3 Robust Scaler 3.4 Comparison […]
2021-07-03

Understating and discovering multicollinearity in regression analysis with Python code

9 mins read In this post, I will explain the concept of collinearity and multicollinearity and why it is important to understand them […]
2021-06-27

Understanding Dates, Times, Periods, and Time Zones in Pandas

15 mins read Introduction  Time-series data is quite common among many datasets related to fields like finance, geography, earthquakes, healthcare, etc. Properly interpreting […]
2021-06-27

Resampling time series in Pandas: resample and asfreq methods

23 mins read This article is an introductory dive into the technical aspects of resampling methods in pandas. 1. Resampling  Resampling is necessary […]
2021-06-26

Time series analysis with Pandas: Power consumption case study

24 mins read Originally developed for financial time series such as daily stock market prices, the robust and flexible data structures in pandas […]