2021-06-24

A complete guide on Pandas Hierarchical Indexing (MultiIndex)

31 mins read Pandas is the go-to library when for data analysis when working with tabular datasets. It is the best solution available for […]
2021-06-24

Data selection (indexing and slicing) in Pandas MultiIndex DataFrames

6 mins read A MultiIndex (also known as a hierarchical index) DataFrame allows you to have multiple columns acting as a row identifier and multiple […]
2021-06-21

Data Science and Machine Learning Cheat Sheets

5 mins read Click on the links to get the high-resolution cheat sheets. Algebra Linear Algebra Calculus Probability Statistics Python R Machine Learning […]
2021-06-18

Positional-only and Keyword-only arguments in Python

4 mins read In this lesson, you’ll learn about how to add positional-only arguments to your functions in Python 3.8. The built-in function float() can be used […]
2021-06-12

Introduction to advanced candlesticks in finance: tick bars, dollar bars, volume bars, and imbalance bars

55 mins read In this article, we will explore why traditional time-based candlesticks are an inefficient method to aggregate price data, especially under […]
2021-06-06

Python __getattr__ and __getattribute__ magic methods

6 mins read Python Magic Methods Magic Methods are a broad and general term that refers to “special” methods in a Python class. […]
2021-06-05

Python filter() method tutorial

7 mins read Introduction The Python built-in filter() function can be used to create a new iterator from an existing iterable (like a list or dictionary) that will […]
2021-06-05

Python Assignment Expressions with walrus operator use cases

5 mins read Assignment expressions allow variable assignments to occur inside of larger expressions. While assignment expressions are never strictly necessary to write […]
2021-05-30

A tutorial on Context Managers in Python

9 mins read Python’s context managers are great for resource management and stopping the propagation of leaked abstractions. You’ve probably used it while […]
2021-05-26

5 steps to start becoming a Machine Learning Engineer

16 mins read Step 1: Adjusting Your Mindset Whenever I lead my workshops I always get a lot of questions afterward from developers […]
2021-05-11

Which Mean should we use? A guide on Arithmetic, Geometric, and Harmonic Means in Data Analysis

45 mins read Introduction It’s probably the most common data analytic task: You have a bunch of numbers. You want to summarize them […]
2021-05-04

Setup and run Jupyter notebook from a remote server by ssh

5 mins read In my research, I usually work with remote servers to run deep learning models inside machines more powerful than my […]
2021-04-28

Python Scipy sparse matrices explained

8 mins read What is a Sparse Matrix? Imagine you have a two-dimensional data set with 10 rows and 10 columns such that […]
2021-04-20

Improvements in Deep Q-Learning with Python code: Dueling Double DQN, Prioritized Experience Replay, and Fixed Q-targets

28 mins read Deep Q-Learning was introduced in 2014. Since then, a lot of improvements have been made. So, today we’ll see four […]
2021-04-17

Understanding intuition behind Markov Chain Monte Carlo Methods (MCMC)

15 mins read For many of us, Bayesian statistics is voodoo magic at best or completely subjective nonsense at worst. Among the trademarks […]
2021-04-14

A complete tutorial on tmux in Linux

16 mins read What’s tmux? tmux authors describe it as a terminal multiplexer. Behind this fancy term hides a simple concept: Within one terminal […]
2021-03-23

Review of important offline evaluation metrics for recommendation systems

28 mins read We are in an era of personalization. The user wants personalized content and businesses are capitalizing on the same. Recommendation […]
2021-03-17

Methods for sampling from complex distributions

8 mins read This writeup includes descriptions from a recent paper on algorithmic sampling, to describe in simpler terms the motivation and approach for […]
2021-03-02

ARIMA for time series forecasting in Python

11 mins read Making out-of-sample forecasts can be confusing when getting started with time series data. The statsmodels Python API provides functions for […]
2021-02-25

Identifying time series AR, MA, ARMA, or ARIMA Models using ACF and PACF plots

4 mins read In time series analysis, the Autocorrelation Function (ACF) and the partial autocorrelation function (PACF) plots are essential in providing the […]