Home
Blog
Filter by
Categories
Tags
Authors
Show all
All
Apache Kafka
Artificial Intelligence
Big Data
Cassandra
Computer Vision
Data Engineering
Data Science
Database
Deep Learning
Design Pattern
DevOps
Django
Docker
ELK
English
Feature Engineering
Finance
Java
Java Script
Keras
Linear Algebra
Linux
Machine Learning
Mathematics
NLP
Python
PyTorch
Recommendation Systems
Reinforcement Learning
Research
Software Engineering
Spark
Statistics
Tensorflow
Uncategorized
All
Apache Kafka
Big Data
boltzman
broadcasting
Cython
Deploy
Django
Docker
Elasticsearch
ELK
Exception Handling
Feature Engineering
GPU
GRU
Linux
Logging
lstm
MinIO
numpy
Pandas
PySpark
Python
PyTorch
Reinforcement Learning
remover
S3
scikit-learn
Self-Driving Car
separator
sklearn
tmux
transfer learning
Tutorial
Ubuntu
vocal
All
Amir Masoud Sefidian
2022-02-11
A tutorial on Apache Cassandra data modeling – RowKeys, Columns, Keyspaces, Tables, and Keys
24
mins read
In this post, I will discuss the basic concepts of data modeling in Apache Cassandra. It is important to understand
[…]
2022-02-11
Understanding Cassandra Partition Key, Composite Key, and Clustering Key
13
mins read
1. Overview Data distribution and data modeling in the Cassandra NoSQL database are different from those in a traditional relational
[…]
2022-02-09
Out of Bag (OOB) score in Random Forests with example
12
mins read
Introduction This post describes the intuition behind the Out of Bag (OOB) score in Random forest, how it is calculated,
[…]
2022-02-08
Understanding the Random Forest algorithm and its hyperparameters
17
mins read
In this post, we will see how the Random Forest algorithm works internally. To truly appreciate it, it might be
[…]
2022-02-04
Connect to Cassandra Cluster with Dbeaver Community edition
2
mins read
DataStax offers the JDBC driver from Magnitude (formerly Simba) to users at no cost so you should be able to
[…]
2022-02-03
Difference between discriminative and generative machine learning models
8
mins read
Introduction In today’s world, Machine learning becomes one of the popular and exciting fields of study that gives machines the ability
[…]
2022-02-03
Feature Selection for categorical data with Python code
17
mins read
Feature selection is the process of identifying and selecting a subset of input features that are most relevant to the target
[…]
2022-02-03
Basic feature engineering tasks for numeric and categorical data with Python code
34
mins read
Machine learning pipelines Any intelligent system basically consists of an end-to-end pipeline starting from ingesting raw data and leveraging data
[…]
2022-01-30
Understanding Expectation-Maximization (EM) algorithm
18
mins read
The EM algorithm is often used in machine learning as an algorithm for data clustering. Sometimes, one of the clustering problems
[…]
2022-01-29
A guide to different Cross-Validation methods in Machine Learning
19
mins read
In machine learning (ML), generalization usually refers to the ability of an algorithm to be effective across various inputs. It
[…]
2022-01-27
Understanding the Dummy Variable Trap with example
4
mins read
Linear regression is a method we can use to quantify the relationship between one or more predictor variables and a response variable.
[…]
2022-01-25
Understanding Alternating Least Squares algorithm for implicit collaborative filtering recommendations
23
mins read
Overview We’re going to write a simple implementation of an implicit (more on that below) recommendation algorithm. We want to
[…]
2022-01-23
Understanding AdaBoost algorithm and its mathematics
14
mins read
If you’re going through this tutorial, you’ve probably heard of XGBoost, LightGBM, or something of those sorts before. These are
[…]
2022-01-11
Theory of Generalization: growth function, dichotomies, and break points
15
mins read
The size of our data set N plays a major role when it comes to the reliability of the generalization Ein
[…]
2022-01-01
Mathematical view of Bias-Variance trade-off
6
mins read
The bias-variance trade-off is an important concept in statistics and machine learning. This is used to get better performance out
[…]
Prev page
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
Next page