Home
Sefidian Academy
Filter by
Categories
Tags
Authors
Show all
All
Apache Kafka
Artificial Intelligence
Bayesian Statistics
Big Data
Cassandra
Computer Vision
Data Engineering
Data Science
Database
Deep Learning
DevOps
Django
Docker
ELK
Feature Engineering
Finance
Keras
Linear Algebra
Linux
Machine Learning
Mathematics
MLOps
NLP
Python
PyTorch
Recommendation Systems
Reinforcement Learning
Software Engineering
Spark
Statistics and Probability
Tensorflow
Time Series
Uncategorized
All
Apache Kafka
Big Data
Cython
Elasticsearch
ELK
Exception Handling
GPU
MinIO
monte carlo
Pandas
PySpark
Python
PyTorch
Reinforcement Learning
remover
S3
separator
transfer learning
Tutorial
Ubuntu
vocal
All
Amir Masoud Sefidian
2023-09-12
Categories
Big Data
Docker
Spark
Run spark-submit for Apache Spark (PySpark) using Docker
3
mins read
Pre-Requisites docker-compose file Below is a docker-compose file to set up a Spark cluster with 1 master and 2 worker
[…]
2023-01-11
Categories
Artificial Intelligence
Big Data
Data Science
Machine Learning
NLP
Python
Recommendation Systems
Spark
Time Series
Machine Learning for Big Data using PySpark with real-world projects
10
mins read
Introduction I have prepared a GitHub Repository that provides a set of self-study tutorials on Machine Learning for big data
[…]
2022-02-17
Categories
Big Data
Python
Spark
A guide on PySpark Window Functions with Partition By
11
mins read
When analyzing data within groups, Pyspark window functions can be more useful than using groupBy for examining relationships. First, a
[…]