Kafka Data Processing Guarantees: An explanation of at-least -once, at-most- once, and exactly-once semantics

12 mins read Distributed event stream processing has become an increasingly hot topic in the area of Big Data. Notable Stream Processing Engines […]

Apache Kafka comprehensive tutorial series – Part 4 – Kafka Broker, Kafka Queuing, Kafka Client

28 mins read Kafka Broker In this section, we are going to learn Kafka Broker. Kafka Broker manages the storage of messages in […]

Apache Kafka comprehensive tutorial series – Part 3 – Kafka Cluster, Producer, and Consumer

17 mins read Kafka Cluster In this Kafka section, we will see Kafka Cluster Setup. This Kafka Cluster tutorial provides us with some simple steps […]

Apache Kafka comprehensive tutorial series – Part 2 – Kafka Architecture and Its Fundamental Concepts

10 mins read Kafka Architecture – Apache Kafka APIs Apache Kafka Architecture has four core APIs, producer API, Consumer API, Streams API, and […]

Kafka Comprehensive Tutorial – Part 1

21 mins read What is Kafka?  We use Apache Kafka when it comes to enabling communication between producers and consumers using message-based topics. […]

Kafka Architecture: Log Compaction

5 mins read This article is heavily inspired by the Kafka section on design around log compaction. You can think of it as the cliff notes […]

Writing a Kafka Consumer in Java

11 mins read In this tutorial, you are going to create simple Kafka Consumer. This consumer consumes messages from the Kafka Producer you wrote […]

Writing a Kafka Producer in Java

9 mins read In this tutorial, we are going to create a simple Java example that creates a Kafka producer. You create a […]

Understand Kafka Clusters, Kafka Consumer Failover, and Kafka Broker Failover with examples

11 mins read In this tutorial, we are going to run many Kafka Nodes on our development laptop so you will need at […]

The Kafka Ecosystem: Kafka Core, Kafka Streams, Kafka Connect, Kafka REST Proxy, and the Schema Registry

3 mins read The core of Kafka is the brokers, topics, logs, partitions, and clusters. The core also consists of related tools like […]

Kafka, Avro Serialization, and Schema Registry

12 mins read Confluent Schema Registry stores Avro Schemas for Kafka producers and consumers. The Schema Registry provides a RESTful interface for managing Avro schemas […]

Avro for Big Data, Data Streaming Architectures, and Kafka

16 mins read Introduction Apache Avro™ is a data serialization system. Avro provides: Rich data structures. A compact, fast, binary data format. A […]

What is Apache Kafka?

26 mins read Kafka’s growth is exploding, more than 1⁄3 of all Fortune 500 companies use Kafka. These companies include the top ten travel companies, 7 […]

Complete guide on Logging in Python

17 mins read The Logging Module The logging module in Python is a ready-to-use and powerful module that is designed to meet the needs of […]

Best practices for Python exceptions

9 mins read How do I manually throw/raise an exception in Python? Use the most specific Exception constructor that semantically fits your issue. […]

What is the Region of Interest Pooling?

8 mins read Region of interest pooling (also known as RoI pooling) is an operation widely used in object detection tasks using convolutional […]

Difference between sessions and cookies – Djnago Example

15 mins read Cookie A cookie is just a key-value pair that is stored in the user’s browser. A cookie is sent to […]

Understanding L1 and L2 as Loss Function and Regularization

6 mins read While practicing machine learning, you may have come upon a choice of the mysterious L1 vs L2. Usually, the two […]

Different missing data mechanisms

3 mins read Missing data mechanisms concern the relationship between missing data and the values of variables in the data matrix. Given this focus, […]

A guide on different Bibtex bibliography styles

8 mins read The next two commands are the ones that set the bibliography style and import the bibliography file. See Bibliography management […]