Distributed-Systems

There's a total of 4 notes tagged with "Distributed-Systems".

Sun, Apr 30, 2023

Kubernetes

Kubernetes is an open-source container orchestration platform that automates the deployment, scaling, and management of containerized applications.
Sat, Feb 29, 2020

Kafka

Kafka is a distributed event streaming platform designed for building high-throughput, fault-tolerant, and scalable data streaming applications. This article covers key designs in Kafka, such as how messages for a topic are shared into partitions assigned to brokers. Then, we see some guarantees about producers, consumers, and consumer groups.
Fri, Feb 28, 2020

Cassandra

Cassandra is a highly scalable, distributed NoSQL (non-relational) database management system designed for handling large amounts of data across multiple commodity servers. This article covers key design features of Cassandra, such as the usage of consistent hashing, the write pattern to a write-ahead log and a memtable, the read pattern from the memtable and from SSTables, and, most importantly, some examples of data modeling for different types of queries.
Mon, Jan 8, 2018

Partitioning

Data partitioning refers to the process of dividing a system's data into smaller, more manageable subsets, which are distributed across multiple storage locations or nodes. This article covers several strategies for partitioning, including random partitioning, by hash key, by range, and a hybrid approach for skewed workloads. It also discusses strategies to rebalance partitions, whether there's a static or dynamic number of partitions.