datastrophic

Resource Allocation in Mesos: Dominant Resource Fairness

27 March 2016·13 mins

Mesos Sheduling DRF

Apache Mesos provides a unique approach to cluster resource management called two-level scheduling: instead of storing information about available cluster resources in a centralized manner it operates with a notion of resource offers which slave nodes advertise to running frameworks via Mesos master, thus keeping the whole system architecture concise and scalable. Master’s allocation module is responsible for making the decisions about which application should receive the next resource offer and it relies on Dominant Resource Fairness(DRF) algorithm for making these decisions.

Apache Spark: core concepts, architecture and internals

3 March 2016·8 mins

Spark Sheduling RDD DAG Shuffle

This post covers core concepts of Apache Spark such as RDD, DAG, execution workflow, forming stages of tasks, and shuffle implementation and also describes the architecture and main components of Spark Driver. There’s a github.com/datastrophic/spark-workshop project created alongside this post which contains Spark Applications examples and dockerized Hadoop environment to play with. Slides are also available at slideshare. Intro Spark is a generalized framework for distributed data processing providing functional API for manipulating data at scale, in-memory data caching, and reuse across computations.

Data processing platforms architectures with SMACK: Spark, Mesos, Akka, Cassandra and Kafka

16 September 2015·12 mins

Spark Mesos Akka Cassandra Kafka SMACK

This post is a follow-up of the talk given at Big Data AW meetup in Stockholm and focused on different use cases and design approaches for building scalable data processing platforms with SMACK(Spark, Mesos, Akka, Cassandra, Kafka) stack. While stack is really concise and consists of only several components it is possible to implement different system designs which list not only purely batch or stream processing, but more complex Lambda and Kappa architectures as well.

Cassandra 2.1 Counters: Testing Consistency During Node Failures

3 September 2015·9 mins

Akka Cassandra

For some cases such as the ones present in AdServing, the counters come really handy to accumulate totals for events coming into a system compared to batch aggregates. While distributed counters consistency is a well-known problem Cassandra counters in version 2.1 are claimed to be more accurate compared to the prior ones. This post describes the approach and the results of Cassandra counters consistency testing in different failure scenarios such as rolling restarts, abnormal termination of nodes, and network splits.

In the Wake of Scala Days 2015

1 July 2015·13 mins

Scala

Scala Days Amsterdam conference was full of interesting topics so in this post I’ll cover talks on the Scala platform, core concepts for making Scala code more idiomatic, monad transformers, consistency in distributed systems, distributed domain driven design, and a little more.

Recent

Resource Allocation in Mesos: Dominant Resource Fairness

Apache Spark: core concepts, architecture and internals

Data processing platforms architectures with SMACK: Spark, Mesos, Akka, Cassandra and Kafka

Cassandra 2.1 Counters: Testing Consistency During Node Failures

In the Wake of Scala Days 2015