What is it all about?
Kudu is a columnar storage manager developed for the Hadoop platform. Kudu shares the common technical properties of Hadoop ecosystem applications: it runs on commodity hardware, is horizontally scalable, and supports highly available operation.
* Fast processing of OLAP workloads. * Integration with MapReduce, Spark and other Hadoop ecosystem components. * Tight integration with Cloudera Impala, making it a good, mutable alternative to using HDFS with Parquet. * Strong but flexible consistency model, allowing you to choose consistency requirements on a per-request basis, including the option for strict-serializable consistency. * Strong performance for running sequential and random workloads simultaneously. * Easy to administer and manage with Cloudera Manager. * High availability. Tablet Servers and Masters use the Raft Consensus Algorithm, which ensures that as long as more than half the total number of replicas is available, the tablet is available for reads and writes. For instance, if 2 out of 3 replicas or 3 out of 5 replicas are available, the tablet is available. * Reads can be serviced by read-only follower tablets, even in the event of a leader tablet failure. * Structured data model.