CubeFS is a cloud native distributed storage platform.

Get Started Download GitHub

An Introducation to CubeFS

CubeFS is a new generation of cloud-native storage product that is compatible with multiple access protocols such as S3, POSIX, HDFS, etc. It supports two storage engines, namely multi-replica and erasure coding, and provides users with multiple features such as multi-tenancy, multi-AZ deployment, and cross-region replication. It is widely used in scenarios such as big data, AI, container platforms, databases, middleware storage and computing separation, data sharing, and data protection.

The Key features of CubeFS


Compatible with various access protocols such as S3, POSIX, HDFS, etc., and the access between protocols can be interoperable


Support replicas and erasure coding engines, users can choose flexibly according to business scenarios

Highly Scalable

Easy to build a PB or EB-scale distributed storage service, and each module can be expanded horizontally


Supports multi-tenant management and provides fine-grained tenant isolation policies

High Performance

Supports multi-level caching, multiple high-performance replication protocols, and optimizes specific performance for small files


Easy to use CubeFS in Kubernetes via CSI Driver

CubeFS Application Scenarios

Machine learning

Backend storage for HBase

Elastic search

Nginx log storage

MySQL database backup

CubeFS Deployment

Kubernetes CSI Driver

Docker Compose

Helm 2/3

CubeFS is a Cloud Native Computing Foundation Incubating Project.
For more details, please refer to our SIGMOD 2019 paper "CFS: A Distributed File System for Large Scale Container Platforms".