mirror of https://github.com/seaweedfs/seaweedfs.git synced 2024-11-30 23:29:02 +08:00

SeaweedFS is a fast distributed storage system for blobs, objects, files, and data lake, for billions of files! Blob store has O(1) disk seek, cloud tiering. Filer supports Cloud Drive, cross-DC active-active replication, Kubernetes, POSIX FUSE mount, S3 API, S3 Gateway, Hadoop, WebDAV, encryption, Erasure Coding.

blob-storage cloud-drive distributed-file-system distributed-storage distributed-systems erasure-coding fuse hadoop-hdfs hdfs kubernetes object-storage posix replication s3 s3-storage seaweedfs tiered-file-system

Go to file

Ryan Russell b6a1b84a00 docs: `orignial` -> `original` (#3661 )		2022-09-14 09:13:59 -07:00
.github	Add an End-to-End workflow for FUSE mount (#3562 )	2022-08-31 09:27:53 -07:00
docker	add tmux for dev	2022-09-01 14:47:45 -07:00
k8s/helm_charts2	3.27	2022-09-11 19:47:53 -07:00
note	add piknik	2022-08-05 17:02:55 -07:00
other	Bump hadoop-common from 2.10.1 to 3.2.3 in /other/java/hdfs2 (#2909 )	2022-09-05 10:42:43 -07:00
snap	move to https://github.com/seaweedfs/seaweedfs	2022-07-29 00:17:28 -07:00
test	minor (typos...), done while reading around	2022-05-16 22:11:33 +08:00
unmaintained	close responses	2022-08-31 00:24:17 -07:00
util	util: added gostd script	2019-04-30 03:23:20 +00:00
weed	docs: `orignial` -> `original` (#3661 )	2022-09-14 09:13:59 -07:00
.gitignore	del	2022-03-28 17:26:43 +00:00
backers.md	Update backers.md	2022-06-29 12:50:31 -07:00
go.mod	fix build	2022-09-13 10:29:16 -07:00
go.sum	fix build	2022-09-13 10:29:16 -07:00
LICENSE	clean up	2020-06-19 13:53:54 -07:00
Makefile	exclude directories to sync on filer	2022-07-27 19:22:57 +05:00
README.md	docs: functional readme hashlinks (#3656 )	2022-09-14 04:45:17 -07:00

README.md

SeaweedFS

Sponsor SeaweedFS via Patreon

SeaweedFS is an independent Apache-licensed open source project with its ongoing development made possible entirely thanks to the support of these awesome backers. If you'd like to grow SeaweedFS even stronger, please consider joining our sponsors on Patreon.

Your support will be really appreciated by me and other supporters!

Quick Start

Quick Start for S3 API on Docker

docker run -p 8333:8333 chrislusf/seaweedfs server -s3

Quick Start with Single Binary

Download the latest binary from https://github.com/seaweedfs/seaweedfs/releases and unzip a single binary file weed or weed.exe
Run weed server -dir=/some/data/dir -s3 to start one master, one volume server, one filer, and one S3 gateway.

Also, to increase capacity, just add more volume servers by running weed volume -dir="/some/data/dir2" -mserver="<master_host>:9333" -port=8081 locally, or on a different machine, or on thousands of machines. That is it!

Quick Start SeaweedFS S3 on AWS

Setup fast production-ready SeaweedFS S3 on AWS with cloudformation

Introduction

SeaweedFS is a simple and highly scalable distributed file system. There are two objectives:

to store billions of files!
to serve the files fast!

SeaweedFS started as an Object Store to handle small files efficiently. Instead of managing all file metadata in a central master, the central master only manages volumes on volume servers, and these volume servers manage files and their metadata. This relieves concurrency pressure from the central master and spreads file metadata into volume servers, allowing faster file access (O(1), usually just one disk read operation).

There is only 40 bytes of disk storage overhead for each file's metadata. It is so simple with O(1) disk reads that you are welcome to challenge the performance with your actual use cases.

SeaweedFS started by implementing Facebook's Haystack design paper. Also, SeaweedFS implements erasure coding with ideas from f4: Facebook’s Warm BLOB Storage System, and has a lot of similarities with Facebook’s Tectonic Filesystem

On top of the object store, optional Filer can support directories and POSIX attributes. Filer is a separate linearly-scalable stateless server with customizable metadata stores, e.g., MySql, Postgres, Redis, Cassandra, HBase, Mongodb, Elastic Search, LevelDB, RocksDB, Sqlite, MemSql, TiDB, Etcd, CockroachDB, YDB, etc.

For any distributed key value stores, the large values can be offloaded to SeaweedFS. With the fast access speed and linearly scalable capacity, SeaweedFS can work as a distributed Key-Large-Value store.

SeaweedFS can transparently integrate with the cloud. With hot data on local cluster, and warm data on the cloud with O(1) access time, SeaweedFS can achieve both fast local access time and elastic cloud storage capacity. What's more, the cloud storage access API cost is minimized. Faster and Cheaper than direct cloud storage!

System	File Metadata	File Content Read	POSIX	REST API	Optimized for large number of small files
SeaweedFS	lookup volume id, cacheable	O(1) disk seek		Yes	Yes
SeaweedFS Filer	Linearly Scalable, Customizable	O(1) disk seek	FUSE	Yes	Yes
GlusterFS	hashing		FUSE, NFS
Ceph	hashing + rules		FUSE	Yes
MooseFS	in memory		FUSE		No
MinIO	separate meta file for each file			Yes	No

SeaweedFS	comparable to Ceph	advantage
Master	MDS	simpler
Volume	OSD	optimized for small files
Filer	Ceph FS	linearly scalable, Customizable, O(1) or O(logN)

README.md Unescape Escape

SeaweedFS

Sponsor SeaweedFS via Patreon

Gold Sponsors

Table of Contents

Quick Start

Quick Start for S3 API on Docker

Quick Start with Single Binary

Quick Start SeaweedFS S3 on AWS

Introduction

Features

Additional Features

Filer Features

Kubernetes

Example: Using Seaweed Object Store

Start Master Server

Start Volume Servers

Write File

Save File Id

Read File

Rack-Aware and Data Center-Aware Replication

Allocate File Key on Specific Data Center

Other Features

Object Store Architecture

Master Server and Volume Server

Write and Read files

Storage Size

Saving memory

Tiered Storage to the cloud

Compared to Other File Systems

Compared to HDFS

Compared to GlusterFS, Ceph

Compared to GlusterFS

Compared to MooseFS

Compared to Ceph

Compared to MinIO

Dev Plan

Installation Guide

Disk Related Topics

Hard Drive Performance

Solid State Disk

Benchmark

License

Stargazers over time

README.md