Table of Contents
This is a page for actual hardware used.
From Aaron: 37 PiB raw storage
Currently, we are operating a cluster with 14 1U servers. Each server has two Seagate 5U 84-Bay SAS JBOD enclosures attached that are populated with 18TB 7.2k RPM drives. This yields about 37 PiB of raw capacity. Initially, the servers were deployed having one volume service per drive but was later changed to using a simple RAID 0 across 4 drives to improve edge cases around single threaded throughput which dramatically improved rebuilding and moving of volume files.
Everything is protected with replication=010
and currently not leveraging erasure coding since it does not currently (2023-01-18) support ensuring multiple fragments are stored on separate physical servers.
The cluster has experienced multiple drives fail across without any impacts. The biggest issue encountered to date is that cluster maintenance tasks like volume.fix.replication
, volume.check.disk
, volume.vacuum
, etc. are only executed against a single volume file at a once and this results in very long times to complete across the cluster.
Details about the Seagate 5U 84-bay SAS JBOD enclosures can be found at https://www.seagate.com/products/storage/data-storage-systems/jbod/exos-e-5u84/
Introduction
API
Configuration
- Replication
- Store file with a Time To Live
- Failover Master Server
- Erasure coding for warm storage
- Server Startup Setup
- Environment Variables
Filer
- Filer Setup
- Directories and Files
- Data Structure for Large Files
- Filer Data Encryption
- Filer Commands and Operations
- Filer JWT Use
Filer Stores
- Filer Cassandra Setup
- Filer Redis Setup
- Super Large Directories
- Path-Specific Filer Store
- Choosing a Filer Store
- Customize Filer Store
Advanced Filer Configurations
- Migrate to Filer Store
- Add New Filer Store
- Filer Store Replication
- Filer Active Active cross cluster continuous synchronization
- Filer as a Key-Large-Value Store
- Path Specific Configuration
- Filer Change Data Capture
FUSE Mount
WebDAV
Cloud Drive
- Cloud Drive Benefits
- Cloud Drive Architecture
- Configure Remote Storage
- Mount Remote Storage
- Cache Remote Storage
- Cloud Drive Quick Setup
- Gateway to Remote Object Storage
AWS S3 API
- Amazon S3 API
- AWS CLI with SeaweedFS
- s3cmd with SeaweedFS
- rclone with SeaweedFS
- restic with SeaweedFS
- nodejs with Seaweed S3
- S3 API Benchmark
- S3 API FAQ
- S3 Bucket Quota
- S3 API Audit log
- S3 Nginx Proxy
- Docker Compose for S3
AWS IAM
Machine Learning
HDFS
- Hadoop Compatible File System
- run Spark on SeaweedFS
- run HBase on SeaweedFS
- run Presto on SeaweedFS
- Hadoop Benchmark
- HDFS via S3 connector
Replication and Backup
- Async Replication to another Filer [Deprecated]
- Async Backup
- Async Filer Metadata Backup
- Async Replication to Cloud [Deprecated]
- Kubernetes Backups and Recovery with K8up
Messaging
Use Cases
Operations
Advanced
- Large File Handling
- Optimization
- Volume Management
- Tiered Storage
- Cloud Tier
- Cloud Monitoring
- Load Command Line Options from a file
- SRV Service Discovery
- Volume Files Structure