Network & Infrastructure2026-04-07

Amazon S3 Express One Zone Complete Guide — Single-Digit Millisecond Latency Storage for ML/HPC [2026]

Amazon S3 Express One Zone delivers up to 10x performance and single-digit millisecond latency vs S3 Standard. This guide covers the 2025 price cuts (up to 85% on GET requests), real-world gains (283% for ClickHouse, 10x for Pinterest), the March 2026 CloudWatch update, and full setup for ML/HPC workloads.

AWS S3 S3 Express One Zone 低遅延 ML HPC

What is Amazon S3 Express One Zone?

Amazon S3 Express One Zone delivers up to 10x performance and single-digit millisecond latency compared to S3 Standard. It is a purpose-built, high-performance storage class designed for latency-sensitive workloads including ML training data loading, HPC jobs, and real-time analytics. The performance gains come from a single-AZ architecture and a dedicated request processing path.

2025 Price Cuts — Cost Barrier Significantly Reduced

In 2025 AWS dramatically reduced S3 Express One Zone pricing, addressing the primary adoption barrier:

Item	Price Reduction	Impact
Storage	-31%	Lower cost for large datasets
PUT requests	-55%	ML write-heavy pipelines much cheaper
GET requests	-85%	Repeated training data reads become very affordable

Real-World Performance Results

Multiple organizations have published measured results after migrating to S3 Express One Zone:

ClickHouse: Query performance improved by 283% after switching to S3 Express One Zone, with the largest gains in workloads involving concurrent access to many small files.

Pinterest: Data processing pipeline speed increased approximately 10x after migrating storage to S3 Express One Zone, significantly reducing ML model training cycle times.

These results demonstrate that ROI is highest for workloads where storage I/O latency is the primary bottleneck.

S3 Express One Zone Architecture

Loading diagram...

March 2026 CloudWatch Update

The March 2026 update expanded CloudWatch metrics for S3 Express One Zone significantly:

- Per-minute request counts: Enables burst detection and capacity planning
- Data transfer (IN/OUT): Granular network cost analysis
- Error rates (4xx/5xx): Application failure detection
- Latency percentiles (P50/P95/P99): SLA monitoring and anomaly detection

This makes it possible to immediately detect performance degradation in ML pipelines using S3 Express One Zone and trigger automated alerts.

Ideal Use Cases for S3 Express One Zone

Use Case	Why S3 Express One Zone Helps
ML/DL model training data	Repeated full dataset reads per epoch
Real-time analytics	Sub-1ms P99 latency for aggregation queries
Interactive applications	Faster response times for user requests
HPC simulations	High-speed access to large intermediate files
Game servers	Fast asset delivery and session state management
Financial trading	Ultra-fast data reads for microsecond decisions

When S3 Express One Zone Is Not the Right Choice

Not every workload benefits from S3 Express One Zone. Consider alternatives for:

- Long-term archival: S3 Glacier Instant Retrieval is far more cost-effective for infrequently accessed data
- Multi-AZ redundancy required: Single-AZ design means data is inaccessible during an AZ outage
- Cost-first workloads: Higher price than S3 Standard when latency is not critical
- Large cold datasets: S3 Intelligent-Tiering is better for large volumes with unpredictable access patterns

S3 Standard vs S3 Express One Zone Comparison

Feature	S3 Standard	S3 Express One Zone
Latency	Tens of milliseconds	Single-digit milliseconds
Throughput	Standard	Up to 10x
TPS	Standard	Up to 10x
Availability SLA	99.99%	99.95% (single AZ)
Durability	11 Nines	11 Nines (single AZ)
Storage price	~$0.023/GB/month	~$0.016/GB/month (after cuts)
GET request price	~$0.0004/1,000	~$0.00006/1,000 (after cuts)
Multi-AZ redundancy	Yes	No
Primary use	General purpose	Low-latency, high-throughput

Directory Buckets — Express One Zone's Dedicated Bucket Type

S3 Express One Zone uses a dedicated bucket type called a "directory bucket". Unlike general-purpose buckets, directory buckets feature a hierarchical namespace optimized for fast prefix operations.

Key characteristics:
- Bucket names follow the format <name>--<az-id>--x-s3 (e.g., my-ml-bucket--use1-az4--x-s3)
- Fully compatible with the standard S3 API; existing SDKs work unchanged
- Access via CreateSession API for temporary session-based credentials
- Optimized for parallel, high-frequency small-object operations

Setup Guide — Directory Bucket Creation and IAM

Step 1: Find Your AZ ID and Create the Bucket

bash

# Find AZ IDs in your region
aws ec2 describe-availability-zones \
  --region us-east-1 \
  --query 'AvailabilityZones[*].[ZoneName,ZoneId]'

# Create directory bucket
aws s3api create-bucket \
  --bucket my-ml-data--use1-az4--x-s3 \
  --region us-east-1

Step 2: IAM Policy

json

{
  "Version": "2012-10-17",
  "Statement": [
    {
      "Effect": "Allow",
      "Action": [
        "s3express:CreateSession",
        "s3express:GetObject",
        "s3express:PutObject"
      ],
      "Resource": "arn:aws:s3express:us-east-1:123456789012:bucket/my-ml-data--use1-az4--x-s3"
    }
  ]
}

Step 3: Access with boto3

python

import boto3

s3 = boto3.client('s3', region_name='us-east-1')

# Upload training data
s3.put_object(
    Bucket='my-ml-data--use1-az4--x-s3',
    Key='datasets/train/batch_001.parquet',
    Body=open('batch_001.parquet', 'rb').read()
)

# High-speed read
response = s3.get_object(
    Bucket='my-ml-data--use1-az4--x-s3',
    Key='datasets/train/batch_001.parquet'
)
data = response['Body'].read()

Cost Optimization — Hybrid Architecture with S3 Standard

The most cost-effective design separates data by access temperature:

- Hot data (active training, last 7 days): S3 Express One Zone
- Warm data (evaluation, reuse, last 30 days): S3 Standard
- Cold data (archive, 30+ days): S3 Glacier Instant Retrieval

Configure S3 Lifecycle Policies to automatically move data between tiers, eliminating manual management overhead while keeping costs optimized.

Monitoring and Alerts — CloudWatch Configuration

Use the March 2026 CloudWatch metrics to set up production-grade alerting:

python

import boto3

cw = boto3.client('cloudwatch', region_name='us-east-1')

# Alert when P99 latency exceeds 10ms
cw.put_metric_alarm(
    AlarmName='S3Express-HighLatency',
    MetricName='FirstByteLatency',
    Namespace='AWS/S3Express',
    Statistic='p99',
    Dimensions=[
        {'Name': 'BucketName', 'Value': 'my-ml-data--use1-az4--x-s3'},
        {'Name': 'FilterId', 'Value': 'EntireBucket'}
    ],
    Period=60,
    EvaluationPeriods=5,
    Threshold=10.0,
    ComparisonOperator='GreaterThanThreshold',
    AlarmActions=['arn:aws:sns:us-east-1:123456789012:ml-ops-alerts']
)

Recommended monitoring targets:
- P99 FirstByteLatency: threshold 10ms
- 5xx error rate: threshold 0.1%
- Per-minute request count: spike detection

Frequently Asked Questions

Q1. Which AWS regions support S3 Express One Zone?
Major regions including US East (N. Virginia), US West (Oregon), and Europe (Ireland). Check the AWS documentation for the latest list.

Q2. Do existing S3 SDKs and APIs work with directory buckets?
Yes. S3 Express One Zone uses S3-compatible APIs. Existing boto3, AWS CLI, and SDK code works without modification; only the bucket name format differs.

Q3. What happens during an AZ outage?
Data becomes temporarily inaccessible. Always configure replication to S3 Standard for important datasets to maintain access during AZ-level failures.

Q4. Is there a throttling risk for large ML training jobs?
Yes. Request rate limits apply. Implement exponential backoff and manage CreateSession connections carefully for large parallel training jobs.

Q5. Can S3 Express One Zone be combined with Intelligent-Tiering?
Not directly. Use lifecycle policies to move data from S3 Express One Zone to S3 Standard or Intelligent-Tiering buckets as it ages.

Q6. Do per-minute CloudWatch metrics incur additional charges?
Yes, CloudWatch custom metrics are billed per metric. Limit monitoring to the most critical metrics to manage costs.

Q7. Does Amazon SageMaker support S3 Express One Zone natively?
Yes. SageMaker natively supports directory buckets for training datasets, significantly reducing job startup and per-epoch data loading times.

How Oflight Can Help

Optimizing ML training pipelines with S3 Express One Zone, designing hybrid storage architectures, or setting up CloudWatch monitoring? Oflight provides AWS infrastructure design and implementation services tailored to ML and HPC workloads. Visit our Network & Infrastructure Services page to get started.

Feel free to contact us