Monitoring - A fix has been implemented and we are monitoring the results.
Mar 18, 2026 - 12:40 UTC
Identified - The team is currently rolling out additional capacity in DFW which should help ease Machine start failures across the region.
Mar 18, 2026 - 11:44 UTC
Investigating - We are investigating reports of machines failing to start in the DFW (Dallas) region with "insufficient memory" errors. This may cause deployment failures for applications running in DFW.

Our team is actively working to restore full capacity in the region. If you are affected, deploying to an alternate region may serve as a temporary workaround.

We will provide updates as the situation progresses.

Mar 18, 2026 - 09:58 UTC

About This Site

This page is for updates about global incidents. It does not include updates about routine hardware failures or isolated infrastructure events that have limited impact. For a personalized view of all events that might affect your apps, please check the personalized status page in your Fly Organization's dashboard. For all internal incidents and other activities, please check Infra Log.

Customer Applications Operational
Dashboard Operational
Machines API Operational
Regional Availability Operational
AMS - Amsterdam, Netherlands Operational
ARN - Stockholm, Sweden Operational
BOM - Mumbai, India Operational
CDG - Paris, France Operational
DFW - Dallas, Texas (US) Operational
EWR - Secaucus, NJ (US) Operational
FRA - Frankfurt, Germany Operational
GRU - Sao Paulo, Brazil Operational
IAD - Ashburn, Virginia (US) Operational
JNB - Johannesburg, South Africa Operational
LAX - Los Angeles, California (US) Operational
LHR - London, United Kingdom Operational
NRT - Tokyo, Japan Operational
ORD - Chicago, Illinois (US) Operational
SIN - Singapore Operational
SJC - San Jose, California (US) Operational
SYD - Sydney, Australia Operational
YYZ - Toronto, Canada Operational
Persistent Storage (Volumes) Operational
Deployments Operational
Remote Builds Operational
Logs Operational
Metrics Operational
SSL/TLS Certificate Provisioning Operational
UDP Anycast Operational
Fly Machine Image Registry 1 Operational
Fly Machine Image Registry 2 Operational
Extensions Operational
Upstash for Redis Operational
DNS Operational
Fly Machine .internal DNS Operational
Fly Machine External DNS Operational
*.flyio.net Nameservers Operational
flydns.net Operational
Billing Operational
Usage Metrics API Operational
Stripe API Connection Operational
Corrosion Operational
Managed Postgres Operational
90 days ago
99.95 % uptime
Today
Management Plane - ORD Operational
90 days ago
99.96 % uptime
Today
Management Plane - IAD Operational
90 days ago
99.81 % uptime
Today
Management Plane - FRA Operational
90 days ago
100.0 % uptime
Today
Management Plane - GRU Operational
90 days ago
100.0 % uptime
Today
Management Plane - LAX Operational
90 days ago
100.0 % uptime
Today
Management Plane - SYD Operational
90 days ago
99.98 % uptime
Today
Management Plane - AMS Operational
90 days ago
99.77 % uptime
Today
Management Plane - LHR Operational
90 days ago
100.0 % uptime
Today
Management Plane - NRT Operational
90 days ago
100.0 % uptime
Today
Management Plane - SIN Operational
90 days ago
99.87 % uptime
Today
Management Plane - SJC Operational
90 days ago
100.0 % uptime
Today
Management Plane - YYZ Operational
90 days ago
100.0 % uptime
Today
Phoenix.new Operational
Support Portal Operational
Sprites Operational
Operational
Degraded Performance
Partial Outage
Major Outage
Maintenance
Major outage
Partial outage
No downtime recorded on this day.
No data exists for this day.
had a major outage.
had a partial outage.
Mar 18, 2026
Resolved - This incident has been resolved.
Mar 18, 17:02 UTC
Monitoring - A fix has been implemented and we are monitoring the results.
Mar 18, 16:31 UTC
Investigating - We are investigating intermittent network issues in SJC region impacting outbound public IPv6 access from Machines. Connecting to IPv6 internet resources from apps hosted in SJC region may be slow or fail at this time.

IPv4 access, as well as 6PN private networking, are unaffected.

Mar 18, 16:12 UTC
Resolved - This incident has been resolved.
Mar 18, 14:18 UTC
Monitoring - Between 13:55 and 14:03 UTC machines and MPG clusters hosted in the SJC region saw elevated connection errors. Users may have seen errors connecting to or from most machines in the region, as well as with deployments or updates to machines in the region.

Networking has returned to normal in the region, and we are continuing to monitor closely to ensure stable recovery.

Mar 18, 14:07 UTC
Resolved - This incident has been resolved.
Mar 18, 14:18 UTC
Monitoring - A fix has been implemented and we are seeing `ssh console` commands succeed as normal.
Mar 18, 14:17 UTC
Identified - We have identified an issue causing new `fly ssh console` connections to fail with 500 errors. A fix is in progress.
Mar 18, 14:12 UTC
Mar 17, 2026

No incidents reported.

Mar 16, 2026

No incidents reported.

Mar 15, 2026

No incidents reported.

Mar 14, 2026
Resolved - This incident has been resolved.
Mar 14, 14:05 UTC
Monitoring - Organizations with names prefixed with numerical digits may experience 401 errors. Affected operations include actions such as Sprite creation, listing, etc...

A fix has been implemented since 2026-03-14 12:30 UTC and we are monitoring the results!

Mar 14, 04:20 UTC
Mar 13, 2026

No incidents reported.

Mar 12, 2026

No incidents reported.

Mar 11, 2026
Resolved - This incident has been resolved.
Mar 11, 11:37 UTC
Update - While the secret storage service was in a read-only state, app creation requests queued up, due to the retry logic and insufficient request concurrency limits in our GraphQL API. This prevented our GraphQL API from serving any other requests. We have scaled up the GraphQL API and are continuing to monitor the situation.
Mar 11, 11:03 UTC
Monitoring - A fix has been implemented and we are monitoring the results.
Mar 11, 10:14 UTC
Identified - An ongoing data migration in our secret storage service is causing degraded Machines API functionality.
Mar 11, 09:19 UTC
Mar 10, 2026

No incidents reported.

Mar 9, 2026

No incidents reported.

Mar 8, 2026

No incidents reported.

Mar 7, 2026
Resolved - This incident has been resolved.
Mar 7, 15:56 UTC
Monitoring - A fix has been implemented and we are monitoring the results.
Mar 7, 15:10 UTC
Investigating - We are investigating a private networking failure between SYD and other regions. Apps continue to run, and private networking within SYD is unaffected.
Mar 7, 14:42 UTC
Mar 6, 2026

No incidents reported.

Mar 5, 2026
Resolved - This incident has been resolved. Due to a BGP issue, we saw some North American traffic routed to edges in Singapore (sin). Users in North America would have seen additional request latency during this period.
Mar 5, 19:50 UTC
Monitoring - A fix has been implemented and we are monitoring the results.
Mar 5, 19:38 UTC
Investigating - We're aware of routing issues affecting some customers in North America regions, and we're actively investigating.
Mar 5, 19:24 UTC
Mar 4, 2026
Completed - The scheduled maintenance has been completed.
Mar 4, 09:00 UTC
In progress - Scheduled maintenance is currently in progress. We will provide updates as necessary.
Mar 4, 03:00 UTC
Scheduled - An upstream provider is performing network maintenance in GRU on 2026-03-04, from 03:00 UTC (00:00am local time) to 09:00 UTC (6:00am local time). A loss of connectivity for up to 30 minutes is expected within the scheduled maintenance window.
Feb 20, 19:32 UTC