All Systems Operational

About This Site

This page is for updates about global incidents. It does not include updates about routine hardware failures or isolated infrastructure events that have limited impact. For a personalized view of all events that might affect your apps, please check the personalized status page in your Fly Organization's dashboard. For all internal incidents and other activities, please check Infra Log.

Customer Applications Operational
Dashboard Operational
Machines API Operational
Regional Availability Operational
AMS - Amsterdam, Netherlands Operational
ARN - Stockholm, Sweden Operational
BOM - Mumbai, India Operational
CDG - Paris, France Operational
DFW - Dallas, Texas (US) Operational
EWR - Secaucus, NJ (US) Operational
FRA - Frankfurt, Germany Operational
GRU - Sao Paulo, Brazil Operational
IAD - Ashburn, Virginia (US) Operational
JNB - Johannesburg, South Africa Operational
LAX - Los Angeles, California (US) Operational
LHR - London, United Kingdom Operational
NRT - Tokyo, Japan Operational
ORD - Chicago, Illinois (US) Operational
SIN - Singapore Operational
SJC - San Jose, California (US) Operational
SYD - Sydney, Australia Operational
YYZ - Toronto, Canada Operational
Persistent Storage (Volumes) Operational
Deployments Operational
Remote Builds Operational
Logs Operational
Metrics Operational
SSL/TLS Certificate Provisioning Operational
UDP Anycast Operational
Fly Machine Image Registry 1 Operational
Fly Machine Image Registry 2 Operational
Extensions Operational
Upstash for Redis Operational
DNS Operational
Fly Machine .internal DNS Operational
Fly Machine External DNS Operational
*.flyio.net Nameservers Operational
flydns.net Operational
Billing Operational
Usage Metrics API Operational
Stripe API Connection Operational
Corrosion Operational
Managed Postgres Operational
90 days ago
99.91 % uptime
Today
Management Plane - ORD Operational
90 days ago
99.92 % uptime
Today
Management Plane - IAD Operational
90 days ago
99.86 % uptime
Today
Management Plane - FRA Operational
90 days ago
99.92 % uptime
Today
Management Plane - GRU Operational
90 days ago
99.92 % uptime
Today
Management Plane - LAX Operational
90 days ago
99.92 % uptime
Today
Management Plane - SYD Operational
90 days ago
99.92 % uptime
Today
Management Plane - AMS Operational
90 days ago
100.0 % uptime
Today
Management Plane - LHR Operational
90 days ago
100.0 % uptime
Today
Management Plane - NRT Operational
90 days ago
100.0 % uptime
Today
Management Plane - SIN Operational
90 days ago
100.0 % uptime
Today
Management Plane - SJC Operational
90 days ago
100.0 % uptime
Today
Management Plane - YYZ Operational
90 days ago
100.0 % uptime
Today
Phoenix.new Operational
Support Portal Operational
Operational
Degraded Performance
Partial Outage
Major Outage
Maintenance
Major outage
Partial outage
No downtime recorded on this day.
No data exists for this day.
had a major outage.
had a partial outage.
Jan 12, 2026
Resolved - This incident has been resolved and all hosts in BOM are accurately reporting metrics.
Jan 12, 04:29 UTC
Update - One host in BOM remains reporting delayed metrics as it continues to catch up. All other hosts in BOM are reporting metrics correctly.

If needed, users with impacted machines can use `fly machine clone` to create new machines in the region, which should land on a different host.

Jan 11, 18:43 UTC
Update - Metrics have returned to normal for most hosts in BOM. Two hosts are still reporting delayed metrics, but are continuing to catch up.

Users with impacted machines can use `fly machine clone` to add new machines, which should land on a different host.

Jan 11, 05:36 UTC
Update - Metrics have completed backfilling and are up to date on most hosts in the BOM region.
Two hosts are still working through the backlog; machines on those two hosts are still reporting delayed metrics at this time.

Jan 10, 01:28 UTC
Update - We are continuing to process the metrics backlog in BOM. Progress is being made, but due to the volume of metrics this may still take some time to fully complete.

At this time users with machines in BOM should see metrics from the past 12h beginning to backfill into fly-metrics.net. However most will not be fully caught up yet.

Jan 9, 20:36 UTC
Update - We are continuing to work through the backlog of metrics in BOM. Metrics will remain unavailable for BOM machines until this is complete.
Jan 9, 17:59 UTC
Update - Our metrics cluster is continuing to working through the backlog of metrics in BOM. Metrics for machines running in the BOM region will continue to be unavailable until this is complete.
Jan 9, 15:20 UTC
Identified - The cause of the issue has been identified and a fix is being implemented.

Metrics for machines in the BOM region remain unavailable in fly-metrics.net, however the machines themselves continue to run, start, and stop normally.

Jan 9, 14:07 UTC
Investigating - We are currently investigating issues collecting machine metrics for machines running in the BOM (Mumbai, India) region.
Machines in the region continue to run, start, and stop normally, however metrics for these machines are not displaying in fly-metrics.net.

Jan 9, 12:53 UTC
Jan 11, 2026
Resolved - This incident has been resolved. We have seen API latency normalize and remain normal since ~19:00 UTC.
Jan 11, 20:44 UTC
Monitoring - API error rates have normalized, however users may still see elevated latency reaching some GraphQL endpoints. Latency continues to trend in the right direction, we continue to monitor for full recovery .
Jan 11, 17:53 UTC
Update - We have deployed an initial fix and are seeing improvements. GraphQL error rates and latency remain elevated over the baseline at this time. We are continuing to keep a close eye on recovery.
Jan 11, 17:33 UTC
Identified - The issue has been identified and a fix is being implemented.
Jan 11, 17:22 UTC
Update - We are continuing to investigate elevated latency and error rates on our GraphQL API endpoints. Users may see errors on parts of the platform that use these APIs. This includes Flyctl actions such as deploys, as well as the fly.io dashboard.
Jan 11, 17:21 UTC
Investigating - We are investigating elevated API Latency and Error rates on the platform. Users may see delays or errors creating apps, as well as on some dashboard pages.

Machines API actions appear unimpacted at this time

Jan 11, 17:12 UTC
Jan 10, 2026
Jan 9, 2026
Jan 8, 2026

No incidents reported.

Jan 7, 2026

No incidents reported.

Jan 6, 2026
Resolved - This incident has been resolved.
Jan 6, 19:30 UTC
Monitoring - A fix has been implemented and we are seeing system performance return to normal. Machine API and general platform operations are succeeding again, although users may see slightly elevated error rates as things finish stabilizing.

We are continuing to closely monitor the platform to ensure full recovery and stability.

Jan 6, 08:59 UTC
Update - We are continuing to work on a fix for this issue.
Jan 6, 08:26 UTC
Update - Services are starting to come up. The dashboard should be accessible and deploys and other flyctl based commands should work. Some services may feel sluggish while things heat up.
Jan 6, 08:01 UTC
Update - The team is getting closer to a fix. We will provide another update within the next 30 minutes.
Jan 6, 07:23 UTC
Update - We are continuing to make progress on a fix for this issue.
Jan 6, 06:48 UTC
Update - We are continuing to work on restoring service to the Machines API and other affected platform components..
Jan 6, 06:01 UTC
Update - We are continuing to work on deploying a fix. The fly.io dashboard and the machines API continue to be unavailable at this time.

Fly Managed Postgres (MPG) clusters continue to run normally, however creating new clusters will fail at this time. Users may also see scheduled backups remain in a running or pending state at this time. These backups will resume as scheduled once the platform level issues are resolved

Jan 6, 05:32 UTC
Identified - We have identified the cause of the outage and are working on a fix. The fly.io dashboard and the machines API continue to be unavailable at this time.

Running machines should continue to stay up and be reachable at this time. However creating/starting/stopping machines, running new deployments, or other operations that rely on the machines API remain unavailable.

Jan 6, 04:47 UTC
Investigating - We are investigating a major outage of our control plane. Apps may continue to run, but it is not currently possible to log in to the dashboard or use the Machines API.
Jan 6, 04:21 UTC
Jan 5, 2026

No incidents reported.

Jan 4, 2026

No incidents reported.

Jan 3, 2026

No incidents reported.

Jan 2, 2026

No incidents reported.

Jan 1, 2026
Resolved - This incident has been resolved.
Jan 1, 05:52 UTC
Monitoring - We're seeing performance on impacted routes return to normal levels. Prior to recovering, we observed intermittent high packet loss for US EU traffic, most acutely from approximately Dec 31 23:35 to 23:45 UTC, and later from Jan 1 00:40 to 01:35.
Jan 1, 02:01 UTC
Investigating - We've detected degraded network performance on some of our upstream network providers, impacting traffic between US and EU regions. We're in contact with these teams as we monitor for recovery.
Jan 1, 01:09 UTC
Dec 31, 2025

No incidents reported.

Dec 30, 2025

No incidents reported.

Dec 29, 2025

No incidents reported.