All Systems Operational

About This Site


Personalized Status


This page is for updates about global incidents. It does not include updates about routine hardware failures or isolated infrastructure events that have limited impact. For a personalized view of all events that might affect your apps, please check the status page in your Fly Organization's dashboard.

View Status page


DNS Operational
Fly Machine .internal DNS ? Operational
Fly Machine External DNS Operational
*.fly.dev Nameservers Operational
*.flyio.net Nameservers Operational
Logs Operational
Metrics ? Operational
Platform and Tools Operational
90 days ago
99.95 % uptime
Today
API Operational
90 days ago
99.72 % uptime
Today
Deployments Operational
90 days ago
99.96 % uptime
Today
Dashboard Operational
90 days ago
100.0 % uptime
Today
Remote Builds Operational
90 days ago
99.96 % uptime
Today
SSL/TLS Certificate Provisioning Operational
90 days ago
100.0 % uptime
Today
Upstash Redis ? Operational
90 days ago
100.0 % uptime
Today
Persistent Storage (Volumes) ? Operational
90 days ago
100.0 % uptime
Today
UDP Anycast ? Operational
90 days ago
100.0 % uptime
Today
Fly Machine Image Registry 1 Operational
Fly Machine Image Registry 2 Operational
Regional Availability Operational
90 days ago
99.99 % uptime
Today
AMS - Amsterdam, Netherlands Operational
90 days ago
100.0 % uptime
Today
ARN - Stockholm, Sweden Operational
90 days ago
100.0 % uptime
Today
ATL - Atlanta, Georgia (US) Operational
90 days ago
100.0 % uptime
Today
BOM - Mumbai, India Operational
90 days ago
100.0 % uptime
Today
CDG - Paris, France Operational
90 days ago
100.0 % uptime
Today
DFW - Dallas, Texas (US) Operational
90 days ago
100.0 % uptime
Today
EWR - Secaucus, New Jersey (US) Operational
90 days ago
100.0 % uptime
Today
FRA - Frankfurt, Germany Operational
90 days ago
100.0 % uptime
Today
GRU - São Paulo, Brazil Operational
90 days ago
100.0 % uptime
Today
HKG - Hong Kong Operational
90 days ago
99.97 % uptime
Today
IAD - Ashburn, Virginia (US) Operational
90 days ago
100.0 % uptime
Today
JNB - Johannesburg, South Africa Operational
90 days ago
100.0 % uptime
Today
LAX - Los Angeles, California (US) Operational
90 days ago
99.98 % uptime
Today
LHR - London, United Kingdom Operational
90 days ago
99.84 % uptime
Today
MAA - Chennai (Madras), India Operational
90 days ago
100.0 % uptime
Today
MAD - Madrid, Spain Operational
90 days ago
100.0 % uptime
Today
MIA - Miami, Florida (US) Operational
90 days ago
100.0 % uptime
Today
MRS - Marseille, France Operational
90 days ago
100.0 % uptime
Today
NRT - Tokyo, Japan Operational
90 days ago
99.99 % uptime
Today
ORD - Chicago, Illinois (US) Operational
90 days ago
100.0 % uptime
Today
SCL - Santiago, Chile Operational
90 days ago
100.0 % uptime
Today
SEA - Seattle, Washington (US) Operational
90 days ago
100.0 % uptime
Today
SIN - Singapore Operational
90 days ago
100.0 % uptime
Today
SJC - Sunnyvale, California (US) Operational
90 days ago
99.99 % uptime
Today
SYD - Sydney, Australia Operational
90 days ago
100.0 % uptime
Today
YYZ - Toronto, Canada Operational
90 days ago
100.0 % uptime
Today
Billing (Stripe API Connection) Operational
Operational
Degraded Performance
Partial Outage
Major Outage
Maintenance
Major outage
Partial outage
No downtime recorded on this day.
No data exists for this day.
had a major outage.
had a partial outage.
API Success Rate
Fetching
Past Incidents
Apr 25, 2024

No incidents reported today.

Apr 24, 2024
Resolved - We experienced a temporary outage in the wireguard mesh connecting all of our physical hosts which resulted in a number of internal systems losing connectivity with each other. The mesh has been repaired, but some regions required manual intervention, notably: AMS, BOM, CDG, HKG, IAD, MIA, SJC, SYD, YYZ. The following was impacted (a non-exhaustive list, this was a full network outage): fly token validation (including github actions and flyctl auth), 6pn networking calls, log delivery, routing requests to user machines.
Apr 24, 21:28 UTC
Update - We have identified a small number of physical servers that are still impacted and are recovering them.
Apr 24, 19:56 UTC
Monitoring - A fix has been implemented and we are monitoring the results.
Apr 24, 19:25 UTC
Investigating - We are observing elevated errors and connectivity issues across Fly.io
Apr 24, 18:58 UTC
Apr 23, 2024

No incidents reported.

Apr 22, 2024

No incidents reported.

Apr 21, 2024
Resolved - This incident has been resolved.
Apr 21, 05:45 UTC
Identified - We have confirmed an upstream network issue affecting traffic routing through the Pacific Northwest region.
Apr 21, 05:23 UTC
Investigating - We are investigating a network issue primarily affecting the SEA and HKG regions.
Apr 21, 04:21 UTC
Apr 20, 2024

No incidents reported.

Apr 19, 2024

No incidents reported.

Apr 18, 2024
Resolved - This incident has been resolved
Apr 18, 16:42 UTC
Monitoring - A fix has been implemented upstream by our providers and we're monitoring the network.
Apr 18, 15:01 UTC
Update - We've identified that the network issues in MAD are affecting remote builds in this region as well.
We continue investigating with our provider in the region to get the issue identified and resolved.

Apr 18, 12:19 UTC
Investigating - We are investigating a networking-related issue in our MAD region workers. This is manifesting as timeouts for some customer's apps trying to reach external services.
Apr 18, 08:51 UTC
Apr 17, 2024
Resolved - This issue is resolved.
Apr 17, 23:35 UTC
Monitoring - Metrics propagation has normalized and missing metrics are being backfilled.
Apr 17, 22:35 UTC
Identified - We've identified and applied a potential fix. We're currently monitoring the results. Some metrics continue to be delayed
Apr 17, 22:01 UTC
Investigating - We're investigating delays in metrics distribution. Some apps may see delays in their metrics showing up on fly-metrics.net
Apr 17, 20:51 UTC
Apr 16, 2024
Resolved - This incident has been resolved.
Apr 16, 19:49 UTC
Monitoring - A fix has been implemented and we are monitoring the results.
Apr 16, 09:36 UTC
Identified - The issue has been identified and a fix is being implemented.
Apr 16, 09:23 UTC
Update - We are continuing to investigate this issue.
Apr 16, 09:11 UTC
Investigating - We are currently investigating this issue.
If you are experiencing this issue, a reploy/update might fix it while we work to resolve this.

Apr 16, 09:10 UTC
Apr 15, 2024
Resolved - This issue is resolved
Apr 15, 05:26 UTC
Monitoring - Customer log access has been restored. We are continuing to monitor logging functionality.
Apr 15, 05:03 UTC
Investigating - We are currently investigating an issue with logs from machines being unavailable.
Apr 15, 04:52 UTC
Apr 14, 2024

No incidents reported.

Apr 13, 2024

No incidents reported.

Apr 12, 2024
Resolved - This incident is now resolved. After the fix was deployed, the managed metrics websites are responding swiftly now.
Apr 12, 13:28 UTC
Monitoring - A fix has been implemented and we are monitoring the results.
Apr 12, 12:46 UTC
Resolved - This incident is now resolved. After the fix was deployed, 502 responses rates are back to baseline.
Apr 12, 13:25 UTC
Update - We've deployed a fix that recovered the machines affected by this incident.
Apr 12, 12:59 UTC
Monitoring - This issue is a fallout from previous incident. For a solution, please refer to https://community.fly.io/t/fly-app-servers-down/19229/40

We are working on finding a fix for the stuck machines.

Apr 12, 09:44 UTC
Resolved - This incident has been resolved.
Apr 12, 07:47 UTC
Update - Creating new machines should work fine. Some machines may be stuck in an invalid state. We are working towards unclogging those.
Apr 12, 07:03 UTC
Monitoring - A fix has been implemented and we are monitoring the results.
Apr 12, 06:21 UTC
Apr 11, 2024

No incidents reported.