Management plane outage

Incident Report for Fly.io

Resolved

This incident has been resolved.
Posted Jan 06, 2026 - 19:30 UTC

Monitoring

A fix has been implemented and we are seeing system performance return to normal. Machine API and general platform operations are succeeding again, although users may see slightly elevated error rates as things finish stabilizing.

We are continuing to closely monitor the platform to ensure full recovery and stability.
Posted Jan 06, 2026 - 08:59 UTC

Update

We are continuing to work on a fix for this issue.
Posted Jan 06, 2026 - 08:26 UTC

Update

Services are starting to come up. The dashboard should be accessible and deploys and other flyctl based commands should work. Some services may feel sluggish while things heat up.
Posted Jan 06, 2026 - 08:01 UTC

Update

The team is getting closer to a fix. We will provide another update within the next 30 minutes.
Posted Jan 06, 2026 - 07:23 UTC

Update

We are continuing to make progress on a fix for this issue.
Posted Jan 06, 2026 - 06:48 UTC

Update

We are continuing to work on restoring service to the Machines API and other affected platform components..
Posted Jan 06, 2026 - 06:01 UTC

Update

We are continuing to work on deploying a fix. The fly.io dashboard and the machines API continue to be unavailable at this time.

Fly Managed Postgres (MPG) clusters continue to run normally, however creating new clusters will fail at this time. Users may also see scheduled backups remain in a running or pending state at this time. These backups will resume as scheduled once the platform level issues are resolved
Posted Jan 06, 2026 - 05:32 UTC

Identified

We have identified the cause of the outage and are working on a fix. The fly.io dashboard and the machines API continue to be unavailable at this time.

Running machines should continue to stay up and be reachable at this time. However creating/starting/stopping machines, running new deployments, or other operations that rely on the machines API remain unavailable.
Posted Jan 06, 2026 - 04:47 UTC

Investigating

We are investigating a major outage of our control plane. Apps may continue to run, but it is not currently possible to log in to the dashboard or use the Machines API.
Posted Jan 06, 2026 - 04:21 UTC
This incident affected: Customer Applications, Dashboard, Machines API, Deployments, Metrics, Phoenix.new and Managed Postgres (Management Plane - ORD, Management Plane - IAD, Management Plane - FRA, Management Plane - GRU, Management Plane - LAX, Management Plane - SYD, Management Plane - AMS, Management Plane - LHR, Management Plane - NRT, Management Plane - SIN, Management Plane - SJC, Management Plane - YYZ).