All Systems Operational
Intake ? Operational
Main website ? Operational
Error processing ? Operational
Metrics processing ? Operational
Operational
Degraded Performance
Partial Outage
Major Outage
Maintenance
Past Incidents
Sep 26, 2016

No incidents reported today.

Sep 25, 2016

No incidents reported.

Sep 24, 2016

No incidents reported.

Sep 23, 2016

No incidents reported.

Sep 22, 2016

No incidents reported.

Sep 21, 2016
Resolved - Metrics processing fully caught up.
Sep 21, 15:26 UTC
Monitoring - Delay was caused by a commit putting too much overhead on our Cassandra stores. It has been reverted and now we are catching up steadily.
Sep 21, 13:54 UTC
Investigating - We are experiencing problems with metrics processing, 24h and 72h data is delayed
Sep 21, 11:52 UTC
Sep 20, 2016
Resolved - The database cluster is fully functional again. The outage was caused by network connectivity issues of a leader database server. We migrated the server to another host, which resolved the issue. Unfortunately, during the outage, the intake stopped accepting data and returned HTTP 500 errors. We will work on making the intake more resilient against such scenarios.
Sep 20, 22:53 UTC
Monitoring - The database cluster stabilized again and the intake resumed accepting data. We are investigating the root cause.
Sep 20, 22:31 UTC
Investigating - We're experiencing problems with our database cluster and are investigating
Sep 20, 22:18 UTC
Sep 19, 2016

No incidents reported.

Sep 18, 2016

No incidents reported.

Sep 17, 2016

No incidents reported.

Sep 16, 2016

No incidents reported.

Sep 15, 2016

No incidents reported.

Sep 14, 2016

No incidents reported.

Sep 13, 2016

No incidents reported.

Sep 12, 2016

No incidents reported.