All Systems Operational
Cloudera Data Platform (US) Operational
90 days ago
99.97 % uptime
Today
CDP Management Console ? Operational
90 days ago
99.95 % uptime
Today
CDP Workload Manager ? Operational
90 days ago
99.97 % uptime
Today
CDP IAM ? Operational
90 days ago
99.98 % uptime
Today
Cloudera SSO Operational
90 days ago
99.9 % uptime
Today
Cloudera Data Platform (AP) Operational
90 days ago
99.99 % uptime
Today
CDP IAM Operational
90 days ago
100.0 % uptime
Today
CDP Management Console Operational
90 days ago
99.98 % uptime
Today
Cloudera Data Platform (EU) Operational
90 days ago
99.99 % uptime
Today
CDP IAM Operational
90 days ago
100.0 % uptime
Today
CDP Management Console Operational
90 days ago
99.99 % uptime
Today
Operational
Degraded Performance
Partial Outage
Major Outage
Maintenance
Major outage
Partial outage
No downtime recorded on this day.
No data exists for this day.
had a major outage.
had a partial outage.
Past Incidents
Sep 29, 2022

No incidents reported today.

Sep 28, 2022
Resolved - AWS reports this issue as resolved.

Excerpt from incident:
[02:05 PM PDT] As of 1:43 PM PDT, error rates and latencies for invoked on API Gateway endpoints in the US-WEST-2 Region are now at normal levels. The issue began at 9:20 AM PDT when error rates and latencies for API Gateway began to increase. Error rates began to improve at 10:38 AM PDT, when engineers took action to reduce contention within the subsystem that handles request processing for API Gateway. Error rates continued to improve until 1:10 PM PDT, when engineers applied a mitigation to resolve the contention within the affected subsystem. These actions accelerated recovery, and by 1:43 PM PDT, error rates and latencies had returned to normal levels. Affected AWS services have now recovered as well. The issue has been resolved and the service is operating normally.

Sep 28, 21:23 UTC
Monitoring - Error rates have decreased within CDP. However, AWS have not reported the issue as resolved, so we are continuing to monitor the situation.
Sep 28, 19:29 UTC
Update - Update from AWS:
> 11:33 AM PDT We continue to work on resolving the elevated error rates and latencies for invokes on API Gateway endpoints in the US-WEST-2 Region. We continue to see a significant improvement in error rates, starting at 10:40 AM PDT, but are not seeing full recovery yet. The issue is caused by contention within the subsystem that is responsible for request processing within the API Gateway service. Engineers are engaged and have applied traffic filters as a precautionary measure, while they work to identify the root cause and resolve the issue. Engineers continue to work to reduce contention within the affected subsystem, which we believe will resolve the elevated error rates and latencies. Customers with applications that use API Gateway, or customers invoking Lambda functions via API Gateway, will be experiencing elevated error rates and latencies as a result of this issue. The AWS services listed below are also experiencing elevated error rates as a result of this issue. While we have seen improvements in error rates since 10:40 AM PDT, recovery has stalled and we do not have a clear ETA on full recovery. For customers that have dependencies on API Gateway and are experiencing error rates, we do not have any mitigations to recommend to address the issue on the customer side. We do expect error rates to continue to improve as contention with the affected subsystem resides, and will provide further updates as recovery progresses.

Sep 28, 18:58 UTC
Update - We are continuing to work on a fix for this issue.
Sep 28, 18:23 UTC
Identified - This has been confirmed to be due to issues at AWS: https://health.aws.amazon.com/health/status

We are monitoring the situation, and once resolved, we will follow-up with AWS for mitigation options if it occurs again.

Quote from AWS:
> We continue to see elevated error rates and latencies for invokes on API Gateway endpoints in the US-WEST-2 Region. While engineers continue to work towards root cause, we have deployed traffic filters from sources with significant increases in traffic prior to the event. As a result of these traffic filters, we are seeing a reduction in error rates and latencies, but continue to work towards full recovery. Although error rates are improving, we do not yet have an ETA for full recovery. The issue is also affecting API requests to some AWS services, including those listed below. Amazon Connect is experiencing increased failures in handling new calls, chats, and tasks as well as issues with user login in the US-WEST-2 Region. We will continue to provide updates as we progress.

Sep 28, 18:22 UTC
Investigating - We are currently investigating this issue.
Sep 28, 18:15 UTC
Sep 27, 2022

No incidents reported.

Sep 26, 2022

No incidents reported.

Sep 25, 2022

No incidents reported.

Sep 24, 2022

No incidents reported.

Sep 23, 2022

No incidents reported.

Sep 22, 2022

No incidents reported.

Sep 21, 2022

No incidents reported.

Sep 20, 2022

No incidents reported.

Sep 19, 2022

No incidents reported.

Sep 18, 2022

No incidents reported.

Sep 17, 2022

No incidents reported.

Sep 16, 2022

No incidents reported.

Sep 15, 2022

No incidents reported.