Monitoring - We have put a temporary fix in. More permanent fixes are being worked on.
Apr 14, 22:17 UTC
Update - We are continuing to work on a fix for this issue.
Apr 14, 18:57 UTC
Identified - Due to an issue with DNS resource limits, multiple services may have issues with creating , updating, and scaling certain resources.

We have discovered the problem and are taking steps to remediate the problem.
Apr 13, 18:10 UTC
Monitoring - Microsoft applied a hotfix for this issue to WestUS2 Azure region and Cloudera has verified that hotfix addresses this issue. Microsoft is rolling out this hotfix to other regions in a phased manner.
Apr 8, 18:15 UTC
Update - Azure Kubernetes Cluster Provisioning for CML (Cloudera Machine Learning) and CDW (Cloudera Data Warehouse) is currently failing due to an issue with AKS (Azure Kubernetes Service).

Root cause of the issue is that Azure recently disabled DNS caching on Kubernetes 1.18 and above that is causing internal Azure timeout issues and impacts AKS provisioning, scaling and upgrading in various Azure regions where this Azure change was rolled out. Microsoft is actively looking into this issue.
Apr 5, 18:11 UTC
Update - Azure Kubernetes Cluster Provisioning for CML (Cloudera Machine Learning) and CDW (Cloudera Data Warehouse) is currently failing due to an issue with AKS (Azure Kubernetes Service). This issue is caused by an Azure networking issue. The AKS team did notice a higher than normal amount of this same error in the westUS2 region and are continuing to investigate. It is not yet clear if other regions are also impacted. The AKS team is still investigating the issue.
Apr 2, 16:32 UTC
Update - Azure Kubernetes Cluster Provisioning for CML (Cloudera Machine Learning is currently failing due to an issue with AKS (Azure Kubernetes Service). This issue is caused by an Azure networking issue. The AKS team did notice a higher than normal amount of this same error in the westUS2 region and are continuing to investigate. It is not yet clear if other regions are also impacted. The AKS team is still investigating the issue.
Apr 1, 17:05 UTC
Update - This issue is caused due to an Azure networking issue. The AKS team did notice a higher than normal amount of this same error in the westUS2 region and are continuing to investigate. It is not yet clear if other regions are also impacted. The AKS team is still investigating the issue.
Mar 31, 21:17 UTC
Update - This issue is caused due to an Azure networking issue. The AKS team did notice a higher than normal amount of this same error in the westUS2 region and are continuing to investigate. It is not yet clear if other regions are also impacted. The AKS team is still investigating the issue.
Mar 31, 16:18 UTC
Update - Tuesday - We are continuing to work on a fix for this issue.
Mar 30, 20:49 UTC
Update - We are continuing to work on a fix for this issue.
Mar 29, 20:22 UTC
Identified - Azure Kubernetes Cluster Provisioning for CML(Cloudera Machine Learning and CDE(Cloudera Data Engineering) is currently failing due to an issue with AKS(Azure Kubernetes Service). Teams are engaged and actively triaging the issue with Microsoft Support.
Mar 26, 21:18 UTC
Cloudera Data Platform Operational
90 days ago
99.72 % uptime
Today
CDP Management Console ? Operational
90 days ago
98.41 % uptime
Today
CDP Workload Manager Operational
90 days ago
100.0 % uptime
Today
Cloudera SSO Operational
90 days ago
99.97 % uptime
Today
Billing and Metering ? Operational
90 days ago
100.0 % uptime
Today
CDP Workload XM ? Operational
90 days ago
100.0 % uptime
Today
CDP IAM ? Operational
90 days ago
99.95 % uptime
Today
Operational
Degraded Performance
Partial Outage
Major Outage
Maintenance
Major outage
Partial outage
No downtime recorded on this day.
No data exists for this day.
had a major outage.
had a partial outage.
Past Incidents
Apr 15, 2021

No incidents reported today.

Apr 14, 2021
Resolved - This incident has been resolved.
Apr 14, 06:03 UTC
Monitoring - The certificate has been renewed and updated for the Cloudera docker container repository.
Machine learning and other experiences are operating as expected
Apr 14, 06:01 UTC
Identified - A CDP-required Cloudera docker container repository's certificate expired (Tuesday, April 13, 2021 at 2:06:00 PM US Pacific), causing some (Machine Learning, and possibly others) new experience creations to fail to install or upgrade, and may affect automatic scaling or pod scheduling events in some circumstances.

We are installing a new certificate.
Apr 13, 21:06 UTC
Apr 13, 2021
Apr 12, 2021

No incidents reported.

Apr 11, 2021

No incidents reported.

Apr 10, 2021

No incidents reported.

Apr 9, 2021

No incidents reported.

Apr 8, 2021
Apr 7, 2021

No incidents reported.

Apr 6, 2021

No incidents reported.

Apr 5, 2021
Apr 4, 2021

No incidents reported.

Apr 3, 2021

No incidents reported.

Apr 2, 2021
Apr 1, 2021