Intermittent Performance and Access Issues with the Cloudera Management Console

Incident Report for Cloudera

Postmortem

The intermittent performance and access issues with the Cloudera Management Console and FreeIPA were triggered by a scheduled service upgrade intended to improve platform stability.

Our investigation determined that a change in a core component introduced a latent configuration issue. This specific condition was not exposed in our testing environments, preventing dependent services from applying dynamic configuration updates in production and leading to the outage.

We've taken immediate action to prevent this issue from recurring:
System Fix: The problematic component change was rolled back, and a permanent patch was deployed to restore proper dynamic configuration functionality.
Process Overhaul: We've implemented a more rigorous upgrade process with mandatory, near-production scale validation steps that specifically test for these types of configuration failures.
Enhanced Monitoring: We significantly improved our monitoring and alerting capabilities to detect these abnormal service behaviours much earlier, ensuring a faster response time.

We are dedicated to providing a reliable platform and will continue to invest in our infrastructure and processes. Thank you again for your patience and understanding

Posted Oct 13, 2025 - 20:04 UTC

Resolved

Our engineering teams have confirmed that the issues affecting the Management Console & FreeIPA services are now fully resolved, and all systems are confirmed to be operating normally. Should you continue to experience any trouble, please raise a support case with us.

We sincerely apologise for any inconvenience this may have caused and we will have the root cause analysis (RCA) published within 7 business days.
Posted Sep 25, 2025 - 17:01 UTC

Monitoring

*Current Status:* Our engineering teams have identified the likely root cause and have rolled out a fix. We are currently monitoring the system to confirm that this has resolved the performance and access issues for the Management Console.

*Customer Experience:* Service availability and performance for the Management Console should now be returning to normal for all users. We will continue to monitor for any residual effects.
Posted Sep 24, 2025 - 20:53 UTC

Investigating

Current Status: We are currently investigating an issue causing intermittent performance and access problems for the Management Console. Our engineering teams are working to identify the root cause and restore full functionality. We will provide another update within the next 90 minutes.
Customer Experience: Customers may experience intermittent performance degradation, slow load times, or errors when attempting to access the Management Console.

Incident Start time: 19:13 UTC September 24th, 2025
Posted Sep 24, 2025 - 19:42 UTC
This incident affected: Cloudera Data Platform (US) (CDP Management Console), Cloudera Data Platform (EU) (CDP Management Console), and Cloudera Data Platform (AP) (CDP Management Console).