Skip to main content

Release Management

Root causes that are triggered by a code change or new release and result in a measurable reliability regression.

Code Change Regression: CPU Congestion

Integration sources:Infrastructure Scraper

After a version upgrade, application containers experience high CPU usage, leading to performance degradation or unresponsiveness. This issue impacts the system's ability to handle requests effectively, potentially causing downtime or delays for end users. High CPU usage post-upgrade typically stems from changes in the application code, dependencies, or configurations.

Code Change Regression: Database Connection Pool Saturated

Integration sources:MetricsInfrastructure ScraperSymptom Activation

After a version upgrade, the client-side database connection pool is exhausted when all available connections are in use, preventing new database queries from being executed. This can cause application requests to hang or fail, impacting user experience and potentially leading to downtime for database-dependent features.

Code Change Regression: Frequent Crash Failure

Integration sources:Infrastructure Scraper

One or multiple containers of a workload are frequently crashing with a non-zero exit code after a version upgrade. This disrupts the application's functionality, leading to downtime or degraded performance depending on the workload design. The issue likely stems from changes introduced in the new version.

Code Change Regression: Frequent Memory Failure

Integration sources:Infrastructure Scraper

The application is running out of memory after a version upgrade, leading to crashes, degraded performance, or instability. This impacts availability and user experience, often requiring container restarts or manual intervention to restore functionality. The issue is likely tied to changes in the updated version that increase memory usage or introduce inefficiencies.

Code Change Regression: Inefficient Garbage Collection

Integration sources:MetricsSymptom ActivationInfrastructure Scraper

After a version upgrade, the garbage collector is frequently running, leading to performance degradation or crashes. This issue is likely caused by changes in the application code or dependencies that increase memory usage or introduce inefficiencies.

Code Change Regression: Java Heap Saturated

Integration sources:MetricsSymptom ActivationInfrastructure Scraper

After a version upgrade, the Java heap is frequently congested, leading to performance degradation or crashes. This issue is likely caused by changes in the application code or dependencies that increase memory usage or introduce inefficiencies.

Code Change Regression: Lock Contention

Integration sources:MetricsSymptom ActivationInfrastructure Scraper

After a version upgrade, the application is experiencing frequent locking contention, leading to performance degradation or crashes. This issue is likely caused by changes in the application code or dependencies that increase locking or introduce inefficiencies.

Code Change Regression: Memory Failure

Integration sources:Infrastructure Scraper

Memory failures after a code change can cause containers to crash or degrade performance, resulting in errors for end users or failed service requests. These issues occur when newly introduced code leads to unexpected increases in memory usage, triggering out-of-memory (OOM) errors and destabilizing the system.

Code Change Regression: Redis Connection Pool Saturated

Integration sources:MetricsSymptom ActivationInfrastructure Scraper

After a version upgrade, the Redis connection pool is frequently congested, leading to performance degradation or crashes. This issue is likely caused by changes in the application code or dependencies that increase Redis usage or introduce inefficiencies.

Code Change Regression: Slow Database Queries

After a version upgrade, the application is experiencing slow database queries that lead to downstream slow consumer behavior and potential resource starvation. This condition affects instance performance, particularly when query execution times become excessively long.