Question
Alight.com
IN
Last activity: 4 Dec 2024 6:31 EST
Cassandra Optimization and Upgrade Recommendations
I've noticed that our Cassandra nodes in the CDH-based Pega application experience significant load fluctuations.
Each node's data load reaches approximately 75 GB, but after running garbage collection and a full repair, the load drops to 41 GB. This behavior suggests there might be excessive temporary data accumulation or inefficient data management.
Could you provide insights into optimizing data storage and garbage collection processes in this context, and whether this indicates any underlying issues with our current configuration or data models?
Can PDC be helpful in any manner to understand this behavior?
Additionally, we are planning to upgrade to Pega 24.1.1 / CDH 24.1. Could you please suggest the stable/recommended 4.x version of Cassandra to use? Currently, we are using Cassandra ReleaseVersion 3.11.3. Any relevant documentation would also be appreciated.