Optimizing data collection
-
- UpdatedJan 30, 2025
- 2 minutes to read
- Yokohama
- Performance Analytics
The optimized Performance Analytics data collector reduces the time, memory, and CPU usage for processing large data sets.
The optimized data collector is active by default on all instances starting in Tokyo. It is activated upon upgrading. To deactivate the optimized collector, create the system property com.snc.pa.dc.hsql and set it to false. For more information, see Add a system property.
The optimized data collector supports the following capabilities:
- Ability to handle over 10 million records without adverse impact to performance.
- Support for at least 10 breakdowns with breakdown matrix enabled. Breakdown support
includes:
- Dot-walked conditions
- Two large breakdowns, with up to 1 million records
- Breakdown relations
- Support for additional conditions on indicators and breakdowns.
- Optimizations for Count Distinct aggregation.
The optimizations stem from the use of an embedded database. A standard data collection job stores all scores in the node memory during the job. However, an optimized data collection job moves packets of scores to temporary storage on disk. At the end of the job, the scores are written to the scores tables and the temporary database is cleared.