COST OPTIMIZATION

Dynamic data pipeline optimization

Reduce unnecessary data processing and improve resource utilization.

Collect metrics:
1. Collect runtime metrics from processing engines, including when, how often, and for how long data processing occurs.
2. Collect metrics from each data store (per data asset), including how often the data changes and is accessed.
Make recommendations:
1. Avoid processing data that changes or is accessed less often than it is processed. For example, avoid daily reprofiling of a data source whose data changes by less than 1-2% per week.
2. Stagger the processing schedule for better resource utilization.
Auto-schedule: Apply the recommended scheduling changes to the data processing environment.