Data Preparation

RequirementEvaluationRemarks

DP Selection Labeling

high

Platform allows to label data.

DP Record Tagging

high

Platform supports record tagging.

DP Version Control

high

It supports check-in and check to Git.

DP Distributed Query Engine

medium

It can be done via pipeline Spark SQL component.

DP Processing Pipelines

very high

Yes, the Data Preparation can be integrated directly with Pipeline.

DP Visual Interface

very high

We have Visual Interface Designer.

DP API Interface

very high

API interfaces are available.

DP Data Catalogue Integration

high

Platform generates Data Catalogue from the underlying meta data automatically.

DP Parquet Files support

medium

Yes, it is supported in the Data Pipeline.

DP Time Series support

medium

Yes, it is supported.

DP Time Series operations

medium

Yes, it is supported, forecasting, and anomaly detection.

DP Secret Management Integration

high

It is supported via Kubernetes secrets.

DP Access Management

high

Support RBAC

DP Reports & Metrics

high

Pipeline generate reporting metric about every process, like Memory used, CPU used , no. of records processed etc.

DP Access Audit Logs

high

Platform captures every user operations and activities

DP Operation Audit Logs

high

Logs can be pushed to third-party log monitoring systems like Datadog, Prometheus, etc.

DP Export as Pipeline

high

Yes, Data Preparation steps can be exported to Pipeline

Last updated