Data Preparation
Requirement | Evaluation | Remarks |
---|---|---|
DP Selection Labeling | high | Platform allows to label data. |
DP Record Tagging | high | Platform supports record tagging. |
DP Version Control | high | It supports check-in and check to Git. |
DP Distributed Query Engine | medium | It can be done via pipeline Spark SQL component. |
DP Processing Pipelines | very high | Yes, the Data Preparation can be integrated directly with Pipeline. |
DP Visual Interface | very high | We have Visual Interface Designer. |
DP API Interface | very high | API interfaces are available. |
DP Data Catalogue Integration | high | Platform generates Data Catalogue from the underlying meta data automatically. |
DP Parquet Files support | medium | Yes, it is supported in the Data Pipeline. |
DP Time Series support | medium | Yes, it is supported. |
DP Time Series operations | medium | Yes, it is supported, forecasting, and anomaly detection. |
DP Secret Management Integration | high | It is supported via Kubernetes secrets. |
DP Access Management | high | Support RBAC |
DP Reports & Metrics | high | Pipeline generate reporting metric about every process, like Memory used, CPU used , no. of records processed etc. |
DP Access Audit Logs | high | Platform captures every user operations and activities |
DP Operation Audit Logs | high | Logs can be pushed to third-party log monitoring systems like Datadog, Prometheus, etc. |
DP Export as Pipeline | high | Yes, Data Preparation steps can be exported to Pipeline |
Last updated