Transformations

The Transformations section of the Jobs Editor provides a set of configurable tasks to modify, filter, compute, and join data before it is written to a target. Each transformation exposes a Meta Information tab where users define the parameters of the operation.

Available Transformations

The following transformations are supported:

  • Alter Columns

  • Select Columns

  • Date Formatter

  • Query

  • Filter

  • Formula

  • Join

  • Aggregation

  • Sort Task


Alter Columns

Change the data type or rename existing columns.

Meta Information Fields

Field
Description

Name

Name of the column to alter.

Alias Name

New name of the column.

Column Type

Select the new data type from the drop-down.

Add New Column

Allows multiple column definitions for modification.


Select Columns

Choose specific columns from the dataset and optionally rename or re-type them.

Meta Information Fields

Field
Description

Name

Name of the column to select.

Alias Name

New name of the column.

Column Type

Select the data type from the drop-down.

Add New Column

Allows multiple columns to be selected.


Date Formatter

Convert date or datetime columns into a specified format.

Meta Information Fields

Field
Description

Name

Column name to format.

Input Format

Select from 61 supported input formats.

Output Format

Format for the transformed output.

Output Column Name

Name of the resulting column.

Add New Row

Add multiple formatter rules.


Query

Apply SQL queries to transform data (e.g., SELECT queries, views).

Meta Information Fields

Field
Description

Query

Provide a valid SQL query. (Note: ALTER queries are not supported.)

Table Name

Target table name.

Schema File Name

Upload a Spark schema file (JSON format).

Choose File

Upload schema file from the system.


Filter

Filter records based on conditions and logical operators.

Meta Information Fields

Field
Description

Field Name

Column to apply the filter on.

Condition

Choose from 8 available condition operators.

Logical Condition

Apply AND or OR to combine filters.

Add New Column

Add multiple filter conditions.


Formula

Apply mathematical, string, or bitwise formulas to generate new columns.

Meta Information Fields

Field
Description

Field Name

Input field name.

Formula Type

Choose from: <ul><li>Math (22 operations)</li><li>String (16 operations)</li><li>Bitwise (3 operations)</li></ul>

Output Field Name

Name of the result column.

Add New Column

Add multiple formulas.


Join

Join two datasets based on column conditions.

Meta Information Fields

Field
Description

Join Type

Choose from: Inner, Outer, Full, Full Outer, Left Outer, Left, Right Outer, Right, Left Semi, Left Anti.

Left Column

Join key column from the left dataset.

Right Column

Join key column from the right dataset.

Add New Column

Add multiple join conditions.


Aggregation

Perform aggregation functions with optional grouping.

Meta Information Fields

Field
Description

Group By Columns

Name of the grouping column(s).

Field Name

Column on which the operation is applied.

Operation

Select from 30 aggregation operations.

Alias

Alias name for the aggregation result.

Add New Column

Add multiple aggregation rules.


Sort Task

Sort data based on column(s) and order.

Meta Information Fields

Field
Description

Sort Key

Column used for sorting.

Order

Sorting order: Ascending or Descending.

Add New Column

Add multiple sort keys.