SQL Component

SQL transformer applies SQL operations to transform and manipulate data, providing flexibility and expressiveness in data transformations within a data pipeline.

The SQL component serves as a bridge between the extracted data and the desired transformed data, leveraging the power of SQL queries and database systems to enable efficient data processing and manipulation.

It also provides an option of using aggregation functions on the complete streaming data processed by the component. The user can use SQL transformations on Spark data frames with the help of this component.

All component configurations are classified broadly into the following sections:

Follow the given steps in the demonstration to configure the SQL transformation component.

Please Note: The schema file that can be uploaded here is a JSON spark schema.

Configuring Meta Information of SQL Component.

  1. Query Type: There are two options available under this field:

    • Batch Query: When this option is selected, then there is no need to upload schema file.

    • Aggregate Query: When this option is selected, it is mandatory to upload the spark schema file in JSON format of the in-event data.

  2. Schema File name: Upload the spark schema file in JSON format when Aggregate query is selected in the query type field.

  3. Table name: Provide the table name

  4. Query: Write a SQL query in this field.

  5. Selected Columns: select the column name from the table, provide the Alias name and the desired data type for that column.

Please Note: The SQL component, when set to Aggregate Query mode and connected to DB Sync, will not write data to the DB Sync event.

Last updated