Sandbox Writer

The Sandbox Writer task is used to write data into the network pool of the Data Sandbox, enabling datasets to be staged in formats suitable for downstream processing and analysis.

Prerequisites

  • Access to the Sandbox environment with write permissions.

  • Knowledge of the Sandbox File name to be created or updated.

  • A defined schema (JSON) when writing CSV or JSON files to enforce consistent datatypes.

Configuring the Meta Information Tab

To configure the Sandbox Writer:

  1. Drag the Sandbox Writer task to the Workspace.

  2. Click on the task to open the configuration tabs.

  3. The Meta Information tab opens by default. Configure the following fields:

Field
Description

Storage Type

Pre-defined field indicating Sandbox storage.

Sandbox File

Enter the file name to which data will be written.

File Type

Select the file format. Supported options: CSV, JSON.

Save Mode

Select how data should be written: <ul><li>Append — Add new data to the existing file.</li><li>Overwrite — Replace existing data with new data.</li></ul>

Schema File Name

Upload a Spark schema file in JSON format to enforce column names, datatypes, and nullability.

Best Practices

  • Use Append for incremental data writes and Overwrite for full data refresh scenarios.

  • Provide a schema JSON file to maintain consistent typing across writes, especially for CSV/JSON outputs.

  • Ensure file names are unique and follow organizational conventions to avoid accidental overwrites.

Save & Next Steps

  • After configuration, click Save Task In Storage to persist the Sandbox Writer settings.

  • Run a test write to confirm schema and data integrity.

  • Validate output files in the Sandbox Network pool before scheduling production jobs.