Sandbox Writer
The Sandbox Writer task is used to write data into the network pool of the Data Sandbox, enabling datasets to be staged in formats suitable for downstream processing and analysis.
Prerequisites
Access to the Sandbox environment with write permissions.
Knowledge of the Sandbox File name to be created or updated.
A defined schema (JSON) when writing CSV or JSON files to enforce consistent datatypes.
Configuring the Meta Information Tab
To configure the Sandbox Writer:
Drag the Sandbox Writer task to the Workspace.
Click on the task to open the configuration tabs.
The Meta Information tab opens by default. Configure the following fields:
Storage Type
Pre-defined field indicating Sandbox storage.
Sandbox File
Enter the file name to which data will be written.
File Type
Select the file format. Supported options: CSV, JSON.
Save Mode
Select how data should be written: <ul><li>Append — Add new data to the existing file.</li><li>Overwrite — Replace existing data with new data.</li></ul>
Schema File Name
Upload a Spark schema file in JSON format to enforce column names, datatypes, and nullability.
Best Practices
Use Append for incremental data writes and Overwrite for full data refresh scenarios.
Provide a schema JSON file to maintain consistent typing across writes, especially for CSV/JSON outputs.
Ensure file names are unique and follow organizational conventions to avoid accidental overwrites.
Save & Next Steps
After configuration, click Save Task In Storage to persist the Sandbox Writer settings.
Run a test write to confirm schema and data integrity.
Validate output files in the Sandbox Network pool before scheduling production jobs.