Datasets

Now the Notebook page facilitates the user to add the datasets.

Check out the given walk-through on how to add a dataset using the Notebook page.

Please Note: The Datasets even if added from a Notebook infrastructure, they get added at the project level, so the added Datasets are available for all the Notebooks under the the same project.

Adding Datasets

  • Navigate to the Notebook page.

  • Click the Datasets tab.

  • Click on the Add Dataset option.

  • The Add Datasets page appears.

  • Select Data Sets or Data Sandbox option from the Data Source drop-down menu.

  • Search for a Data Set using the search bar.

  • Select dataset(s) as per the requirement using the check box. The user can select multiple datasets/ data sandbox.

  • Click the Add option.

Please Note: The Add option gets displayed only after you select at least one dataset.

  • A notification message appears.

  • The selected Dataset(s) get added to the given Datasets tab.

  • Click on the More icon for an added Dataset.

  • The drop-down menu appears displaying the Preview and Data Preparation actions for the added dataset appears.

Uploading Datasets (Data Sandbox)

Please Note: The Upload option is provided for the Sandbox files inside the Data Science Notebook.

  • Navigate to the Add Datasets panel from a Data Science Notebook.

  • Select the Data Sandbox option as Data Source.

  • Click the Upload option.

  • The Upload Data Sandbox window appears.

  • Provide a Sandbox Name.

  • Provide Description (it is optional).

  • Use the Choose File option to select a file from the system.

  • Select a file from the system and upload.

  • Once the selected file name appears next to the Choose File, click the Save option to upload the selected file.

  • A notification message appears to inform completion of the action.

  • The uploaded file lists below with a checkbox to select it.

  • Select the File using the checkbox.

  • Click the Add option.

  • A notification message appears.

  • The uploaded Data Sandbox dataset gets added to the Notebook.

​Reading Datasets

Please Note: Using get_data function datasets and data sandbox files (csv & xlsx files) can be read.

  • Add a new Code cell to Notebook or access an empty Code cell.

  • Select a dataset from the Datasets tab.

  • The get_data function appears in the code cell.

  • Provide the df to print the data from the selected Dataset.

  • Run the cell.

  • The Data preview appears below.

Please Note:

  • The Text files added as Datasets to a Notebook will be disabled for the data load function. Only Copy Path option will be provided for such datasets.

  • Refer the Data Science Lab Quick Start Flow page to get an overview of the Data Science Lab module in nutshell.

Last updated