Datasets
Now the Notebook page facilitates the user to add the datasets.
Last updated
Now the Notebook page facilitates the user to add the datasets.
Last updated
Check out the given walk-through on how to add a dataset using the .ipynb file.
Please Note: The Datasets even if added from a .ipynb infrastructure, they get added at the project level, so the added Datasets are available for all the files under the the same project.
Navigate to the Notebook page.
Click the Datasets tab.
Click on the Add Dataset option.
The Add Datasets page appears.
Select Data Sets or Data Sandbox option from the Data Source drop-down menu.
Search for a Data Set using the search bar.
Select dataset(s) as per the requirement using the check box. The user can select multiple datasets/ data sandbox.
Click the Add option.
Please Note: The Add option gets displayed only after you select at least one dataset.
A notification message appears.
The selected Dataset(s) get added to the given Datasets tab.
Click on the More icon for an added Dataset.
The drop-down menu appears displaying the Preview and Data Preparation actions for the added dataset appears.
Please Note: The Upload option is provided for the Sandbox files inside the Data Science Notebook.
Navigate to the Add Datasets panel from a Data Science Notebook.
Select the Data Sandbox option as Data Source.
Click the Upload option.
The Upload Data Sandbox window appears.
Provide a Sandbox Name.
Provide Description (it is optional).
Use the Choose File option to select a file from the system.
Select a file from the system and upload.
Once the selected file name appears next to the Choose File, click the Save option to upload the selected file.
A notification message appears to inform completion of the action.
The uploaded file lists below with a checkbox to select it.
Select the File using the checkbox.
Click the Add option.
A notification message appears.
The uploaded Data Sandbox dataset gets added to the Notebook.
Please Note: Using get_data function datasets and data sandbox files (csv & xlsx files) can be read.
Add a new Code cell to Notebook or access an empty Code cell.
Select a dataset from the Datasets tab.
The get_data function appears in the code cell.
Select the df from the get_data function of the dataset.
Add a code cell below.
Provide the df to display the data from the selected Dataset using the show function.
Run all the cells.
The Data preview appears below. It will display 20 rows by default.
The user can insert the number of rows to get the preview of the data up to that limit.
Please Note:
The Text files added as Datasets to a Notebook will be disabled for the data load function. Only Copy Path option will be provided for such datasets.
Refer the Data Science Lab Quick Start Flow page to get an overview of the Data Science Lab module in nutshell.