Data Science Lab
  • What is Data Science Lab?
  • Accessing the Data Science Lab Module
  • Data Science Lab Quick Start Flow
  • Project
    • Environments
    • Creating a Project
    • Project List
      • View
      • Keep Multiple Versions of a Project
      • Sharing a Project
      • Editing a Project
      • Activating a Project
      • Deactivating a Project
      • Deleting a Project
    • Tabs for a Data Science Lab Project
      • Tabs for TensorFlow and PyTorch Environment
        • Notebook
          • Ways to Access Notebook
            • Create
            • Import
              • Importing a Notebook
              • Pull from Git
          • Notebook Page
            • Preview Notebook
            • Notebook Cells
              • Using a Code Cell
              • Using a Markdown Cell
              • Using an Assist Cell
            • Renaming a Notebook
            • Resource Utilization Graph
            • Notebook Taskbar
            • Notebook Operations
              • Datasets
                • Copy Path (for Sandbox files)
              • Secrets
              • Algorithms
              • Transforms
              • Utility Notebook Operation
              • Models
                • Model Explainer
                • Registering & Unregistering a Model
                • Model Filter
              • Artifacts
              • Files
              • Variable Explorer
              • Writers
              • Find and Replace
            • Notebook Actions
          • Notebook List
            • Notebook List Actions
              • Export
                • Export to Pipeline
                • Export to GIT
              • Register as Job
              • Notebook Version Control
              • Sharing a Notebook
              • Deleting a Notebook
        • Dataset
          • Adding Data Sets
            • Data Sets
            • Data Sandbox
          • Dataset List Page
            • Preview
            • Data Profile
            • Create Experiment
            • Data Preparation
            • Delete
        • Utility
          • Pull from Git (Utility)
        • Model
          • Model Explainer
          • Share a Model
          • Import Model
          • Export to GIT
          • Register a Model
          • Unregister A Model
          • Register a Model as an API Service
            • Register a Model as an API
            • Register an API Client
            • Pass Model Values in Postman
          • AutoML Models
        • Auto ML
          • Creating AutoML Experiments
            • Creating an Experiment
          • AutoML List Page
            • View Report
              • Details
              • Models
                • View Explanation
                  • Model Summary
                  • Model Interpretation
                    • Classification Model Explainer
                    • Regression Model Explainer
                    • Forecasting Model Explainer
                  • Dataset Explainer
            • Delete
      • Tabs for PySpark Environment
        • Notebook
          • Ways to Access Notebook
            • Create
            • Import
              • Importing a Notebook
          • Notebook Page
            • Preview Notebook
            • Notebook Cells
              • Using a Code Cell
              • Using a Markdown Cell
              • Using an Assist Cell
            • Renaming a Notebook
            • Resource Utilization Graph
            • Notebook Taskbar
            • Notebook Operations
              • Datasets
                • Copy Path (for Sandbox files)
              • Secrets
              • Utility
              • Files
              • Variable Explorer
              • Writers
              • Find and Replace
            • Notebook Actions
          • Notebook List
            • Notebook List Actions
              • Export
                • Export to Pipeline
                • Export to GIT
              • Register as Job
              • Notebook Version Control
              • Sharing a Notebook
              • Deleting a Notebook
        • Dataset
          • Adding Data Sets
            • Data Sets
            • Data Sandbox
          • Dataset List Page
            • Preview
            • Data Profile
            • Data Preparation
            • Delete
        • Utility
  • Repo Sync Project
    • Environments
    • Creating a Repo Sync Project
    • Project List
      • View
      • Project Migration
      • Keep Multiple Versions of a Project
      • Sharing a Project
      • Editing a Project
      • Activating a Project
      • Deactivating a Project
      • Deleting a Project
    • Tabs for a Data Science Lab Project
      • Tabs for TensorFlow and PyTorch Environment
        • Notebook
          • Accessing the Notebook Tab
          • Adding a Folder or File
          • Notebook Page
            • Preview File
            • .ipynb Cells
              • Using a Code Cell
              • Using a Markdown Cell
              • Using an Assist Cell
            • Resource Utilization Graph
            • Notebook Taskbar
            • Operations for an .ipynb File
              • Datasets
                • Copy Path (for Sandbox files)
              • Secrets
              • Algorithms
              • Transforms
              • Models
                • Model Explainer
                • Registering & Unregistering a Model
                • Model Filter
              • Files
              • Variable Explorer
              • Writers
              • Find and Replace
            • Actions Icons for .ipynb File
          • File Options
            • Export
            • Register
            • Delete
          • Git Console
        • Dataset
          • Adding Data Sets
            • Data Sets
            • Data Sandbox
          • Dataset List Page
            • Preview
            • Data Profile
            • Create Experiment
            • Data Preparation
            • Delete
        • Model
          • Import Model
          • Model Explainer
          • Share a Model
          • Export to GIT
          • Register a Model
          • Unregister A Model
          • Register a Model as an API Service
            • Register a Model as an API
            • Register an API Client
            • Pass Model Values in Postman
          • AutoML Models
        • Auto ML
          • Creating AutoML Experiments
            • Creating an Experiment
          • AutoML List Page
            • Experiment Status
            • Actions
              • View Report
                • Details
                • Models
                  • View Explanation
                    • Model Summary
                    • Model Interpretation
                      • Classification Model Explainer
                      • Regression Model Explainer
                      • Forecasting Model Explainer
                    • Dataset Explainer
              • Delete
      • Tabs for PySpark Environment
        • Notebook
          • Accessing the Notebook Tab
          • Adding a Folder or File
          • Notebook Page
            • Preview a File
            • Cells for .ipynb Files
              • Using a Code Cell
              • Using a Markdown Cell
              • Using an Assist Cell
            • Resource Utilization Graph
            • Notebook Taskbar
            • Operations for an .ipynb File
              • Datasets
                • Copy Path (for Sandbox files)
              • Secrets
              • Files
              • Variable Explorer
              • Writers
              • Find and Replace
            • Actions for .ipynb Files
            • File Options
              • Export
              • Register
              • Delete
            • Git Console
        • Dataset
          • Adding Data Sets
            • Data Sets
            • Data Sandbox
          • Dataset List Page
            • Preview
            • Data Profile
            • Data Preparation
            • Delete
Powered by GitBook
On this page
  • Adding Datasets
  • Uploading Datasets (Data Sandbox)
  • ​Reading Datasets
  • Project Level Dataset Tab
  1. Project
  2. Tabs for a Data Science Lab Project
  3. Tabs for TensorFlow and PyTorch Environment
  4. Notebook
  5. Notebook Page
  6. Notebook Operations

Datasets

Now the Notebook page facilitates the user to add the datasets.

PreviousNotebook OperationsNextCopy Path (for Sandbox files)

Last updated 1 year ago

Check out the given walk-through on how to add a dataset using the Notebook page.

Please Note: Once the Datasets are added to a Notebook, they get added by default to all the other Notebooks under the same project.

Adding Datasets

  • Navigate to the Notebook page.

  • Click the Datasets tab.

  • Click on the Add icon.

  • The Add Datasets page appears.

  • Select Data Sets or Data Sandbox option from the Data Source drop-down menu (In this case, Data Sets option is selected).

  • Search for the datasets or data sandbox files.

  • Select dataset(s) as per the requirement using the check box. The user can select multiple datasets/ data sandbox.

  • Click the Add option.

Please Note: The Add option gets displayed only after you select at least one dataset.

  • A notification message appears.

  • The selected Dataset(s) get added to the given Datasets tab.

  • Click on the More icon.

  • The drop-down menu appears displaying the Preview and Data Preparation actions for the added dataset appears.

Uploading Datasets (Data Sandbox)

Please Note: The Upload option is provided for the Sandbox files inside the Data Science Notebook.

  • Navigate to the Add Datasets page from the Datasets option provided under a Notebook.

  • Select the Data Sandbox option as Data Source.

  • Click the Upload option.

  • The Upload Data Sandbox window appears.

  • Provide a Sandbox Name.

  • Provide Description (it is optional).

  • Use the Choose File option to select a file from the system.

  • The selected file gets uploaded and the file name appears next to the Choose File option.

  • Click the Save option to upload the selected file.

  • After clicking the Save option progressive digits reaching 100 indicate completion of the upload file action.

  • A notification message appears to inform completion of the action.

  • The uploaded Sandbox file gets listed below.

  • Select the File using the checkbox.

  • Click the Add option.

  • A notification message appears.

  • The uploaded Data Sandbox dataset gets added to the Notebook.

  • Click on the More icon.

  • The drop-down menu appears displaying the Preview and Data Preparation actions for the added dataset appears.

​Reading Datasets

Please Note: Using get_data function datasets and data sandbox files (csv & xlsx files) can be read.

  • Add a new Code cell to Notebook or access an empty Code cell.

  • Select a dataset from the Datasets tab.

  • The get_data function appears in the code cell.

  • Provide the df (DataFrame) to print the data from the selected Dataset.

  • Run the cell.

  • The Data preview appears below after the cell run gets completed.

Project Level Dataset Tab

The Data Sets or sandbox files added to a Notebook will also be listed under the Dataset tab provided under the same project. Hence, the added datasets will be available for all the Notebooks created or imported under the same project.

Please Note: Refer the page to get an overview of the Data Science Lab module in nutshell.

Data Science Lab Quick Start Flow
Adding a Dataset to Notebook
Upload option for Data Sandbox
Adding the Data Sandbox file
Dataset Tab provided at Project Level
Adding a Dataset to a Notebook and reading data
Uploading a Datastore file, Adding it to Notebook and reading the Data