Data Science Lab
  • What is Data Science Lab?
  • Accessing the Data Science Lab Module
  • Data Science Lab Quick Start Flow
  • Project
    • Environments
    • Creating a Project
    • Project List
      • View
      • Keep Multiple Versions of a Project
      • Sharing a Project
      • Editing a Project
      • Activating a Project
      • Deactivating a Project
      • Deleting a Project
    • Tabs for a Data Science Lab Project
      • Tabs for TensorFlow and PyTorch Environment
        • Notebook
          • Ways to Access Notebook
            • Create
            • Import
              • Importing a Notebook
              • Pull from Git
          • Notebook Page
            • Preview Notebook
            • Notebook Cells
              • Using a Code Cell
              • Using a Markdown Cell
              • Using an Assist Cell
            • Renaming a Notebook
            • Resource Utilization Graph
            • Notebook Taskbar
            • Notebook Operations
              • Datasets
                • Copy Path (for Sandbox files)
              • Secrets
              • Algorithms
              • Transforms
              • Utility Notebook Operation
              • Models
                • Model Explainer
                • Registering & Unregistering a Model
                • Model Filter
              • Artifacts
              • Files
              • Variable Explorer
              • Writers
              • Find and Replace
            • Notebook Actions
          • Notebook List
            • Notebook List Actions
              • Export
                • Export to Pipeline
                • Export to GIT
              • Register as Job
              • Notebook Version Control
              • Sharing a Notebook
              • Deleting a Notebook
        • Dataset
          • Adding Data Sets
            • Data Sets
            • Data Sandbox
          • Dataset List Page
            • Preview
            • Data Profile
            • Create Experiment
            • Data Preparation
            • Delete
        • Utility
          • Pull from Git (Utility)
        • Model
          • Model Explainer
          • Share a Model
          • Import Model
          • Export to GIT
          • Register a Model
          • Unregister A Model
          • Register a Model as an API Service
            • Register a Model as an API
            • Register an API Client
            • Pass Model Values in Postman
          • AutoML Models
        • Auto ML
          • Creating AutoML Experiments
            • Creating an Experiment
          • AutoML List Page
            • View Report
              • Details
              • Models
                • View Explanation
                  • Model Summary
                  • Model Interpretation
                    • Classification Model Explainer
                    • Regression Model Explainer
                    • Forecasting Model Explainer
                  • Dataset Explainer
            • Delete
      • Tabs for PySpark Environment
        • Notebook
          • Ways to Access Notebook
            • Create
            • Import
              • Importing a Notebook
          • Notebook Page
            • Preview Notebook
            • Notebook Cells
              • Using a Code Cell
              • Using a Markdown Cell
              • Using an Assist Cell
            • Renaming a Notebook
            • Resource Utilization Graph
            • Notebook Taskbar
            • Notebook Operations
              • Datasets
                • Copy Path (for Sandbox files)
              • Secrets
              • Utility
              • Files
              • Variable Explorer
              • Writers
              • Find and Replace
            • Notebook Actions
          • Notebook List
            • Notebook List Actions
              • Export
                • Export to Pipeline
                • Export to GIT
              • Register as Job
              • Notebook Version Control
              • Sharing a Notebook
              • Deleting a Notebook
        • Dataset
          • Adding Data Sets
            • Data Sets
            • Data Sandbox
          • Dataset List Page
            • Preview
            • Data Profile
            • Data Preparation
            • Delete
        • Utility
  • Repo Sync Project
    • Environments
    • Creating a Repo Sync Project
    • Project List
      • View
      • Project Migration
      • Keep Multiple Versions of a Project
      • Sharing a Project
      • Editing a Project
      • Activating a Project
      • Deactivating a Project
      • Deleting a Project
    • Tabs for a Data Science Lab Project
      • Tabs for TensorFlow and PyTorch Environment
        • Notebook
          • Accessing the Notebook Tab
          • Adding a Folder or File
          • Notebook Page
            • Preview File
            • .ipynb Cells
              • Using a Code Cell
              • Using a Markdown Cell
              • Using an Assist Cell
            • Resource Utilization Graph
            • Notebook Taskbar
            • Operations for an .ipynb File
              • Datasets
                • Copy Path (for Sandbox files)
              • Secrets
              • Algorithms
              • Transforms
              • Models
                • Model Explainer
                • Registering & Unregistering a Model
                • Model Filter
              • Files
              • Variable Explorer
              • Writers
              • Find and Replace
            • Actions Icons for .ipynb File
          • File Options
            • Export
            • Register
            • Delete
          • Git Console
        • Dataset
          • Adding Data Sets
            • Data Sets
            • Data Sandbox
          • Dataset List Page
            • Preview
            • Data Profile
            • Create Experiment
            • Data Preparation
            • Delete
        • Model
          • Import Model
          • Model Explainer
          • Share a Model
          • Export to GIT
          • Register a Model
          • Unregister A Model
          • Register a Model as an API Service
            • Register a Model as an API
            • Register an API Client
            • Pass Model Values in Postman
          • AutoML Models
        • Auto ML
          • Creating AutoML Experiments
            • Creating an Experiment
          • AutoML List Page
            • Experiment Status
            • Actions
              • View Report
                • Details
                • Models
                  • View Explanation
                    • Model Summary
                    • Model Interpretation
                      • Classification Model Explainer
                      • Regression Model Explainer
                      • Forecasting Model Explainer
                    • Dataset Explainer
              • Delete
      • Tabs for PySpark Environment
        • Notebook
          • Accessing the Notebook Tab
          • Adding a Folder or File
          • Notebook Page
            • Preview a File
            • Cells for .ipynb Files
              • Using a Code Cell
              • Using a Markdown Cell
              • Using an Assist Cell
            • Resource Utilization Graph
            • Notebook Taskbar
            • Operations for an .ipynb File
              • Datasets
                • Copy Path (for Sandbox files)
              • Secrets
              • Files
              • Variable Explorer
              • Writers
              • Find and Replace
            • Actions for .ipynb Files
            • File Options
              • Export
              • Register
              • Delete
            • Git Console
        • Dataset
          • Adding Data Sets
            • Data Sets
            • Data Sandbox
          • Dataset List Page
            • Preview
            • Data Profile
            • Data Preparation
            • Delete
Powered by GitBook
On this page
  • Adding Datasets
  • Uploading Datasets (Data Sandbox)
  • ​Reading Datasets
  • Project Level Dataset Tab
  1. Repo Sync Project
  2. Tabs for a Data Science Lab Project
  3. Tabs for TensorFlow and PyTorch Environment
  4. Notebook
  5. Notebook Page
  6. Operations for an .ipynb File

Datasets

Now the Notebook page facilitates the user to add the datasets.

PreviousOperations for an .ipynb FileNextCopy Path (for Sandbox files)

Last updated 10 months ago

Check out the given walk-through on how to add a dataset using the Notebook page.

Please Note: Once the Datasets are added to a Notebook, they get added by default to all the other Notebooks under the same project.

Adding Datasets

  • Navigate to the Notebook page.

  • Click the Datasets tab.

  • Click on the Add icon.

  • The Add Datasets page appears.

  • Select the Data Sets or Data Sandbox option from the Data Source drop-down menu (In this case, the Data Sets option is selected).

  • Search for the datasets or data sandbox files.

  • Select dataset(s) as per the requirement using the check box. The user can select multiple datasets/ data sandboxes.

  • Click the Add option.

Please Note: The Add option gets displayed only after you select at least one dataset.

  • A notification message appears.

  • The selected Dataset(s) get added to the given Datasets tab.

  • Click on the More icon.

  • The drop-down menu appears displaying the Preview and Data Preparation actions for the added dataset.

Uploading Datasets (Data Sandbox)

Please Note: The Upload option is provided for the Sandbox files inside the Data Science Notebook.

  • Navigate to the Add Datasets page from the Datasets option provided under a Notebook.

  • Select the Data Sandbox option as Data Source.

  • Click the Upload option.

  • The Upload Data Sandbox window appears.

  • Provide a Sandbox Name.

  • Provide Description (it is optional).

  • Use the Choose File option to select a file from the system.

  • The selected file gets uploaded and the file name appears next to the Choose File option.

  • Click the Save option to upload the selected file.

  • After clicking the Save option progressive digits reaching 100 indicate completion of the upload file action.

  • A notification message appears to inform the completion of the action.

  • The uploaded Sandbox file is listed below.

  • Select the File using the checkbox.

  • Click the Add option.

  • A notification message appears.

  • The uploaded Data Sandbox dataset gets added to the Notebook.

  • Click on the More icon.

  • The drop-down menu appears displaying the Preview and Data Preparation actions for the added dataset.

​Reading Datasets

Please Note: Using the get_data function datasets and data sandbox files (csv & xlsx files) can be read.

  • Add a new Code cell to Notebook or access an empty Code cell.

  • Select a dataset from the Datasets tab.

  • The get_data function appears in the code cell.

  • Provide the df (DataFrame) to print the data from the selected Dataset.

  • Run the cell.

  • The Data preview appears below after the cell run is completed.

Project Level Dataset Tab

The Data Sets or sandbox files added to a Notebook will also be listed under the Dataset tab provided under the same project. Hence, the added datasets will be available for all the Notebooks created or imported under the same project.

Please Note: Refer to the page to get an overview of the Data Science Lab module in a nutshell.

Data Science Lab Quick Start Flow
Adding a Dataset to a Notebook and reading data
Uploading a Datastore file, Adding it to Notebook and reading the Data
Adding a Dataset to Notebook
Upload option for Data Sandbox
Dataset Tab provided at Project Level