Data Science Lab
  • What is Data Science Lab?
  • Accessing the Data Science Lab Module
  • Data Science Lab Quick Start Flow
  • Project
    • Environments
    • Creating a Project
    • Project List
      • View
      • Keep Multiple Versions of a Project
      • Sharing a Project
      • Editing a Project
      • Activating a Project
      • Deactivating a Project
      • Deleting a Project
    • Tabs for a Data Science Lab Project
      • Tabs for TensorFlow and PyTorch Environment
        • Notebook
          • Ways to Access Notebook
            • Create
            • Import
              • Importing a Notebook
              • Pull from Git
          • Notebook Page
            • Preview Notebook
            • Notebook Cells
              • Using a Code Cell
              • Using a Markdown Cell
              • Using an Assist Cell
            • Renaming a Notebook
            • Resource Utilization Graph
            • Notebook Taskbar
            • Notebook Operations
              • Datasets
                • Copy Path (for Sandbox files)
              • Secrets
              • Algorithms
              • Transforms
              • Utility
              • Models
                • Model Explainer
                • Registering & Unregistering a Model
                • Model Filter
              • Artifacts
              • Files
              • Variable Explorer
              • Writers
              • Find and Replace
            • Notebook Actions
          • Notebook List
            • Notebook List Actions
              • Export
                • Export to Pipeline
                • Export to GIT
              • Register as Job
              • Notebook Version Control
              • Sharing a Notebook
              • Deleting a Notebook
        • Dataset
          • Adding Data Sets
            • Data Sets
            • Data Sandbox
          • Dataset List Page
            • Preview
            • Data Profile
            • Create Experiment
            • Data Preparation
            • Delete
        • Utility
          • Pull from Git (Utility)
        • Model
          • Model Explainer
          • Import Model
          • Export to GIT
          • Register a Model
          • Unregister A Model
          • Register a Model as an API Service
            • Register a Model as an API
            • Register an API Client
            • Pass Model Values in Postman
          • AutoML Models
        • Auto ML
          • Creating AutoML Experiments
            • Creating an Experiment
          • AutoML List Page
            • View Report
              • Details
              • Models
                • View Explanation
                  • Model Summary
                  • Model Interpretation
                    • Classification Model Explainer
                    • Regression Model Explainer
                    • Forecasting Model Explainer
                  • Dataset Explainer
            • Delete
      • Tabs for PySpark Environment
        • Notebook
          • Ways to Access Notebook
            • Create
            • Import
              • Importing a Notebook
          • Notebook Page
            • Preview Notebook
            • Notebook Cells
              • Using a Code Cell
              • Using a Markdown Cell
              • Using an Assist Cell
            • Renaming a Notebook
            • Resource Utilization Graph
            • Notebook Taskbar
            • Notebook Operations
              • Datasets
                • Copy Path (for Sandbox files)
              • Secrets
              • Utility
              • Files
              • Variable Explorer
              • Writers
              • Find and Replace
            • Notebook Actions
          • Notebook List
            • Notebook List Actions
              • Export
                • Export to Pipeline
                • Export to GIT (on hold)
              • Register as Job
              • Notebook Version Control
              • Sharing a Notebook
              • Deleting a Notebook
        • Dataset
          • Adding Data Sets
            • Data Sets
            • Data Sandbox
          • Dataset List Page
            • Preview
            • Data Profile
            • Data Preparation
            • Delete
        • Utility
Powered by GitBook
On this page
  • Adding Datasets
  • Uploading Datasets (Data Sandbox)
  • ​Reading Datasets
  1. Project
  2. Tabs for a Data Science Lab Project
  3. Tabs for PySpark Environment
  4. Notebook
  5. Notebook Page
  6. Notebook Operations

Datasets

Now the Notebook page facilitates the user to add the datasets.

PreviousNotebook OperationsNextCopy Path (for Sandbox files)

Last updated 1 year ago

Check out the given walk-through on how to add a dataset using the Notebook page.

Please Note: The Datasets even if added from a Notebook infrastructure, they get added at the project level, so the added Datasets are available for all the Notebooks under the the same project.

Adding Datasets

  • Navigate to the Notebook page.

  • Click the Datasets tab.

  • Click on the Add Dataset option.

  • The Add Datasets page appears.

  • Select Data Sets or Data Sandbox option from the Data Source drop-down menu.

  • Search for a Data Set using the search bar.

  • Select dataset(s) as per the requirement using the check box. The user can select multiple datasets/ data sandbox.

  • Click the Add option.

Please Note: The Add option gets displayed only after you select at least one dataset.

  • A notification message appears.

  • The selected Dataset(s) get added to the given Datasets tab.

  • Click on the More icon for an added Dataset.

  • The drop-down menu appears displaying the Preview and Data Preparation actions for the added dataset appears.

Uploading Datasets (Data Sandbox)

Please Note: The Upload option is provided for the Sandbox files inside the Data Science Notebook.

  • Navigate to the Add Datasets panel from a Data Science Notebook.

  • Select the Data Sandbox option as Data Source.

  • Click the Upload option.

  • The Upload Data Sandbox window appears.

  • Provide a Sandbox Name.

  • Provide Description (it is optional).

  • Use the Choose File option to select a file from the system.

  • Select a file from the system and upload.

  • Once the selected file name appears next to the Choose File, click the Save option to upload the selected file.

  • A notification message appears to inform completion of the action.

  • The uploaded file lists below with a checkbox to select it.

  • Select the File using the checkbox.

  • Click the Add option.

  • A notification message appears.

  • The uploaded Data Sandbox dataset gets added to the Notebook.

​Reading Datasets

Please Note: Using get_data function datasets and data sandbox files (csv & xlsx files) can be read.

  • Add a new Code cell to Notebook or access an empty Code cell.

  • Select a dataset from the Datasets tab.

  • The get_data function appears in the code cell.

  • Provide the df to print the data from the selected Dataset.

  • Run the cell.

  • The Data preview appears below.

Please Note:

  • The Text files added as Datasets to a Notebook will be disabled for the data load function. Only Copy Path option will be provided for such datasets.

Refer the page to get an overview of the Data Science Lab module in nutshell.

Data Science Lab Quick Start Flow
Adding a Dataset to Notebook
Uploading a Datastore file, Adding it to Notebook
The Add Datasets page