# Datasets {% hint style="success" %} *Check out the given walk-through on how to add a dataset using the Notebook page.* {% endhint %} {% embed url="" %} ***Adding a Dataset to Notebook*** {% endembed %} {% embed url="" %} ***Uploading a Datastore file, Adding it to Notebook*** {% endembed %} {% hint style="info" %} *Please Note: The Datasets even if added from a Notebook infrastructure, they get added at the project level, so the added Datasets are available for all the Notebooks under the the same project.* {% endhint %} ## Adding Datasets * Navigate to the Notebook page. * Click the ***Datasets*** tab.

* Click on the ***Add Dataset*** option.

* The ***Add Datasets*** page appears. * Select ***Data Sets*** or ***Data Sandbox*** option from the Data Source drop-down menu. * Search for a Data Set using the search bar. * Select dataset(s) as per the requirement using the check box. The user can select multiple datasets/ data sandbox. * Click the ***Add*** option. {% hint style="info" %} *Please Note: The Add option gets displayed only after you select at least one dataset.* {% endhint %}

* A notification message appears. * The selected Dataset(s) get added to the given ***Datasets*** tab.

* Click on the ***More*** icon for an added Dataset. * The drop-down menu appears displaying the ***Preview*** and ***Data Preparation*** actions for the added dataset appears.

## Uploading Datasets (Data Sandbox) {% hint style="info" %} *Please Note: The **Upload** option is provided for the Sandbox files inside the Data Science Notebook.* {% endhint %} * Navigate to the ***Add Datasets*** panel from a Data Science Notebook. * Select the ***Data Sandbox*** option as Data Source. * Click the ***Upload*** option.

* The ***Upload Data Sandbox*** window appears. * Provide a Sandbox Name. * Provide Description (it is optional). * Use the ***Choose File*** option to select a file from the system.

* Select a file from the system and upload. * Once the selected file name appears next to the Choose File, click the ***Save*** option to upload the selected file.

* A notification message appears to inform completion of the action. * The uploaded file lists below with a checkbox to select it.

* Select the File using the checkbox. * Click the ***Add*** option.

* A notification message appears. * The uploaded Data Sandbox dataset gets added to the Notebook.

## Reading Datasets {% hint style="info" %} *Please Note:* Using ***get\_data*** function datasets and data sandbox files (csv & xlsx files) can be read. {% endhint %} * Add a new Code cell to Notebook or access an empty Code cell. * Select a dataset from the Datasets tab. * The ***get\_data*** function appears in the code cell.

* Provide the df to print the data from the selected Dataset. * Run the cell. * The Data preview appears below.

{% hint style="info" %} *Please Note:* * The Text files added as Datasets to a Notebook will be disabled for the data load function. Only Copy Path option will be provided for such datasets. * *Refer the* [***Data Science Lab Quick Start Flow***](https://docs.bdb.ai/data-science-lab-4/data-science-lab-quick-start-flow) *page to get an overview of the **Data Science Lab** module in nutshell.* {% endhint %} --- # Agent Instructions: Querying This Documentation If you need additional information that is not directly available in this page, you can query the documentation dynamically by asking a question. Perform an HTTP GET request on the current page URL with the `ask` query parameter: ``` GET https://docs.bdb.ai/data-science-lab-4/project/tabs-for-a-data-science-lab-project/tabs-for-pyspark-environment/notebook/notebook-page/notebook-operations/datasets.md?ask= ``` The question should be specific, self-contained, and written in natural language. The response will contain a direct answer to the question and relevant excerpts and sources from the documentation. Use this mechanism when the answer is not explicitly present in the current page, you need clarification or additional context, or you want to retrieve related documentation sections.