> For the complete documentation index, see [llms.txt](https://docs.bdb.ai/data-science-lab-3/llms.txt). Markdown versions of documentation pages are available by appending `.md` to page URLs; this page is available as [Markdown](https://docs.bdb.ai/data-science-lab-3/project/tabs-for-a-data-science-lab-project/tabs-for-tensorflow-and-pytorch-environment/notebook/notebook-page/notebook-operations/datasets.md).

# Datasets

{% hint style="success" %}
*Check out the given walk-through on how to add a dataset using the Notebook page.*
{% endhint %}

{% embed url="<https://files.gitbook.com/v0/b/gitbook-x-prod.appspot.com/o/spaces%2FBLGYLEkBUnc8nVEBAuEI%2Fuploads%2Fp00KkD1DhW0Q1cQTq8wf%2FAdding%20A%20Dataset%20and%20reading%20V2.mp4?alt=media&token=5101a4cd-e512-43ce-8f04-e477ca81878e>" %}
***Adding a Dataset to a Notebook and reading data***
{% endembed %}

{% embed url="<https://files.gitbook.com/v0/b/gitbook-x-prod.appspot.com/o/spaces%2FBLGYLEkBUnc8nVEBAuEI%2Fuploads%2FztxG0IjAnL9zzdGQ2FFS%2FUploading%20a%20Data%20Sandbox%20to%20DSL%20Project%20V2.mp4?alt=media&token=fe7791cc-257b-41af-88ee-c55944cad4e8>" %}
***Uploading a Datastore file, Adding it to Notebook and reading the Data***
{% endembed %}

{% hint style="info" %}
*<mark style="color:green;">Please Note:</mark> Once the Datasets are added to a Notebook, they get added by default to all the other Notebooks under the same project.*
{% endhint %}

## Adding Datasets

* Navigate to the Notebook page.
* Click the ***Datasets*** tab.

<figure><img src="/files/iFJ1892osUL2J9qM9K83" alt=""><figcaption></figcaption></figure>

* Click on the ***Add*** icon.

<figure><img src="/files/ILh4vf99QgFAaZP7XMOH" alt=""><figcaption></figcaption></figure>

* The ***Add Datasets*** page appears.&#x20;
* Select ***Data Sets*** or ***Data Sandbox*** option from the Data Source drop-down menu (In this case, Data Sets option is selected).
* Search for the datasets or data sandbox files.
* Select dataset(s) as per the requirement using the check box. The user can select multiple datasets/ data sandbox.
* Click the ***Add*** option.&#x20;

{% hint style="info" %}
*<mark style="color:green;">Please Note:</mark> The Add option gets displayed only after you select at least one dataset.*
{% endhint %}

<figure><img src="/files/nzIBDbEeVqBuBQ8rSAOy" alt=""><figcaption><p><em><strong>Adding a Dataset to Notebook</strong></em></p></figcaption></figure>

* A notification message appears.
* The selected Dataset(s) get added to the given ***Datasets*** tab.

<figure><img src="/files/VIz7VD5qZSohHClI2JY2" alt=""><figcaption></figcaption></figure>

* Click on the ***More*** icon.
* The drop-down menu appears displaying the ***Preview*** and ***Data Preparation*** actions for the added dataset appears.

&#x20;     ![](/files/dGBelUxCn6HmKgpVJvYt)

## Uploading Datasets (Data Sandbox)

{% hint style="info" %}
*<mark style="color:green;">Please Note:</mark>  The **Upload** option is provided for the Sandbox files inside the Data Science Notebook.*
{% endhint %}

* Navigate to the ***Add Datasets*** page from the ***Datasets*** option provided under a **Notebook**.
* Select the ***Data Sandbox*** option as ***Data Source***.
* Click the ***Upload*** option.

<figure><img src="/files/AjjEoG6xHSx4WY6Br7lN" alt=""><figcaption><p><em><strong>Upload option for Data Sandbox</strong></em></p></figcaption></figure>

* The ***Upload Data Sandbox*** window appears.
* Provide a Sandbox Name.
* Provide Description (it is optional).
* Use the ***Choose File*** option to select a file from the system.

<figure><img src="/files/lzVSGUOjOyWb6SGDqsLP" alt=""><figcaption></figcaption></figure>

* The selected file gets uploaded and the file name appears next to the ***Choose File*** option.
* Click the ***Save*** option to upload the selected file.

<figure><img src="/files/0rEQbFFyUtMQULhP7z1Y" alt=""><figcaption></figcaption></figure>

* After clicking the ***Save*** option progressive digits reaching 100 indicate completion of the upload file action.

<figure><img src="/files/CHYGSa3U6DzyIO6E2kmc" alt=""><figcaption></figcaption></figure>

* A notification message appears to inform completion of the action.
* The uploaded Sandbox file gets listed below.

<figure><img src="/files/bPEmbepd4BC66XypKOyi" alt=""><figcaption></figcaption></figure>

* Select the File using the checkbox.
* Click the ***Add*** option.

<figure><img src="/files/kqwW9QFiWdeVKFhy5Y0P" alt=""><figcaption><p><em><strong>Adding the Data Sandbox file</strong></em></p></figcaption></figure>

* A notification message appears.
* The uploaded Data Sandbox dataset gets added to the Notebook.

<figure><img src="/files/hJkjhCO9GW5n6pl1QRtb" alt=""><figcaption></figcaption></figure>

* Click on the ***More*** icon.
* The drop-down menu appears displaying the ***Preview*** and ***Data Preparation*** actions for the added dataset appears.

&#x20;     ![](/files/vvj2u9wPCxac6Kjwx6Yf)

## ​Reading Datasets

{% hint style="info" %}
*<mark style="color:green;">Please Note</mark>:* Using ***get\_data*** function datasets and data sandbox files (csv & xlsx files) can be read.
{% endhint %}

* Add a new Code cell to Notebook or access an empty Code cell.
* Select a dataset from the Datasets tab.
* The ***get\_data*** function appears in the code cell.

<figure><img src="/files/QG9tg6sDqbI30e5dNYS8" alt=""><figcaption></figcaption></figure>

* Provide the df (DataFrame) to print the data from the selected Dataset.
* Run the cell.
* The Data preview appears below after the cell run gets completed.

<figure><img src="/files/cRPcnj4Ool6pAvdYonOd" alt=""><figcaption></figcaption></figure>

## Project Level Dataset Tab&#x20;

The Data Sets or sandbox files added to a Notebook will also be listed under the Dataset tab provided under the same project. Hence, the added datasets will be available for all the Notebooks created or imported under the same project.

<figure><img src="/files/Gp2y9yQlxA0wcp9tAKZY" alt=""><figcaption><p><em><strong>Dataset Tab provided at Project Level</strong></em> </p></figcaption></figure>

{% hint style="info" %}
*<mark style="color:green;">Please Note:</mark> Refer the* [***Data Science Lab Quick Start Flow*** ](https://docs.bdb.ai/data-science-lab-3/data-science-lab-quick-start-flow)*page to get an overview of the **Data Science Lab** module in nutshell.*&#x20;
{% endhint %}


---

# Agent Instructions
This documentation is published with GitBook. GitBook is the documentation platform designed so that both humans and AI agents can read, navigate, and reason over technical content effectively. Learn more at gitbook.com.

## Querying This Documentation
If you need additional information that is not directly available in this page, you can query the documentation dynamically by asking a question.

Perform an HTTP GET request on the current page URL with the `ask` query parameter, and the optional `goal` query parameter:

```
GET https://docs.bdb.ai/data-science-lab-3/project/tabs-for-a-data-science-lab-project/tabs-for-tensorflow-and-pytorch-environment/notebook/notebook-page/notebook-operations/datasets.md?ask=<question>&goal=<endgoal>
```

`ask` is the immediate question: it should be specific, self-contained, and written in natural language.
`goal` is optional and describes the broader end goal you are ultimately trying to accomplish on behalf of the user. GitBook uses it to tailor the answer towards what is most useful for that goal.

The response will contain a direct answer to the question and relevant excerpts and sources from the documentation.

Use this mechanism when the answer is not explicitly present in the current page, you need clarification or additional context, or you want to retrieve related documentation sections.
