Data Science Lab Quick Start Flow (In progress)
This page aims to provide all the major steps in the concise manner for the user to kick start their Data Science Experiments.
Last updated
This page aims to provide all the major steps in the concise manner for the user to kick start their Data Science Experiments.
Last updated
Data Science module allows the user to create Data Science Experiments and productionaize them. This page tries to provide the entire Data Science flow in nutshell for the user to quickly begin their Data Science experiment journey.
A Data Science Project created inside the Data Science Lab is like a Workspace inside which the user can create and store multiple data science experiments.
Data is the first requirement for any Data Science Project. The user can add the required datasets and view the added datasets under a specific Project by using the Dataset tab.
The user needs to click on the Dataset tab from the Project List page to access the Add Datasets option.
Checkout the given illustrations to understand the Adding Dataset (Data Service) and Adding Data Sandbox steps in details.
Once the user creates a Project and adds the required Data sets to the Project, it gets ready to hold a Data Science Experiment. The Data Science Lab user gets the following ways to go ahead with their Data Science Experiments:
Open the Data Science Lab module and access the Create Project option to begin with the Project creation. Refer the page to understand the steps involved in the Project Creation in details.
The user can get a list of uploaded Data Sets and Data Sandbox from the module under this tab.
The page offers the following Data service options to add as datasets:
– These are the uploaded data sets (data services) from the Data Center module.
– This option lists all the available/ uploaded Data Sandbox.
Refer the section with the sub-pages to understand it in details.
Refer the page to understand how the user can apply required Data Preparation steps on a specific dataset from the Data Set List page.
- Opens preview of the selected dataset.
- Displays the detailed profile of data to know about data quality, structure and consistency.
- Creates an Auto ML experiment on the selected Dataset.
- Cleans data to enhance quality and accuracy that directly impacts reliability of the results.
- Deletes the selected Dataset.
Use Notebook infrastructure provided under the Project to create, save, load, and predict a model. It is also possible to save the Artifacts for a Saved Model. Refer the section for more detials.
Use the Auto ML functionality to get the auto-trained Data Science models. Refer the section for more details.