Create Project

This page explains project creation steps for a Data Science Lab Project.

What is a Project?

A Data Science Project created inside the Data Science Lab is like a Workspace inside which the user can create and store multiple data science experiments and their associated artifacts.

Creating a new Project

Creating a Project
Creating a Repo Sync Project

Steps to create a new DSL Project

  • Navigate to the Home page of the Data Science Lab module.

  • Click the Create icon from the homepage.

  • The Create Project or Feature Store drawer opens.

  • Click the Create option provided for the Project.

  • The Create Project opens to provide the related information for a new Project.

  • Provide the following details for a new project:

    • Project Name: Give a name to the new project.

    • Project Description: Describe the project.

    • Select Algorithms: Select algorithms using the drop-down menu.

    • Environment: Allows users to select the environment they want to work in. Currently, supported environments are Python TensorFlow, Python PyTorch, and PySpak.

      • Users who select the TensorFlow environment do not need to install packages like the TensorFlow and Keras explicitly in the notebook. These packages can be imported inside the notebook.

      • Users who select the PyTorch environment do not need to install packages like Torch and Torchvision in the notebook. These packages can be imported inside the notebook.

    • Resource Allocation: This allows the users to allocate CPU/ GPU and memory to be used by the Notebook container inside a given project. The currently supported Resource Allocation options are Low, Medium, and High.

    • Idle Shutdown: It allows the users to specify the idle time limit after which the notebook session will get disconnected, and the project will be deactivated. To use the notebook again, the project should be activated. The supported Idle Shutdown options are 30m, 1h, and 2h.

    • External Libraries: Mention the names of external libraries (mention the library name with the version number if a specific version is required) that must be installed in your DSL project /notebook. The names of the external libraries should be separated only by commas (without space) for this field. This is an optional field.

  • After you fill in the mandatory fields the following modifiable fields appear with pre-selected values:

    • Image Name

    • Image Version

    • Limit

    • Memory

    • Request (CPU)

    • Memory

    • Git Project: Select a project from the drop-down menu.

    • Git Branch: Select a branch option from the drop-down menu (The supported branches are main, migration, and version).

  • GPU Type: Select GPU type from the drop-down menu (Currently we support Nvidia as the GPU Type).

    • GPU Limit: Set the GPU limit using this field (This field appears only after the GPU Type option is selected).

  • Sync git repo at project creation: Put a checkmark in the given checkbox to avail of sync git repo while creating a DS Lab project.

Please Note:

  • You can enable the Sync git repo at the project creation option to make your DSL Project a Git Repo Sync Project. The Repo Sync Projects will be displayed in the Project list with a branch icon in their title.

  • You can configure the Git access for a normal Data Science Lab project by configuring the Git Repository and Git Branch fields while creating a new project. Such projects will display the branch icon without the drop-down option while opening that project. For example,

  • Click the Save option.

The Create Project Drawer
  • The confirmation message appears.

  • The newly created project gets saved, and it appears on the screen.

The newly created Project gets added at the top of the Projects list

Last updated