Data Science Lab
  • What is Data Science Lab?
  • Accessing the Data Science Lab Module
  • Data Science Lab Quick Start Flow
  • Project
    • Environments
    • Creating a Project
    • Project List
      • View
      • Keep Multiple Versions of a Project
      • Sharing a Project
      • Editing a Project
      • Activating a Project
      • Deactivating a Project
      • Deleting a Project
    • Tabs for a Data Science Lab Project
      • Tabs for TensorFlow and PyTorch Environment
        • Notebook
          • Ways to Access Notebook
            • Create
            • Import
              • Importing a Notebook
              • Pull from Git
          • Notebook Page
            • Preview Notebook
            • Notebook Cells
              • Using a Code Cell
              • Using a Markdown Cell
              • Using an Assist Cell
            • Renaming a Notebook
            • Resource Utilization Graph
            • Notebook Taskbar
            • Notebook Operations
              • Datasets
                • Copy Path (for Sandbox files)
              • Secrets
              • Algorithms
              • Transforms
              • Utility Notebook Operation
              • Models
                • Model Explainer
                • Registering & Unregistering a Model
                • Model Filter
              • Artifacts
              • Files
              • Variable Explorer
              • Writers
              • Find and Replace
            • Notebook Actions
          • Notebook List
            • Notebook List Actions
              • Export
                • Export to Pipeline
                • Export to GIT
              • Register as Job
              • Notebook Version Control
              • Sharing a Notebook
              • Deleting a Notebook
        • Dataset
          • Adding Data Sets
            • Data Sets
            • Data Sandbox
          • Dataset List Page
            • Preview
            • Data Profile
            • Create Experiment
            • Data Preparation
            • Delete
        • Utility
          • Pull from Git (Utility)
        • Model
          • Model Explainer
          • Share a Model
          • Import Model
          • Export to GIT
          • Register a Model
          • Unregister A Model
          • Register a Model as an API Service
            • Register a Model as an API
            • Register an API Client
            • Pass Model Values in Postman
          • AutoML Models
        • Auto ML
          • Creating AutoML Experiments
            • Creating an Experiment
          • AutoML List Page
            • View Report
              • Details
              • Models
                • View Explanation
                  • Model Summary
                  • Model Interpretation
                    • Classification Model Explainer
                    • Regression Model Explainer
                    • Forecasting Model Explainer
                  • Dataset Explainer
            • Delete
      • Tabs for PySpark Environment
        • Notebook
          • Ways to Access Notebook
            • Create
            • Import
              • Importing a Notebook
          • Notebook Page
            • Preview Notebook
            • Notebook Cells
              • Using a Code Cell
              • Using a Markdown Cell
              • Using an Assist Cell
            • Renaming a Notebook
            • Resource Utilization Graph
            • Notebook Taskbar
            • Notebook Operations
              • Datasets
                • Copy Path (for Sandbox files)
              • Secrets
              • Utility
              • Files
              • Variable Explorer
              • Writers
              • Find and Replace
            • Notebook Actions
          • Notebook List
            • Notebook List Actions
              • Export
                • Export to Pipeline
                • Export to GIT
              • Register as Job
              • Notebook Version Control
              • Sharing a Notebook
              • Deleting a Notebook
        • Dataset
          • Adding Data Sets
            • Data Sets
            • Data Sandbox
          • Dataset List Page
            • Preview
            • Data Profile
            • Data Preparation
            • Delete
        • Utility
  • Repo Sync Project
    • Environments
    • Creating a Repo Sync Project
    • Project List
      • View
      • Project Migration
      • Keep Multiple Versions of a Project
      • Sharing a Project
      • Editing a Project
      • Activating a Project
      • Deactivating a Project
      • Deleting a Project
    • Tabs for a Data Science Lab Project
      • Tabs for TensorFlow and PyTorch Environment
        • Notebook
          • Accessing the Notebook Tab
          • Adding a Folder or File
          • Notebook Page
            • Preview File
            • .ipynb Cells
              • Using a Code Cell
              • Using a Markdown Cell
              • Using an Assist Cell
            • Resource Utilization Graph
            • Notebook Taskbar
            • Operations for an .ipynb File
              • Datasets
                • Copy Path (for Sandbox files)
              • Secrets
              • Algorithms
              • Transforms
              • Models
                • Model Explainer
                • Registering & Unregistering a Model
                • Model Filter
              • Files
              • Variable Explorer
              • Writers
              • Find and Replace
            • Actions Icons for .ipynb File
          • File Options
            • Export
            • Register
            • Delete
          • Git Console
        • Dataset
          • Adding Data Sets
            • Data Sets
            • Data Sandbox
          • Dataset List Page
            • Preview
            • Data Profile
            • Create Experiment
            • Data Preparation
            • Delete
        • Model
          • Import Model
          • Model Explainer
          • Share a Model
          • Export to GIT
          • Register a Model
          • Unregister A Model
          • Register a Model as an API Service
            • Register a Model as an API
            • Register an API Client
            • Pass Model Values in Postman
          • AutoML Models
        • Auto ML
          • Creating AutoML Experiments
            • Creating an Experiment
          • AutoML List Page
            • Experiment Status
            • Actions
              • View Report
                • Details
                • Models
                  • View Explanation
                    • Model Summary
                    • Model Interpretation
                      • Classification Model Explainer
                      • Regression Model Explainer
                      • Forecasting Model Explainer
                    • Dataset Explainer
              • Delete
      • Tabs for PySpark Environment
        • Notebook
          • Accessing the Notebook Tab
          • Adding a Folder or File
          • Notebook Page
            • Preview a File
            • Cells for .ipynb Files
              • Using a Code Cell
              • Using a Markdown Cell
              • Using an Assist Cell
            • Resource Utilization Graph
            • Notebook Taskbar
            • Operations for an .ipynb File
              • Datasets
                • Copy Path (for Sandbox files)
              • Secrets
              • Files
              • Variable Explorer
              • Writers
              • Find and Replace
            • Actions for .ipynb Files
            • File Options
              • Export
              • Register
              • Delete
            • Git Console
        • Dataset
          • Adding Data Sets
            • Data Sets
            • Data Sandbox
          • Dataset List Page
            • Preview
            • Data Profile
            • Data Preparation
            • Delete
Powered by GitBook
On this page
  • Create a New Project
  • Container Status Message
  1. Repo Sync Project

Creating a Repo Sync Project

Create your experimental DS projects within the platform.

PreviousEnvironmentsNextProject List

Last updated 1 year ago

Pre-requisite: The users must have the following Admin level settings configured to access and use the Repo Syncs Project functionality inside the DS Lab module.

  • It is mandatory to configure the option before beginning with the Data Science Project creation.

  • Also, select the Algorithms by using the Algorithms field from the DS Lab Settings section which you wish to use for your DS Lab project.

  • The user must have the following settings done.

    • The token key has to be configured for the DS Lab module.

    • The repository and branch have to be specified to save the settings.

  • The user must complete the following :

    • Token key – bdbvcstoken

    • User id key - bdbvcsuserid

  • The user must do the following User level configuration to create a Repo Sync DS Lab project.

Create a New Project

  • Navigate to the Projects Page of the Data Science Lab module.

  • Click the Create Project option to create a new project.

  • A form opens to provide the Project-related information.

  • The next screen opens asking for the following details for a new project:

    • Project Name: Give a name to the new project.

    • Project Description: Describe the project.

    • Select Algorithms: Select algorithms using the drop-down menu.

    • Environment: Allows users to select the environment they want to work in. Currently, supported Python frameworks are Sklearn (default), TensorFlow, and PyTorch (The user can execute Sklearn commands by default in the notebook).

      • If the users select the TensorFlow environment, they do not need to install packages like the TensorFlow and Keras explicitly in the notebook. These packages can simply be imported inside the notebook.

      • If the users select the PyTorch environment, they do not need to install packages like the Torch and Torchvision explicitly in the notebook. These packages can simply be imported inside the notebook.

      The user can select an option out of the given choices: 1. Python Tensor Flow, 2. Python PyTorch

    • Resource Allocation: This allows the users to allocate CPU/ GPU and memory to be used by the Notebook container inside a given project. The currently supported Resource Allocation options are Low, Medium, and High.

    • Idle Shutdown: It allows the users to specify the idle time limit after which the notebook session will get disconnected, and the project will be deactivated. To use the notebook again, the project should be activated. The supported Idle Shutdown options are 30m, 1h, and 2h.

    • External Libraries: Mention the names of external libraries (if a specific version is required then mention the library name with version number) that need to be installed for use in your project /notebook. The names of the external libraries should be separated only by commas (without space) for this field.

  • After you fill in the mandatory fields the following modifiable fields appear with pre-selected values:

    • Image Name

    • Image Version

    • Limit

    • Memory

    • Request (CPU)

    • Memory

    • Git Project: Select a project from the drop-down menu.

    • Git Branch: Select a branch option from the drop-down menu (The supported branches are main, migration, and version).

    • GPU Type: Select GPU type from the drop-down menu (Currently we support nvidia as the GPU Type).

      • GPU Limit: Set the GPU limit using this field (This field appears only after GPU Type option is selected).

    • Sync git repo at project creation: Put a checkmark in the given checkbox to avail sync git repo while creating a DS Lab project.

  • Click the Save option.

  • The confirmation message appears.

  • The newly created project gets saved, and it appears on the screen.

Container Status Message

The user gets all the updates regarding container status through color coded message display for a specific DSL Project. After creating a new project and opening it the user gets to see various status messages on the top right side of the page.

Steps to see the container message:

  • Open an active Data Science Project.

  • The user gets redirected to create or import Notebook.

  • The container status message gets displayed on the top right side of this screen.

  • The following status messages get displayed till the container gets created and comes into the running status.

    • Container status message when container is not available. An error message also appears to inform user that the Project container is not up and running.

    • Container status message when container is getting created, and it is initializing.

    • Container status message when container is running.

Please Note: The user can click on the branch icon to get the latest branch related configuration.

Check out the given walk-through on how the Container Status message appears for a specific Data Science Lab Project.

Please Note: Refer the page to get an overview of the Data Science Lab module in nutshell.

Data Science Lab Quick Start Flow
DS Lab Settings
Version Control
Custom Field Settings
Git Token
Git Username
Creating a DSL Repo Sync Project
Displaying various container Status while using a Data Science Lab Project
Accessing the Create Project option
Mandatory fields for Project creation
More fields to create a new Project
The newly created Project gets listed under the Projects list
Project Container is inactive
Project container initializing
Project container is active