Algorithms
Get steps on how to do Algorithm Settings and Project level access to use Algorithms inside Notebook.
Using Algorithms in Data Science Lab Projects
The Algorithms functionality in Data Science Lab (DSLab) allows users to select pre-defined algorithms at the project level and apply them inside notebooks for model training, prediction, and evaluation. This feature provides a streamlined approach to implement machine learning workflows without writing boilerplate code.
Selecting Algorithms During Project Creation
Navigation path: Data Science Lab > Projects > Create > Algorithms
Open the Create Project page.
From the Algorithms drop-down menu, select the desired algorithm categories using the checkboxes.
Selected algorithm categories appear in the field, separated by commas.
The supported algorithm categories under Data Science Lab are Regression, Classification, Forecasting, Unsupervised Learning, and Natural Language Processing
Complete all other required project fields.
Click Save to create the project.
Prerequisite:
Activate the project to access notebook functionality.
Ensure Admin-level and Project-level settings are configured to enable algorithm access within notebooks.
Using Algorithms Inside a Notebook
Navigation path: Data Science Lab > Workspace > Left-side panel > Algorithms tab > Notebook Code cell > Algorithm sub-category
Open the Workspace tab inside the activated project.
Add a dataset and run it in a notebook.
Click the Algorithms icon from the left-side panel under Workspace.
Add a new code cell in the notebook.
A list of algorithms selected at the Project level is displayed.
Select the desired algorithm sub-category using a checkbox.
Pre-defined code for the selected algorithm type is automatically added to the code cell.
Define necessary variables in the code cell, such as:
Data columns
Target column
Run the code cell.
After execution, predictions or results based on the test dataset appear below the code cell.
Saving and Registering Algorithm-Based Models
After training, the model can be saved under the Models tab.
Algorithm-based models can be registered for use inside the Data Pipeline module.
Models can also be exported as API services.
Refer to Register a Model as an API Service for detailed instructions.
List of Available Algorithms
The Algorithms section provides pre-built solutions across five key categories:
Regression – Standard regression models for numerical prediction.
Unlock predictive insights with various regression techniques tailored for accurate data modeling. The supported Regression Algorithms within the Data Science Module are:
Linear Regression
SVR
KNN Regressor
Bagging Regressor
Decision Tree Regressor
Random Forest Regressor
Extremely Randomized Trees Regressor
AdaBoost Regressor
GBM Regressor
XGBoost Regressor
Classification – Supervised learning for categorical outcomes.
Leverage advanced classification algorithms to categorize data and enhance decision-making.
AdaBoost Classifier
Logistic Regression
Decision Tree Classifier
Random Forest Classifier
SVC
XGBoost Classifier
Bagging Classifier
GBM Classifier
Extremely Randomized Trees Classifier
Bayes Classifier
LGBM Classifier
Catboost Classifier
KNN Classifier
Forecasting – Time series prediction (requires admin enablement).
Accurately anticipate trends and future outcomes using cutting-edge forecasting algorithms.
ARIMA(X)
SARIMA (X)
Auto ARIMA
Exponential Smoothing
N-BEATS, Prophet
Random Forest
Unsupervised Learning – Clustering and dimensionality reduction (requires admin enablement).
These algorithms are mainly used to discover hidden patterns in data without pre-labeled outcomes.
Clustering
KMeans
KMeans++
Spectral Clustering
Agglomerative Clustering
DBSCAN
OPTICS
Anomaly Detection
Elliptic Envelope
Local Outlier Factor
One Class SVM
SGD One Class SVM
Isolation Forest
Natural Language Processing (NLP) – Text-based algorithms for language data (requires admin enablement).
Harness the power of NLP to derive meaningful insights from unstructured text data.
The user needs to apply all the listed NLP algorithms to perform text analysis and get meaningful output from it.
Sequence classification: Sentiment Analysis, Topic Labelling, Zero-shot Classification
Token Classification: Named Entity Recognition, Part of Speech Tagging
Summarization