Algorithms

Get steps on how to do Algorithm Settings and Project level access to use Algorithms inside Notebook.​

Pre-requisite:

  1. Configure the Algorithms using the Data Science Lab Settings from the Admin module to access them under the Data Science Lab Project creation.

  2. The user must select Algorithms while creating a Project to make them accessible for a Notebook within the Project.

The entire process to access the Algorithms option inside the DS Lab and actually create a model based on the Algorithm is a three-step process:

Please Note: The first two steps are prerequisites for the user to avail desired Algorithms inside their DS Lab Projects.

Admin Settings for Algorithms

  • Navigate to the Admin module.

  • Open the Data Science Settings option from the Configuration section of the Admin panel.

  • The Data Science Settings Information page opens.

  • Select the Algorithms using the drop-down option.

  • Click the Save option.

  • A confirmation message appears to inform about the Notebook details updates.

Please Note:

  • Regression & Classification - Default Algorithm types that Admin will enable for each Data Science Lab module user.

  • Forecasting, Unsupervised, Natural Language Processing - These algorithms will be disabled by default. As per the user's request, they will be enabled by the Admin.

Project Level Algorithm Selection

Once the Algorithm settings are configured in the Admin module, and the required Algorithms are selected while creating a Data Science Project, the user can access those Algorithms within a Notebook created under the same DSL Project.

Please Note: Once the Algorithm configuration is completed from the Admin and Project level the same set of Algorithms will be available for all the Notebooks which are part of that DSL Project.

  • Navigate to the Data Science Lab.

  • Click the Create option for Project.

  • The Create Project page appears.

  • Select the algorithms using the given checkboxes from the drop-down menu.

  • The selected Algorithms appear on the field separated by a comma.

  • Save the project.

Please Note: Provide all the required fields for the Project creation.

Using Algorithms inside a .ipynb File

Once the Algorithms are selected while creating a Project, those algorithms will be available for all the Notebooks created inside that project.

Using Algorithm script inside a DS Notebook
  • Navigate to the Workspace tab inside the same Project.

  • Add a dataset and run it.

  • Click the Algorithms tab.

  • Add a new code cell in the .ipynb file.

  • It will display the list of algorithms selected and added at the Project level. Select a sub-category of the Algorithm using a checkbox.

  • The pre-defined code for the selected algorithm type gets added to the code cell.

  • Define the necessary variables in the code cell. Define the Data and Target column in the auto-generated algorithm code.

  • Run the code cell.

  • After the code cell run is completed.

  • The test data predictions based on the train data appear below.

Please Note:

  • To see the output, you can run the cell containing the data frame details.

  • The model based on the Algorithm can be saved under the Models tab.

  • The algorithm-based models can be registered to be accessed inside the Data Pipeline module.

  • The model based on an Algorithm script can be registered as an API service. Refer to the Register a Model as an API Service section for more details.

List of Algorithms in Data Science Lab

The Algorithm section within the Workspace offers a wide array of powerful out-of-the-box solutions across five key categories:

Regression

Unlock predictive insights with various regression techniques tailored for accurate data modeling.

  • Linear Regression

  • SVR

  • KNN Regressor

  • Bagging Regressor

  • Decision Tree Regressor

  • Random Forest Regressor

  • Extremely Randomized Trees Regressor

  • AdaBoost Regressor

  • GBM Regressor

  • XGBoost Regressor

Classification

Leverage advanced classification algorithms to categorize data and enhance decision-making.

  • AdaBoost Classifier

  • Logistic Regression

  • Decision Tree Classifier

  • Random Forest Classifier

  • SVC

  • XGBoost Classifier

  • Bagging Classifier

  • GBM Classifier

  • Extremely Randomized Trees Classifier

  • Bayes Classifier

  • LGBM Classifier

  • Catboost Classifier

  • KNN Classifier

Forecasting

Accurately anticipate trends and future outcomes using cutting-edge forecasting algorithms.

  • ARIMA(X)

  • SARIMA (X)

  • Auto ARIMA

  • Exponential Smoothing

  • N-BEATS, Prophet

  • Random Forest

Unsupervised Learning

These algorithms are mainly used to discover hidden patterns in data without pre-labeled outcomes.

  • Clustering

    • KMeans

    • KMeans++

    • Spectral Clustering

    • Agglomerative Clustering

    • DBSCAN

    • OPTICS

  • Anomaly Detection

    • Elliptic Envelope

    • Local Outlier Factor

    • One Class SVM

    • SGD One Class SVM

    • Isolation Forest

Natural Language Processing (NLP)

Harness the power of NLP to derive meaningful insights from unstructured text data.

The user needs to apply all the listed NLP algorithms to perform text analysis and get meaningful output from it.

  • Sequence classification: Sentiment Analysis, Topic Labelling, Zero-shot Classification

  • Token Classification: Named Entity Recognition, Part of Speech Tagging

  • Summarization

Please Note: Access to the sub-categories Forecasting, Unsupervised Learning, and Natural Language Processing requires administrator enablement. By default, all users can view and access Regression and Classification algorithms.

Last updated