Data Science Lab
  • What is Data Science Lab?
  • Accessing the Data Science Lab Module
  • Homepage
  • List Projects
  • List Feature Stores
  • Create
    • Create Project
      • Container Status Message
    • Create Feature Store
  • Registered Models and APIs
  • Settings
  • Trash
  • Tabs for a DSL Project
    • Workspace
      • Workspace Folders
        • Repo Folder Attributes
          • Notebook Actions
            • Export
            • Register as Job
            • Notebook Version Control
            • Share
            • Delete
            • Information
        • Repo Folder Attributes for a Repo Sync Project
          • File Attributives
        • Utils Folder Attributes
          • Utility Actions
        • Files Attributes
      • Working with the Workspace tab
        • Create
        • Import
          • Importing Notebook
          • Pull from Git
        • Adding File and Folders
      • Linter
      • Git Console
    • Data
      • Adding Data
      • Data List Page
    • Model
      • Import Model
      • Explainer Generator
      • Export to GIT/ Model Migration
      • Model Explainer
      • Share a Model
      • Register a Model
      • Unregister a Model
      • Register a Model as an API Service
        • Register a Model as an API
        • Register an API Client
        • Pass Model Values in Postman
      • Delete Model
    • AutoML
      • Creating AutoML Experiment
      • AutoML List Page
        • View Explanation
          • Model Summary
          • Model Interpretation
            • Classification Model Explainer
            • Regression Model Explainer
            • Forecasting Model Explainer
          • Dataset Explainer
  • Data Science Notebook
    • Preview File
    • Save as Notebook
    • .ipynb File Cells
      • Using a Code Cell
      • Using a Markdown Cell
      • Using an Assist Cell
    • Resource Utilization Graph
    • Taskbar
    • Actions Icons
  • Model Creation using Data Science Notebook
  • Notebook Operations
    • Data
      • Copy Path Functionality
    • Secrets
    • Algorithms
    • Transforms
    • Artifacts
    • Variable Explorer
    • Writers
Powered by GitBook
On this page
  • Data Info
  • Variable Types
  • Warnings
  • Variables
  • Correlation
  • Missing Values
  • Sample
  • First rows
  • Last Rows
  1. Tabs for a DSL Project
  2. AutoML
  3. AutoML List Page
  4. View Explanation

Dataset Explainer

The Dataset Explainer tab provides a high-level preview of the dataset that has been used for the experiment. It redirects the user to the Data Profile page.

The Data Profile is displayed using various sections such as:

  • Data Set Info

  • Variable Types

  • Warnings

  • Variables

  • Correlations

  • Missing Values

  • Sample

Let us see each of them one by one.

Data Info

The Data Profile displayed under the Dataset Explainer section displays the following information for the Dataset.

  • Numbers of variables

  • Number of observations

  • Missing cells

  • Duplicate rows

  • Total size in memory

  • Average record size in memory

Variable Types

This section mentions variable types for the data set variables. The selected Data set contains the following variable types:

  • Numeric

  • Categorical

  • Boolean

  • Date

  • URL

  • Text (Unique)

  • Rejected

  • Unsupported

Warnings

This section informs user about the warnings for the selected dataset.

Variables

It lists all the variables from the selected Data Set with the following details:

  • Distinct count

  • Unique

  • Missing (in percentage)

  • Missing (in number)

  • Infinite (in percentage)

  • Infinite (in number)

  • Mean

  • Minimum

  • Maximum

  • Zeros (in percentage)

Correlation

It displays the variables in the correlation chart by using various popular methods.

Missing Values

This section provides information on the missing values through Count, Matrix, and Heatmap visualization.

  • Count: The count of missing values is explained through column chart.

  • Matrix

  • Heatmap

Sample

This section describes the first 10 and last 10 rows of the selected dataset as a sample.

First rows

Last Rows

PreviousForecasting Model ExplainerNextData Science Notebook

Last updated 1 year ago