Data Catalog Overview
Get an in-depth overview of a selected data asset, offering users comprehensive metadata and contextual information necessary for informed data usage and analysis.
The View Details page in the Data Catalog offers a comprehensive view of an individual data asset, consolidating all essential metadata, lineage, usage insights, update history, and governance information in a single, user-friendly interface. This page is designed to help users understand the structure, context, and relevance of the asset, enabling more accurate and confident data-driven decisions.
By selecting a Data Catalog entry, the View Details option becomes available.
The View Details option provides access to a comprehensive overview of the selected data asset, including its metadata, structure, lineage, and usage information.
Accessing the Overview Page
The following illustration displays the steps to access the View Details option for a dataset asset.
Navigate to the Data Catalog page.
The default asset list appears.
Use the search bar or select an asset type from the displayed list.
Access the View Details option for the selected asset. Click the option to open the details of the selected asset. E.g., the KPI_Tile_Data data set is displayed in the following image.
Click the drop-down icon for the selected data asset.
The View Details option appears for the selected data asset. Click on it.
The catalog overview page opens.
The Overview Page
The Overview page provides comprehensive metadata and structural details about a selected data asset within the Data Catalog. This interface helps users understand the context, ownership, structure, and lineage of the dataset for better data governance and usability.
The Catalog overview page can be divided into two sections:
Overview Section
Tabbed Navigation
Overview Section
This section displays high-level metadata of the selected asset:
Asset: Name of the dataset asset (e.g., KPI_Tile_Data).
Owner: Username of the individual responsible for the asset (e.g., vivekkumar).
Tags: User-defined labels to classify and organize the dataset.
Project Status: Indicates the progress status of the associated project (e.g., Work in Progress).
Asset Type: Specifies the kind of asset, such as 'Dataset'.
Created Date: Timestamp when the asset was created (e.g.,
2025-04-25 12:53:35
).Description: A brief narrative or note describing the purpose or contents of the dataset.
Adding Tags
Access the Tags field.
Insert a tag.
Click the checkmark.
A success message appears.
The tag will be added to the asset.
Adding Description
Access the Description field.
Insert an appropriate description for the selected data asset.
Click the checkbox option.
A success message ensures that the provided description is added to the concerned data asset.
Tabbed Navigation
The central area provides tabbed access to multiple views:
Columns (active tab): Displays the schema structure of the dataset.
Sample Data: Shows sample records for preview.
Details: Provides extended metadata and configuration details.
Lineage Graph: Visualizes data lineage and transformations.
Data Profile: Presents statistical summaries and quality metrics.
History: Logs changes or access events related to the dataset.

Columns
This is the default tab to open, it displays all the column names with Column Type and allows the Data Catalog users to modify or insert Comments and tags to the displayed columns.

Adding Tags
Navigate to the Columns tab for the selected Data Catalog.
Choose a column from the displayed list.
In the Tags section, click the insert icon to add a tag.
A success message will appear, confirming that the tag has been successfully added.
Inserting Comment
Navigate to the Columns Tab.
Select a column from the list. In this case, the "indicator with formatter" column is selected.
Enter a description in the Comment field corresponding to the selected column.
The comment helps provide context or usage information for the column.
Click the checkmark (✓) icon next to the inserted text.
A success message appears to confirm that the latest information update is saved.
The checkmark icon turns into the edit option, indicating the scope of editing the inserted comment.

Notifications
The user gets notified under the Columns tab if a column contains Personal Information. The user can mask the sensitive information to prevent it from being displayed in the data preview.
Navigate to the Columns tab.
Select a column from the displayed list of columns.
The notification will be displayed under the Comment section of the selected column.
Sample Data
This view allows users to quickly explore sample entries from the dataset and validate the structure, content, and any formatting applied to key metrics. Pagination is enabled at the bottom right, allowing navigation across data pages.

Details
The Details tab helps users verify source connectivity, understand dataset origin, and review last modification details for governance and auditing purposes.
Description: The asset is briefly described as 'test'.
Connector Type: Indicates the data source connection type used, which is 'clickhouse'.
Query: Displays the associated query. The query used here is:
SELECT * FROM KPI_Tile_Data
Publish Status: 'True' publish status indicates that the data set is published and available for consumption.
Database Name: Displays the source database name ('qa_test').
Host Name: Displays the database host ('clilckhouse.clickhouse').
Last Updated: Displays the timestamp of the last update.
Last Updated By: Displays the username who performed the last update.
Lineage Graph
The Lineage Graph tab provides a visual representation of the end-to-end data flow, allowing users to trace the origin and journey of a dataset. It helps users understand how data is sourced, transformed, and consumed.
Lineage Flow Structure (left to right):
Database(qa_test):
Represents the source database where the raw data resides.
Identified as the starting point of the concerned data asset.
Data Connector (ClickHouse_priyanka_new):
Serves as the connector used to extract data from the
qa_test
database.Enables secure and structured data transfer to the dataset layer.
Dataset (KPI_Tile_Data):
Refers to the dataset that is created using the connected source data.
Includes 5 attributes, indicated beneath the dataset name.
Acts as a curated, ready-to-use asset for further consumption.
Widget (KPI Tile):
Denotes the final consumption layer or dashboard widget that visualizes or utilizes the dataset.
This shows how the data ultimately supports business insights and decisions.
Zoom Controls:
A Zoom in/ Zoom out icon is provided on the top-right corner of the Lineage Graph tab.
Users can zoom in, zoom out, or reset to 100% for better visibility of the graph.
A full-screen view option is also available to enhance navigation.

Data Profile
The Data Profile tab provides a statistical summary and visual representation of the dataset's structure and content, helping users understand data quality and composition.
Dataset Statistics (Left Panel)
Types: Identifies the data types used in the dataset.
Number of Variables: Displays the total number of columns or features in the dataset.
Number of Observations: Represents the total number of rows or records.
Record Size: Displays the average size of each data record.
Missing Cells: Indicates the total number of empty or null cells.
Memory Size: Indicates the total memory consumed by the dataset (likely in KB).
Missing Cells (%): Indicates the percentage of missing data within the dataset.
Data Distribution Chart (Right Panel)
Displays a Column chart to visualize the frequency distribution of values across selected columns.
Helps identify dominant categories, outliers, or irregularities.
In the displayed chart:
The highest count (~9) is seen in the category "Secondary Series".
Other categories show lower frequencies, indicating less common data values.

History
The History tab presents a chronological timeline of all modifications made to the selected data asset, enabling auditability and change tracking.
Each entry on the left-hand vertical line represents a discrete update event.
Timestamp: Displays the exact date and time when the update occurred (e.g.,
2025-04-25 12:53:35
).User: Identifies the individual who performed the update (e.g., vivekkumar, priyanka S).
History Tab for Data Pipeline and Job
The History tab for a Pipeline or Job provides a chronological timeline of all major activities related to the asset, helping users to ensure traceability and transparency.
The displayed information includes:
Last Updated: Timestamp when the asset was last modified.
Last Activated: Timestamp indicating when the asset became active.
Last Deactivated: Indicates when the asset was disabled or made inactive.

Last updated