Data Catalog
Accelerate data discovery and understanding to locate and access the data you need for your decision-making processes.
Last updated
Accelerate data discovery and understanding to locate and access the data you need for your decision-making processes.
Last updated
Check out the given walk-through on Data Catalog creation and modification steps using the Data Center module.
Please Note: The below-given explanation aims to show how Data Set as Asset Type provided for the Data Catalog list will reflect any newly added Data Set. The user can similarly search and list various Asset Types using the Data Catalog list.
Data Catalog acts as a metadata management tool, providing a comprehensive inventory of data sources, datasets, databases, tables, columns, and other related information.
The prime aim of a Data Catalog is to facilitate data discovery and understanding for the data users. It helps data users, such as data analysts, data scientists, and business users, to locate and access the data they need for their analysis or decision-making processes. A data catalog provides detailed descriptions of data assets, indicating their structure, data types, relationships, and other relevant attributes. It may also include information about data lineage and data creation.
Data Catalogs typically offer search and browsing functionalities, allowing users to explore available datasets based on keywords, data domains, or specific criteria. They provide a user-friendly interface that allows users to view metadata, preview data samples, and assess the suitability of a dataset for their needs.
Navigate to the homepage of the Data Center module.
Open the Data Connectors list.
Click on the Create icon.
Select the New Data Set using the Create icon.
The Data Set form opens for the new Data Set creation.
Provide the necessary information to create Data Set (Data Set name, Description, Query for the Data Set).
Click the Validate option to check the connection.
After getting the preview, click the Save option to save the Data Set.
The Data Set gets saved in the Data Sets list.
Now, open the Data Catalog list option.
Search for the recently created Data Set using the Data Set option inside the Asset Type filter.
You can also use the Data Set name in the Search Data Catalog space to find it.
The newly created Data Set will list over there.
The Data Catalog option appears at the bottom of the menu panel on the left side of the page inside the Data Center module.
Navigate to the Data Center Homepage.
Click the Data Catalog option.
The Data Catalog page opens displaying a list of system-generated Data catalogs.
The user can filter the Data Catalog list based on three parameters.
Assets: The supported Assets are Attribute, Component, Dashboard, Connector, Data set, Data as API, Database, Data Store, Event, Pipeline, and Table. The user can filter the data by selecting any of the given Assets.
Data Connectors: Filter by connector name.
Status: Filter by the Status of the Data Catalog.
The Asset filter contains the following options:
Attribute: It will list the column names of tables from the database
Component: It will filter the catalog list by displaying the names of the pipeline components.
Dashboard: It will list the dashboard created for a specific connector.
Connector: It will list the connectors created and used for the scenario [MySQL/ MSSQL/ MongoDBBi/ Mongo DB/Oracle/ PostgreSQL]
Data Set: It will filter the catalog list by the created data sets based on the variety of data connectors.
Data Set API: It will filter the catalog list with the available Data APIs from the Data Center.
Database: It will filter the catalog list with the available databases under the Data Center.
Data Store: It will filter the catalog list displaying data stores under the Data Center.
Event: It will filter the list displaying only the Events.
Pipeline: It will filter the catalog list displaying only the names of the Pipelines.
Table: It will filter the catalog list by the names of the table.
The Data Connector filter contains the following options:
MongoDB
MongoDBBi
MySQL
MSSQL
Oracle
PostgreSQL
The Status filter contains the following options:
Published
Non-Published
Please Note: The user can search for a specific Data Catalog by using the Search Data Catalog space.
Navigate to the Data Catalog list.
Select a Data Catalog from the list.
Click the Update Info icon.
The Edit Details dialog box opens.
Provide the required details:
Status (appears only for the Data catalog that is filtered using the Data Connector or Dataset option)
Description
Data Steward
Enter Tag
Click the Update option to save the updated information.
A confirmation message appears, and the selected Data Catalog details get updated.
Navigate to the Data Catalog list.
Filter the Data Catalog list with the Data set option.
Click the Details icon for a data catalog. This is to show mode details of the Data Set.
The Data Catalog Details page appears.
Select a specific Data Catalog entry.
Click the Update Info icon for a mode (In the given image it is shown for dashboards).
The Edit Details window opens.
You can modify the following information:
Description
Enter a tag
Click the Update option to save the updated information.
A confirmation message appears and the selected Data Catalog details get updated.