AI Agents
Get information on the intelligent component of the pipeline to automate data ingestion and related processes.
What is an AI Agent?
An AI Agent is a software program or system designed to operate autonomously or semi-autonomously by perceiving its environment, processing information, and taking actions to achieve defined goals. It can interact with humans, other systems, or the physical world.
AI Agents in Data Pipeline
Within a data pipeline, an AI agent operates as a sophisticated component that automates key processes such as data ingestion, processing, and transformation, as well as complex decision-making. By leveraging AI and machine learning methodologies, it augments the efficiency of the pipeline, enabling the extraction of critical insights, optimization of workflows, and the initiation of autonomous actions.
AI Agents act as intelligent orchestration layers—automating tasks such as ingestion, preprocessing, anomaly detection, and triggering downstream actions.
Role of an AI Agent in a Data Pipeline
Data Ingestion & Preprocessing
Collects data from multiple sources such as APIs, databases, and sensors.
Cleans, normalizes, and transforms raw data before storage or further processing.
Intelligent Data Processing
Detects anomalies, missing values, or inconsistencies.
Applies feature engineering or automated data labeling.
Automated Decision-Making
Executes ML model predictions for classification, clustering, or forecasting.
Triggers alerts or initiates downstream processes based on real-time insights.
Optimization & Orchestration
Dynamically adjusts pipeline parameters based on real-time conditions.
Improves data flow efficiency through intelligent resource allocation.
Self-Learning & Adaptation
Continuously improves accuracy and efficiency by learning from new data patterns.
Refines decisions through iterative feedback loops.
Agentic Execution Loop
AI Agents operate in a loop-based execution cycle:
Perception – Capture input data or environment signals.
Reasoning – Process information using AI/ML or LLM outputs.
Action – Call a function, trigger a tool, or run a workflow step.
Observation – Evaluate the results of the action.
Iteration – Repeat the loop until a satisfactory outcome is achieved.
This iterative cycle ensures adaptability and continuous improvement of pipeline workflows.
Please note: Agents are useful when you need an LLM to determine the workflow of an app. But they often overkill.
AI Agent Use Case
Check the illustration on how to use an AI Agent component in the pipeline workflow.
Please note: This use case demonstrates how to consume an exported Agent as a tool inside a pipeline workflow. Users can also configure an AI Agent component using the Data Pipeline interface.
Exporting an AI Agent as a Tool
The Export as Tool feature allows users to register an AI Agent script as a reusable tool within the Data Science Lab (DSL). This enables integration of agentic logic into other workflows and pipelines.
Prerequisites
Before exporting an AI Agent as a tool, ensure the following:
You have permission to access the DSL module.
At least one AI Agent is created under your user account.
Steps to Export an AI Agent as a Tool
Consuming an Exported AI Agent
Once an AI Agent script is successfully registered as a tool, it can be integrated into a Data Pipeline Workflow using the AI Agent component. This allows the exported tool to participate in automated data flows alongside other pipeline components.
Steps to Consume an Exported AI Agent
Access the Data Engineering Module
Navigate to the Data Engineering module.
Open the Pipelines list page.
Select a pipeline designed for agentic workflows.
Example: The
mail_agentpipeline containing an email listener and out-event components.
Please note: If no pipeline exists, create a new pipeline before proceeding.
More Configuration for Agent Component
Agent Configuration
Configure the following Meta Information for the dragged Agent Component:
Role: Define the role of the agent.
Description: Provide details clarifying the agent's task.
Select Agentic Project: Choose a registered agentic project from the dropdown list.
Task Details
Navigate to the Task Details section and configure:
Out Event: Select the out event from the dropdown.
Description: Enter clear task instructions for the agent to execute.
JSON Schema: Upload a JSON schema that defines the expected output format.
Click Save Component in Storage.
A success message confirms the component properties have been saved.
Monitor Execution
Component Status
Open the Component Status tab to check the execution status of the agentic tool.
Log Panel
Click the Logs icon in the toolbar to open the Log Panel.
Use the Refresh icon to update the logs.
A confirmation message indicates that the latest logs have been added.
Review the detailed runtime logs for the AI Agent in the Logs tab.



