Data Pipeline
  • Data Pipeline
    • About Data Pipeline
    • Design Philosophy
    • Low Code Visual Authoring
    • Real-time and Batch Orchestration
    • Event based Process Orchestration
    • ML and Data Ops
    • Distributed Compute
    • Fault Tolerant and Auto-recovery
    • Extensibility via Custom Scripting
  • Getting Started
    • Homepage
      • List Pipelines
      • Creating a New Pipeline
        • Adding Components to Canvas
        • Connecting Components
          • Events [Kafka and Data Sync]
        • Memory and CPU Allocations
      • List Jobs
      • Create Job
        • Job Editor Page
        • Task Components
          • Readers
            • HDFS Reader
            • MongoDB Reader
            • DB Reader
            • S3 Reader
            • Azure Blob Reader
            • ES Reader
            • Sandbox Reader
          • Writers
            • HDFS Writer
            • Azure Writer
            • DB Writer
            • ES Writer
            • S3 Writer
            • Sandbox Writer
            • Mongodb Writer
            • Kafka Producer
          • Transformations
        • PySpark Job
        • Python Job
      • List Components
      • Delete Orphan Pods
      • Scheduler
      • Data Channel
      • Cluster Event
      • Trash
      • Settings
    • Pipeline Workflow Editor
      • Pipeline Toolbar
        • Pipeline Overview
        • Pipeline Testing
        • Search Component in Pipelines
        • Push Pipeline (to VCS/GIT)
        • Pull Pipeline
        • Full Screen
        • Log Panel
        • Event Panel
        • Activate/Deactivate Pipeline
        • Update Pipeline
        • Failure Analysis
        • Pipeline Monitoring
        • Delete Pipeline
      • Component Panel
      • Right-side Panel
    • Testing Suite
    • Activating Pipeline
    • Monitoring Pipeline
  • Components
    • Adding Components to Workflow
    • Component Architecture
    • Component Base Configuration
    • Resource Configuration
    • Intelligent Scaling
    • Connection Validation
    • Readers
      • S3 Reader
      • HDFS Reader
      • DB Reader
      • ES Reader
      • SFTP Stream Reader
      • SFTP Reader
      • Mongo DB Reader
        • MongoDB Reader Lite (PyMongo Reader)
        • MongoDB Reader
      • Azure Blob Reader
      • Azure Metadata Reader
      • ClickHouse Reader (Docker)
      • Sandbox Reader
      • Azure Blob Reader
    • Writers
      • S3 Writer
      • DB Writer
      • HDFS Writer
      • ES Writer
      • Video Writer
      • Azure Writer
      • ClickHouse Writer (Docker)
      • Sandbox Writer
      • MongoDB Writers
        • MongoDB Writer
        • MongoDB Writer Lite (PyMongo Writer)
    • Machine Learning
      • DSLab Runner
      • AutoML Runner
    • Consumers
      • SFTP Monitor
      • MQTT Consumer
      • Video Stream Consumer
      • Eventhub Subscriber
      • Twitter Scrapper
      • Mongo ChangeStream
      • Rabbit MQ Consumer
      • AWS SNS Monitor
      • Kafka Consumer
      • API Ingestion and Webhook Listener
    • Producers
      • WebSocket Producer
      • Eventhub Publisher
      • EventGrid Producer
      • RabbitMQ Producer
      • Kafka Producer
    • Transformations
      • SQL Component
      • Dateprep Script Runner
      • File Splitter
      • Rule Splitter
      • Stored Producer Runner
      • Flatten JSON
      • Email Component
      • Pandas Query Component
      • Enrichment Component
      • Mongo Aggregation
      • Data Loss Protection
      • Data Preparation (Docker)
      • Rest Api Component
      • Schema Validator
    • Scripting
      • Script Runner
      • Python Script
        • Keeping Different Versions of the Python Script in VCS
    • Scheduler
  • Custom Components
  • Advance Configuration & Monitoring
    • Configuration
      • Default Component Configuration
      • Logger
    • Data Channel
    • Cluster Events
    • System Component Status
  • Version Control
  • Use Cases
Powered by GitBook
On this page
  • Scheduler as a Component
  • Cron Generator
  • Scheduler List Page
  1. Components

Scheduler

A task can be scheduled to automatically execute at a given scheduler time.

PreviousKeeping Different Versions of the Python Script in VCSNextCustom Components

Last updated 1 year ago

All component configurations are classified broadly into the following sections:

  • ​​

  • Meta Information

  • ​

Scheduler as a Component

Check out the walk-through on Scheduler to get an idea on how to configure & use it in a workflow.

All component configurations are classified broadly into the following sections:

  • Drag and drop the Scheduler component to the Workflow Editor.

  • Connect it with a reader or Data Loading component (Input event).

  • Click on the Scheduler component to get the configuration details.

  • The Basic Information tab opens by default.

    • Select an Invocation type from the drop-down menu to confirm the running mode of the reader component. The supported invocation type is Real-Time.

    • Deployment Type: It displays the deployment type for the component. This field comes pre-selected.

    • Container Image Version: It displays the image version for the docker container. This field comes pre-selected.

    • Failover Event: Select a failover event from the drop-down menu.

    • Batch Size (min 10): Provide the maximum number of records to be processed in one execution cycle (Min limit for this field is 10).

  • Open the Meta Information tab and provide the required details for the same.

  • Scheduler Name: Provide a name for the Scheduler.

  • Payload: Provide the Payload value in the predefined format.

  • Scheduler Time: Time generated based on user selection in Cron generator.

  • Cron Generator: A cron generator generates cron expression which is a string that represents a set of times at which a task will be executed. The user can set any time using the given units of time from Minutes to Year.

Please Note:

  • The different values for various time units will be as given below:

    • Hours(0-23)

    • Minute(0-59)

    • Seconds(0-59)

    • Months in words(January-December)

    • Months in digit(1-12)

    • Week day(Monday-Sunday)

    • Date of months(1-31) The Date of the months for the February month is up to 29.

Cron Generator

The supported units of time for generating a Cron expression are as given below:

Scheduler List Page

The scheduled pipelines are listed together with the scheduler details. It displays the meta-information filled in the scheduler component for the respective pipeline. The page also contains information on how many times the pipeline has been triggered and when the next time the scheduled component will get deployed.

Check out the given walk-through on the Scheduler List Page.

  • All the scheduled workflows get displayed on the Scheduler List Page of the Data Pipeline.

  • The user can access the Scheduler icon from the left-side panel of the Pipeline landing page.

  • By clicking on a Pipeline the right-side panel describes how many times the Pipeline has been triggered.

  • The user gets redirected to the concerned pipeline by clicking the Pipeline name from the Scheduler List page.

​Basic Information​
Resource Configuration​
Configuring & Using the Scheduler Component in the Data Pipeline Workflow
Scheduler List page
Schduler List