Back to Projects
2024Product Design & Engineering

Kestra Governance Assets

A comprehensive data governance feature enabling enterprises to track lineage and metadata of resources created and used by workflows. Complete product leadership from concept to client delivery.

Product DesignDSL DesignEnterprise SoftwareData GovernanceProduct Management
Kestra Governance Assets feature overview

Overview

Governance Assets represents one of the most significant enterprise features added to Kestra, addressing a critical need for data governance in modern data orchestration platforms. As enterprises scale their data operations, understanding what resources workflows create, modify, and consume becomes essential for compliance, auditing, and operational visibility.

This project required end-to-end product leadership: from initial concept and user research, through design and DSL specification, to engineering coordination, QA, and client presentations. I owned every aspect of the product development lifecycle, ensuring the feature met both technical requirements and enterprise customer needs.

The Challenge

Enterprise customers managing complex data pipelines needed visibility into:

  • Data Lineage: Understanding how data flows through workflows and what downstream resources depend on upstream outputs
  • Resource Tracking: Knowing which workflows create, modify, or consume specific data assets, tables, files, or services
  • Metadata Management: Attaching rich metadata to resources for documentation, compliance, and discovery
  • Impact Analysis: Identifying which workflows would be affected by changes to upstream data sources

The solution needed to integrate seamlessly with Kestra's declarative workflow DSL while providing powerful governance capabilities that enterprise customers expect from modern data platforms.

Product Leadership & Design

As the product owner, I led this feature from initial concept through delivery:

Concept & Research

Conducted user interviews with enterprise customers to understand governance pain points, analyzed competitive solutions, and defined the product vision and requirements.

DSL Design

Designed the declarative syntax for asset declaration, ensuring it felt natural within Kestra's workflow language while remaining powerful and flexible.

UI/UX Design

Created the interface for visualizing asset lineage, browsing metadata, and understanding resource relationships across workflows.

Specifications

Wrote detailed technical specifications covering API design, data models, storage requirements, and integration points with existing Kestra infrastructure.

DSL Design

One of the most critical aspects was designing a DSL that felt natural within Kestra workflows while providing the expressiveness needed for complex governance scenarios:

id: data_pipeline_assets
namespace: kestra.company.data

tasks:
  - id: create_staging_layer_asset
    type: io.kestra.plugin.jdbc.duckdb.Query
    sql: |
      CREATE TABLE IF NOT EXISTS trips AS
      select VendorID, passenger_count, trip_distance from sample_data.nyc.taxi limit 10;
    assets:
      inputs:
        - id: sample_data.nyc.taxi
      outputs:
          - id: trips
            namespace: "{{flow.namespace}}"
            type: io.kestra.plugin.ee.assets.Table
            metadata:
              model_layer: staging

  - id: for_each
    type: io.kestra.plugin.core.flow.ForEach
    values:
      - passenger_count
      - trip_distance
    tasks:
      - id: create_mart_layer_asset
        type: io.kestra.plugin.jdbc.duckdb.Query
        sql: SELECT AVG({{taskrun.value}}) AS avg_{{taskrun.value}} FROM trips;
        assets:
          inputs:
              - id: trips
          outputs:
              - id: avg_{{taskrun.value}}
                type: io.kestra.plugin.ee.assets.Table
                namespace: "{{flow.namespace}}"
                metadata:
                  model_layer: mart
pluginDefaults:
  - type: io.kestra.plugin.jdbc.duckdb
    values:
      url: "jdbc:duckdb:md:my_db?motherduck_token={{ secret('MOTHERDUCK_TOKEN') }}"
      fetchType: STORE

This example demonstrates a complete data pipeline with asset tracking across multiple layers: staging tables created from source data, and mart layer aggregations built from staging assets. The DSL supports dynamic asset IDs (using template variables), metadata attachment for documentation (like model_layer), and clear lineage tracking through inputs and outputs. The design prioritizes clarity and discoverability while maintaining the declarative philosophy of Kestra workflows.

Engineering Coordination

Working closely with the engineering team, I coordinated:

Architecture Planning

Designed the data model for asset storage, lineage graph representation, and metadata schema. Ensured scalability for enterprise-scale deployments with thousands of workflows and assets.

API Design

Specified REST APIs for asset CRUD operations, lineage queries, and metadata management, ensuring consistency with Kestra's existing API patterns.

Integration Points

Defined how assets integrate with workflow execution, task outputs, and the broader Kestra platform ecosystem.

Quality Assurance & Testing

I led comprehensive QA efforts to ensure the feature met enterprise-grade quality standards:

  • Functional Testing: Validated all asset operations, lineage tracking, and metadata management across various scenarios
  • Performance Testing: Ensured lineage queries and asset lookups perform efficiently at scale
  • Edge Cases: Tested complex scenarios including circular dependencies, orphaned assets, and concurrent modifications
  • User Acceptance: Coordinated beta testing with select enterprise customers to gather feedback before general availability

Client Presentations & Adoption

I presented the Governance Assets feature to enterprise customers, demonstrating:

Value Proposition

Showed how the feature addresses compliance requirements, improves operational visibility, and enables better data governance practices.

Live Demonstrations

Walked through real-world use cases, showing asset declaration, lineage visualization, and metadata management in action.

The feature received strong positive feedback from enterprise customers, with several adopting it immediately upon release. The presentations helped refine the feature based on real-world requirements and use cases.

Organization & Planning

Managing a feature of this scope required careful planning and coordination:

  • Roadmap Planning: Broken down the feature into manageable phases, prioritizing core functionality before advanced features
  • Sprint Coordination: Worked with engineering teams to plan sprints, manage dependencies, and ensure timely delivery
  • Stakeholder Management: Regular updates to leadership and sales teams on progress, timelines, and customer feedback
  • Documentation: Created comprehensive documentation, examples, and migration guides for users and customers

Impact & Results

Governance Assets became a key differentiator for Kestra in enterprise sales conversations, addressing a critical gap in the data orchestration market. The feature enables:

Compliance & Auditing

Enterprises can now track data lineage for regulatory compliance, understand data provenance, and maintain audit trails of resource usage across workflows.

Operational Excellence

Teams can quickly identify which workflows depend on specific data sources, enabling faster incident response and impact analysis for changes.

Data Discovery

Rich metadata and lineage visualization help data teams discover available resources, understand relationships, and make informed decisions about data usage.

Key Learnings

Leading this project end-to-end provided valuable insights into product development at scale:

  • DSL Design Matters: The declarative syntax needed to feel intuitive while remaining powerful—this required multiple iterations and user feedback cycles
  • Enterprise Requirements: Understanding the specific governance needs of enterprise customers was crucial—generic solutions wouldn't have sufficed
  • Cross-Functional Leadership: Successfully coordinating design, engineering, QA, and customer success required clear communication and shared vision
  • User-Centric Design: Presenting to customers early and often helped validate assumptions and refine the feature before full release

Explore the feature

Read the complete documentation and learn how to use Governance Assets in your workflows.

View Documentation