Edge Delta Architecture

Explore Edge Delta’s architecture for scalable telemetry pipelines, featuring Node, Gateway, and Coordinator Pipelines with agentless options and support for open source standards.

14 minute read

How the Pieces Fit Together

Edge Delta combines telemetry pipelines with AI teammates that investigate, correlate, and respond to operational signals. Pipelines are the data hygiene layer: they collect, parse, enrich, mask, and route telemetry before it reaches the platform or downstream tools. AI teammates then operate on that clean, structured data to detect anomalies, investigate incidents, and surface findings.

The data flow follows a continuous loop:

Pipelines collect logs, metrics, traces, and events from your infrastructure. They run at the edge (per-node agents), centrally (gateway aggregators), or as cloud-hosted services. Processors within each pipeline handle parsing, enrichment, masking, filtering, and routing before data leaves your environment.
Platform indexes the processed telemetry for interactive exploration, dashboarding, and alerting. Monitors detect anomalies and threshold breaches, routing alerts to humans or AI teammates.
AI Team receives alerts and events through connectors and monitors, then investigates autonomously. Specialized teammates correlate signals across data sources, build timelines, and surface findings in shared channels where humans review and approve next steps.
Connectors close the loop by integrating with incident management, source control, ticketing, and communication tools. Event connectors push external signals into Edge Delta; MCP connectors let AI teammates query and act on third-party systems.

The quality of AI Team investigations depends directly on the quality of the data pipelines produce. Well-configured pipelines ensure that teammates have access to clean, enriched, and relevant telemetry, while masking and filtering rules prevent sensitive data from reaching AI models or unauthorized destinations.

Pipeline Architecture

Edge Delta provides a modular telemetry processing system made up of pipelines and agents that work across Kubernetes and non-Kubernetes environments, including Linux, Docker, Amazon ECS, macOS, Windows, and more. Its architecture is designed to optimize observability and security data workflows with flexibility, scalability, and operational clarity.

Core Components

Agents and Integrations
Edge Delta deploys lightweight agents within a wide variety of environments such as Kubernetes, Linux, Docker, ECS, macOS, Windows, and OpenShift. It also supports SELinux-enforced clusters. These agents collect telemetry data from a variety of sources including infrastructure, applications, and cloud-native services. Optionally, Cloud Pipeline integrations allow for ingestion from agentless, serverless, or third party agents without the need to deploy Edge Delta agents in every environment.
Processing Pipelines
A pipeline in Edge Delta is a collection of agents that share a common configuration. You configure the pipeline, and it governs how telemetry data is parsed, filtered, enriched, and routed by all associated agents. Processing occurs within agents and includes metric extraction, anomaly detection, and pattern recognition — all in near real time. This ensures that only relevant and optimized data is forwarded.
Note: Deploy only one pipeline per host or cluster. Installing multiple pipelines on the same host causes duplicate data collection, increased resource consumption, and potential conflicts. If you need to process different data sources differently, configure multiple source nodes within a single pipeline rather than deploying separate pipelines.
Destinations
Data is routed to one or more destinations for storage, alerting, or further analysis. These include observability platforms, SIEM tools, object stores like Amazon S3, and Edge Delta’s own Observability Platform.

Pipeline Types

Node Pipeline

A Node Pipeline is deployed per cluster and runs agents on each node in a Kubernetes environment. These agents collect and process telemetry data locally, standardize it on open formats like OpenTelemetry and OCSF, extract insights, compress data, and generate alerts in real time. The data is then sent to the configured outputs such as third-party tools or to Edge Delta.

Node pipelines run as DaemonSets in Kubernetes, deploying one agent per node. They excel at collecting node-local data:

Container logs from /var/log
Host-level metrics
Kubernetes events
eBPF-based traces and service maps

Edge Delta supports multiple types of agents:

Processing Agent: Executes the pipeline logic.
Compactor Agent: Compresses and encodes telemetry data into efficient formats.
Rollup Agent: Aggregates metric data to reduce frequency and cardinality, improving performance and reducing storage needs.

Gateway Pipeline

A Gateway Pipeline is a shared pipeline that provides centralized aggregation and advanced processing in Kubernetes clusters. It collects telemetry data from Node Pipelines and external sources, performing service-level metric aggregation, log deduplication, and trace tail sampling. The Gateway sees the full picture of incoming telemetry data, making it ideal for holistic processing.

Gateway pipelines run as Deployments that scale horizontally based on load. They serve as centralized aggregation points for:

Receiving data from multiple node pipelines
Collecting telemetry from OTLP collectors and third-party agents
Performing cross-source operations like deduplication
Enabling trace tail sampling with full trace context
Calculating service-level metrics

It is typically deployed as a set of scalable agents and connected to Node Pipelines through dedicated input/output configurations.

When to use Gateway Pipelines for third-party agents: When ingesting telemetry from OTLP collectors or other third-party agents, send data directly to a gateway pipeline rather than routing through node pipelines. This avoids unnecessary hops and enables trace correlation. See Routing Third-Party Agents to Gateway Pipelines.

Coordinator Pipeline

The Coordinator Pipeline is a control component of the pipeline that manages backend communication and agent coordination within a cluster. Deployed as a singleton in each Kubernetes cluster, it manages communication between Edge Delta agents and the Observability Platform, reducing overhead in large environments. It handles tasks like discovery, agent grouping, and live capture coordination.

Coordinator pipelines:

Handle backend communication on behalf of node agents
Manage heartbeats and control messages
Perform leader election for processor agents
Coordinate live capture across multiple nodes

A coordinator is required for live tail to work in clusters with more than 20 nodes. For smaller clusters, a coordinator is optional but recommended for production deployments—it reduces backend overhead and improves live capture coverage across nodes.

Node Pipelines collect telemetry data at the source, Gateway Pipelines aggregate and process it at the cluster level, and Coordinator Pipelines manage environment-wide agent communication and control — together forming a robust foundation for efficient telemetry operations at scale.

With them, teams can intelligently collect, process, and route telemetry data to filter out noise and preserve high-value signals, reducing telemetry costs and enhancing downstream analysis.

Cloud Pipelines (Agentless Option)

Edge Delta also offers cloud-hosted pipelines that do not require teams to deploy agents into their infrastructure. These are ideal for serverless workloads (e.g., AWS Lambda), streaming platforms (e.g., Amazon Kinesis), IoT systems, or environments with tight security or resource constraints.

You can push telemetry data to Cloud Pipelines using HTTP, HTTPS, or gRPC, or configure them to pull data using supported source nodes like HTTP Pull. For third-party agent integration options, see Third Party Agents or Agentless.

This deployment model is fully managed by Edge Delta and requires no additional infrastructure provisioning on your side. For troubleshooting Cloud Pipelines, see Troubleshoot Cloud Pipelines.

Note: Gateway and Coordinator Pipelines are currently only supported within Kubernetes environments.

Data flow and egress

Edge Delta agents process telemetry at the source (on the node or within the cluster) before forwarding it to configured destinations. Processed telemetry leaves the customer environment and is sent to one or more of the following:

Third-party observability platforms, SIEM tools, and object stores (Splunk, Datadog, Amazon S3, etc.)
The Edge Delta Observability Platform (the SaaS backend)

Only the reduced and enriched output crosses network boundaries. Raw telemetry is processed locally, and pipeline processors control what leaves the environment:

Mask processor redacts sensitive fields and patterns before egress
Filter processor excludes entire log sources or event types
EDXEncrypt applies field-level encryption so downstream systems receive ciphertext
Sampling and routing rules control which data reaches which destination

These processors run within the agent before data is transmitted. You configure data protection once in the pipeline, and every destination (including AI teammates, third-party tools, and the Edge Delta platform) inherits those rules.

See Mask Processor, Filter Processor, EDXEncrypt and EDXDecrypt, and Strengthening Security and Compliance for configuration details.

Control plane and agent communication

Edge Delta agents initiate all connections outbound. Agents poll the backend for their pipeline configuration on a regular interval (pull-based). The Edge Delta backend cannot initiate connections into customer environments, cannot execute commands on customer hosts, and cannot read local file systems. This is why customers manage their own agent upgrades: Edge Delta cannot push updates or trigger actions on agents remotely.

Each agent authenticates to the backend using a pipeline-specific API key over TLS-encrypted connections. See Proxy Configuration for details on routing agent traffic through corporate proxies.

The following table summarizes what flows in each direction:

Direction	What is transmitted
Agent pulls from backend	Pipeline configuration, processor definitions, control signals (live capture coordination, leader election)
Agent sends to backend	Heartbeats, health metrics, processed telemetry (routed to Edge Delta destinations), anomaly patterns
Backend initiates to agent	Nothing. The backend cannot reach into customer environments.

All configuration changes made through the Edge Delta UI or API are logged with the identity of the user, the change, and a timestamp. See Configuration Audit Trail for details.

Choosing Pipeline Types

Use this decision matrix to select the right pipeline type:

Data Source	Recommended Pipeline	Reason
Container logs	Node Pipeline	Data is node-local
Host metrics	Node Pipeline	Data is node-local
Kubernetes events	Node Pipeline	Collected via K8s API
OTLP collectors	Gateway Pipeline	Pushed telemetry, needs aggregation
Third-party agents	Gateway Pipeline	Pushed telemetry, needs aggregation
Distributed traces	Gateway Pipeline	Requires full trace context for tail sampling
Service-level metrics	Gateway Pipeline	Requires cross-node visibility
Serverless functions (AWS Lambda)	Cloud Pipeline	Agentless, no infrastructure to deploy
Streaming platforms (Kinesis, Pub/Sub)	Cloud Pipeline	Cloud-hosted ingestion
IoT or constrained environments	Cloud Pipeline	Cannot deploy agents

DaemonSet vs Deployment: The node agent chart (edgedelta/edgedelta) supports a deployment.kind=Deployment option, but this is not recommended for network-based ingestion. If you need to receive data over the network (OTLP, syslog, HTTP, TCP, or UDP), deploy a Gateway Pipeline using the edgedelta/edgedelta-gateway chart. Gateway pipelines are purpose-built for aggregation, trace correlation, and horizontal scaling.

Cluster Name Assignment

The Kubernetes Cluster Name you specify when creating pipelines provides logical grouping in the UI. Pipelines sharing the same cluster name appear together in the Pipelines table when grouped by cluster.

When node and gateway pipelines share the same cluster name, the gateway automatically appears in the dropdown list when configuring the Gateway Connection destination, simplifying integration.

Deployment Patterns

Edge Delta supports deployment patterns ranging from single-node setups to global multi-region architectures. Each pattern combines pipeline types to match your infrastructure scale and operational requirements.

Pattern	Pipelines Used	Best For
Single Node	Node only	POC, dev sandboxes, k3s, lightweight VMs
Multi-Node Cluster	Node + Coordinator	Production clusters needing centralized control and live capture
Shared Platform	Node + Coordinator + Gateway	Multi-team clusters requiring deduplication and cluster-wide metrics
Multi-Cluster	Node per cluster + shared Gateway	Branch offices, retail sites, edge locations with central aggregation
Global Multi-Region	Node + Coordinator per region + regional/global Gateway	Geographic distribution with optional regional or global aggregation
Hybrid Cloud	Node at edge + cloud Gateway	On-prem clusters syncing to centralized cloud processing
High Compliance	Isolated Node + Coordinator + Gateway per environment	Strict isolation between dev, staging, production
Multi-Tenant (MSP)	Per-customer Node + Coordinator + centralized multi-tenant Gateway	Service providers managing multiple customer environments

For detailed diagrams and configuration guidance for each pattern, see Federation with Gateway Pipelines.

Pipeline Organization

Organize your pipelines for clarity and maintainability at scale. Edge Delta provides several mechanisms for managing configuration across multiple clusters and environments.

Organizational Strategies

Strategy	Approach	Best For
Packs	Reusable configuration modules shared across pipelines	Common processing logic across environments
Metadata Tags	Custom attributes added at deployment time (`edCustomTags`)	Identifying region, cluster, or stage in telemetry
Tag Overrides	Shared configuration with per-cluster naming (`edTagOverride`)	Same config across clusters with unique pipeline names

For detailed implementation of each strategy, see Multi-Tenant and Multi-Cluster Deployments.

Configuration Reuse with Packs

Packs allow you to define reusable configuration components:

Create environment-specific packs (DEV, UAT, PROD) with appropriate processing logic
Apply packs to multiple pipelines for consistent configuration
Modify a pack once to propagate changes across all pipelines using it
Combine with pipeline-specific nodes for regional customization

Pipeline Selection Wizard

Use this wizard to determine the right pipeline architecture for your deployment:

1 Approach

2 Details

3 Recommendation

How do you want to send data to Edge Delta?

Choose between installing an agent in your environment or sending data directly to Edge Delta's backend.

Your answers are private

Deployment Strategies

Operations and Monitoring

Monitoring and Visibility - Track pipeline health
Flow Control - Manage data volume dynamically
Troubleshooting and Diagnostics - Debug pipeline issues
Strengthening Security and Compliance - Data protection and audit controls
Proxy Configuration - Route agent traffic through corporate proxies

Configuration

Pipeline Settings - Agent configuration options
Effective Pipeline Design - Design patterns and best practices

Edge Delta Architecture

How the Pieces Fit Together

Pipeline Architecture

Core Components

Pipeline Types

Node Pipeline

Gateway Pipeline

Coordinator Pipeline

Cloud Pipelines (Agentless Option)

Data flow and egress

Control plane and agent communication

Choosing Pipeline Types

Cluster Name Assignment

Deployment Patterns

Pipeline Organization

Organizational Strategies

Configuration Reuse with Packs

Pipeline Selection Wizard

How do you want to send data to Edge Delta?

Where will you install the agent?

Do any of these apply to your deployment?

Describe your cluster

Do you need stateful processing or pull-based sources?

Do you need centralized aggregation?

Your Recommendation

Your Recommended Architecture

Your Recommendation

Cloud Pipeline

Your Recommendation

Ingestion Pipeline

Your Recommended Architecture

Deployment Strategies

Operations and Monitoring

Configuration

Edge Delta Architecture

How the Pieces Fit Together

Pipeline Architecture

Core Components

Pipeline Types

Node Pipeline

Gateway Pipeline

Coordinator Pipeline

Cloud Pipelines (Agentless Option)

Data flow and egress

Control plane and agent communication

Choosing Pipeline Types

Cluster Name Assignment

Deployment Patterns

Pipeline Organization

Organizational Strategies

Configuration Reuse with Packs

Pipeline Selection Wizard

How do you want to send data to Edge Delta?

Where will you install the agent?

Do any of these apply to your deployment?

Describe your cluster

Do you need stateful processing or pull-based sources?

Do you need centralized aggregation?

Your Recommendation

Your Recommended Architecture

Your Recommendation

Cloud Pipeline

Your Recommendation

Ingestion Pipeline

Your Recommended Architecture

Related Documentation

Deployment Strategies

Operations and Monitoring

Configuration

Edge Delta AI Assistant

Conversations

Hi! I'm your Edge Delta AI Assistant

Current Context