Filter Before You Forward with Edge Delta

Reduce unnecessary data transfer and storage by filtering and extracting data before transmitting it.

Overview

Filtering data before transmitting it is fundamental to efficient and cost-effective data processing pipelines. In Edge Delta, the Regex Filter node uses regex patterns to filter out logs that do not match specific criteria, thereby forwarding only relevant logs for further processing.

Bandwidth and Storage

Filtering data before forwarding it ensures that only relevant data is transmitted to the next stages of the pipeline or to external systems. This approach reduces the amount of data being transmitted, which can result in lower bandwidth consumption and reduced costs associated with data transfer. By storing only the necessary data, you can avoid paying for extra storage space that would be wasted on irrelevant or redundant data. Transmitting less data can speed up the processing pipeline, leading to quicker insights and more timely alerting. It can also reduce latency by lessening the processing burden on downstream systems.

Data Quality

Filtering helps to remove noise from the data stream, allowing meaningful signals to stand out. Noise can consist of routine logs that do not contribute valuable insight into the system’s state or behavior. By focusing on signals that matter, such as logs related to errors, exceptions, or significant events, the data retains a higher signal quality that is more actionable for operational teams.

Scalability

An optimized data flow allows for better resource management and scalability. As system loads grow, pipelines designed with this principle can handle increased data volumes without increasing costs or experiencing performance degradation. This consideration is vital for organizations looking to scale their operations without sacrificing reliability or exceeding budget constraints.

Compliance and Security

Filtering out sensitive information before it is transmitted may be part of your policy for compliance with privacy laws and regulations, such as the General Data Protection Regulation (GDPR) or the California Consumer Privacy Act (CCPA). This preemptive approach to data handling can protect against accidental exposure of personal or sensitive data, reducing the risk of security breaches and regulatory non-compliance.

Implementing the “Filter Before You Forward” practice requires thoughtful configuration of the data processing nodes, regular expressions, and other filtering criteria to ensure the right balance between data breadth and focus. It is an iterative process, where the filters may need to be adjusted as the application environment and business requirements evolve.

Regex Filter Node

Route Node

Route Logs in a Branched Pipeline