Skip to content

Latest commit

 

History

History
39 lines (32 loc) · 1.05 KB

File metadata and controls

39 lines (32 loc) · 1.05 KB
date 2020-04-16
title New Dedupe Transform
description Shed duplicate logs
authors
binarylogic
pr_numbers
1848
release 0.9.0
hide_on_release_notes false
badges
type domains sources
new feature
sources
vector

For certain use cases, log deduplication can be a useful tool. Not only does this promote your data integrity, but it can help protect against upstream mistakes that accidentally duplicate logs. This mistake can easily double (or more!) your log volume. To protect against this you can use our new dedupe transform.

Get Started

Simply add the transform to your pipeline:

[transforms.my_transform_id]
  # General
  type = "dedupe" # required
  inputs = ["my-source-id"] # required

  # Fields
  fields.match = ["timestamp", "host", "message"] # optional, default

{{< success >}} The fields.match option lets you control which fields are compared to determine if events are equal. {{< /success >}}