I have the following requirement:
I am able to write a function which consumes a PCollection does the necessary grouping, logic and filtering and emit events according to my business logic.
Questions:
Many thanks
PCollection
into the global window (using the regular Window.into()
) and dedupe using a GroupByKey
.PCollection
. In order to "do something for each event", all you need is a ParDo.of(whatever action you want)
applied to this collection. Triggers do something else: they control what happens when a new value V
arrives for a particular key K
in a GroupByKey<K, V>
: whether to drop the value, or buffer it, or to pass the buffered KV<K, Iterable<V>>
for downstream processing.