Version: 1.2

Basic Nodes

Nodes work with a data stream. They can produce, fetch, send, collect data or organize data flow. Each node has at least two parameters: Name and Description. Name has to be unique in a scenario. Description is a narrative of your choice.

Most of the nodes, with source and sink nodes being notable exceptions, have input and at least one output flow.

Sinks and filters can be disabled by selecting Disable checkbox.

Variable component

A Variable component is used to declare a new variable; in the simplest form a variable declaration looks like in the example below. As the event was read from the Kafka topic, the #input variable stores its content and its value is assigned to a newly declared myFirstVariable variable.

what is this about

As you can see in the variable configuration form below, Nussknacker inferred the data type of the #input variable from the information already available to Nussknacker.

alt_text

In the next example #input variable is used to create an expression returning a boolean value. If the input Kafka topic contains json objects and they contain operation field, the value of this field can be obtained in the following way:

#input.operation

Note that internally Nussknacker converts JSON’s object into SpEL’s map.

alt_text

MapVariable

The specialized mapVariable component can be used to declare a map variable (object in JSON)

alt_text

The same can be achieved using a plain Variable component, just make sure to write a valid SpEL expression.

alt_text

Filter

Filter passes records which satisfy the filtering condition. It can have one or two outputs. filter graph

Records from the source which meet the filter's condition go to the true sink, and others go to the false sink.

filter graph single

Records from the source which meet the condition go to the blue sink, and others are filtered out.

filter window The Expression field should contain the SpEL expression for the filtering conditions and should produce a boolean value

Split

Split node logically splits processing into two or more parallel branches. Each branch receives all records and processes them independently.

split graph

Every record from the source goes to sink 1 and sink 2. Split node doesn't have additional parameters.

Switch

Switch distributes incoming records among output branches in accordance with the filtering criteria configured in those branches.

switch graph

Each record from the source is tested against the condition defined on the edge. If #color is blue record goes to the blue sink. If #color is green record goes to the green sink. For every other value record goes to the sink for others.

switch window

The Switch node takes two parameters: Expression and exprVal. Expression contains expression which is evaluated for each record; result is assigned to the variable configured in exprVal entry field - #color in the example above.

switch_edge_condition

Each outgoing edge of Switch node has a boolean expression attached to it; if the expression evaluates to true the record is allowed to pass through this edge. Record goes to the first output with matching condition. Order of matching outgoing edges is not guaranteed.

switch_edge_default

There can be at most one edge of type Default, and it gets all records that don't match any Condition edge.

ForEach

for_each

for-each transforms the stream so that subsequent nodes are executed once for every value (possibly multiple times). This node has two parameters:

Elements - list of values over which to loop. It can contain both fixed values and expressions evaluated during execution.
Output Variable Name - the name of the variable to which element value will be assigned.

For example, when

Elements is {#input.value1, #input.value2}
Output Variable Name is outputVar

then nodes that follow for-each will be executed twice and the value of current element can be referenced as #outputVar.

Union

union_window

Union merges multiple branches into one branch. Events from the incoming branches are passed to the output branch without an attempt to combine or match them. The #input variable will be no longer available downstream the union node; a new variable will be available instead, which is defined in the union node.

Branch names visible in the node configuration form are derived from node names preceding the union node.

Example: union_example

Entry fields:

Output Variable Name - the name of the variable containing results of the merge (replacing previously defined variables, in particular #input).
Output Expression - there is one expression for each of the input branches. When there is an incoming event from a particular input branch, the expression defined for that branch is evaluated and passed to the output branch.

Please note, that the #input variable used in the Output expression field refers to the content of the respective incoming branch.

UnionMemo

(Streaming-Flink only)

union_memo_window

Similarly to Union, UnionMemo node merges branches into one branch, events are emitted on every incoming event and event time is inherited from the incoming event.

There are however important differences in the way UnionMemo works:

events from the incoming branches are matched together based on some key value
data that arrived from any of the incoming branches will be memoized by the UnionMemo node for time duration defined in stateTimeout. If new event arrives before stateTimeout, the stateTimeout timer is reset

Example: union_memo_example

UnionMemo merges multiple branches into one stream. For each incoming branch two parameters are configured:

key - it's value should be of type String, defines how elements from branches will be matched together
value - the value of this expression will be put in the output variable with the name the same as branch id

#input variable is no longer available downstream the UnionMemo, a new variable whose name is defined by "Output variable name' parameter will be present instead:

{
  "key": `value of key expression for given event`,
  "branch1": `value of output expression if memoized, otherwise null`,
  "branch2": `value of output expression if memoized, otherwise null`,
  "branch3": `value of output expression if memoized, otherwise null`,
  ...
}

PreviousValue

(Streaming-Flink only)

previous_value_window

previousValue stores arbitrary value for the given key. This element has two parameters:

groupBy - expression defining key for which we compute aggregate, e.g. #input.userId
value - stored value

For example, given stream of events which contain users with their current location, when we set

groupBy is #input.userId
value is #input.location

then the value of the output variable is the previous location for the current user. If this is the first appearance of this user, the current location will be returned.

Delay

(Streaming-Flink only)

Holds event in the node until event time + delay >= max (event time ever seen by the delay node).

The key parameter will be removed in the future release of Nussknacker, for the time being configure it to #inputMeta.key.

DeadEnd

(Streaming-Flink only)

dead_end_window

dead-end is a special type of a sink that sends your data into the void. It is handy when you want to end your scenario without specifying exact data sink at the moment.

Periodic

(Streaming-Flink only)

periodic_window

This source provides functionality of sending a number of given events in a periodic way. It's mainly used for testing.

This source has the following parameters:

period - specifies how often events will be sent
count - specifies number of event that will be sent at every period
value - specifies data that event will hold

Basic Nodes

Variable component​

MapVariable​

Filter​

Split​

Switch​

ForEach​

Union​

UnionMemo​

PreviousValue​

Delay​

DeadEnd​

Periodic​

Variable component

MapVariable

Filter

Split

Switch

ForEach

Union

UnionMemo

PreviousValue

Delay

DeadEnd

Periodic