Construction
Graph construction start with a Graph object.
from asyncgraphs import Graph
g = Graph()
This graph acts as the handle for your entire ETL operation.
Adding a source
An ETL graph needs nodes and the first nodes that are needed are source nodes. More information on the operations that can be used as a source can be found on the source page.
Adding sources to the ETL flow is done by linking it to the Graph object. This can be done in a couple of ways. The following example shows them.
from asyncgraphs import Graph
g = Graph()
g.link_to([1, 2, 3])
g >> [4, 5, 6]
g | [7, 8, 9]
In the example above there are 3 different source nodes in the graph, each emitting values from a fixed list.
Adding a transform (or sink)
All nodes that are not a source are considered a transform. Transform nodes take in data and optionally emit data.
The same methods and operators can be used to attach transformations to a source.
from asyncgraphs import Graph
g = Graph()
source_ref = g | [1, 2, 3]
source_ref | (lambda x: x + 1) | print