Overview of Nodes¶
All nodes in a VOR Stream process run independent of each other and are connected only through a queue. The queue passes protocol buffers for each observation.
There are currently 4 different kinds of nodes:
- Input / Output (IO) Nodes
- Read or write to disk
- Are end points or sync points in the stream
- Take data from CSV, Hadoop, database, or AWS S3
- SQL Nodes
- Process queue data using SQL syntax
- Use a SQL SELECT statement to perform joins, filters, and GROUP BY clauses
- Automatically generate a Golang node
- SAS Nodes
- Execute arbitrary SAS code
- Partition and multi-thread the execution on an input and synchronize the input and output queues
- Automatically generate a Golang node
- Computational Nodes
Node CLI¶
Nodes are created by using the *.strm file, as explained in the Process section.