We have a complex flow with more than 50 processing nodes to transform huge amount of data and dump the data into a single Amazon S3 cluster. It is very difficult to understand if there is any issue in the data which is getting uploaded to S3 when the flow completes.
Is there any provision in Mosaic to look at the data which is getting processed at any of these 50 processing nodes?
This will save a lot of our time and efforts.
Yes, In mosaic there is a node called Peek which can be used to view a sample output of any processed node to which it is connected. The output can be viewed in the Data section in flow canvas, once the flow is executed successfully.
Following are the steps to use Peek node:
INPUT
and PROCESS
.PROCESS
section, enter the number of records to be displayed.OK
.Now to view the resultant data, once your flow is executed successfully follow below steps -
Run History
, click on your flow instance which will redirect you to the status view of your flow. You can recognize it by the green border that appears around the nodes.Peek
node. This will enable the Data
tab. Click on it to see your resultant data.