I have a question about implementing pipeline using Dataflow TPL library.
My case is that I have a software that needs to process some tasks concurrently. Processing looks like this: first we process album at global level, and then we go inside album and process each picture individually. Let's say that application has got processing slots and they are configurable (for the sake of example assume slots = 2). This means that application can process either:
a) two albums on the same time
b) one album + one photo from different album
c) two photos on the same time for same album
d) two photos on the same time for different albums
Currently I implemented process like this:
var albumTransferBlock = new TransformBlock<Album, Album>(ProcessAlbum,
new ExecutionDataflowBlockOptions { MaxDegreeOfParallelism = 2 });
ActionBlock<Album> photoActionBlock = new ActionBlock<Album>(ProcessPhoto);
albumTransferBlock.LinkTo(photoActionBlock);
Album ProcessAlbum(Album a)
{
return a;
}
void ProcessPhoto(Album album)
{
foreach (var photo in album)
{
// do some processing
}
}
The problem I have is that when I process 1 album at the time, application will never use two slots for processing photos. It meets all requirement except c)
Can anyone help me to solve this issue using DataFlow TPL?
I think I can answer myself. What I did is:
1) I created an interface IProcessor with method Process() 2) wrapped AlbumProcessing and PhotoProcessing with interface IProcessor 3) Created one ActionBlock that takes IProcessor as Input and executes Process method.
4) At the end of processing Album I am adding processing of all photos to ActionBlock.
This fulfills my requiremens in 100%. Maybe someone has some other solution?