Audio mixing effect would 2+ inputs and 1 output MFT which mixes the inputs. There is no stock transform for this either, but question description suggests that you look for even more specialized transform.
If your intent is to have one multichannel input but switch channels (does not make much sense to me but description is not clear either), stock Audio Resampler MFT can be used to select downmixing modes that you can switch on the go (conversion matrix is configurable).
Nothing special is needed from this type of transforms to be of real-time grade. However depending on your situation there is still a chance that such transform alone does not satisfy your needs: such mixing transform will be a part of pipeline in front of certain buffering and change in mixing might have effect once the currently buffered data is played out. I believe this should not be a problem but if your understanding of real-time is real instant then you might have this problem as well.