Which format is preferable for tflite model NCHW or NHWC?

For gpus nchw mode is preferrable due to speed benefits, but what is the preferable mode in terms of mobile perfomance for tflite model? Now converting from pytorch to tflite yields a working nchw model but is this mode optimal?

Solution

The TensorFlow Lite's matrix multiplication library, for convolution 2d and so on, prefers NHWC inputs .

The TFLite converter tries to automatically transform the given NCHW weights to the corresponding NHWC weights if the given weights are constant to perform well on mobile. If the given weights are not constants, the converter will add a transpose operator after the NCHW weights to choose the NHWC's based convolution 2d algorithm for mobile.

The “Forward/Backward Passage Size” is too large for the pytorch model (Yolov3)
How do I use distributed DNN training in TensorFlow?
Neural network learning to sum two numbers
Implementation of F1-score, IOU and Dice Score
use matplotlib_inline and torch、d2l show error :NotImplementedError: Implement enable_gui in a subclass
how to implement custom metric in keras?
torchrl: Using SyncDataCollector with a custom pytorch dqn
Does peft train newly initialized weights?
Do I have to write custom AutoModel transformers class in case "TypeError: NVEmbedModel.forward() got an unexpected keyword argument 'inputs_embeds'"
Why RAG is slower than LLM?
"RuntimeError: Numpy is not available" when using inverse_transform
Pytorch RuntimeError: "host_softmax" not implemented for 'torch.cuda.LongTensor'
AMD ROCm with Pytorch on Navi10 (RX 5700 / RX 5700 XT)
Can we use multiple loss functions in same layer?
How do I update pixelClassificationLayer() to a custom loss function?
Neuralnet RMSE is 10x bigger than linear model's RMSE on test data set
Back Propagation in Convolutional Neural Networks and how to update filters
Face alignment megaface
autoencoder.fit() raises 'KeyError: 'Exception encountered when calling Functional.call()'
When to use numpy.random.randn(...) and when numpy.random.rand(...)?
What is freezing/unfreezing a layer in neural networks?
How can I use a pre-trained neural network with grayscale images?
PyTorch RuntimeError: device >= 0 && device < num_gpus INTERNAL ASSERT FAILED
How do I initialize weights in PyTorch?
Does one convolutional filter always have different coefficients for each of the channels of the previous layer?
Obtain the output of intermediate layer (Functional API) and use it in SubClassed API
Optuna Hyperband Algorithm Not Following Expected Model Training Scheme
Broadcasting multiple versions of X_data that pair with the same y_data
How to make TensorFlow use 100% of GPU?
Contrastive Loss from Scratch