When the outputs (prediction) are the probabilities coming from a Softmax function, and the training target is one-hot type, how do we compare those two different kinds of data to calculate the accuracy?
(the number of training data classified correctly) / (the number of the total training data) *100%
Usually, we assign the class label with highest probability in the output of the soft max function as the label.