neural-network conv-neural-network mnist

What do P letters mean in neural network layer scheme?

In Wikipedia article about MNIST database it is said, that lowest error rate is of "committee of 35 convolutional networks" with the scheme:

1-20-P-40-P-150-10

What does this scheme mean?

Numbers are probably neuron numbers. But what does 1 mean then?

What do P letters mean?

Solution

In this particular scheme, 'P' means 'pooling' layer.

So, basic structure is following:

One grayscale input image
20 images after convolution layer (20 different filters)
Pooling layer
40 outputs from next convolution
Pooling layer
150... can be either 150 small convolution outputs or just fully-connected 150 neurons
10 output fully-connected neurons

That's why 1-20-P-40-P-150-10. Not best notation, but still pretty clear if you familiar with CNN.

You can read more details about internal structure of CNN in base article of Yann LeCun "Gradient-Based Learning Applied to Document Recognition".

Random forest is worse than linear regression. Is it normal and what is the reason?
How to compute number of weights of CNN?
Can a neural network be trained while it changes in size?
Is it possible to set different activation functions for different outputs at the final layer in the neural net?
Multi dimensional inputs in pytorch Linear method?
What is the role of "Flatten" in Keras?
I implemented a MLP Neural Network in C++, but if i normally compile it doesn't work, if i use the debugger with VS Code it works
Am I implementing my perceptron with backpropagation correctly?
I need to write complex-value neural network in tensorflow but I get an error
Multi Step Prediction Neural Networks
How to know if I am not overfitting my neural network?
How to solve : UnknownError: Graph execution error:
AttributeError: 'tuple' object has no attribute 'rank' when calling model.fit() in NLP task
how to use a neural network to learn a matrix transformation?
What is the problem with my implementation of the cross-entropy function?
How to print the "actual" learning rate in Adadelta in pytorch
PyTorch - RuntimeError: Expected floating point type for target with class probabilities, got Long
Why is ReLU a non-linear activation function?
Code in mnielson book on neural networks not working
Keras model not predicting values in the Test set
Avoiding vanishing gradient in deep neural networks
How to get around in place operation error if index leaf variable for gradient update?
Stochastic Gradient Descent(SGD) vs Mini-batch size 1
What is an Epoch in Neural Networks Training
Loss functions in GANs
What are forward and backward passes in neural networks?
What is a loss function in simple words?
What is "batch normalizaiton"? why using it? how does it affect prediction?
Changing model during training
PyTorch - RuntimeError: shape '[16, 400]' is invalid for input of size 9600