Why does very simple port of the official Keras mnist example to tensorflow 2.x result in massive drop in accuracy?

Here is the mnist example from the Keras documentation: https://keras.io/examples/mnist_cnn/

I put it into google colab, under Tensorflow 1.x, and it performs really well: https://colab.research.google.com/drive/15NW-lXhRUxqSCCygVxddXCo5ID7yF2iL

I made very simple changes to make it execute under TF-2.x: https://colab.research.google.com/drive/1ul-eFn1XRe9ta3cu5vHchaa4DxStRda_

It completely crushes performance! Accuracy drops like a rock!

What did I do wrong?

Solution

The difference is in the optimizers. tf.keras.optimizers.Adadelta uses a learning rate of 0.001. keras.optimizers.Adadelta uses a learning rate of 1.0.

Check keras.optimizers and tf.keras.optimizers.Adadelta for more details. In particular, the Tensorflow page mentions that Adadelta is supposed to have a learning rate of 1.0 to match the original paper.

How To Choose Values From Tensor Using Another Tensor In Tensorflow
This model has not yet been built error on model.summary()
How can I solve the "unhashable type" error when importing TensorFlow in Python?
"ModuleNotFoundError: No module named 'tensorflow.keras' " in jupiter notebook
What is the fastest method to count elements of a tensorflow.data.Datset?
Parallelize DeepFace on multiple GPUs
Tensorboard in Colab: No dashboards are active for the current data set
Validation Data in CNN model is causing AttributeError: 'NoneType' object has no attribute 'items'
NewRandomAccessFile failed to Create/Open : Access is denied. ; Input/output error [Op:ReadFile]
Why do I get ValueError: Unrecognized data type: x=[...] (of type <class 'list'>) with model.fit() in TensorFlow?
ValueError: Bucket names must start and end with a number or letter
Tensorflow 2.13.1, no matching distribution found for tensorflow-text 2.13.0
tensorflow keras Model.fit returning: ValueError: Unrecognized data type
running simple tensorflow saved model in c (Segmentation fault)
Keras model.export() fails because of NoneType shapes in model
ImportError: cannot import name 'model_lib_v2' from 'object_detection' (already installed an still not working)
AttributeError: The layer has never been called and thus has no defined input shape
Visualize TFLite graph and get intermediate values of a particular node?
the get_file() function freezes my script
Unable to download tensorflow 2.15.0
TypeError: __init__() got an unexpected keyword argument 'name' when loading a model with Custom Layer
TensorFlow model can't predict on polars dataframe
3D cropping inside a TensorFlow/Keras model
Replacing placeholder for tensorflow v2
Weights loading problems with custom TF model
Freeing and Reusing GPU in Tensorflow
How to Get Reproducible Results (Keras, Tensorflow):
tensorflow.python.framework.errors_impl.OperatorNotAllowedInGraphError: Iterating over a symbolic `tf.Tensor` is not allowed
AssertionError: Tried to export a function which references untracked resource
looking for a tool to predict runtime of XLA-HLO computational graph