How does Keras deal with log(0) for categorical cross entropy?

I have a neural network, trained on MNIST, with categorical cross entropy as its loss function.

For theoretical purposes my output layer is ReLu. Therefore a lot of its outputs are 0.

Now I stumbled across the following question:

Why don't I get a lot of errors, since certainly there will be a lot of zeros in my output, which I will take the log of.

Here, for convenience, the formula for categorical cross entropy.

$L = \sum_{i=1}^m \sum_j L_{i,j} \log y_{i,j}$

Solution

It's not documented in https://keras.io/losses/#categorical_crossentropy and it seems to depend on the backend, but I'm quite sure that they don't make log y, but rather log(y+ epsilon) where epsilon is a small constant to prevent log(0).

Training a Keras model to identify leap years
How to improve the performance of CNN Model for a specific Dataset? Getting Low Accuracy on both training and Testing Dataset
how to improve the accuracy of autoencoder?
tensorflow.keras only runs correctly once
Loading tf.keras model, ValueError: The two structures don't have the same nested structure
How to predict list elements outside the bounds of a py dataframe?
Is it ok to have the training history very similar to the validation history?
how to implement custom metric in keras?
Meaning of sparse in "sparse cross entropy loss"?
Unable to import Keras in Jupyter
Change the threshold value of the keras RELU activation function
How to optimize multiple loss functions separately in Keras?
InvalidArgumentError in Keras custom loss function
Can we use multiple loss functions in same layer?
Running into issue: cannot import name '__version__' from 'tensorflow.python.keras'
How Can I Use GPU to Accelerate Image Augmentation?
ValueError: Shapes (None, 1) and (None, 3) are incompatible
autoencoder.fit() raises 'KeyError: 'Exception encountered when calling Functional.call()'
Error when loading old .h5 file with latest Keras
How to train tensorflow.js on medical data
Need to convert Keras model to TensorFlow.js but facing version compatibility issues between TensorFlow and Keras
How can I use a pre-trained neural network with grayscale images?
Loaded Keras Model Throws Error While Predicting (Likely Issues with Masking)
Alternative for ImageDataGenerator for custom dataset
ModuleNotFoundError: No module named 'keras_preprocessing'
TypeError: Unrecognized keyword arguments: ['batch_shape'] when loading a Keras model
Deploying Keras model for prediction in Google Cloud Functions
What does Conv2D(32, (3, 3) in TensorFlow mean?
Obtain the output of intermediate layer (Functional API) and use it in SubClassed API
How to use a Dense layer with an input that has a dynamically sized dimension?