How can I implement the Kullback-Leibler loss in TensorFlow?

I need to minimize KL loss in tensorflow.

I tried this function tf.contrib.distributions.kl(dist_a, dist_b, allow_nan=False, name=None), but I failed.

I tried to implement it manually:

def kl_divergence(p,q):
    return p* tf.log(p/q)+(1-p)*tf.log((1-p)/(1-q))

Is it correct?

Solution

What you have there is the cross entropy, KL divergence should be something like:

def kl_divergence(p, q): 
    return tf.reduce_sum(p * tf.log(p/q))

This assumes that p and q are both 1-D tensors of float, of the same shape and for each, their values sum to 1.

It should also work if p and q are equally sized mini-batches of 1-D tensors that obey the above constraints.

Bayesian Linear Regression with Tensorflow Probability
Making predictions on live video feed using React Native and Tensorflow.js
Can I use TensorBoard with Google Colab?
difference between categorical and binary cross entropy
Why do I get ValueError: Unrecognized data type: x=[...] (of type <class 'list'>) with model.fit() in TensorFlow?
A KerasTensor cannot be used as input to a TensorFlow function
tensorflow keras Model.fit returning: ValueError: Unrecognized data type
How to utilize the .experimental module in tensorflow without generating attribute error
How to choose the number of hidden layers and nodes?
Why can GPU do matrix multiplication faster than CPU?
Is there a Python library where I can import a gradient descent function/method?
Broadcasting multiple version of X_data that pairing with same y_data
Why is Keras LSTM on CPU three times faster than GPU?
Determine batch size during `tensorflow.keras` Custom Class `call` method
Tensorflow Neural Network with more than 2 categories
Error loading model: SyntaxError: Unexpected token '<', "<!DOCTYPE "... is not valid JSON
Why is my convolutional model for detecting image rotation predicting the same class for every picture?
TensorFlow GPU problem 'libnvinfer.so.7' and ' 'libnvinfer.so.7'' could not load
How to extract and save images from tensorboard event summary?
Layer 'conv2d_11' expected 2 variables, but received 0 variables during loading. Expected: ['conv2d_11/kernel:0', 'conv2d_11/bias:0']
Division by neural network; fixed value produced each time
Cuda 12 + tf-nightly 2.12: Could not find cuda drivers on your machine, GPU will not be used, while every checking is fine and in torch it works
AssertionError: Some objects had attributes which were not restored
ImportError : import segmentation_models as sm
tensorflow:Can save best model only with val_acc available, skipping
TensorFlow libdevice not found. Why is it not found in the searched path?
Issue with training Keras model using ModelCheckpoint in Kaggle notebook (Unexpected result of `train_function` (Empty logs))
KeyError: 'filename' (Pandas)
What is the difference between backpropagation and reverse-mode autodiff?
tensorflow - how to use 16 bit precision float