Why does TensorFlow Conv2D have two weights matrices?

I have a tf.keras.layers.Conv2D constructed like so:

>>> conv2d_layer = tf.keras.layers.Conv2D(filters=128, kernel_size=(3, 3), strides=2)

For reference that layer is part of a network where the prior layer is prior_layer = Conv2D(filters=64, kernel_size=(3, 3), strides=2).

When I call conv2d_layer.get_weights(), it returns a list with two entries:

>>> [w.shape for w in conv2d_layer.get_weights()]
[(3, 3, 64, 128), (128,)]

Why are there two np.ndarrays in conv2d_layer.get_weights()? What are their respective meanings?

Solution

The first shape is for the weights of your conv2D, and the second one is the bias for the same layer, which is represented by a vector.

Looking at the documentation, you can see

For example, a Dense layer returns a list of two values: the kernel matrix and the bias vector. These can be used to set the weights of another Dense layer:

Turn a tf.data.Dataset to a jax.numpy iterator
Training a Keras model to identify leap years
Can batch normalization be considered a linear transformation?
Reduce inference time of object detection model by retraining with subset of original dataset
How to save a Dataset in multiple shards using `tf.data.Dataset.save`
why explain logit as 'unscaled log probabililty' in sotfmax_cross_entropy_with_logits?
How to improve the performance of CNN Model for a specific Dataset? Getting Low Accuracy on both training and Testing Dataset
InvalidArgumentError: No DNN in stream executor while training a TensorFlow RetinaNet model on Google Colab
how to improve the accuracy of autoencoder?
TypeError: Only integers, slices, ellipsis, tf.newaxis and scalar tf.int32/tf.int64 tensors are valid indices
tensorflow.keras only runs correctly once
Install Tensorflow in MacOs M1
Could not find a version that satisfies the requirement tensorflow
How do I use distributed DNN training in TensorFlow?
Loading tf.keras model, ValueError: The two structures don't have the same nested structure
Tensorflow is unable to train to predict simple multiplication
Why does tensorflow loss go to infinity with larger training set?
Tensorflow Probability MixtureNormal layer example not working as in example
how to get string value out of tf.tensor which dtype is string
How to predict list elements outside the bounds of a py dataframe?
Load model from model.weights.h5 file stored in Azure Blob
the number and the name of the event files in tensorflow?
Meaning of sparse in "sparse cross entropy loss"?
Error --accelerator unrecognized argument when launching gcloud beta ai-platform versions create API
Change the threshold value of the keras RELU activation function
How to implement tf.gather_nd in Pytorch with the argument batch_dims?
Pipenv fails locking when installing TensorFlow 2.4.1
Tensorflow dataset splitted sizing parameter problem: Local rendezvous is aborting with status: OUT_OF_RANGE: End of sequence
Difference between "compute capability" "cuda architecture" clarification for using Tensorflow v2.3.0
problem with importing @tensorflow/tfjs-node while working with face-api.js package (node.js)