deep-learning caffe backpropagation pycaffe

Weight update of Siamese networks in Caffe

By following this web-site

http://caffe.berkeleyvision.org/gathered/examples/siamese.html

, I can use a Siamese network in Caffe, which shares the weights for each layer.

But, I was wondering about how the Siamese network in Caffe updates their shared weights. To be specific, if we have

input1 -> conv1(shared) -> output1

input2 -> conv1(shared) -> output2 ===> contrastive loss (from output1 and output2),

then, does Caffe just sums up the two gradients for conv1 from the first and second networks?

Thanks for your response in advance.

Solution

You are correct, the diffs (gradients) of shared weights (all parameters with the same name) are accumulated. Note that you can not use different learn rate multipliers (lr_mult) for shared weights. Other features like momentum and weight decay should work like expected.

How to list all the removable devices with DBus and UDisks2?
gobject/gnome/glib bindings for D using GIR?
How do nested functions get compiled?
The "this" pointer and message receiving in D
64-bit executables with DMD
GtkD with D lang on Fedora
Why Android used Java concept instead of D language or C or C++? But Chromium web browser is in C++, its very complicated match
How to use MongoItemWriter to write a List<T>
Why a function with protected modifier can be overridden and accessible every where?
Convert Unicode const(uint)* to a dlang character type
Compiling D with Code::Blocks
DMD vs. GDC vs. LDC
Rendering a font in raylib using freetype
Digital Mars D compiler; acquiring ASM output
D Programming: openssl rsa forward reference compiler error
D compiler DMD doesn't link object files
OPTLINK: Warning 23: No Stack
Is there a limit in the amount of temporary generated symbols during a project build using dmd 2.063?
Is this the right way to combine Garbage collected with none Garbage collected code in D
D compiler (Digital Mars D Compiler) throwing error
Which D Compiler to Use?
Splitting a string treating multiple whitespace as one separator
Proper way of passing array parameters to D functions
Detailed Valgrind internals documentation
Is it possible, in D, to tell the garbage collector to not scan a particular pointer (or anything below it)?
Iterate over key/value pairs in associative array in D.
Dlang associative array of an array of strings keyed by a string has unexpected behavior
How to repeat a statement N times (simple loop)
Is worth the effort to learn D?
ld: undefined reference to object I can see in objdump