CNTK deployment for real time predictions

TensorFlow has a separate project for its production usage, as noted here, called TensorFlow Serving.

How should I use CNTK in a production environment, and how should I handle it's deployment? Hopefully one could deploy trained models in to a server/cluster that serves the model with RPC or HTTP REST API.

If no such tool exists, what should be the first steps to develop it and a good architecture to roll out on my own?

Solution

We do support serving CNTK models in a production environment. You can find information about model evaluation/inference: https://github.com/Microsoft/CNTK/wiki/CNTK-Evaluation-Overview. For deployment, you need deploy the dlls specified here. A tutorial for deploying CNTK in Azure is available here.

Why does 1.0/100.0 == 0.1/10.0 give True?
How to organize the receive msg and user current input in C network such that it's clean
Using inclusive scan syntax in OpenMP in the C language
Unable to understand context in book "OOP in C" by Axel Schreiner
How to dereference member from struct in which the definition is hid from user?
variable-length array in struct with TI compiler in C (socket programming)
A faster way to test 32bpp DDBs for a valid Alpha channel
How can I compile the zephyr example-application as a freestanding application?
Why does my forked process sometimes overwrite data in a file?
Why in C can I initialize more values than the size of an array?
_Generic in C needs typecasting?
Declaring/defining an unused variable changes the output from an unrelated variable
How to use gdb to explore the stack/heap?
How can I read an input string of unknown length?
Confused by difference between expression inside if and expression outside if
Fast Arc Cos algorithm?
Discrepancy of `unsigned long` size between llvm and gcc in riscv32
GCC (C) - error: 'x' redeclared as different kind of symbol
Why does GCC’s static analyser falsely warn that a pointer to an allocated memory block itself stored in an allocated memory block may leak?
c language gcc compiler *.i file #3 "" 2 what is this?
How to make clang compile to llvm IR
Hashtable implementation in C
EFI variables or config file on the EFI partition for EFI application?
Pack high bit of every byte in ARM, for 64 bytes like AVX512 vpmovb2m?
Calling an external C function (in a shared lib) from Perl with Inline::C does not work
Can a C program detect if its own source code has been modified?
Is this declaration UB?
Eclipse C/C++ how to find variable belongs to which struct quickly
Is this truly best way to delete last element in C?
flint/arb.h: No such file or directory