Semantic search engine with augmented categories

I'm building semantic search engine by encoding objects in the database (into 512-dim vectors), then encoding the query and finally using k-NN algorithm to find results. The result is good, but ..

I want to try augmenting my objects with additional categories from Wikipedia. So for each object I may get zero or more additional vectors (depending on how many words found in Wikipedia).

My idea is to use numpy.average on all encoded vectors (per object) and then use my regular k-NN search.

Is this an optimal approach? I feel averaging the vectors might not get accurate result.

Solution

numpy.average indeed works pretty well for this task. Also I'm satisfied with the approach overall. I hope this info will be handy for someone.

I want to install the "n" package and I get an error
n <version> command does not activate specified version
Change n install location
How to install a specific version of Node on Ubuntu/Debian?
Different node version for different projects, is there a way of telling node which version to use?
Install Node.js to install n to install Node.js?
How to select the latest node.js v6 version using n?
n-install: ERROR: GNU Make not found, which is required for operation
How to downgrade Node version with n
how switch to previous version in n (Node version manager)?
Automatically use the right version of Node for a package
internal/modules/cjs/loader.js:905 -> throw err;
Why doesn't "n" downgrade my node version on a Mac?
Node version manager
n failed to install/switch node in Linux?
vue command not found on Mac
How to uninstall n and all node versions installed by n
Angular CLI on HTTPS - can't install CI as root
n (node version manager): cannot create directory
npm module n emits errors
How to update npm permanently?
Cannot change nodejs version using n
upgrade nodejs to stable version
How should I install and use multiple versions of Node on the same production machine?