All of the words, those I use to train the word2vec model, must be in model.vocab, aren't they?

I use the next code to train the model:

norms_train = [ [''], [ u'word', u'to', u'learn', ... ], ...]
model = word2vec.Word2Vec(norms_train, size=100, window=10)

With procedure to check the results:

i, j = 0, 0
for text in norms_train:
    j += len(text)
    for word in text:
        if word not in model.vocab:
            i += 1
print i, '/', j

13129 / 185379

Solution

All words that you have used to train the Word2Vec model should be in model.vocab. There may be a threshold on the minimum number of occurrences of a word, that have to be present for it to be included in the model vocabulary.

I suppose the argument min_count is set to 5 by default i.e. if a word has occurred less than 5 times in the training data, that word would not be present in the model.vocab.

Change value in Ini to empty using Python configparser
Script works with error line but won't run corectly when bad line is removed
FFMPEG not saving logs when converting to audio format
Numerically obtaining response of a damped driven oscillator gives peak at wrong frequency
What is the deal with the pony in Python community?
'Area' object is not callable
ValueError: Attribute Users.request is required
How to place class in its own file when it appears to be inheriting from an instance?
Python 3.6 with pony 0.7 gives error on commit to oracle db
Why pyqt tablewigdet is only displaying row number and no data?
How to check if a value exists in a dictionary?
OpenCV: draw a rectangle around a region
Python, Tkinter, trying to pull random numbers from a list based off user input for number and have results open in mew window
How to remove repeated elements in a vector, similar to 'set' in Python
Understanding descriptor protocol for 'wrapper-descriptor' itself
Passing a NumPy 3d array to a C function with a triple pointer as an argument
How do I create variable variables?
Turn a tf.data.Dataset to a jax.numpy iterator
Django 5.1 + Postgresql (debian server)
Passing in arguments to Dependency function makes it recognized as a Query Parameter
How to pass parameters to an endpoint using `add_route()` in FastAPI?
How to make Depends optional in FastAPI?
Streaming multiple videos through FastAPI to Web Browser causes HTTP requests to stall
FastAPI error when handling file together with form-data defined in a Pydantic model
Use serial port in Python without installing external packages
How can i get the output to print 10 circles instead of 9 in python turtle module
Unable to install TA-Lib on Ubuntu
Unable to install Python without sudo access
Size of an open file object
two if or more with one ELSE