python tensorflow nlp artificial-intelligence word2vec

Pretrained (Word2Vec) embedding in Neural Networks

If I have to use pretrained word vectors as embedding layer in Neural Networks (eg. say CNN), How do I deal with index 0?

Detail:

We usually start with creating a zero numpy 2D array. Later we fill in the indices of words from the vocabulary. The problem is, 0 is already the index of another word in our vocabulary (say, 'i' is index at 0). Hence, we are basically initializing the whole matrix filled with 'i' instead of empty words. So, how do we deal with padding all the sentences of equal length?

One easy pop-up in mind is we can use the another digit=numberOfWordsInVocab+1 to pad. But wouldn't that take more size? [Help me!]

Solution

One easy pop-up in mind is we can use the another digit=numberOfWordsInVocab+1 to pad. But wouldn't that take more size?

Nope! That's the same size.

a=np.full((5000,5000), 7)
a.nbytes
200000000

b=np.zeros((5000,5000))
b.nbytes
200000000

Edit: Typo

How to pick just one item from a generator?
Python: Get unbound class method
global frame vs. stack frame
How to generate a snapshot of a field in a time step with VTK and Python
How to read the first letter from the last line in a txt file in python
How to control scientific notation in matplotlib?
Streamlit multiselect, if I don't select anything, doesn't show data frame
How to extend a class in python?
Is there a standard location to store function cache files in Python?
C++ function (Vectors) wrapped with Cython being around 4 times slower than equivalent Cython function (NumPy Arrays MemoryViews), with large arrays
Error in anyjson setup command: use_2to3 is invalid
Send paid media aiogram 3.10
Is there a workaround for adding Microsoft Word footnotes dynamically in Python?
Training a Keras model to identify leap years
Overload a method based on init variables
How do I create a constant in Python?
What is gettext_lazy on django for?
Pydantic - parse a list of objects from YAML configuration file
How to print stdout excerpt in IPython
What is the difference between Spyder and Jupyter?
How do I create a multiline plot using seaborn?
How to read the request body using orjson library in FastAPI?
Does iPython have built-in support for viewing a variable in pager?
cropping the image by removing the white spaces
Verbose level with argparse and multiple -v options
How to return data in JSON format using FastAPI?
Rounding a rational number to the nearest integer, with half-up
Python inspector ignores property return hint when using TypeVar
How to highlight values per column in Polars
Create arbitrary multidimensional zeros array