Search code examples
pythonfilemodelword2vecgensim

How to load a pre-trained Word2vec MODEL File and reuse it?


I want to use a pre-trained word2vec model, but I don't know how to load it in python.

This file is a MODEL file (703 MB). It can be downloaded here:
http://devmount.github.io/GermanWordEmbeddings/


Solution

  • just for loading

    import gensim
    
    # Load pre-trained Word2Vec model.
    model = gensim.models.Word2Vec.load("modelName.model")
    

    now you can train the model as usual. also, if you want to be able to save it and retrain it multiple times, here's what you should do

    model.train(//insert proper parameters here//)
    """
    If you don't plan to train the model any further, calling
    init_sims will make the model much more memory-efficient
    If `replace` is set, forget the original vectors and only keep the normalized
    ones = saves lots of memory!
    replace=True if you want to reuse the model
    """
    model.init_sims(replace=True)
    
    # save the model for later use
    # for loading, call Word2Vec.load()
    
    model.save("modelName.model")