I have a set of names, a fixed set of names which can extend up-to 50,000 names.
"John","Mike","Josh","Peter","Karl".
And I have a document, this document is dynamic. I need to find whether this document has the predefined name or not ?
Is defining everything as a entity in spacy nlp the right way to do it ?
I understand that your purpose is to look for known names (from a list) in a document.
It seems that Named Entity Recognition may not be useful to you.
Instead, a scalable approach to this problem can be Flashtext (https://github.com/vi3k6i5/flashtext).