Search code examples
machine-learningdeep-learningnlp

Validate first name and last name using a deep learning or machine learning model


I have around 1 million data points of first name and last name. These names could be valid ones , for example : 'David Beckham' or invalid like - 'rockstar123' or 'new mutant'. Is there any deep learning / ML model which will allow me to differentiate amongst the 2 ?


Solution

  • Option 1: Use any pre-trained Named Entity Recognition (NER) model. Example

    Option 2: Train your custom NER model. Under the hood, different NER models use different embeddings (GloVe, Transformer-based, etc.). Once you have decided the embeddings for your task, any binary classification model can output the probability (e.g., naive-bayes, logistic regression, SVM, neural network).

    Option 3: Don't use ML. For this simple task I would lean towards regex/rule-matching.