Trying to find any Tensorflow/Keras
implementation of the original BERT model trained using MLM/NSP
. The official google and HuggingFace implementations are very complex and has so much of added functionalities. But I want to learn and implement BERT
for just learning its working.
Any leads will be helpful?
As mentioned in the comment, you can try the following implementation of MLP-BERT TensorFlow. It's a simplified version and easy to follow comparatively.