Im a newbie in NLP and i was wondering if it is a good idea to summarize a document that has already been classified into a certain topic through methods such as LDA by considering the Word Embedding retrieved from Word2Vec and the topic-word distribution that has already been generated, to come up with a sentence scoring algorithm. Does this sound like a good approach for creating a summary of a document?
I would like to suggest you this post.
Instead of using Skip-Thought Encoder on the Step 4, you could use pre-trained Word2Vec model from Google or Facebook (check FastText documentation to see how to parse second model or to choose another language).
In general, you will have next steps:
I hope it will help. Good luck! :)