I am looking for rule-based methods or any other methods to identify all mentions in a text. I have found several libraries that give coreferences but no exact options for only mentions. What I want is something like below:
Input text:
[This painter]'s indulgence of visual fantasy, and appreciation of different historic architectural styles can be seen in his 1840 Architect's Dream. After a series of paintings on The Last of the Mohicans, [he] made a three year trip to Europe in 1829, but [he] is better known for a trip four years earlier in which [he] journeyed up the Hudson River to the Catskill Mountains. FTP, name [this painter of The Oxbow] and The Voyage of Life series.
*The square brackets highlight mentions.
How do I find mentions? Also, how is it different from coreferences? It would be really helpful if someone could post links to the concerned papers.
I think you can get what you want from the standard dcoref
annotator. Look at the annotation set by this annotator, CorefChainAnnotation
. This is a map from document entities to "coref chains."
Each CorefChain
can provide you with a list of mentions for the relevant entity in textual order.