My question here is that given a document d1 on the web and a document d2
how do I tell that d1 and d2 are semantically related. Are there some API's that can do some amount of natural language processing that might give me a hint as to d1 is a probably connected to d2.
I need it badly and uregently.Please Help!!
You can use special microformats. See more at http://microformats.org/
Simple example:
<a href="http://creativecommons.org/licenses/by/2.0/" rel="license">cc by 2.0</a>
Rel-License is one of several microformats. By adding rel="license" to a hyperlink, a page indicates that the destination of that hyperlink is a license for the current page.