A system and method of disambiguating entities in a computerized web search includes identifying a set of potential meanings for an entity; retrieving at least one web page having descriptions referencing the entity; establishing a base web page having a selected context for the entity; attributing dimensions of a vector space attributed to domains in the retrieved web page; and computing a probability of similarity between the referenced entity in the retrieved web page and the entity in the base web page. The method includes corresponding a similarity measure between the dimensions of the vector space attributed to domains in the retrieved web page and a likelihood of the retrieved web page referring to the entity in the base web page. The method further includes ranking web pages based on the computed probability of similarity.

 
Web www.patentalert.com

< Apparatus and method for interconnecting a processor to co-processors using shared memory

> Remote scoring and aggregating similarity search engine for use with relational databases

~ 00441