LINQS

STATISTICAL RELATIONAL LEARNING GROUP @ UMD



 

Query-Time Entity Resolution

ACM SIGKDD International Conference on Knowledge Discovery and Data Mining - August 2006
Download the publication : kdd06.pdf [187Ko]  
The goal of entity resolution is to reconcile database references corresponding to the same real-world entities. Given the abundance of publicly available databases where entities are not resolved, we motivate the problem of quickly processing queries that require resolved entities from such `unclean' databases. We propose a two-stage collective resolution strategy for processing queries. We then show how it can be performed on-the-fly by adaptively extracting and resolving those database references that are the most helpful for resolving the query. We validate our approach on two large real-world publication databases where we show the usefulness of collective resolution and at the same time demonstrate the need for adaptive strategies for query processing. We then show how the same queries can be answered in real time using our adaptive approach while preserving the gains of collective resolution.

BibTex references

@InProceedings{bhattacharya:kdd06,
  author       = "Bhattacharya, Indrajit and Licamele, Louis and Getoor, Lise",
  title        = "Query-Time Entity Resolution",
  booktitle    = "ACM SIGKDD International Conference on Knowledge Discovery and Data Mining",
  month        = "August",
  year         = "2006",
}

Other publications in the database