The task of an Information Retrieval (IR) system is to retrieve documents from a large repository of data, which are relevant to a user query. This task is addressed by matching documents to queries on a term basis, and by ranking the documents accordingly. A core component of this process is the use of "term weights", which are weights representing how much a term contributes to the meaning of the text where it occurs.
Typical term weights are computed using lexical frequency statistics, i.e. word counts. This talk will present a different type of term weights, namely "graph based term weights". The computation of such weights involves modeling text as a graph, where vertices denote terms, and edges denote co-occurrence and grammatical relations between terms.
Modeling text as a graph is an interesting alternative to modeling text as a bag of words, and allows to compute term weights that contain statistical or linguistic relations as an integral part of their computation. Experimental evaluation confirms the usability of graph based term weights for IR systems.
Share this on
Graph Based Term Weights for Information Retrieval: Text as a Network of Words
Presenter(s) Dr Christina Lioma, Katholieke Universiteit Leuven
Seminar type Open Seminar Series
Location SEERC Seminar Room
Date and time 19/02/2009, 12:00 – 13:00
Website http://