Section: Partnerships and Cooperations
Inria Associate Teams
Participants : Francis Bach [correspondant] , Ronny Luss.
See also: http://www.di.ens.fr/~fbach/statweb.html
The goal of the proposed research is to provide web-based tools for the analysis and visualization of large corpora of text documents, with a focus on databases of news articles. We intend to use advanced algorithms, drawing from recent progresses in machine learning and statistics, to allow a user to quickly produce a short summary and associated timeline showing how a certain topic is described in news media. We are also interested in unsupervised learning techniques that allow a user to understand the difference between several different news sources, topics or documents.