Section: Partnerships and Cooperations
International Initiatives
Inria Associate Teams
STATWEB
-
Title: Fast Statistical Analysis of Web Data via Sparse Learning
-
International Partner (Institution - Laboratory - Researcher):
-
See also: http://www.di.ens.fr/~fbach/statweb.html
-
The goal of the proposed research is to provide web-based tools for the analysis and visualization of large corpora of text documents, with a focus on databases of news articles. We intend to use advanced algorithms, drawing from recent progresses in machine learning and statistics, to allow a user to quickly produce a short summary and associated timeline showing how a certain topic is described in news media. We are also interested in unsupervised learning techniques that allow a user to understand the difference between several different news sources, topics or documents.