Section: Partnerships and Cooperations
International Initiatives
INRIA Associate Teams
STATWEB
-
Title: Fast Statistical Analysis of Web Data via Sparse Learning
-
INRIA principal investigator: Francis Bach
-
International Partner:
-
Duration: 2011 - 2013
-
See also: http://www.di.ens.fr/~fbach/statweb.html
-
The goal of the proposed research is to provide web-based tools for the analysis and visualization of large corpora of text documents, with a focus on databases of news articles. We intend to use advanced algorithms, drawing from recent progresses in machine learning and statistics, to allow a user to quickly produce a short summary and associated timeline showing how a certain topic is described in news media. We are also interested in unsupervised learning techniques that allow a user to understand the difference between several different news sources, topics or documents.