EN FR
EN FR


Project Team Sierra


Overall Objectives
Application Domains
Bibliography


Project Team Sierra


Overall Objectives
Application Domains
Bibliography


Section: Partnerships and Cooperations

International Initiatives

INRIA Associate Teams

STATWEB
  • Title: Fast Statistical Analysis of Web Data via Sparse Learning

  • INRIA principal investigator: Francis Bach

  • International Partner:

    • Institution: University of California Berkeley (United States)

    • Laboratory: EECS and IEOR Departments

  • Duration: 2011 - 2013

  • See also: http://www.di.ens.fr/~fbach/statweb.html

  • The goal of the proposed research is to provide web-based tools for the analysis and visualization of large corpora of text documents, with a focus on databases of news articles. We intend to use advanced algorithms, drawing from recent progresses in machine learning and statistics, to allow a user to quickly produce a short summary and associated timeline showing how a certain topic is described in news media. We are also interested in unsupervised learning techniques that allow a user to understand the difference between several different news sources, topics or documents.