<?xml version="1.0" encoding="utf-8"?>
<!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.1 plus MathML 2.0 plus SVG 1.1//EN" "http://www.w3.org/2002/04/xhtml-math-svg/xhtml-math-svg.dtd">
<html xmlns="http://www.w3.org/1999/xhtml">
  <head>
    <meta http-equiv="Content-Type" content="application/xhtml+xml; charset=utf-8"/>
    <title>Team:COMPSYS</title>
    <link rel="stylesheet" href="../static/css/raweb.css" type="text/css"/>
    <meta name="description" content="New Results - Stencil Accelerators"/>
    <meta name="dc.title" content="New Results - Stencil Accelerators"/>
    <meta name="dc.creator" content="Steven Derrien"/>
    <meta name="dc.creator" content="Sanjay Rajopadhye"/>
    <meta name="dc.creator" content="Tomofumi Yuki"/>
    <meta name="dc.subject" content=""/>
    <meta name="dc.publisher" content="INRIA"/>
    <meta name="dc.date" content="(SCHEME=ISO8601) 2016-01"/>
    <meta name="dc.type" content="Report"/>
    <meta name="dc.language" content="(SCHEME=ISO639-1) en"/>
    <meta name="projet" content="COMPSYS"/>
    <script type="text/javascript" src="https://raweb.inria.fr/rapportsactivite/RA2016/static/MathJax/MathJax.js?config=TeX-MML-AM_CHTML">
      <!--MathJax-->
    </script>
  </head>
  <body>
    <div class="tdmdiv">
      <div class="logo">
        <a href="http://www.inria.fr">
          <img style="align:bottom; border:none" src="../static/img/icons/logo_INRIA-coul.jpg" alt="Inria"/>
        </a>
      </div>
      <div class="TdmEntry">
        <div class="tdmentete">
          <a href="uid0.html">Team Compsys</a>
        </div>
        <span>
          <a href="uid1.html">Members</a>
        </span>
      </div>
      <div class="TdmEntry">Overall Objectives<ul><li><a href="./uid3.html">Introduction</a></li><li><a href="./uid5.html">General Presentation</a></li><li><a href="./uid10.html">Summary of Compsys I Achievements</a></li><li><a href="./uid14.html">Summary of Compsys II
Achievements</a></li><li><a href="./uid19.html">Summary of Compsys III
Achievements</a></li></ul></div>
      <div class="TdmEntry">Research Program<ul><li><a href="uid23.html&#10;&#9;&#9;  ">Architecture and Compilation Trends</a></li><li><a href="uid38.html&#10;&#9;&#9;  ">Code Analysis, Code Transformations, Code Optimizations</a></li><li><a href="uid41.html&#10;&#9;&#9;  ">Mathematical Tools</a></li></ul></div>
      <div class="TdmEntry">Application Domains<ul><li><a href="uid43.html&#10;&#9;&#9;  ">Compilers for Embedded Computing Systems</a></li><li><a href="uid44.html&#10;&#9;&#9;  ">Users of HPC Platforms and Scientific Computing</a></li></ul></div>
      <div class="TdmEntry">
        <a href="./uid48.html">Highlights of the Year</a>
      </div>
      <div class="TdmEntry">New Software and Platforms<ul><li><a href="uid67.html&#10;&#9;&#9;  ">Lattifold</a></li><li><a href="uid70.html&#10;&#9;&#9;  ">PolyOrdo</a></li><li><a href="uid72.html&#10;&#9;&#9;  ">OpenOrdo</a></li><li><a href="uid74.html&#10;&#9;&#9;  ">ppcg-paramtiling</a></li></ul></div>
      <div class="TdmEntry">New Results<ul><li><a href="uid78.html&#10;&#9;&#9;  ">Handling Polynomials for Program
Analysis and Transformation</a></li><li><a href="uid79.html&#10;&#9;&#9;  ">Static Analysis of OpenStream Programs</a></li><li><a href="uid80.html&#10;&#9;&#9;  ">Liveness Analysis in Explicitly-Parallel
Programs</a></li><li><a href="uid81.html&#10;&#9;&#9;  ">Extended Lattice-Based Memory Allocation</a></li><li class="tdmActPage"><a href="uid82.html&#10;&#9;&#9;  ">Stencil Accelerators</a></li><li><a href="uid83.html&#10;&#9;&#9;  ">Efficient Mapping of Irregular Memory Accesses on FPGA</a></li><li><a href="uid85.html&#10;&#9;&#9;  ">PolyApps</a></li></ul></div>
      <div class="TdmEntry">Bilateral Contracts and Grants with Industry<ul><li><a href="uid87.html&#10;&#9;&#9;  ">Bilateral Contracts with Industry</a></li><li><a href="uid88.html&#10;&#9;&#9;  ">Bilateral Grants with Industry</a></li></ul></div>
      <div class="TdmEntry">Partnerships and Cooperations<ul><li><a href="uid90.html&#10;&#9;&#9;  ">Regional Initiatives</a></li><li><a href="uid91.html&#10;&#9;&#9;  ">National Initiatives</a></li><li><a href="uid96.html&#10;&#9;&#9;  ">European Initiatives</a></li><li><a href="uid100.html&#10;&#9;&#9;  ">International Initiatives</a></li><li><a href="uid106.html&#10;&#9;&#9;  ">International Research Visitors</a></li></ul></div>
      <div class="TdmEntry">Dissemination<ul><li><a href="uid115.html&#10;&#9;&#9;  ">Promoting Scientific Activities</a></li><li><a href="uid128.html&#10;&#9;&#9;  ">Teaching - Supervision - Juries</a></li><li><a href="uid139.html&#10;&#9;&#9;  ">Popularization</a></li></ul></div>
      <div class="TdmEntry">
        <div>Bibliography</div>
      </div>
      <div class="TdmEntry">
        <ul>
          <li>
            <a id="tdmbibentyear" href="bibliography.html">Publications of the year</a>
          </li>
          <li>
            <a id="tdmbibentfoot" href="bibliography.html#References">References in notes</a>
          </li>
        </ul>
      </div>
    </div>
    <div id="main">
      <div class="mainentete">
        <div id="head_agauche">
          <small><a href="http://www.inria.fr">
	    
	    Inria
	  </a> | <a href="../index.html">
	    
	    Raweb 
	    2016</a> | <a href="http://www.inria.fr/en/teams/compsys">Presentation of the Team COMPSYS</a> | <a href="http://www.ens-lyon.fr/LIP/COMPSYS/index.html.en">COMPSYS Web Site
	  </a></small>
        </div>
        <div id="head_adroite">
          <table class="qrcode">
            <tr>
              <td>
                <a href="compsys.xml">
                  <img style="align:bottom; border:none" alt="XML" src="../static/img/icons/xml_motif.png"/>
                </a>
              </td>
              <td>
                <a href="compsys.pdf">
                  <img style="align:bottom; border:none" alt="PDF" src="IMG/qrcode-compsys-pdf.png"/>
                </a>
              </td>
              <td>
                <a href="../compsys/compsys.epub">
                  <img style="align:bottom; border:none" alt="e-pub" src="IMG/qrcode-compsys-epub.png"/>
                </a>
              </td>
            </tr>
            <tr>
              <td/>
              <td>PDF
</td>
              <td>e-Pub
</td>
            </tr>
          </table>
        </div>
      </div>
      <!--FIN du corps du module-->
      <br/>
      <div class="bottomNavigation">
        <div class="tail_aucentre">
          <a href="./uid81.html" accesskey="P"><img style="align:bottom; border:none" alt="previous" src="../static/img/icons/previous_motif.jpg"/> Previous | </a>
          <a href="./uid0.html" accesskey="U"><img style="align:bottom; border:none" alt="up" src="../static/img/icons/up_motif.jpg"/>  Home</a>
          <a href="./uid83.html" accesskey="N"> | Next <img style="align:bottom; border:none" alt="next" src="../static/img/icons/next_motif.jpg"/></a>
        </div>
        <br/>
      </div>
      <div id="textepage">
        <!--DEBUT2 du corps du module-->
        <h2>Section: 
      New Results</h2>
        <h3 class="titre3">Stencil Accelerators</h3>
        <p class="participants"><span class="part">Participants</span> :
	Steven Derrien [University of Rennes 1, Inria/CAIRN] , Sanjay Rajopadhye [Colorado State University] , Tomofumi Yuki.</p>
        <p>Stencil computations have been known to be an important class of programs for
scientific calculations. Recently, various architectures (mostly targeting
FPGAs) for stencils are being proposed as hardware accelerators with high
throughput and/or high energy efficiency. There are many different challenges
for such design: How to maximize compute-I/O ratio? How to partition the
problem so that the data fits on the on-chip memory? How to efficiently
pipeline? How to control the area usage? We seek to address these challenges by
combining techniques from compilers and high-level synthesis tools.</p>
        <p>One project in collaboration with the CAIRN team and Colorado State University
targets stencils with regular dependence patterns. Although many architectures
have been proposed for this type of stencils, most of them use a large number
of small processing elements (PE) to achieve high throughput. We are exploring
an alternative design that aims for a single, large, deeply-pipelined PE. The
hypothesis is that the pipelined parallelism is more area-efficient compared to
replicating small PEs. We have published a work-in-progress paper on this topic
at IMPACT'16 <a href="./bibliography.html#compsys-2016-bid18">[4]</a>.
</p>
      </div>
      <!--FIN du corps du module-->
      <br/>
      <div class="bottomNavigation">
        <div class="tail_aucentre">
          <a href="./uid81.html" accesskey="P"><img style="align:bottom; border:none" alt="previous" src="../static/img/icons/previous_motif.jpg"/> Previous | </a>
          <a href="./uid0.html" accesskey="U"><img style="align:bottom; border:none" alt="up" src="../static/img/icons/up_motif.jpg"/>  Home</a>
          <a href="./uid83.html" accesskey="N"> | Next <img style="align:bottom; border:none" alt="next" src="../static/img/icons/next_motif.jpg"/></a>
        </div>
        <br/>
      </div>
    </div>
  </body>
</html>
