CUTLASS 2.0 (#62)
CUTLASS 2.0 Substantially refactored for - Better performance, particularly for native Turing Tensor Cores - Robust and durable templates spanning the design space - Encapsulated functionality embodying modern C++11 programming techniques - Optimized containers and data types for efficient, generic, portable device code Updates to: - Quick start guide - Documentation - Utilities - CUTLASS Profiler Native Turing Tensor Cores - Efficient GEMM kernels targeting Turing Tensor Cores - Mixed-precision floating point, 8-bit integer, 4-bit integer, and binarized operands Coverage of existing CUTLASS functionality: - GEMM kernels targeting CUDA and Tensor Cores in NVIDIA GPUs - Volta Tensor Cores through native mma.sync and through WMMA API - Optimizations such as parallel reductions, threadblock rasterization, and intra-threadblock reductions - Batched GEMM operations - Complex-valued GEMMs Note: this commit and all that follow require a host compiler supporting C++11 or greater.
This commit is contained in:
@ -3,21 +3,23 @@
|
||||
<head>
|
||||
<meta http-equiv="Content-Type" content="text/xhtml;charset=UTF-8"/>
|
||||
<meta http-equiv="X-UA-Compatible" content="IE=9"/>
|
||||
<meta name="generator" content="Doxygen 1.8.14"/>
|
||||
<meta name="viewport" content="width=device-width, initial-scale=1"/>
|
||||
<title>Cutlass: Class Members - Functions</title>
|
||||
<meta name="generator" content="Doxygen 1.8.11"/>
|
||||
<title>CUTLASS: Class Members - Functions</title>
|
||||
<link href="tabs.css" rel="stylesheet" type="text/css"/>
|
||||
<script type="text/javascript" src="jquery.js"></script>
|
||||
<script type="text/javascript" src="dynsections.js"></script>
|
||||
<link href="search/search.css" rel="stylesheet" type="text/css"/>
|
||||
<script type="text/javascript" src="search/searchdata.js"></script>
|
||||
<script type="text/javascript" src="search/search.js"></script>
|
||||
<script type="text/javascript">
|
||||
$(document).ready(function() { init_search(); });
|
||||
</script>
|
||||
<script type="text/x-mathjax-config">
|
||||
MathJax.Hub.Config({
|
||||
extensions: ["tex2jax.js"],
|
||||
jax: ["input/TeX","output/HTML-CSS"],
|
||||
});
|
||||
</script><script type="text/javascript" async src="http://cdn.mathjax.org/mathjax/latest/MathJax.js"></script>
|
||||
</script><script type="text/javascript" src="http://cdn.mathjax.org/mathjax/latest/MathJax.js"></script>
|
||||
<link href="doxygen.css" rel="stylesheet" type="text/css" />
|
||||
</head>
|
||||
<body>
|
||||
@ -26,8 +28,9 @@
|
||||
<table cellspacing="0" cellpadding="0">
|
||||
<tbody>
|
||||
<tr style="height: 56px;">
|
||||
<td id="projectlogo"><img alt="Logo" src="cutlass-logo-small.png"/></td>
|
||||
<td id="projectalign" style="padding-left: 0.5em;">
|
||||
<div id="projectname">Cutlass
|
||||
<div id="projectname">CUTLASS
|
||||
</div>
|
||||
<div id="projectbrief">CUDA Templates for Linear Algebra Subroutines and Solvers</div>
|
||||
</td>
|
||||
@ -36,22 +39,80 @@
|
||||
</table>
|
||||
</div>
|
||||
<!-- end header part -->
|
||||
<!-- Generated by Doxygen 1.8.14 -->
|
||||
<!-- Generated by Doxygen 1.8.11 -->
|
||||
<script type="text/javascript">
|
||||
/* @license magnet:?xt=urn:btih:cf05388f2679ee054f2beb29a391d25f4e673ac3&dn=gpl-2.0.txt GPL-v2 */
|
||||
var searchBox = new SearchBox("searchBox", "search",false,'Search');
|
||||
/* @license-end */
|
||||
</script>
|
||||
<script type="text/javascript" src="menudata.js"></script>
|
||||
<script type="text/javascript" src="menu.js"></script>
|
||||
<script type="text/javascript">
|
||||
/* @license magnet:?xt=urn:btih:cf05388f2679ee054f2beb29a391d25f4e673ac3&dn=gpl-2.0.txt GPL-v2 */
|
||||
$(function() {
|
||||
initMenu('',true,false,'search.php','Search');
|
||||
$(document).ready(function() { init_search(); });
|
||||
});
|
||||
/* @license-end */</script>
|
||||
<div id="main-nav"></div>
|
||||
<div id="navrow1" class="tabs">
|
||||
<ul class="tablist">
|
||||
<li><a href="index.html"><span>Main Page</span></a></li>
|
||||
<li><a href="modules.html"><span>Modules</span></a></li>
|
||||
<li><a href="namespaces.html"><span>Namespaces</span></a></li>
|
||||
<li class="current"><a href="annotated.html"><span>Classes</span></a></li>
|
||||
<li><a href="files.html"><span>Files</span></a></li>
|
||||
<li>
|
||||
<div id="MSearchBox" class="MSearchBoxInactive">
|
||||
<span class="left">
|
||||
<img id="MSearchSelect" src="search/mag_sel.png"
|
||||
onmouseover="return searchBox.OnSearchSelectShow()"
|
||||
onmouseout="return searchBox.OnSearchSelectHide()"
|
||||
alt=""/>
|
||||
<input type="text" id="MSearchField" value="Search" accesskey="S"
|
||||
onfocus="searchBox.OnSearchFieldFocus(true)"
|
||||
onblur="searchBox.OnSearchFieldFocus(false)"
|
||||
onkeyup="searchBox.OnSearchFieldChange(event)"/>
|
||||
</span><span class="right">
|
||||
<a id="MSearchClose" href="javascript:searchBox.CloseResultsWindow()"><img id="MSearchCloseImg" border="0" src="search/close.png" alt=""/></a>
|
||||
</span>
|
||||
</div>
|
||||
</li>
|
||||
</ul>
|
||||
</div>
|
||||
<div id="navrow2" class="tabs2">
|
||||
<ul class="tablist">
|
||||
<li><a href="annotated.html"><span>Class List</span></a></li>
|
||||
<li><a href="classes.html"><span>Class Index</span></a></li>
|
||||
<li><a href="inherits.html"><span>Class Hierarchy</span></a></li>
|
||||
<li class="current"><a href="functions.html"><span>Class Members</span></a></li>
|
||||
</ul>
|
||||
</div>
|
||||
<div id="navrow3" class="tabs2">
|
||||
<ul class="tablist">
|
||||
<li><a href="functions.html"><span>All</span></a></li>
|
||||
<li class="current"><a href="functions_func.html"><span>Functions</span></a></li>
|
||||
<li><a href="functions_vars.html"><span>Variables</span></a></li>
|
||||
<li><a href="functions_type.html"><span>Typedefs</span></a></li>
|
||||
<li><a href="functions_enum.html"><span>Enumerations</span></a></li>
|
||||
<li><a href="functions_eval.html"><span>Enumerator</span></a></li>
|
||||
</ul>
|
||||
</div>
|
||||
<div id="navrow4" class="tabs3">
|
||||
<ul class="tablist">
|
||||
<li><a href="functions_func.html#index_a"><span>a</span></a></li>
|
||||
<li class="current"><a href="functions_func_b.html#index_b"><span>b</span></a></li>
|
||||
<li><a href="functions_func_c.html#index_c"><span>c</span></a></li>
|
||||
<li><a href="functions_func_d.html#index_d"><span>d</span></a></li>
|
||||
<li><a href="functions_func_e.html#index_e"><span>e</span></a></li>
|
||||
<li><a href="functions_func_f.html#index_f"><span>f</span></a></li>
|
||||
<li><a href="functions_func_g.html#index_g"><span>g</span></a></li>
|
||||
<li><a href="functions_func_h.html#index_h"><span>h</span></a></li>
|
||||
<li><a href="functions_func_i.html#index_i"><span>i</span></a></li>
|
||||
<li><a href="functions_func_k.html#index_k"><span>k</span></a></li>
|
||||
<li><a href="functions_func_l.html#index_l"><span>l</span></a></li>
|
||||
<li><a href="functions_func_m.html#index_m"><span>m</span></a></li>
|
||||
<li><a href="functions_func_n.html#index_n"><span>n</span></a></li>
|
||||
<li><a href="functions_func_o.html#index_o"><span>o</span></a></li>
|
||||
<li><a href="functions_func_p.html#index_p"><span>p</span></a></li>
|
||||
<li><a href="functions_func_q.html#index_q"><span>q</span></a></li>
|
||||
<li><a href="functions_func_r.html#index_r"><span>r</span></a></li>
|
||||
<li><a href="functions_func_s.html#index_s"><span>s</span></a></li>
|
||||
<li><a href="functions_func_t.html#index_t"><span>t</span></a></li>
|
||||
<li><a href="functions_func_u.html#index_u"><span>u</span></a></li>
|
||||
<li><a href="functions_func_v.html#index_v"><span>v</span></a></li>
|
||||
<li><a href="functions_func_w.html#index_w"><span>w</span></a></li>
|
||||
<li><a href="functions_func_0x7e.html#index_0x7e"><span>~</span></a></li>
|
||||
</ul>
|
||||
</div>
|
||||
</div><!-- top -->
|
||||
<!-- window showing the filter options -->
|
||||
<div id="MSearchSelectWindow"
|
||||
@ -70,28 +131,43 @@ $(function() {
|
||||
<div class="contents">
|
||||
 
|
||||
|
||||
<h3><a id="index_b"></a>- b -</h3><ul>
|
||||
<h3><a class="anchor" id="index_b"></a>- b -</h3><ul>
|
||||
<li>back()
|
||||
: <a class="el" href="classcutlass_1_1Array_3_01T_00_01N_00_01false_01_4.html#a693677ee48012a4d013d55741d38764e">cutlass::Array< T, N, false ></a>
|
||||
, <a class="el" href="classcutlass_1_1Array_3_01T_00_01N_00_01true_01_4.html#a6c81a715431cf5a772c2273362df97fd">cutlass::Array< T, N, true ></a>
|
||||
</li>
|
||||
<li>batch()
|
||||
: <a class="el" href="structcutlass_1_1gemm_1_1GemmCoord.html#ac441e93411ed6954a93ab36f0c747d26">cutlass::gemm::GemmCoord</a>
|
||||
: <a class="el" href="structcutlass_1_1gemm_1_1BatchedGemmCoord.html#a40582b341f6916b17105377a64743682">cutlass::gemm::BatchedGemmCoord</a>
|
||||
</li>
|
||||
<li>BatchedGemmCoord()
|
||||
: <a class="el" href="structcutlass_1_1gemm_1_1BatchedGemmCoord.html#ae1065cdcd7d6d99f971cba5c2565fe7d">cutlass::gemm::BatchedGemmCoord</a>
|
||||
</li>
|
||||
<li>BatchedReduction()
|
||||
: <a class="el" href="structcutlass_1_1reduction_1_1BatchedReduction.html#a9d76da3dcf4d8ec0cfeb2134f73ea22b">cutlass::reduction::BatchedReduction< BatchedReductionTraits_ ></a>
|
||||
</li>
|
||||
<li>begin()
|
||||
: <a class="el" href="structcutlass_1_1PredicateVector.html#a649045d8224514a4c28bcaf4b247b4a5">cutlass::PredicateVector< kPredicates_, kPredicatesPerByte_, kPredicateStart_ ></a>
|
||||
, <a class="el" href="structcutlass_1_1TensorRefArray.html#a6b0f0d9cef4a2f3f4a8bf6c192a282db">cutlass::TensorRefArray< Storage_, Rank_, MapFunc_, StorageRank_, Index_, LongIndex_ ></a>
|
||||
, <a class="el" href="structcutlass_1_1TensorRefBatchStrided.html#a4f4a2f860cc10688ee27cc9ce1df1015">cutlass::TensorRefBatchStrided< Storage_, Rank_, MapFunc_, StorageRank_, Index_, LongIndex_ ></a>
|
||||
: <a class="el" href="classcutlass_1_1Array_3_01T_00_01N_00_01false_01_4.html#a6e9dbf4a486f07dc72dd5140a7628971">cutlass::Array< T, N, false ></a>
|
||||
, <a class="el" href="classcutlass_1_1Array_3_01T_00_01N_00_01true_01_4.html#acf5a84cce457d31be7d30c57ab52f64c">cutlass::Array< T, N, true ></a>
|
||||
, <a class="el" href="classcutlass_1_1library_1_1Manifest.html#aa8a131b4258bfda04fdba4449520c587">cutlass::library::Manifest</a>
|
||||
, <a class="el" href="structcutlass_1_1PredicateVector.html#a649045d8224514a4c28bcaf4b247b4a5">cutlass::PredicateVector< kPredicates_, kPredicatesPerByte_, kPredicateStart_ ></a>
|
||||
</li>
|
||||
<li>beta()
|
||||
: <a class="el" href="classcutlass_1_1gemm_1_1LinearScalingDevicePtr_1_1Params.html#a0938bcb61d2572d5cf6cf2de95d11816">cutlass::gemm::LinearScalingDevicePtr< Scalar_, FragmentMultiplyAdd_ >::Params</a>
|
||||
<li>bitcast()
|
||||
: <a class="el" href="structcutlass_1_1half__t.html#acb746c82bd4dd496f79b7e611e3653dd">cutlass::half_t</a>
|
||||
</li>
|
||||
<li>block_shape()
|
||||
: <a class="el" href="classcutlass_1_1reduction_1_1kernel_1_1ReduceSplitK.html#af788ae48c72021b8ce49da15dfa72be3">cutlass::reduction::kernel::ReduceSplitK< Shape_, OutputOp_, ReductionOp_, PartitionsPerStage ></a>
|
||||
</li>
|
||||
<li>BlockForEach()
|
||||
: <a class="el" href="structcutlass_1_1reference_1_1device_1_1BlockForEach.html#a161e212b9b7ddbac36888de97538e106">cutlass::reference::device::BlockForEach< Element, Func ></a>
|
||||
, <a class="el" href="structcutlass_1_1reference_1_1host_1_1BlockForEach.html#aa2e578397b5cd68214736c2437f92480">cutlass::reference::host::BlockForEach< Element, Func ></a>
|
||||
</li>
|
||||
</ul>
|
||||
</div><!-- contents -->
|
||||
<!-- start footer part -->
|
||||
<hr class="footer"/><address class="footer"><small>
|
||||
Generated on Fri Oct 26 2018 14:53:41 for Cutlass by  <a href="http://www.doxygen.org/index.html">
|
||||
Generated by  <a href="http://www.doxygen.org/index.html">
|
||||
<img class="footer" src="doxygen.png" alt="doxygen"/>
|
||||
</a> 1.8.14
|
||||
</a> 1.8.11
|
||||
</small></address>
|
||||
</body>
|
||||
</html>
|
||||
|
||||
Reference in New Issue
Block a user