zoukankan      html  css  js  c++  java
  • PLASMA

     


    最新版本

    • Add  LU  inversion functions: PLASMA_zgetri, PLASMA_zgetri_Tile and PLASMA_zgetri_Tile_Async using   the   recursive  parallel  panel implementation of LU factorization.
    • The  householder  reduction  trees for QR and LQ factorizations can now  work on general cases and not only on matrices with M multiple of MB.
    • Matrices  generation  has been changed in every timing, testing and example  files to use a parallel initialization generating a better distribution  of  the data on the architecture, especially for Tile interface. “numactl” is not required anymore.
    • Timing  routines  can  now generate DAGs with the --dag option, and traces with --trace option if EZTRACE is present.


     Purpose of PLASMA(来自于软件包的readme)

       The  main  purpose of PLASMA is to address the performance shortcomings
       of the [2]LAPACK and [3]ScaLAPACK libraries on multicore processors and
       multi-socket  systems  of  multicore  processors and their inability to
       efficiently  utilize  accelerators  such  as  Graphics Processing Units
       (GPUs).  PLASMA  provides  routines  to  solve dense general systems of
       linear   equations,  symmetric  positive  definite  systems  of  linear
       equations and linear least squares problems, using LU, Cholesky, QR and
       LQ factorizations. Real arithmetic and complex arithmetic are supported
       in both single precision and double precision.

       PLASMA has been designed to supercede LAPACK and ScaLAPACK, principally
       by restructuring the software to achieve much greater efficiency, where
       possible,  on  modern  computers  based on multicore processors. PLASMA
       also  relies  on new or improved algorithms. Currently, however, PLASMA
       does  not  serve  as  a  complete  replacement of LAPACK due to limited
       functionality.  Specifically, PLASMA does not support band matrices and
       does  not  solve  eigenvalue  and singular value problems. Also, PLASMA
       does   not   replace  ScaLAPACK  as  software  for  distributed  memory

    computers, since it only supports shared-memory machines
                                                               
     PLASMA的安装 


    1准备阶段

    安装BLAS

    安装CBLAS

    安装LAPACK

    2修改make.inc

    进行编译

    3测试

    timing.c是时间和flops函数


    问题:

    1测试plasma时候怎么把blas设置为1?

  • 相关阅读:
    vtk体绘制时采样的起点使用噪声纹理来进行扰动
    转:轻松搞死VS
    虚拟华师(UDK)
    虚拟手术中的血流模拟(Physx+OpenGL)
    要找工作了,研究工作得暂停了
    MC+多个emitter成功把撕裂场景基本解决了
    鸭梨很大
    这世界好人多啊
    JS代码的格式化和压缩
    FusionCharts使用实例
  • 原文地址:https://www.cnblogs.com/catkins/p/5270789.html
Copyright © 2011-2022 走看看