NVCC 的编译和链接

zoukankan html css js c++ java

NVCC 的编译和链接
https://devblogs.nvidia.com/separate-compilation-linking-cuda-device-code/

1. 编译：
```
objects = main.o particle.o v3.o

all: $(objects)
    nvcc -arch=sm_20 $(objects) -o app

%.o: %.cpp
    nvcc -x cu -arch=sm_20 -I. -dc $< -o $@

clean:
    rm -f *.o app

2 链接
```
```
nvcc –arch=sm_20 –dlink v3.o particle.o main.o –o gpuCode.o
```
```
g++ gpuCode.o main.o particle.o v3.o –lcudart –o app




NVCC  的控制精度的一些编译选项
```
--use_fast_math (-use_fast_math)
Make use of fast math library. '--use_fast_math' implies '--ftz=true --prec-div=false
--prec-sqrt=false --fmad=true'.

--ftz {true|false} (-ftz)
This option controls single-precision denormals support. '--ftz=true' flushes
denormal values to zero and '--ftz=false' preserves denormal values. '--use_fast_math'
implies '--ftz=true'.
Default value: false.

--prec-div {true|false} (-prec-div)
This option controls single-precision floating-point division and reciprocals.
'--prec-div=true' enables the IEEE round-to-nearest mode and '--prec-div=false'
enables the fast approximation mode. '--use_fast_math' implies '--prec-div=false'.
Default value: true.

--prec-sqrt {true|false} (-prec-sqrt)
This option controls single-precision floating-point squre root. '--prec-sqrt=true'
enables the IEEE round-to-nearest mode and '--prec-sqrt=false' enables the
fast approximation mode. '--use_fast_math' implies '--prec-sqrt=false'.
Default value: true.

--fmad {true|false} (-fmad)
This option enables (disables) the contraction of floating-point multiplies
and adds/subtracts into floating-point multiply-add operations (FMAD, FFMA,
or DFMA). '--use_fast_math' implies '--fmad=true'.
Default value: true.
```
 
```
查看全文

相关阅读:
【BZOJ4009】[HNOI2015]接水果 DFS序+整体二分+扫描线+树状数组
 【BZOJ4010】[HNOI2015]菜肴制作拓扑排序
 【BZOJ4028】[HEOI2015]公约数数列分块
 有道云笔记配合MPic+七牛云自制MarkDown文档图床(适用Typora)
Python面试题之Python反射机制
 Python爬虫学习笔记之爬虫基础库
 Django学习笔记之Django Form表单详解
 Django学习笔记之Django的url反向解析
 SQL学习笔记之SQL中INNER、LEFT、RIGHT JOIN的区别和用法详解
 SQL学习笔记之MySQL中真假“utf8” 问题

原文地址：https://www.cnblogs.com/luoyinjie/p/12305335.html