When I read the "Directable, High-Resolution Simulation of Fire on the GPU", it said that it used a GPU-based Jacobi iterator to solve the non-divergence equation for velocity field, so I made one based on CUBLAS. I think it should be much faster when the linear system is quite large. The next problem is GPU wavelet transform, there are still lot's of things to learn.
![](https://images.cnblogs.com/cnblogs_com/jedimaster/icon/7zip-48x48.png)