Developed a CUDA version of the FDTD method and achieved a speedup 40x. Implemented on a NVIDIA Quadro FX 3800 GPU, which has 192 SPs, 1GB global memory, and a memory bandwidth of 51.2 GB/s.
This paper develops two local mesh-free methods for designing stencil weights and spatial discretization, respectively, for parabolic partial differential equations (PDEs) of ...
The decomposition of portfolio risks in terms of the underlying assets, which are extremely important for risk budgeting, asset allocation and risk monitoring, is well described by risk contributions.