多流向演算法GPU並行化

zhanlijun發表於2013-11-22

和導師在Computers & Geosciences上發表的關於多流向演算法GPU並行化的文章(SCI, IF=1.834)。

論文：http://sourcedb.igsnrr.cas.cn/zw/lw/201207/P020120717506311161951.pdf

As one of the important tasks in digital terrain analysis, the calculation of flow accumulations from gridded digital elevation models (DEMs) usually involves two steps in a real application: (1) using an iterative DEM preprocessing algorithm to remove the depressions and flat areas commonly contained in real DEMs, and (2) using a recursive flow-direction algorithm to calculate the flow accumulation for every cell in the DEM. Because both algorithms are computationally intensive, quick calculation of the flow accumulations from a DEM (especially for a large area) presents a practical challenge to personal computer (PC) users. In recent years, rapid increases in hardware capacity of the graphics processing units (GPUs) provided in modern PCs have made it possible to meet this challenge in a PC environment. Parallel computing on GPUs using a compute-unified-device-architecture (CUDA) programming model has been explored to speed up the execution of the single-flow-direction algorithm (SFD). However, the parallel implementation on a GPU of the multiple-flow-direction (MFD) algorithm, which generally performs better than the SFD algorithm, has not been reported. Moreover, GPU-based parallelization of the DEM preprocessing step in the flow-accumulation calculations has not been addressed. This paper proposes a parallel approach to calculate flow accumulations (including both iterative DEM preproces- sing and a recursive MFD algorithm) on a CUDA-compatible GPU. For the parallelization of an MFD algorithm (MFD-md), two different parallelization strategies using a GPU are explored. The first parallelization strategy, which has been used in the existing parallel SFD algorithm on GPU, has the problem of computing redundancy. Therefore, we designed a parallelization strategy based on graph theory. The application results show that the proposed parallel approach to calculate flow accumula- tions on a GPU performs much faster than either sequential algorithms or other parallel GPU-based algorithms based on existing parallelization strategies.

GPU程式設計(四):並行規約優化
2019-02-17
GPU程式設計並行優化
並行多工學習論文閱讀（二）同步和非同步優化演算法
2021-10-30
並行非同步優化演算法
使用MPI並行化遺傳演算法框架GAFT
2019-02-12
並行演算法框架
GPU的並行運算與CUDA的簡介
2019-01-26
GPU並行
26、多執行緒與並行
2020-10-17
執行緒並行
模型並行-Gpipe演算法
2024-12-09
模型並行演算法
MinkowskiEngine多GPU訓練
2021-01-04
GPU
pangrank演算法--PageRank演算法並行實現
2021-09-09
演算法並行
並行排序演算法：雙調排序
2024-11-15
並行排序演算法
並行Louvain社群檢測演算法
2021-12-12
並行AI演算法
Tensorflow多GPU使用詳解
2018-08-03
GPU
PyTorch中的多程序並行處理
2024-07-07
PyTorch並行
Pytorch：單卡多程式並行訓練
2023-01-24
PyTorch並行
用Dask並行化特徵工程！
2018-08-20
並行特徵工程
wsl docker裡執行ollama並使用nvidia gpu的一些記錄
2024-08-04
DockerGPU
任務排程的並行演算法
2018-04-03
並行演算法
從偽並行的 Python 多執行緒說起
2019-02-19
並行Python執行緒
C#多執行緒（四）並行程式設計篇之結構化
2022-12-18
C#執行緒並行行程程式設計
多執行緒並行執行，然後彙總結果
2019-01-18
執行緒並行
淺談GPU虛擬化技術(四)- GPU分片虛擬化
2018-06-11
GPU
淺談GPU虛擬化技術（四）-GPU分片虛擬化
2018-06-04
GPU
【python隨筆】之【多程式並行統計多個cvs檔案行數】
2020-10-24
Python並行
並行化最佳化KD樹演算法：使用C#實現高效的最近鄰搜尋
2024-03-10
並行演算法C#
C＃並行，多執行緒程式設計並行集合和PLINQ的例項講解
2019-01-09
並行執行緒程式設計
Pytorch使用資料並行，單機多卡
2020-05-14
PyTorch並行
演算法金 | 最難的來了：超引數網格搜尋、貝葉斯最佳化、遺傳演算法、模型特異化、Hyperopt、Optuna、多目標最佳化、非同步並行最佳化
2024-07-09
演算法模型非同步並行
淺談GPU虛擬化技術：GPU圖形渲染虛擬化
2018-06-11
GPU
PARL1.1一個修飾符實現並行強化學習演算法
2019-04-28
並行強化學習演算法
Java多執行緒並行處理任務的實現
2019-04-20
Java執行緒並行
第19節從庫MTS多執行緒並行回放（一）
2020-01-09
執行緒並行
第20節從庫MTS多執行緒並行回放（二）
2020-01-09
執行緒並行
二十：從庫MTS多執行緒並行回放（二）（筆記）
2019-07-09
執行緒並行筆記
十九：從庫MTS多執行緒並行回放（一）（筆記）
2019-07-09
執行緒並行筆記
C#多執行緒開發-任務並行庫04
2021-09-09
C#執行緒並行
多專案並行時人員怎麼分配
2021-07-19
並行
GPU開啟持久化模式
2024-06-12
GPU持久化模式
在no_ui中使用多程式實現多賬戶並行執行，並分配各自獨立的工作環境和策略
2021-12-10
UI並行
HTTP流量是如何流向代理的？
2022-05-27
HTTP
首個GPU高階語言，大規模並行就像寫Python，已獲8500 Star
2024-05-20
GPU並行Python

多流向演算法GPU並行化

相關文章