PMO OpenIR  > 南极天文中心
Large-scale parallelization based on CPU and GPU cluster for cosmological fluid simulations
Meng, Chen1,2; Wang, Long1; Cao, Zongyan1,3; Feng, Long-long4; Zhu, Weishan4
2015-03-30
Source PublicationCOMPUTERS & FLUIDS
Volume110Pages:152-158
AbstractWe present our parallel implementation for large-scale cosmological simulations of 3D supersonic fluids based on CPU and GPU clusters. Our developments are based on a CPU code named WIGEON. It is shown that, compared to the original sequential Fortran code, a speedup of 19-31 (depending on the specific GPU card) can be achieved on single GPU. Furthermore, our results show that the pure MPI parallelization scales very well up to 10 thousand CPU cores. In addition, a hybrid CPU/GPU parallelization scheme is introduced and a detailed analysis of the speedup and the scaling on the different number of CPU/GPU units are presented (up to 256 GPU cards due to computing resource limitation). Our high scalability and speedup rely on the domain decomposition approach, optimization of the algorithm and a series of techniques to optimize the CUDA implementation, especially in the memory access pattern on CPU. We believe this hybrid MPI + CUDA code can be an excellent candidate for 10 Peta-scale computing and beyond. (C) 2014 Elsevier Ltd. All rights reserved.
KeywordCosmological Hydrodynamics Weno Gpu Hierarchical Memory Heterogeneous Large-scale
Subject Area天文和天体物理
WOS HeadingsScience & Technology ; Technology
DOI10.1016/j.compfluid.2014.04.006
WOS KeywordSCHEMES
Indexed BySCI
Language英语
WOS Research AreaComputer Science ; Mechanics
WOS SubjectComputer Science, Interdisciplinary Applications ; Mechanics
WOS IDWOS:000350535100016
Citation statistics
Cited Times:3[WOS]   [WOS Record]     [Related Records in WOS]
Document Type期刊论文
Identifierhttp://libir.pmo.ac.cn/handle/332002/14928
Collection南极天文中心
Affiliation1.Chinese Acad Sci, Supercomp Ctr, Comp Network Informat Ctr, Beijing 100190, Peoples R China
2.Univ Chinese Acad Sci, Beijing 100049, Peoples R China
3.Chinese Acad Sci, Natl Astron Observ, Beijing 100012, Peoples R China
4.Chinese Acad Sci, Purple Mt Observ, Nanjing 210008, Jiangsu, Peoples R China
Recommended Citation
GB/T 7714
Meng, Chen,Wang, Long,Cao, Zongyan,et al. Large-scale parallelization based on CPU and GPU cluster for cosmological fluid simulations[J]. COMPUTERS & FLUIDS,2015,110:152-158.
APA Meng, Chen,Wang, Long,Cao, Zongyan,Feng, Long-long,&Zhu, Weishan.(2015).Large-scale parallelization based on CPU and GPU cluster for cosmological fluid simulations.COMPUTERS & FLUIDS,110,152-158.
MLA Meng, Chen,et al."Large-scale parallelization based on CPU and GPU cluster for cosmological fluid simulations".COMPUTERS & FLUIDS 110(2015):152-158.
Files in This Item:
File Name/Size DocType Version Access License
2015084.pdf(1324KB) 开放获取Apache LicenseView Application Full Text
Related Services
Recommend this item
Bookmark
Usage statistics
Export to Endnote
Google Scholar
Similar articles in Google Scholar
[Meng, Chen]'s Articles
[Wang, Long]'s Articles
[Cao, Zongyan]'s Articles
Baidu academic
Similar articles in Baidu academic
[Meng, Chen]'s Articles
[Wang, Long]'s Articles
[Cao, Zongyan]'s Articles
Bing Scholar
Similar articles in Bing Scholar
[Meng, Chen]'s Articles
[Wang, Long]'s Articles
[Cao, Zongyan]'s Articles
Terms of Use
No data!
Social Bookmark/Share
File name: 2015084.pdf
Format: Adobe PDF
All comments (0)
No comment.
 

Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.