Powerpack: Energy profiling and analysis of high-performance systems and applications R Ge, X Feng, S Song, HC Chang, D Li, KW Cameron IEEE Transactions on Parallel and Distributed Systems 21 (5), 658-671, 2009 | 538 | 2009 |
{Zero-offload}: Democratizing {billion-scale} model training J Ren, S Rajbhandari, RY Aminabadi, O Ruwase, S Yang, M Zhang, D Li, ... 2021 USENIX Annual Technical Conference (USENIX ATC 21), 551-564, 2021 | 371 | 2021 |
Destiny: A tool for modeling emerging 3d nvm and edram caches M Poremba, S Mittal, D Li, JS Vetter, Y Xie 2015 Design, Automation & Test in Europe Conference & Exhibition (DATE …, 2015 | 200 | 2015 |
A survey of architectural approaches for managing embedded DRAM and non-volatile on-chip caches S Mittal, JS Vetter, D Li IEEE Transactions on Parallel and Distributed Systems 26 (6), 1524-1537, 2014 | 199 | 2014 |
Hybrid MPI/OpenMP power-aware computing D Li, BR de Supinski, M Schulz, K Cameron, DS Nikolopoulos 2010 IEEE International Symposium on Parallel & Distributed Processing …, 2010 | 196 | 2010 |
Processing-in-memory for energy-efficient neural network training: A heterogeneous approach J Liu, H Zhao, MA Ogleari, D Li, J Zhao 2018 51st Annual IEEE/ACM International Symposium on Microarchitecture …, 2018 | 152 | 2018 |
Classifying soft error vulnerabilities in extreme-scale scientific applications using a binary instrumentation tool D Li, JS Vetter, W Yu SC'12: Proceedings of the International Conference on High Performance …, 2012 | 125 | 2012 |
Unimem: Runtime data managementon non-volatile memory-based heterogeneous main memory K Wu, Y Huang, D Li Proceedings of the International Conference for High Performance Computing …, 2017 | 121 | 2017 |
Enabling and exploiting flexible task assignment on GPU through SM-centric program transformations B Wu, G Chen, D Li, X Shen, J Vetter Proceedings of the 29th ACM on International Conference on Supercomputing …, 2015 | 119 | 2015 |
Identifying opportunities for byte-addressable non-volatile memory in extreme-scale scientific applications D Li, JS Vetter, G Marin, C McCurdy, C Cira, Z Liu, W Yu 2012 IEEE 26th International Parallel and Distributed Processing Symposium …, 2012 | 97 | 2012 |
The tradeoffs of fused memory hierarchies in heterogeneous computing architectures KL Spafford, JS Meredith, S Lee, D Li, PC Roth, JS Vetter Proceedings of the 9th conference on Computing Frontiers, 103-112, 2012 | 87 | 2012 |
Strategies for energy-efficient resource management of hybrid programming models D Li, BR De Supinski, M Schulz, DS Nikolopoulos, KW Cameron IEEE Transactions on parallel and distributed Systems 24 (1), 144-157, 2012 | 86 | 2012 |
PORPLE: An extensible optimizer for portable data placement on GPU G Chen, B Wu, D Li, X Shen 2014 47th Annual IEEE/ACM International Symposium on Microarchitecture, 88-100, 2014 | 82 | 2014 |
Exploring hybrid memory for GPU energy efficiency through software-hardware co-design B Wang, B Wu, D Li, X Shen, W Yu, Y Jiao, JS Vetter Proceedings of the 22nd international conference on Parallel architectures …, 2013 | 82 | 2013 |
Sentinel: Efficient tensor migration and allocation on heterogeneous memory systems for deep learning J Ren, J Luo, K Wu, M Zhang, H Jeon, D Li 2021 IEEE International Symposium on High-Performance Computer Architecture …, 2021 | 74 | 2021 |
Performance analysis and characterization of training deep learning models on mobile device J Liu, J Liu, W Du, D Li 2019 IEEE 25th International Conference on Parallel and Distributed Systems …, 2019 | 68 | 2019 |
Runtime data management on non-volatile memory-based heterogeneous memory for task-parallel programs K Wu, J Ren, D Li SC18: International Conference for High Performance Computing, Networking …, 2018 | 65 | 2018 |
Fauce: fast and accurate deep ensembles with uncertainty for cardinality estimation J Liu, W Dong, Q Zhou, D Li Proceedings of the VLDB Endowment 14 (11), 1950-1963, 2021 | 62 | 2021 |
Smart-PGSim: Using neural network to accelerate AC-OPF power grid simulation W Dong, Z Xie, G Kestor, D Li SC20: International Conference for High Performance Computing, Networking …, 2020 | 59 | 2020 |
Hm-ann: Efficient billion-point nearest neighbor search on heterogeneous memory J Ren, M Zhang, D Li Advances in Neural Information Processing Systems 33, 10672-10684, 2020 | 59 | 2020 |