Follow
Fengguang Song
Title
Cited by
Cited by
Year
Dynamic task scheduling for linear algebra algorithms on distributed-memory multicore systems
F Song, A YarKhan, J Dongarra
High Performance Computing, Networking, Storage and Analysis (SC), 2009 …, 2009
1472009
Enabling and Scaling Matrix Computations on Heterogeneous Multi-Core and Multi-GPU Systems
F Song, S Tomov, J Dongarra
ICS 2012, 2012
142*2012
An algebra for cross-experiment performance analysis
F Song, F Wolf, N Bhatia, J Dongarra, S Moore
International Conference on Parallel Processing, 2004. ICPP 2004., 63-72, 2004
782004
A Scalable Framework for Heterogeneous GPU-Based Clusters
F Song, J Dongarra
SPAA 2012, 2012
702012
Scalable tile communication-avoiding QR factorization on multicore cluster systems
F Song, H Ltaief, B Hadri, J Dongarra
SC'10: Proceedings of the 2010 ACM/IEEE International Conference for High …, 2010
512010
Correcting Soft Errors Online in Fast Fourier Transform
X Liang, J Chen, D Tao, S Li, P Wu, H Li, K Ouyang, Y Liu, F Song, ...
SC'17, 2017
452017
L2 cache modeling for scientific applications on chip multi-processors
F Song, S Moore, J Dongarra
2007 International Conference on Parallel Processing (ICPP 2007), 51-51, 2007
412007
Analytical modeling and optimization for affinity based thread scheduling on multicore systems
F Song, S Moore, J Dongarra
IEEE Cluster Computing, 2009., 1-10, 2009
392009
Feedback-directed thread scheduling with memory considerations
F Song, S Moore, J Dongarra
HPDC 2007, 97-106, 2007
282007
Experiments with strassen’s algorithm: from sequential to parallel
F Song, J Dongarra, S Moore
Parallel and Distributed Computing and Systems 2 (3), 2006
272006
Scaling Up Matrix Computations on Shared-Memory Manycore Systems with 1000 CPU Cores
F Song, J Dongarra
The 28th ACM International Conference on Supercomputing (ICS'14), 2014
212014
Performance instrumentation and compiler optimizations for MPI/OpenMP applications
O Hernandez, F Song, B Chapman, J Dongarra, B Mohr, S Moore, F Wolf
International Workshop on OpenMP, 267-278, 2005
192005
Performance analysis and optimization of in-situ integration of simulation with data analysis: zipping applications up
Y Fu, F Li, F Song, Z Chen
The 27th International Symposium on High-Performance Parallel and …, 2018
182018
Automatic experimental analysis of communication patterns in virtual topologies
N Bhatia, F Song, F Wolf, J Dongarra, B Mohr, S Moore
2005 International Conference on Parallel Processing (ICPP'05), 465-472, 2005
172005
A scalable approach to solving dense linear algebra problems on hybrid CPU‐GPU systems
F Song, J Dongarra
Concurrency and Computation: Practice and Experience 27 (14), 3702-3723, 2015
162015
KV-Cache: A Scalable High-Performance Web-Object Cache for Manycore
D Waddington, J Colmenares, J Kuang, F Song
The 6th ACM/IEEE International Conference on Utility and Cloud Computings …, 2013
162013
OpenGraphGym: A parallel reinforcement learning framework for graph optimization problems
W Zheng, D Wang, F Song
International conference on computational science, 439-452, 2020
152020
Automating the Large-Scale Collection and Analysis of Performance Data on Linux Clusters
P Mucci, J Dongarra, S Moore, F Song, F Wolf, R Kufrin
Proceedings of the 5th LCI International Conference on Linux Clusters: The …, 2004
152004
An extended roofline model with communication-awareness for distributed-memory hpc systems
D Cardwell, F Song
Proceedings of the International Conference on High Performance Computing in …, 2019
122019
Building a Scientific Workflow Framework to Enable Real-time Machine Learning and Visualization
F Li, F Song
Concurrency and Computation: Practice and Experience, 2018
112018
The system can't perform the operation now. Try again later.
Articles 1–20