Fengguang Song
Cited by
Cited by
Dynamic task scheduling for linear algebra algorithms on distributed-memory multicore systems
F Song, A YarKhan, J Dongarra
High Performance Computing, Networking, Storage and Analysis (SC), 2009 …, 2009
Enabling and Scaling Matrix Computations on Heterogeneous Multi-Core and Multi-GPU Systems
F Song, S Tomov, J Dongarra
ICS 2012, 2012
An algebra for cross-experiment performance analysis
F Song, F Wolf, N Bhatia, J Dongarra, S Moore
International Conference on Parallel Processing, 2004. ICPP 2004., 63-72, 2004
A Scalable Framework for Heterogeneous GPU-Based Clusters
F Song, J Dongarra
SPAA 2012, 2012
Scalable tile communication-avoiding QR factorization on multicore cluster systems
F Song, H Ltaief, B Hadri, J Dongarra
SC'10: Proceedings of the 2010 ACM/IEEE International Conference for High …, 2010
Correcting Soft Errors Online in Fast Fourier Transform
X Liang, J Chen, D Tao, S Li, P Wu, H Li, K Ouyang, Y Liu, F Song, ...
SC'17, 2017
L2 cache modeling for scientific applications on chip multi-processors
F Song, S Moore, J Dongarra
2007 International Conference on Parallel Processing (ICPP 2007), 51-51, 2007
Analytical modeling and optimization for affinity based thread scheduling on multicore systems
F Song, S Moore, J Dongarra
IEEE Cluster Computing, 2009., 1-10, 2009
Feedback-directed thread scheduling with memory considerations
F Song, S Moore, J Dongarra
HPDC 2007, 97-106, 2007
Experiments with strassen’s algorithm: from sequential to parallel
F Song, J Dongarra, S Moore
Parallel and Distributed Computing and Systems 2 (3), 2006
Scaling Up Matrix Computations on Shared-Memory Manycore Systems with 1000 CPU Cores
F Song, J Dongarra
The 28th ACM International Conference on Supercomputing (ICS'14), 2014
Performance instrumentation and compiler optimizations for MPI/OpenMP applications
O Hernandez, F Song, B Chapman, J Dongarra, B Mohr, S Moore, F Wolf
International Workshop on OpenMP, 267-278, 2005
Automatic experimental analysis of communication patterns in virtual topologies
N Bhatia, F Song, F Wolf, J Dongarra, B Mohr, S Moore
2005 International Conference on Parallel Processing (ICPP'05), 465-472, 2005
A scalable approach to solving dense linear algebra problems on hybrid CPU‐GPU systems
F Song, J Dongarra
Concurrency and Computation: Practice and Experience 27 (14), 3702-3723, 2015
Performance analysis and optimization of in-situ integration of simulation with data analysis: zipping applications up
Y Fu, F Li, F Song, Z Chen
The 27th International Symposium on High-Performance Parallel and …, 2018
Automating the Large-Scale Collection and Analysis of Performance Data on Linux Clusters
P Mucci, J Dongarra, S Moore, F Song, F Wolf, R Kufrin
Proceedings of the 5th LCI International Conference on Linux Clusters: The …, 2004
Opengraphgym: a parallel reinforcement learning framework for graph optimization problems
W Zheng, D Wang, F Song
International conference on computational science, 439-452, 2020
KV-Cache: A Scalable High-Performance Web-Object Cache for Manycore
D Waddington, J Colmenares, J Kuang, F Song
The 6th ACM/IEEE International Conference on Utility and Cloud Computings …, 2013
LBM-IB: A parallel library to solve 3D fluid-structure interaction problems on manycore systems
P Nagar, F Song, L Zhu, L Lin
2015 44th International Conference on Parallel Processing, 51-60, 2015
Interactive 3D Simulation for Fluid-Structure Interactions Using Two GPUs
B Zigon, L Zhu, F Song
Journal of Supercomputing, 2017
The system can't perform the operation now. Try again later.
Articles 1–20