Task-based FMM for multicore architectures E Agullo, B Bramas, O Coulaud, E Darve, M Messner, T Takahashi SIAM Journal on Scientific Computing 36 (1), C66-C93, 2014 | 92 | 2014 |
Task‐based FMM for heterogeneous architectures E Agullo, B Bramas, O Coulaud, E Darve, M Messner, T Takahashi Concurrency and Computation: Practice and Experience 28 (9), 2608-2629, 2016 | 56 | 2016 |
A novel hybrid quicksort algorithm vectorized using AVX-512 on Intel Skylake B Bramas arXiv preprint arXiv:1704.08579, 2017 | 43 | 2017 |
Bridging the gap between OpenMP and task-based runtime systems for the fast multipole method E Agullo, O Aumage, B Bramas, O Coulaud, S Pitoiset IEEE Transactions on Parallel and Distributed Systems 28 (10), 2794-2807, 2017 | 34 | 2017 |
Fast sorting algorithms using AVX-512 on Intel Knights Landing B Bramas arXiv preprint arXiv:1704.08579 305, 315, 2017 | 23 | 2017 |
ScalFMM: A Generic Parallel Fast Multipole Library P Blanchard, B Bramas, O Coulaud, E Darve, L Dupuy, A Etcheverry, ... Computational Science and Engineering (CSE), 2015 | 21 | 2015 |
Optimized M2L kernels for the Chebyshev interpolation based fast multipole method M Messner, B Bramas, O Coulaud, E Darve arXiv preprint arXiv:1210.7292, 2012 | 21 | 2012 |
Optimization of a discontinuous Galerkin solver with OpenCL and StarPU B Bramas, P Helluy, L Mendoza, B Weber International Journal on Finite Volumes 15 (1), 1-19, 2020 | 16* | 2020 |
Pipelining the fast multipole method over a runtime system E Agullo, B Bramas, O Coulaud, E Darve, M Messner, T Toru arXiv preprint arXiv:1206.0115, 2012 | 15 | 2012 |
Computing the sparse matrix vector product using block-based kernels without zero padding on processors with AVX-512 instructions B Bramas, P Kus PeerJ Computer Science 4, e151, 2018 | 13 | 2018 |
Optimization and parallelization of the boundary element method for the wave equation in time domain B Bramas Bordeaux, 2016 | 13 | 2016 |
Matrices over runtime systems at exascale E Agullo, G Bosilca, B Bramas, C Castagnede, O Coulaud, E Darve, ... 2012 SC Companion: High Performance Computing, Networking Storage and …, 2012 | 11 | 2012 |
Matrices over runtime systems at exascale E Agullo, G Bosilca, B Bramas, C Castagnede, O Coulaud, E Darve, ... 2012 SC Companion: High Performance Computing, Networking Storage and …, 2012 | 11 | 2012 |
Task-based fast multipole method for clusters of multicore processors E Agullo, B Bramas, O Coulaud, M Khannouz, L Stanisic Inria Bordeaux Sud-Ouest, 2017 | 10 | 2017 |
Improving parallel executions by increasing task granularity in task-based runtime systems using acyclic DAG clustering B Bramas, A Ketterlin PeerJ Computer Science 6, e247, 2020 | 9 | 2020 |
Inastemp: A novel intrinsics-as-template library for portable simd-vectorization B Bramas Scientific Programming 2017, 2017 | 9 | 2017 |
Shape-and scale-dependent coupling between spheroids and velocity gradients in turbulence N Pujara, JA Arguedas-Leiva, CC Lalescu, B Bramas, M Wilczek Journal of Fluid Mechanics 922, R6, 2021 | 8 | 2021 |
Impact study of data locality on task-based applications through the Heteroprio scheduler B Bramas PeerJ Computer Science 5, e190, 2019 | 8 | 2019 |
Increasing the degree of parallelism using speculative execution in task-based runtime systems B Bramas PeerJ Computer Science 5, e183, 2019 | 8 | 2019 |
Design of a sound system to increase emotional expression impact in human-robot interaction B Bramas, YM Kim, DS Kwon 2008 international conference on control, automation and systems, 2732-2737, 2008 | 7 | 2008 |