Follow
Yi Yang
Yi Yang
Verified email at nec-labs.com - Homepage
Title
Cited by
Cited by
Year
A gpgpu compiler for memory optimization and parallelism management
Y Yang, P Xiang, J Kong, H Zhou
ACM SIGPLAN Notices 45 (6), 86-97, 2010
4132010
Optimizing memory efficiency for deep convolutional neural networks on GPUs
C Li, Y Yang, M Feng, S Chakradhar, H Zhou
SC'16: Proceedings of the International Conference for High Performance …, 2016
1392016
CPU-Assisted GPGPU on Fused CPU-GPU Architectures
Y Yang, P Xiang, M Mantor, H Zhou
1342012
Accelerating deep neural network training with inconsistent stochastic gradient descent
L Wang, Y Yang, R Min, S Chakradhar
Neural Networks 93, 219-229, 2017
1162017
Warp-level divergence in GPUs: Characterization, impact, and mitigation
P Xiang, Y Yang, H Zhou
2014 IEEE 20th International Symposium on High Performance Computer …, 2014
922014
CUDA-NP: Realizing nested thread-level parallelism in GPGPU applications
Y Yang, H Zhou
ACM SIGPLAN Notices 49 (8), 93-106, 2014
912014
Memory efficiency for convolutional neural networks operating on graphics processing units
Y Yang, C Li, M Feng, S Chakradhar
US Patent 10,489,703, 2019
782019
Blasx: A high performance level-3 blas library for heterogeneous multi-gpu computing
L Wang, W Wu, Z Xu, J Xiao, Y Yang
Proceedings of the 2016 International Conference on Supercomputing, 1-11, 2016
772016
Shared Memory Multiplexing: A Novel Way to Improve GPGPU Throughput
Y Yang, P Xiang, M Mantor, N Rubin, H Zhou
Proceedings of the 21st international conference on Parallel architectures …, 2012
77*2012
Locality principle revisited: A probability-based quantitative approach
S Gupta, P Xiang, Y Yang, H Zhou
Journal of Parallel and Distributed Computing, 2013
582013
Locality Principle Revisited: A Probability-Based Quantitative Approach
S Gupta, P Xiang, Y Yang, H Zhou
IEEE International Parallel & Distributed Processing Symposium, 995 - 1009, 2012
582012
Accelerating MATLAB image processing toolbox functions on GPUs
J Kong, M Dimitrov, Y Yang, J Liyanage, L Cao, J Staples, M Mantor, ...
Proceedings of the 3rd Workshop on General-Purpose Computation on Graphics …, 2010
572010
Exploiting uniform vector instructions for GPGPU performance, energy efficiency, and opportunistic reliability enhancement
P Xiang, Y Yang, M Mantor, N Rubin, LR Hsu, H Zhou, M Mantor, N Rubin
ICS, 433-442, 2013
472013
Automatic data placement into GPU on-chip memory resources
C Li, Y Yang, Z Lin, H Zhou
2015 IEEE/ACM International Symposium on Code Generation and Optimization …, 2015
422015
Understanding the tradeoffs between software-managed vs. hardware-managed caches in GPUs
C Li, Y Yang, H Dai, S Yan, F Mueller, H Zhou
2014 IEEE International Symposium on Performance Analysis of Systems and …, 2014
422014
A unified optimizing compiler framework for different GPGPU architectures
Y Yang, P Xiang, J Kong, M Mantor, H Zhou
ACM Transactions on Architecture and Code Optimization (TACO) 9 (2), 1-33, 2012
392012
Tasks integrated networks: Joint detection and retrieval for image search
L Zhang, Z He, Y Yang, L Wang, X Gao
IEEE Transactions on Pattern Analysis and Machine Intelligence 44 (1), 456-473, 2020
382020
Fixing Performance Bugs: An Empirical Study of Open-Source GPGPU Programs
Y Yang, P Xiang, M Mantor, H Zhou
International Conference on Parallel Processing, 2012
342012
Accelerating deep neural network training with inconsistent stochastic gradient descent
W Linnan, Y Yang, R Min, S Chakradhar
US Patent 10,572,800, 2020
262020
A case for a flexible scalar unit in SIMT architecture
Y Yang, P Xiang, M Mantor, N Rubin, L Hsu, Q Dong, H Zhou
2014 IEEE 28th International Parallel and Distributed Processing Symposium …, 2014
232014
The system can't perform the operation now. Try again later.
Articles 1–20