Selected Publications

Conference Papers
 

Optimizing Data Permutations for SIMD Devices
(slides)
PLDI 2006. (Gang Ren, Peng Wu, and David Padua)

Optimizing Compiler for the CELL Processor
(slides)
PACT 2005. (A. Eichenberger et al)
 
An Empirical Study On the Vectorization of Multimedia Applications for Multimedia Extensions
IPDPS 2005. (Gang Ren, Peng Wu, and David Padua)
 
An Integrated Simdization Framework Using Virtual Vectors (slides)
ICS 2005. (Peng Wu, Alex. Eichenberger, Amy Wang, and Peng Zhao)
 
Efficient SIMD Code Generation for Runtime Alignment and Length Conversion (slides)
CGO 2005. (Peng Wu, Alex Eichenberger, Amy Wang)
 
Vectorization for SIMD architectures with alignment constraints (slides)
PLDI 2004. (Alex Eichenberger, Peng Wu, and Kevin O'Brien)
 
A Comparison of Empirical and Model-driven Optimization
PLDI 2003. (Kamen Yotov et al)
 
Instance-wise Points-to Analysis for Loop-based Dependence Testing
ICS 2002. (Peng Wu, Paul Feautrier, David Padua, and Zehra Sura)
 
Monotonic Evolution: an Alternative to Induction Variable Substitution for Dependence Analysis
ICS 2001. (Peng Wu, Albert Cohen and Jay Hoelfinger)
 
Induction Variable Analysis without Idiom Recognition: Beyond Monotonicity
LCPC 2001. (Peng Wu, Albert Cohen and David Padua)
 
Containers on the Parallelization of General-purpose Java Programs
PACT 1999. (Peng Wu and David Padua)
 
Journal Articles
 
Using advanced compiler technology to exploit the performance of the Cell Broadband Engine (TM) architecture (link)
IBM System Journal, Vol 45, Num 1, 2006.
T. Chen, A. E. Eichenberger, K. K. O'Brien, K. O'Brien, P. H. Oden, D. A. Prener, A. D. Shepherd, J. C. Shepherd, B. So, Z. Sura, A. K. Wang, P. Wu, T. Zhang, P. Zhao, M. Gschwind, R. Archambault, Y. Gao, and R. Koo
 
Design and Exploitation of a High-Performance SIMD Floating Point Unit for BlueGene/L (link)
IBM System Journal, Vol 49, Num 2/3, Pages 377, 2005
S. Chatterjee, L. R. Bachega, P. Bergner, K. A. Dockser, J. A. Gunnels, M. Gupta, F. G. Gustavson, C. A. Lapkowski, G. K. Liu, M. Mendell, R. Nair, C. D. Wait, T. J. C. Ward, P. Wu
 
The NINJA Project: Making Java Work for High Performance Numerical Computing
Communications of the ACM, Vol. 44, No. 10, Pages 102-109, Oct 2001.
Jose Moreira, Samuel Midkiff, Manish Gupta, Pedra Artigas, Peng Wu, and Gheorghe Almasi
 

PLDI is the Conference on Programming Language Design and Implementation; IPDPS is International Parallel and Distributed Processing Symposium; ICS is the International Conference on Supercomputing; PACT is the International Conference on Parallel Architectures and Compilation Techniques; CGO is the International Symposium on Code Generation and Optimization; LCPC is the International Workshop on Lanuages and Compilers for Parallel Processing.

IEEE - Copyright © 1990-2001 by IEEE. Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit. To copy otherwise, to republish, to post on servers, or to redistribute to lists, requires prior specific permission and/or a fee.

ACM - Copyright © 1990-2001 by Association for Computing Machinery, Inc. Permission to make digital or hard copies of part of all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage. To copy otherwise, to republish, to post on servers, or to redistribute to lists, requires prior specific permission and/or a fee.