Selected Publications

Conference Papers
 
Reducing Memory Ordering Overheads in Software Transactional Memory
CGO 2009. (Michael Spear, Maged Michael, Michael Scott, and Peng Wu)
 
Compiler-driven Dependence Profiling To Guide Program Parallelization
LCPC 2008. (Peng Wu, Arun Kejariwal, and Calin Cascaval)

Optimizing Data Permutations for SIMD Devices
(slides)
PLDI 2006. (Gang Ren, Peng Wu, and David Padua)

Optimizing Compiler for the CELL Processor
(slides)
PACT 2005. (A. Eichenberger et al)
 
An Empirical Study On the Vectorization of Multimedia Applications for Multimedia Extensions
IPDPS 2005. (Gang Ren, Peng Wu, and David Padua)
 
An Integrated Simdization Framework Using Virtual Vectors (slides)
ICS 2005. (Peng Wu, Alex. Eichenberger, Amy Wang, and Peng Zhao)
 
Efficient SIMD Code Generation for Runtime Alignment and Length Conversion (slides)
CGO 2005. (Peng Wu, Alex Eichenberger, Amy Wang)
 
Vectorization for SIMD architectures with alignment constraints (slides)
PLDI 2004. (Alex Eichenberger, Peng Wu, and Kevin O'Brien)
 
A Comparison of Empirical and Model-driven Optimization
PLDI 2003. (Kamen Yotov et al)
 
Instance-wise Points-to Analysis for Loop-based Dependence Testing
ICS 2002. (Peng Wu, Paul Feautrier, David Padua, and Zehra Sura)
 
Monotonic Evolution: an Alternative to Induction Variable Substitution for Dependence Analysis
ICS 2001. (Peng Wu, Albert Cohen and Jay Hoelfinger)
 
Induction Variable Analysis without Idiom Recognition: Beyond Monotonicity
LCPC 2001. (Peng Wu, Albert Cohen and David Padua)
 
Containers on the Parallelization of General-purpose Java Programs
PACT 1999. (Peng Wu and David Padua)
 
Journal Articles
Software Transactional Memory: Why is it only a research toy?
Journal of Communications of ACM, Nov, 2008.
Calin Cascaval, Colin Blundell, Maged Michael, Trey Cain, and Peng Wu
 
Compiler and Runtime Techniques for Software Transactional Memory Optimization
Journal of Concurrency and Computation: Practice and Experience, July, 2008.
Peng Wu, Maged Michael, Christoph von Praun, Takuya Nakaike, Rajesh Bordawekar, Harold Cain, Calin Cascaval, Siddhartha Chatterjee, Stefanie Chiras, Rui Hou, Mark Mergen, Xiaowei Shen, Michael Spear, Hua Yong Wang, and Kun Wang
Using advanced compiler technology to exploit the performance of the Cell Broadband Engine (TM) architecture (link)
IBM System Journal, Vol 45, Num 1, 2006.
T. Chen, A. E. Eichenberger, K. K. O'Brien, K. O'Brien, P. H. Oden, D. A. Prener, A. D. Shepherd, J. C. Shepherd, B. So, Z. Sura, A. K. Wang, P. Wu, T. Zhang, P. Zhao, M. Gschwind, R. Archambault, Y. Gao, and R. Koo
 
Design and Exploitation of a High-Performance SIMD Floating Point Unit for BlueGene/L (link)
IBM System Journal, Vol 49, Num 2/3, Pages 377, 2005
S. Chatterjee, L. R. Bachega, P. Bergner, K. A. Dockser, J. A. Gunnels, M. Gupta, F. G. Gustavson, C. A. Lapkowski, G. K. Liu, M. Mendell, R. Nair, C. D. Wait, T. J. C. Ward, P. Wu
 
The NINJA Project: Making Java Work for High Performance Numerical Computing
Communications of the ACM, Vol. 44, No. 10, Pages 102-109, Oct 2001.
Jose Moreira, Samuel Midkiff, Manish Gupta, Pedra Artigas, Peng Wu, and Gheorghe Almasi

PLDI is the Conference on Programming Language Design and Implementation; IPDPS is International Parallel and Distributed Processing Symposium; ICS is the International Conference on Supercomputing; PACT is the International Conference on Parallel Architectures and Compilation Techniques; CGO is the International Symposium on Code Generation and Optimization; LCPC is the International Workshop on Lanuages and Compilers for Parallel Processing.

IEEE - Copyright © 1990-2001 by IEEE. Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit. To copy otherwise, to republish, to post on servers, or to redistribute to lists, requires prior specific permission and/or a fee.

ACM - Copyright © 1990-2001 by Association for Computing Machinery, Inc. Permission to make digital or hard copies of part of all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage. To copy otherwise, to republish, to post on servers, or to redistribute to lists, requires prior specific permission and/or a fee.