*Domain Decomposition: Parallel Multilevel Methods for
Elliptic Partial Differential Equations*, B. F. Smith, P. E. Bjørstad, and **W. D. Gropp**, Cambridge University Press, 1996.
BibTeX

*MPI - The Complete Reference:
Volume 2, The MPI-2 Extensions*, **William Gropp**, Steven Huss-Lederman, Andrew Lumsdaine, **Ewing Lusk**, Bill Nitzberg, William Saphir, and Marc Snir, MIT Press, 1998.
BibTeX

*Using MPI: Portable Parallel Programming with the Message
Passing Interface, \rm 2nd edition*, **William Gropp**, **Ewing Lusk**, and Anthony Skjellum, MIT Press, 1999.
BibTeX

*Using MPI-2: Advanced Features of the Message-Passing Interface*, **William Gropp**, **Ewing Lusk**, and **Rajeev Thakur**, MIT Press, 1999.
BibTeX

*Sourcebook of Parallel Computing*, Jack Dongarra, Ian Foster, Geoffrey Fox, William
Gropp, Ken Kennedy, Linda Torczon, and Andy White, Morgan Kaufmann, 2003.
BibTeX

*Beowulf Cluster Computing with Linux*, **William Gropp**, **Ewing Lusk**, and Thomas Sterling, MIT Press, 2003.
BibTeX

*OpenMP in the Petascale Era – 7th International Workshop
on OpenMP, IWOMP 2011, Chicago, IL, USA, June 13-15, 2011.
Proceedings*, 2011.
BibTeX

*Using Advanced MPI: Modern Features of the
Message-Passing Interface*, **William Gropp**, Torsten Hoefler, **Rajeev Thakur**, and **Ewing Lusk**, MIT Press, 2014.
BibTeX

*Using MPI: Portable Parallel Programming with the
Message-Passing Interface, \rm 3rd edition*, **William Gropp**, **Ewing Lusk**, and Anthony Skjellum, MIT Press, 2014.
BibTeX

Brett Bode, Michelle Butler, Thom Dunning, Torsten Hoefler, William Kramer, **William Gropp**, and Wen-mei Hwu, *The Blue Waters Super-System for Super-Science*, Taylor and Francis, 2013, in *Contemporary High Performance Computing: From Petascale Toward Exascale*.
BibTeX

**William Gropp**, *Parallel Computer Architectures*, Morgan Kaufmann, 2003, in *Sourcebook of Parallel Computing*.
BibTeX

Ian Foster, **William Gropp**, and Carl Kesselman, *Message Passing and Threads*, Morgan Kaufmann, 2003, in *Sourcebook of Parallel Computing*.
BibTeX

**Rajeev Thakur** and **William Gropp**, *Parallel I/O*, Morgan Kaufmann, 2003, in *Sourcebook of Parallel Computing*.
BibTeX

**William Gropp**, *The 2-D Poisson Problem*, Morgan Kaufmann, 2003, in *Sourcebook of Parallel Computing*.
BibTeX

Satish Balay, **William Gropp**, Lois Curfman
McInnes, and Barry F. Smith, *Software for the Scalable Solution of Partial
Differential Equations*, Morgan Kaufmann, 2003, in *Sourcebook of Parallel Computing*.
BibTeX

**William Gropp**, *So You Want to Use a Cluster*, MIT Press, 2003, in *Beowulf Cluster Computing with Linux*.
BibTeX

**Ewing Lusk**, **William Gropp**, and Ralph Butler, *An Introduction to Writing Parallel Programs*, MIT Press, 2003, in *Beowulf Cluster Computing with Linux*.
BibTeX

**William Gropp** and **Ewing Lusk**, *Parallel Programming with MPI*, MIT Press, 2003, in *Beowulf Cluster Computing with Linux*.
BibTeX

**William Gropp** and **Ewing Lusk**, *Advanced Topics in MPI Programming*, MIT Press, 2003, in *Beowulf Cluster Computing with Linux*.
BibTeX

**Rajeev Thakur**, **William Gropp**, and **Ewing Lusk**, *ADIO: A Framework for High-Performance, Portable
Parallel I/O*, MIT Press, 2004, in *Scalable Input/Output*.
BibTeX

**William D. Gropp**, *Issues in Accurate and Reliable Use of Parallel
Computing in Numerical Programs*, SIAM, 2005, in *Accuracy and Reliability in Scientific
Computing*.
BibTeX

Ricky A. Kendall, Masha Sosonkina, William
D. Gropp, Robert W. Numrich, and Thomas Sterling, *Parallel Programming Models Applicable to Cluster
Computing and Beyond*, Springer, 2006, in *Numerical Solution of Partial Differential Equations
on Parallel Computers*.
BibTeX

**William D. Gropp** and Andrew Lumsdaine, *Parallel Tools and Environments: A Survey*, SIAM, 2006, in *Parallel Processing for Scientific Computing*.
BibTeX

Boyana Norris, Albert Hartono, and **William Gropp**, *Annotations for Productivity and Performance Portability*, Chapman & Hall / CRC Press, Taylor and Francis Group, Preprint ANL/MCS-P1392-0107, 2007, in *Petascale Computing: Algorithms and Applications*.
BibTeX

*A Science-Based Case For Large-Scale Simulation, Volume 2*, David Keyes, Philip Colella, Thom H. Dunning, and **William D. Gropp**, DRAFT, Office of Science, U.S. Department of Energy, SEP, 2004.
BibTeX

*Workshop on High-Productivity Programming Languages
and Models*, Hans P. Zima, Report of the workshop, 2004.
BibTeX

*International Workshop on Advanced Computational
Materials Science: Application to Fusion and
Generation-IV Fission Reactors*, Also ORNL/TM-2004/132, 2004.
BibTeX

*Workshop on Simulation and Modeling for Advanced Nuclear Energy Systems*, Phillip Finck, David Keyes, and Rick Stevens, CoAuthored Section 3.4, Software Tools and
Environments, with Robert Armstrong. Available as
www.mcs.anl.gov/anes/SMANES/gnep06-final.pdf., 2006.
BibTeX

*Future Directions for NSF Advanced Computing Infrastructure to Support U.S. Science and Engineering in 2017–2020: Interim Report*, National Research Council, The National Academies Press, 2014.
BibTeX

*Future Directions for NSF Advanced Computing Infrastructure to Support U.S. Science and Engineering in 2017–2020*, National Academies of Sciences, Engineering, and Medicine, The National Academies Press, 2016.
BibTeX

*Solving PDEs on loosely-coupled parallel
processors*, **William D. Gropp**, Parallel Computing, 5, 1-2, 165–173, Proceedings of the international conference on vector
and parallel computing—issues in applied research and
development (Loen, 1986), 1987.
BibTeX

*A comparison of domain decomposition techniques for
elliptic partial differential equations and their
parallel implementation*, David E. Keyes and **William D. Gropp**, SIAM Journal on Scientific and Statistical Computing, 8, 2, S166–S202, Reprinted in Selected Papers from the Second
Conference on Parallel Processing for Scientific
Computing (C. W. Gear & R. G. Voigt, eds., SIAM, 1987), 1987.
BibTeX

*Local uniform mesh refinement with moving grids*, **William D. Gropp**, SIAM Journal on Scientific and Statistical Computing, 8, 3, 292–304, 1987.
BibTeX

*Local Uniform Mesh Refinement on Loosely-Coupled
Parallel Processors*, **W. Gropp**, I. J. Comp. Math. Appl., 15, 375–389, 1988.
BibTeX

*Complexity of parallel implementation of domain
decomposition techniques for elliptic partial
differential equations*, **William D. Gropp** and David E. Keyes, SIAM Journal on Scientific and Statistical Computing, 9, 2, 312–326, 1988.
BibTeX

*Recursive Mesh Refinement on Hypercubes*, **William D. Gropp** and I. C. F. Ipsen, Nordisk Tidskr. Informationsbehandling (BIT), 29, 186–211, 1989.
BibTeX

*Domain Decomposition on Parallel Computers*, **William D. Gropp** and David E. Keyes, Impact Comput. Sci. Eng., 1, 421–439, 1989.
BibTeX

*Domain decomposition techniques for the parallel
solution of nonsymmetric systems of elliptic boundary
value problems*, David E. Keyes and **William D. Gropp**, Applied Numerical Mathematics: Transactions of IMACS, 6, 4, 281–301, 1990.
BibTeX

*Krylov Methods Preconditioned with Incompletely
Factored Matrices on the CM-2*, H. Berryman, J. Saltz, **W. Gropp**, and R.
Mirchandaney, Journal of Parallel and Distributed Computing, 8, 2, 186–190, 1990.
BibTeX

*A Parallel Version of the Fast Multipole Method*, Leslie Greengard and **William D. Gropp**, Computers and Mathematics with Applications, 20, 63–71, 1990.
BibTeX

*Computational Fluid Dynamics on Parallel Processors*, **William D. Gropp** and Edward Smith, Computers and Fluids, 18, 289–304, 1990.
BibTeX

*Convergence rate estimate for a domain decomposition
method*, Xiao-Chuan Cai, **William D. Gropp**, and David E.
Keyes, Numerische Mathematik, 61, 2, 153–169, 1992.
BibTeX

*Domain decomposition with local mesh refinement*, **W. D. Gropp** and D. E. Keyes, SIAM J. Sci. Stat. Comput., 13, 967–993, 1992.
BibTeX

*Parallel Performance of Domain-Decomposed
Preconditioned Krylov Methods for PDEs with Locally
Uniform Refinement*, **W. D. Gropp** and D. E. Keyes, SIAM Journal on Scientific and Statistical
Computing, 13, 128–145, 1992.
BibTeX

*Domain decomposition methods in computational fluid
dynamics*, **W. D. Gropp** and D. E. Keyes, Int. J. Numer. Meth. Fluids, 14, 147–165, 1992.
BibTeX

*The Parallel Scalability of the Spectral Transform
Method*, I. Foster, **W. Gropp**, and R. Stevens, Monthly Weather Review, 120, 5, 835–850, 1992.
BibTeX

*Parallel Programming Tools for Distributed Memory Computers*, **William Gropp**, High Performance Computing: Grand Challenges in
Computer Simulation, Adrian Tentner, 166–169, 1993.
BibTeX

*A Comparison of Some Domain Decomposition and ILU
Preconditioned Iterative Methods for Nonsymmetric
Elliptic Problems*, Xiao-Chuan Cai,

*MPI: A Message Passing Interface Standard*, Message Passing Interface Forum, International Journal of Supercomputer Applications, 8, 3/4, 159–416, 1994.
BibTeX

*Solution of dense systems of linear equations
arising from integral equation formulations*, K. Forsman, **W. Gropp**, L. Kettunen, D. Levine, and J. Salonen, IEEE Antennas and Propagation Magazine, 96–100, 1995.
BibTeX

*Experiences with the IBM SP1*, **W. D. Gropp** and **E. Lusk**, IBM Systems Journal, 34, 2, 249–262, 1995.
BibTeX

*Early applications in the Message-Passing Interface
(MPI)*, Anthony Skjellum, **Ewing Lusk**, and **William Gropp**, International Journal of Supercomputer Applications
and High Performance Computing, 9, 2, 79–94, 1995.
BibTeX

*Numerical Simulation of Vortex Dynamics in
High- T_c Superconductors*, William D. Gropp, Hans Kaper, G. Leaf, D. Levine, V. Vinokur, and M. Palumbo, J. Comp. Physics, 123, 254–266, 1996.
BibTeX

*A high-performance, portable implementation of the
MPI message passing interface standard*, **W. Gropp**, **E. Lusk**, N. Doss, and A. Skjellum, Parallel Computing, 22, 6, 789–828, 1996.
BibTeX

*An Experimental Evaluation of the Parallel I/O
Systems of the IBM SP and Intel Paragon Using a
Production Application*, **R. Thakur**, **W. Gropp**, and **E. Lusk**, Lecture Notes in Computer Science, 1127, 1996.
BibTeX

*The Design of Data-Structure-Neutral Libraries for
the Iterative Solution of Sparse Linear Systems*, Barry Smith and **William Gropp**, Scientific Programming, 5, 329–336, 1996.
BibTeX

*A high-performance MPI implementation on a
shared-memory vector supercomputer*, **W. Gropp** and **E. Lusk**, Parallel Computing, 22, 11, 1513–1526, 1997.
BibTeX

*Sowing MPICH: A Case Study in the Dissemination of
a Portable Environment for Parallel Scientific
Computing*, **W. Gropp** and **E. Lusk**, The International Journal of Supercomputer
Applications and High Performance Computing, 11, 2, 103–114, 1997.
BibTeX

*MPI2: A Message Passing Interface Standard*, Message Passing Interface Forum, High Performance Computing Applications, 12, 1–2, 1–299, 1998.
BibTeX

*I/O in Parallel Applications: The Weakest Link*, **Rajeev Thakur**, **Ewing Lusk**, and **William Gropp**, The International Journal of High Performance Computer Applications, 12, 4, part 2, 389–395, 1998.
BibTeX

*Parallel Newton-Krylov-Schwarz Algorithms for the
Transonic Full Potential Equation*, X-C Cai, **William D. Gropp**, David E. Keyes, R. G. Melvin, and D. P. Young, SIAM Journal of Scientific Computing, 19, 246–265, Also ICASE report TR 96-39., 1998.
BibTeX

*A Wide-Area
Implementation of the Message Passing Interface*, I. Foster, J. Geisler, **W. Gropp**, N. Karonis, **E. Lusk**, G. Thiruvathukal, and S. Tuecke, Parallel Computing, 24, 12–13, 1735–1749, 1998.
BibTeX

*Parallel Computation of Three-dimensional Nonlinear
Magnetostatic Problems*, David Levine, **William Gropp**, Kimmo Forsman, and Lauri
Kettunen, Concurrency Practice and Experience, 11, 2, 109–120, 1999.
BibTeX

*Toward Scalable Performance Visualization with Jumpshot*, Omer Zaki, **Ewing Lusk**, **William Gropp**, and Deborah
Swider, High Performance Computing Applications, 13, 2, 277–288, 1999.
BibTeX

*Globalized Newton-Krylov-Schwarz Algorithms
and Software for Parallel Implicit CFD*, **William Gropp**, David E. Keyes, Lois C. McInnes, and M. D. Tidriri, High Performance Computing Applications, 14, 2, 102–136, 2000.
BibTeX

*High Performance Parallel Implicit CFD*, **W. D. Gropp**, D. K. Kaushik, D. E. Keyes, and B. F. Smith, Parallel Computing, 27, 4, 337–362, 2001.
BibTeX

*Components and Interfaces of a Process Management System for
Parallel Programs*, Ralph Butler, **William Gropp**, and **Ewing Lusk**, Parallel Computing, 27, 11, 1417–1429, 2001.
BibTeX

*Optimizing Noncontiguous Accesses in MPI-IO*, **Rajeev Thakur**, **William Gropp**, and **Ewing Lusk**, Parallel Computing, 28, 1, 83–105, 2002.
BibTeX

*Special Issue: Cluster 2001*, Mark Baker, Daniel Katz, **William Gropp**, and Thomas
Sterling, Concurrency and Computation: Practice and Experience, 15, 7–8, 623–624, 2003.
BibTeX

*Fault Tolerance in MPI Programs*, **William D. Gropp** and **Ewing Lusk**, International Journal of High Performance Computer Applications, 18, 3, 363–372, 2004.
BibTeX

*Evaluating structured I/O methods for parallel file systems*, Avery Ching, Alok N. Choudhary, Wei-keng Liao, Robert B. Ross, and **William Gropp**, International Journal of High Performance Computing and Networking, 2, 2/3/4, 133–145, 2004.
BibTeX

*Optimization of Collective Communication Operations
in MPICH*, **Rajeev Thakur**, Rolf Rabenseifner, and **William Gropp**, International Journal of High Performance Computer Applications, 19, 1, 49–66, 2005.
BibTeX

*Design and Implementation of Message-Passing Services
for the Blue Gene/L Supercomputer*, George Almási, Charles Archer, Jose
G. Castaños, J. A. Gunnels, C. Chris Erway, Philip
Heidelberger, Xavier Martorell, Jose
E. Moreira, Kurt Pinnow, Joe Ratterman, Burkhard Steinmacher-Burow, **William Gropp**, and Brian Toonen, IBM Journal of Research and Development, 49, 2/3, 393–406, Available at
http://www.research.ibm.com/journal/rd49-23.html, 2005.
BibTeX

*Optimizing the Synchronization Operations in MPI
One-Sided Communication*, **Rajeev Thakur**, **William Gropp**, and Brian Toonen, High Performance Computing Applications, 19, 2, 119–128, 2005.
BibTeX

*A Portable Method for Finding User Errors in the
Usage of MPI Collective Operations*, Christopher Falzone, Anthony Chan, **Ewing Lusk**, and **William Gropp**, International Journal of High Performance Computing
Applications, 21, 2, 155–165, 2007.
BibTeX

*Electron Injection by a Nanowire in the Bubble Regime*, Baifei Shen, Yuelin Li, Karoly Nemeth, Hairong Shang, Yong-chul Chae, Robert Soliday, Robert Crowell, Edward Frank, William
Gropp, and John Cary, Physics of Plasmas, 14, 2007.
BibTeX

*An Efficient Format for Nearly Constant-Time Access to
Arbitrary Time Intervals in Large Trace Files*, Anthony Chan, **William Gropp**, and **Ewing Lusk**, Scientific Programming, 16, 2, 155–165, 2008.
BibTeX

*Test Suite for Evaluating Performance of multithreaded
MPI communication*, **Rajeev Thakur** and
**William Gropp**, Parallel Computing, 35, 608–617, 2009.
BibTeX

*Toward Exascale Resilience*, Cappello, Franck, Geist, Al, Gropp, Bill, Kale,
Laxmikant, Kramer, Bill, and Snir, Marc, International Journal of High Performance Computing Applications, 23, 4, 374–388, 2009.
BibTeX

*On the Need for a Consortium of Capability Centers*, Gropp, William and Snir, Marc, International Journal of High Performance Computing Applications, 23, 4, 413–420, 2009.
BibTeX

*Toward message passing for a million processes: characterizing
MPI on a massive scale Blue Gene/P*, Pavan Balaji, Anthony Chan, **Rajeev Thakur**, **William Gropp**, and Ewing L. Lusk, Computer Science - R&D, 24, 1-2, 11–19, 2009.
BibTeX

*Software for Petascale Computing Systems*, Gropp, William D., IEEE Computing in Science and Engineering, 11, 5, 17–21, 2009.
BibTeX

*Self-Consistent MPI Performance Guidelines*, Jesper Larsson Träff, **William D. Gropp**, and **Rajeev Thakur**, IEEE Transactions on Parallel and Distributed Systems, 21, 5, 698–709, 2009.
BibTeX

*Formal methods applied to high-performance computing
software design: a case study of MPI one-sided
communication-based locking*, Salman Pervez, Ganesh Gopalakrishnan, Robert
M. Kirby, **Rajeev Thakur**, and **William Gropp**, Software Practice and Experience, 40, 1, 23–42, 2010.
BibTeX

*Fine-Grained Multithreading Support for Hybrid Threaded MPI
Programming*, Balaji, Pavan, Buntinas, Darius, Goodell, David, Gropp, William, and Thakur, Rajeev, International Journal of High Performance Computing Applications, 24, 1, 49–57, 2010.
BibTeX

*Teaching parallel programming: a roundtable discussion*, Mellor-Crummey, J., Gropp, W., and Herlihy, M., XRDS: Crossroads, The ACM Magazine for Students, 17, 1, 28–30, 2010.
BibTeX

*The Importance of Non-Data-Communication Overheads in MPI*, Balaji, Pavan, Chan, Anthony, Gropp, William, Thakur, Rajeev, and Lusk, Ewing, International Journal of High Performance Computing Applications, 24, 1, 5–15, 2010.
BibTeX

*A Pipelined Algorithm for Large, Irregular All-Gather Problems*, Träff, Jesper Larsson, Ripke, Andreas, Siebert,
Christian, Balaji, Pavan, Thakur, Rajeev, and Gropp, William, International Journal of High Performance Computing Applications, 24, 1, 58–68, 2010.
BibTeX

*Optimizing Sparse Data Structures for Matrix-vector Multiply*, Dahai Guo and **William Gropp**, International Journal of High Performance Computing
Applications, 25, 1, 115–131, 2011.
BibTeX

*The International Exascale Software Project Roadmap*, Dongarra, Jack, Beckman, Pete, Moore, Terry, Aerts,
Patrick, Aloisio, Giovanni, Andre, Jean-Claude, Barkai, David, Berthou, Jean-Yves, Boku,
Taisuke, Braunschweig, Bertrand, Cappello,
Franck, Chapman, Barbara, Chi, Xuebin, Choudhary, Alok, Dosanjh, Sudip, Dunning, Thom, Fiore, Sandro, Geist, Al, Gropp, William, Harrison, Robert, Hereld, Mark, Heroux,
Michael, Hoisie, Adolfy, Hotta, Koh, Jin, Zhong, Ishikawa, Yutaka, Johnson, Fred, Kale, Sanjay, Kenway, Richard, Keyes, David, Kramer, Bill, Labarta, Jesus, Lichnewsky,
Alain, Lippert, Thomas, Lucas, Bob, Maccabe, Barney, Matsuoka, Satoshi, Messina,
Paul, Michielse, Peter, Mohr, Bernd, Mueller, Matthias S., Nagel, Wolfgang E., Nakashima, Hiroshi, Papka, Michael E, Reed,
Dan, Sato, Mitsuhisa, Seidel, Ed, Shalf,
John, Skinner, David, Snir, Marc, Sterling,
Thomas, Stevens, Rick, Streitz, Fred, Sugar, Bob, Sumimoto, Shinji, Tang, William, Taylor, John, Thakur, Rajeev, Trefethen,
Anne, Valero, Mateo, van der Steen, Aad, Vetter, Jeffrey, Williams, Peg, Wisniewski,
Robert, and Yelick, Kathy, International Journal of High Performance Computing
Applications, 25, 1, 3–60, 2011.
BibTeX

*EcoG: A Power-Efficient GPU Cluster Architecture for
Scientific Computing*, Showerman, M., Enos, J., Steffen, C., Treichler,
S., Gropp, W., and Hwu, W.-m.W., Computing in Science Engineering, 13, 2, 83–87, 2011.
BibTeX

*MPI on Millions of Cores*, Pavan Balaji, Darius Buntinas, David Goodell, **William Gropp**, Torsten Hoefler, Sameer
Kumar, **Ewing Lusk**, **Rajeev Thakur**, and Jesper
Larsson Träff, Parallel Processing Letters, 21, 1, 45–60, 2011.
BibTeX

*Formal analysis of MPI-based parallel programs*, Ganesh Gopalakrishnan, Robert M. Kirby, Stephen F. Siegel, **Rajeev Thakur**, **William Gropp**, Ewing L. Lusk, Bronis R. de Supinski, Martin Schulz, and Greg Bronevetsky, Commun. ACM, 54, 12, 82–91, 2011.
BibTeX

*Best algorithms + best computers = powerful match*, Gropp, William, Commun. ACM, 55, 5, 100–100, 2012.
BibTeX

*Applications of the streamed storage format for sparse matrix operations*, Guo, D. and Gropp, W., International Journal of High Performance Computing Applications, 2013.
BibTeX

*Multiphysics simulations: Challenges and opportunities*, Keyes, David E, McInnes, Lois C, Woodward, Carol, Gropp, William, Myra, Eric, Pernice, Michael, Bell, John, Brown, Jed, Clo, Alain, Connors, Jeffrey, Constantinescu, Emil, Estep,
Don, Evans, Kate, Farhat, Charbel, Hakim,
Ammar, Hammond, Glenn, Hansen, Glen, Hill,
Judith, Isaac, Tobin, Jiao, Xiangmin, Jordan, Kirk, Kaushik, Dinesh, Kaxiras,
Efthimios, Koniges, Alice, Lee, Kihwan, Lott, Aaron, Lu, Qiming, Magerlein, John, Maxwell, Reed, McCourt, Michael, Mehl, Miriam, Pawlowski, Roger, Randles, Amanda P, Reynolds, Daniel, Rivière, Beatrice, Rüde,
Ulrich, Scheibe, Tim, Shadid, John, Sheehan, Brendan, Shephard, Mark, Siegel,
Andrew, Smith, Barry, Tang, Xianzhu, Wilson, Cian, and Wohlmuth, Barbara, International Journal of High Performance Computing Applications, 27, 1, 4–83, 2013.
BibTeX

*MPI + MPI: a new hybrid approach to parallel programming
with MPI plus shared memory*, T. Hoefler, J. Dinan, D. Buntinas, P. Balaji, B. Barrett, R. Brightwell, **W. Gropp**, V. Kale, and **R. Thakur**, Journal of Computing, doi: 10.1007/s00607-013-0324-2, 2013.
BibTeX

*Applications of the streamed storage format for sparse matrix operations*, Guo, Dahai and Gropp, William, International Journal of High Performance Computing Applications, 28, 1, 3–12, 2014.
BibTeX

*Special issue: SC13 - The International Conference for High Performance Computing, Networking, Storage and Analysis*, **William Gropp** and Satoshi Matsuoka, Scientific Programming, 57–58, 2014.
BibTeX

*Enabling the environmentally clean air transportation of the
future: a vision of computational fluid dynamics in
2030*, Slotnick, Jeffrey P, Khodadoust, Abdollah, Alonso,
Juan J, Darmofal, David L, Gropp, William D, Lurie, Elizabeth A, Mavriplis, Dimitri J, and Venkatakrishnan, Venkat, Philosophical Transactions of the Royal Society A:
Mathematical, Physical and Engineering Sciences, 372, 2022, 20130317, @Article{slotnick2014enabling, 2014.
BibTeX

*Collective Algorithms for Multiported Torus Networks*, Sack, Paul and Gropp, William, ACM Trans. Parallel Comput., 1, 2, 12:1–12:33, 2015.
BibTeX

*Remote Memory Access Programming in MPI-3*, Torsten Hoefler, James Dinan, **Rajeev Thakur**, Brian Barrett, Pavan Balaji, **William Gropp**, and Keith D. Underwood, ACM Trans. Parallel Comput., 2, 2, 9:1–9:26, 2015.
BibTeX

*A hybrid format for better performance of sparse
matrix-vector multiplication on a GPU*, Guo, Dahai, Gropp, William, and Olson, Luke N, International Journal of High Performance Computing Applications, 2015.
BibTeX

*Reducing Parallel Communication in Algebraic Multigrid through Sparsification*, Amanda Bienz, Robert D. Falgout, **William Gropp**, Luke N. Olson, and Jacob B. Schroder, SIAM Journal on Scientific Computing, 38, 5, S332-S357, 2016.
BibTeX

*An implementation and evaluation of the MPI 3.0 one-sided communication interface*, Dinan, James, Balaji, Pavan, Buntinas, Darius, Goodell, David, Gropp, William, and Thakur, Rajeev, Concurrency and Computation: Practice and Experience, 28, 17, 4385–4404, cpe.3758, 2016.
BibTeX

*Rethinking key-value store for parallel I/O optimization*, Anthony Kougkas, Hassan Eslami, Xian-He Sun, **Rajeev Thakur**, and **William Gropp**, The International Journal of High Performance Computing Applications, 31, 4, 335–356, 2017.
BibTeX

*Eliminating contention bottlenecks in multithreaded
MPI*, Hoang-Vu Dang, Marc Snir, and **William Gropp**, Parallel Computing, 69, 1–23, 2017.
BibTeX

*Numerical Linear Algebra on Workstations*, **W. D. Gropp**, Proc. Army Research Office Workshop on Microcomputers
in Scientific Computing, 1985.
BibTeX

*A System for Numerical Linear Algebra*, **William D. Gropp**, New Computing Environments: Microcomputers in
Large-Scale Computing, A. Wouk, 26–38, 1987.
BibTeX

*Local Uniform Mesh Refinement on Parallel Processors*, **William D. Gropp**, Large Scale Scientific Computing, P. Deuflhard and B.~Enquist, 1987.
BibTeX

*Adaptive Methods for Hyperbolic Problems on Local
Memory Parallel Processors*, **William D. Gropp**, Numerical Algorithms for Modern Computer Architectures, M. H. Schultz, 77–84, 1988.
BibTeX

*Computational Fluid Dynamics on Parallel Processors*, **William Gropp** and Edward Smith, 1st National Fluid Dynamics Congress, Part 1, 612–619, 1988.
BibTeX

*A Highly Parallel Method for an Underwater Acoustics
Problem*, **William D. Gropp** and Martin Schultz, Proceedings of the Fourth International Conference
on Supercomputing, Santa Clara, California, 1989.
BibTeX

*Krylov Methods and the CM/2*, H. S.~Barryman, William D.~Gropp, and J.~Saltz, Proceedings of the Fourth International Conference
on Supercomputing, Santa Clara, California, 1989.
BibTeX

*CLAM: A Programming Language for Interactive
Supercomputing and Visualization*, **William D. Gropp** and David Foulser, Proceedings of the Fourth International Conference
on Supercomputing, Santa Clara, California, 1989.
BibTeX

*Dynamic Grid Manipulation for PDEs on Hypercube
Parallel Processors*, **William D. Gropp**, Parallel Processing and Medium-Scale Multiprocessors, A. Wouk, 192–203, 1989.
BibTeX

*A Parallel Version of the Fast Multipole Method*, Leslie Greengard and **William D. Gropp**, Proceedings of the 3rd Conference on Parallel
Processing for Scientific Computing, Gary Rodrigue, 213–222, 1989.
BibTeX

*A Gray Code Scheme for Local Uniform Mesh Refinement
on Hypercubes*, **W. D. Gropp** and I. C. F. Ipsen, Parallel Processing for Scientific Computing:
Proceedings of the Third SIAM Conference on Parallel
Processing for Scientific Computing, Los Angeles,
California, December 1–4, 1987, Garry Rodrigue, 202–206, 1987.
BibTeX

*Parallel Domain Decomposition with Local Mesh
Refinement*, **William D. Gropp** and David E. Keyes, Proceedings of the 4th Conference on Parallel
Processing for Scientific Computing, Danny C. Sorensen, Jack Dongarra, Paul Messina, and Robert G. Voigt, 295–296, 1989.
BibTeX

**W. D. Gropp** and D. E. Keyes, *Domain decomposition on parallel computers*, SIAM, 1989, in *Domain Decomposition Methods*.
BibTeX

*Domain decomposition techniques for large sparse
nonsymmetric systems arising from elliptic problems
with first–order terms*, D. E. Keyes, **W. D. Gropp**, and A. Ecder, Proceedings of a Symposium on the Solution of Super
Large Problems in Computational Mechanics, J. H. Kane and A. D. Carlson, 1989.
BibTeX

D. E. Keyes and **W. D. Gropp**, *Domain decomposition techniques for nonsymmetric
systems of equations: examples from computational fluid
dynamics*, SIAM, 1989, in *Domain Decomposition Methods*.
BibTeX

*Balanced Divide-and-Conquer Algorithms for the
Fine-Grained Parallel Direct Solution of Dense and
Banded Triangular Linear Systems and their Connection
Machine Implementation*, Z. George Mou, David E. Keyes, and William D.
Gropp, Proceedings of the 4th Conference on Parallel
Processing for Scientific Computing, Danny C. Sorensen, Jack Dongarra, Paul Messina, and Robert G. Voigt, 386–387, 1989.
BibTeX

*CLAM and CLAMShell: An Interactive Front-End for
Parallel Computing and Visualization*, D. E. Foulser and **W. D. Gropp**, Proceedings of the 1990 International Conference on
Parallel Processing. Volume 3: Algorithms and
Architectures, Pen-Chung Yew, 35–43, 1990.
BibTeX

*A domain decomposition method with locally uniform
mesh refinement*, **W. D. Gropp** and D. E. Keyes, Third International Symposium on Domain Decomposition
Methods for Partial Differential Equations, T. F. Chan, R. Glowinski, J. Périaux, and O.
B. Widlund, 115–129, 1990.
BibTeX

**William D. Gropp** and Martin H. Schultz, *High Performance Parabolic Equation Solvers*, Elsevier Science Pub., 1990, in *Computational Acoustics*.
BibTeX

*Parallel Computing and the Solution of Partial
Differential Equations (abstract)*, **William Gropp**, Transactions of the American Nuclear Society, Irene O. Macke, 269, Invited Paper, 1990.
BibTeX

*Visual Artifacts in Boundary Conditions*, **William Gropp**, Proceedings for the ICASE Workshop on Heterogeneous
Boundary Conditions, A. Louise Perkins and Jeffrey S. Scroggs, 1–3, 1991.
BibTeX

*Domain Decomposed Preconditionings for Transport Operators*, Tony F. Chan, **William Gropp**, and David E. Keyes, Proceedings for the ICASE Workshop on Heterogeneous
Boundary Conditions, A. Louise Perkins and Jeffrey S. Scroggs, 12–30, 1991.
BibTeX

*Parallel Scalability of the Spectral Transform
Method*, I. Foster, **W. Gropp**, and R. Stevens, Proceedings of the 5th SIAM Conference on Parallel
Processing for Scientific Computing, Jack Dongarra, Ken Kennedy, Paul Messina, Danny C. Sorensen, and Robert G. Voigt, 307–314, 1991.
BibTeX

*Parallel domain decomposition and the solution of
nonlinear systems of equations*, **W. D. Gropp** and D. E. Keyes, Fourth International Symposium on Domain Decomposition
Methods for Partial Differential Equations, R. Glowinski, Yu. A. Kuznetsov, G. A. Meurant, J. Périaux, and O. B. Widlund, 373–381, 1991.
BibTeX

*Domain-decomposable preconditioners for second–order
upwind discretizations of multicomponent systems*, D. E. Keyes and **W. D. Gropp**, Fourth International Symposium on Domain Decomposition
Methods for Partial Differential Equations, R. Glowinski, Yu. A. Kuznetsov, G. A. Meurant, J. Périaux, and O. B. Widlund, 129–139, 1991.
BibTeX

*A Comparison of Some Domain Decomposition Algorithms
for Nonsymmetric Elliptic Problems*, Xiao-Chuan Cai, **William D. Gropp**, and David E.
Keyes, Fifth International Symposium on Domain Decomposition
Methods for Partial Differential Equations, Tony F. Chan, David E. Keyes, Gérard A.
Meurant, Jeffrey S. Scroggs, and Robert G. Voigt, 1992.
BibTeX

**William D. Gropp** and David E. Keyes, *Domain Decomposition as a Mechanism for Using
Asymptotic Methods*, Kluwer, 1992, in *Asymptotic and Numerical Methods for Partial
Differential Equations with Critical Parameters*.
BibTeX

*Parallel Computing and Domain Decomposition*, **William D. Gropp**, Fifth International Symposium on Domain Decomposition
Methods for Partial Differential Equations, Tony F. Chan, David E. Keyes, Gérard A.
Meurant, Jeffrey S. Scroggs, and Robert G. Voigt, 1992.
BibTeX

**William D. Gropp** and David E. Keyes, *Semi-structured Refinement and Parallel Domain
Decomposition Methods*, MIT Press, 1992, in *Unstructured Scientific Computation on Multiprocessors*.
BibTeX

*Developing Applications for a Heterogeneous
Computing Environment*, Ralph Butler, **William D. Gropp**, and **Ewing Lusk**, Proc. Workshop on Heterogeneous Processing, 77–83, 1993.
BibTeX

*Parallel Programming Tools for Distributed-Memory Computers*, **William Gropp**, Proc. of the 1993 SCS Simulation Multiconference, 1993.
BibTeX

*Parallel Solution of the Three-Dimensional,
Time-Dependent Ginzburg-Landau Equation*, N. Galbreath, **W. Gropp**, D. Gunter, D. Leaf, and D. Levine, Proceedings of the 6th SIAM Conference on Parallel
Processing for Scientific Computing, Linda R. Petzold, Richard F. Sincovec, David E. Keyes, Michael R. Leuze, and Daniel A. Reed, 160–164, 1993.
BibTeX

*Applications-Driven Parallel I/O*, N. Galbreath, **W. Gropp**, and D. Levine, Proceedings of Supercomputing '93, 462–471, Reprinted in the book ``High Performance Storage and Parallel I/O''
(http://www.buyya.com/superstorage/, 2001, pages
539–547), 1993.
BibTeX

*Solutions of TEAM Problem #13 Using Integral
Equations in a Sequential and Parallel Computing
Environment*, L. Kettunen, K. Forsman, D. Levine, and **W. Gropp**, Proceedings of the Miami TEAM Workshop, 1993.
BibTeX

*Parallel implicit methods for aerodynamics*, X.-C. Cai, **W. D. Gropp**, D. E. Keyes, and M. D.
Tidriri, Domain Decomposition Methods in Scientific and
Engineering Computing: Proceedings of the Seventh
International Conference on Domain Decomposition, 465–470, 1994.
BibTeX

*Newton-Krylov-Schwarz Methods in CFD*, X.-C. Cai, **William D. Gropp**, David E. Keyes, and M. D. Tidriri, Proceedings of the International Workshop on
Numerical Methods for the Navier-Stokes Equations, F. Hebeker and R. Rannacher, 17–30, 1994.
BibTeX

*Implicit domain decomposition algorithms for steady,
compressible aerodynamics*, **W. D. Gropp**, D. E. Keyes, and J. S. Mounts, Domain Decomposition Methods in Science and
Engineering: The Sixth International Conference on
Domain Decomposition, 203–213, 1994.
BibTeX

*Scalable, extensible, and portable numerical
libraries*, **W. Gropp** and B. Smith, Proceedings of the Scalable Parallel Libraries
Conference, October 6–8, 1993, Mississippi State,
Mississippi, 87–93, 1994.
BibTeX

*The MPI communication library: its design and a
portable implementation*, **W. Gropp** and **E. Lusk**, Proceedings of the Scalable Parallel Libraries
Conference, October 6–8, 1993, Mississippi State,
Mississippi, 160–165, 1994.
BibTeX

*Scalable Unix Tools on Parallel Processors*, **W. Gropp** and **E. Lusk**, Proceedings of the Scalable High-Performance
Computing Conference, May 23–25, 1994, Knoxville,
Tennessee, 56–62, 1994.
BibTeX

*Experiences with domain decomposition in three
dimensions: overlapping Schwarz methods*, **W. D. Gropp** and B. F. Smith, Domain Decomposition Methods in Science and
Engineering: The Sixth International Conference on
Domain Decomposition, 323–333, 1994.
BibTeX

*Solutions of TEAM Problems 13 and 20 Using a Volume
Integral Formulation*, L. Kettunen, K. Forsman, D. Levine, and William
D. Gropp, Proceedings of Aix-les-Bains TEAM workshop, 1994.
BibTeX

*Computational Electromagnetics and Parallel Dense
Matrix Computations*, Kimmo Forsman, **William Gropp**, Lauri Kettunen, and David Levine, Proceedings of the 27th Conference on Parallel
Processing for Scientific Computing, {Bailey, David H., Bjørstad, Petter E., Gilbert,
John E., Mascagni, Michael V., Schreiber, Robert S.,
Simon, Horst D., Torczon, Virginia J.} and Layne T.
Watson, 225–230, 1995.
BibTeX

**W. D. Gropp**, D. E. Keyes, and M. D. Tidriri, *Parallel implicit solvers for steady, compressible
aerodynamics*, Elsevier Science Publishers B.V. (North–Holland), 1995, in *Parallel Computational Fluid Dynamics*.
BibTeX

*Dynamic process management in an MPI setting*, **W. Gropp** and **E. Lusk**, Proceedings / Seventh IEEE Symposium on Parallel and
Distributed Processing, October 25–28, 1995, San
Antonio, Texas, 530–534, IEEE catalog number 95TB8131., 1995.
BibTeX

*Implementing MPI: the 1994 MPI Implementors'
Workshop*, **W. Gropp** and **E. Lusk**, Proceedings of the 1994 Scalable Parallel Libraries
Conference: October 12–14, 1994, Mississippi State
University, Mississippi, 55–59, 1995.
BibTeX

*MPE graphics: scalable X11 graphics in MPI*, **W. Gropp**, E. Karrels, and **E. Lusk**, Proceedings of the 1994 Scalable Parallel Libraries
Conference: October 12–14, 1994, Mississippi State
University, Mississippi, 49–54, 1995.
BibTeX

*A Taxonomy of Programming Models for Symmetric
Multiprocessors and SMP Clusters*, **William D. Gropp** and Ewing L. Lusk, Programming Models for Massively Parallel Computers, W. K. Giloi, S. Jahnichen, and B. D. Shriver, 2–7, 1995.
BibTeX

*Scalable Libraries for Solving Systems of Nonlinear
Equations and Unconstrained Minimization Problems*, **William D. Gropp**, Lois Curfman McInnes, and Barry Smith, Proceedings of the 1994 Scalable Parallel Libraries
Conference: October 12–14, 1994, Mississippi State
University, Mississippi, 60–67, 1995.
BibTeX

*The MPI Message-Passing Interface Standard: Overview
and Status*, **W. Gropp** and **E. Lusk**, High performance computing: technology, methods, and
applications (Advanced workshop, June 1994, Cetraro,
Italy), Grandinetti et al, 265–270, 1995.
BibTeX

**William D. Gropp** and Barry Smith, *Parallel Domain Decomposition Software*, SIAM, 1995, in *Domain-Based Parallelism and Problem Decomposition
Methods in Computational Science and Engineering*.
BibTeX

*An Abstract-Device Interface for Implementing Portable
Parallel-I/O Interfaces*, **Rajeev Thakur**, **William Gropp**, and **Ewing Lusk**, Proceedings of Frontiers '96: The Sixth Symposium on
the Frontiers of Massively Parallel Computation, 180–187, 1996.
BibTeX

*MPI-2: extending the Message-Passing Interface*, A. Geist, **W. Gropp**, S. Huss-Lederman, A.
Lumsdaine, **E. Lusk**, W. Saphir, T. Skjellum, and M. Snir, Euro-Par '96 parallel processing: second
International Euro-Par Conference, Lyon, France,
August 26–29, 1996: proceedings, Luc Bouge, P. Fraigniaud, A. Mignotte, and Y.
Robert, 128–135, 1996.
BibTeX

*Why we couldn't use numerical libraries for PETSc*, **William D. Gropp**, Proceedings of the IFIP TC2/WG2.5 Working Conference
on the Quality of Numerical Software, Assessment and
Enhancement, Ronald F. Boisvert, 249–254, 1997.
BibTeX

*An Experimental Evaluation of the Parallel I/O
Systems of the IBM SP and Intel Paragon Using a
Production Application*, **Rajeev Thakur**, **William Gropp**, and **Ewing Lusk**, Proceedings of the Third International Conference of
the Austrian Center for Parallel Computation (ACPC), 24–35, 1996.
BibTeX

S. Balay, **W. D. Gropp**, L. C. McInnes, and B. F. Smith, *Efficient Management of Parallelism in
Object-Oriented Numerical Software Libraries*, Birkhauser Press, 1997, in *Modern Software Tools in Scientific Computing*.
BibTeX

**William D. Gropp**, *An Introduction to Performance Debugging For
Parallel Computers*, Kluwer Academic Publishers, 1997, in *Parallel Numerical Algorithms*.
BibTeX

*Optimization environments and the NEOS Server*, **W. Gropp** and Jorge Morè, Approximation Theory and Optimization:
Tributes to M. J. D. Powell, M. D. Buhmann and A. Iserles, 167–182, 1997.
BibTeX

*Parallel Implicit PDE Computations: Algorithms and
Software*, **William D. Gropp**, D.E. Keyes, L.C. McInnes, and M.D. Tidriri, Proceedings of Parallel CFD'97, 333–344, 1997.
BibTeX

*Why are PVM and MPI so Different?*, **William D. Gropp** and **Ewing Lusk**, Recent Advances in Parallel Virtual Machine and Message Passing Interface, Marian Bubak, Jack Dongarra, and Jerzy Wa\'sniewski, 3–10, 4th European PVM/MPI Users' Group Meeting, Cracow, Poland,
November 1997, 1997.
BibTeX

*Performance Driven Programming Models*, **William D. Gropp**, Massively Parallel Programming Models (MPPM-97), 61–67, November 12-14, 1997; London; Third working conference, 1997.
BibTeX

*Which comes first: The Architecture or the
Algorithm? (abstract)*, **William D. Gropp**, Innovative Architectures for Future Generation
High-Performance Processors and Systems, A. Veidenbaum and K. Joe, 13, 1998.
BibTeX

*Data Sieving and Collective I/O in ROMIO*, **Rajeev Thakur**, **William Gropp**, and **Ewing Lusk**, Proceedings of the 7th Symposium on the Frontiers of Massively
Parallel Computation, 182–189, 1999.
BibTeX

*Exploiting Existing Software in Libraries: Successes, Failures, and
Reasons Why*, **William Gropp**, Object Oriented Methods for Interoperable Scientific and
Engineering Computing, Michael Henderson, Christopher Anderson, and Stephen L. Lyons, 21–29, 1999.
BibTeX

*A Microkernel Design for Component-based Numerical Software
Systems*, Satish Balay, **William Gropp**, Lois Curfman McInnes, and Barry Smith, Object Oriented Methods for Interoperable Scientific and
Engineering Computing, Michael Henderson, Christopher Anderson, and Stephen L. Lyons, 60–69, Also ANL/MCS-P727-0998, 1998.
BibTeX

*A Case for Using MPI's Derived Datatypes to Improve I/O
performance*, **Rajeev Thakur**, **Ewing Lusk**, and **William Gropp**, Proceedings of SC98: High Performance Networking and
Computing, 1998.
BibTeX

*Improving the performance of MPI Derived Datatypes*, **William Gropp**, **Ewing Lusk**, and Debbie Swider, Proceedings of the Third MPI Developer's and User's
Conference, Anthony Skjellum, Purushotham V. Bangalore, and Yoginder
S. Dandass, 25–30, 1999.
BibTeX

*On Implementing MPI-IO Portably and with High Performance*, **Rajeev Thakur**, **William Gropp**, and **Ewing Lusk**, Proceedings of the 6th Workshop on I/O in Parallel and
Distributed Systems, 23–32, 1999.
BibTeX

*Towards Realistic Performance Bounds for Implicit CFD
Codes*, **W. D. Gropp**, D. K. Kaushik, D. E. Keyes, and B. F. Smith, Proceedings of Parallel CFD'99, 241–248, 1999.
BibTeX

*Reproducible Measurements of MPI Performance
Characteristics*, **William D. Gropp** and **Ewing Lusk**, Recent Advances in Parallel Virtual Machine
and Message Passing Interface, Jack Dongarra, Emilio Luque, and Tomàs Margalef, 11–18, 6th European PVM/MPI Users' Group Meeting, Barcelona, Spain,
September 1999, 1999.
BibTeX

*A Standard Interface for Debugger Access to Message Queue
Information in MPI*, James Cownie and **William Gropp**, Recent Advances in Parallel Virtual Machine
and Message Passing Interface, Jack Dongarra, Emilio Luque, and Tomàs Margalef, 51–58, 6th European PVM/MPI Users' Group Meeting, Barcelona, Spain,
September 1999, 1999.
BibTeX

*Achieving High Sustained Performance in an Unstructured Mesh
CFD Application*, W. K. Anderson, **William D. Gropp**, D. K. Kaushik, D. E. Keyes, and B. F. Smith, Proceedings of the ACM/IEEE SC99 Conference on High
Performance Networking and Computing, CDROM. Also at http://portal.acm.org and ICASE Report
No. 2000-2, 1999.
BibTeX

*Infrastructure and Interfaces for Large-Scale
Numerical Software*, Lori A. Freitag, **William Gropp**, Paul
D. Hovland, Lois C. McInnes, and Barry F. Smith, Proceedings of PDPTA 1999, 2657–2664, 1999.
BibTeX

*Exploiting Hierarchy in Parallel Computer Networks to
Optimize Collective Operation Performance*, Nicholas T. Karonis, Bronis R. de Supinski, Ian Foster, **William Gropp**, **Ewing Lusk**, and John Bresnahan, Fourteenth International Parallel and Distributed Processing
Symposium, 377–384, 2000.
BibTeX

*A Scalable Process-Management Environment for Parallel
Programs*, R. Butler, **W. Gropp**, and **E. Lusk**, Recent Advances in Parallel Virutal Machine and Message
Passing Interface, Jack Dongarra, Peter Kacsuk, and Norbert Podhorszki, 168–175, 2000.
BibTeX

*Runtime Checking of Datatype Signatures in MPI*, **William D. Gropp**, Recent Advances in Parallel Virutal Machine and Message
Passing Interface, Jack Dongarra, Peter Kacsuk, and Norbert Podhorszki, 160–167, 7th European PVM/MPI Users' Group Meeting, 2000.
BibTeX

*Analyzing the Parallel Scalablity of an Implicit
Unstructured Mesh CFD Code*, **W. D. Gropp**, D. K. Kaushik, B. F. Smith, and D. E. Keyes, High Performance Computing – HiPC2000, Mateo Valero, Viktor K. Prasanna, and Sriram Vajapeyam, 395–404, 2000.
BibTeX

*Solving CFD Problems with Open Source Parallel Libraries*, **William Gropp**, Applied Parallel Computing: New Paradigms for HPC in
Industry and Academia, Tor Sorevik, Fredrik Manne, Randi Moe, and Assefaw
Hadish Gebremedhin, 52, (Abstract), 2000.
BibTeX

*From Trace Generation to Visualization: A
Performance Framework for Distributed Parallel
Systems*, C. Eric Wu, Anthony Bolmarcich, Marc Snir, David Wootton, Farid Parpia, Anthony Chan, Ewing L. Lusk, and **William Gropp**, Proceedings of SC2000, 2000.
BibTeX

*PETSc and Overture: Lessons Learned Developing
an Interface between Components*, Kristopher R. Buschelman, **William Gropp**, Lois
C. McInnes, and Barry F. Smith, The Architecture of Scientific Software 2000, 57–68, 2000.
BibTeX

*MPICH-GQ: Quality of Service for Message Passing Programs*, Alain Roy, Ian Foster, **William Gropp**, Nicholas
Karonis, Volker Sander, and Brian Toonen, Proceedings of SC2000, 2000.
BibTeX

*Performance Modeling and Tuning of an Unstructured Mesh
CFD Application*, **William D. Gropp**, Dinesh K. Kaushik, David
E. Keyes, and Barry F. Smith, Proceedings of SC2000, 2000.
BibTeX

*Efficient Communication Across the Internet in Wide-Area
MPI*, Rajkumar Vinkat, Philip M. Dickens, and **William Gropp**, Proceedings of Parallel and Distributed Processing
Techniques and Applications, 2001.
BibTeX

*Interfacing Parallel Jobs to Process Managers*, Brian Toonen, David Ashton, **Ewing Lusk**, Ian Foster, **William Gropp**, Edgar Gabriel, Ralph Butler, and Nicholas Karonis, Proceedings of the 10th IEEE International Symposium on High
Performance Distributed Computing, 431–432, 2001.
BibTeX

*Performance of Tightly Coupled Linux Cluster Simulation
using PETSc of Reaction and Transport Processes During
Corrosion Pit Initiation*, Eric Webb, Jay Alameda, **William Gropp**, Joshua Gray, and Richard Alkire, Proceedings of Linux Clusters: the HPC Revolution, Urbana, IL, 2001.
BibTeX

*Advanced Cluster Programming with MPI*, **William Gropp**, 2001 IEEE International Conference on Cluster Computing
(CLUSTER 2001), 8-11 October 2001, Newport Beach, CA, USA, 453, 2001.
BibTeX

*Scalable Unix Commands for Parallel Processors: A
High-Performance Implementation*, Emil Ong, **Ewing Lusk**, and **William Gropp**, Recent Advances in Parallel Virtual Machine and
Message Passing Interface, Y. Cotronis and J. Dongarra, 410–418, 8th European PVM/MPI Users' Group Meeting, 2001.
BibTeX

*Challenges and Successes in Achieving the Potential of MPI*, **William Gropp**, Recent Advances in Parallel Virtual Machine and
Message Passing Interface, Y. Cotronis and J. Dongarra, 7, 2001.
BibTeX

*Latency, Bandwidth, and Concurrent Issue Limitations in
High-Performance CFD*, **W. D. Gropp**, D. K. Kaushik, D. E. Keyes, and B. F. Smith, Proceedings of the First MIT Conference on
Computational Fluid and Solid Mechanics, 2001.
BibTeX

*Single Precision Incomplete LU Factorization for
Incompressible Fluid Flow Applications on Pentium III
Processors in PETSc*, Kristopher R. Buschelman, **William Gropp**, and Barry
F. Smith, Abstract for poster presented at the 2001 International
Conference On
Preconditioning Techniques For Large Sparse Matrix Problems
In Industrial Applications, 2001.
BibTeX

*Learning from the Success of MPI*, **William D. Gropp**, High Performance Computing – HiPC 2001, Burkhard Monien, Viktor K. Prasanna, and Sriram Vajapeyam, 81–92, 8th International Conference, 2001.
BibTeX

*An Evaluation of a User-Level Data Transfer Mechanism for
High-Performance Networks*, Philip M. Dickens and **William D. Gropp**, Proceedings of HPDC'02, 255–264, 2002.
BibTeX

*High Performance Wide Area Data Transfers over High
Performance Networks*, Philip M. Dickens, **William Gropp**, and Paul R. Woodward, Proceedings of IPDPS 2002, 2002.
BibTeX

*Goals Guiding Design: PVM and MPI*, **William D. Gropp** and **Ewing Lusk**, Proceedings of IEEE Cluster, **William Gropp**, Rob Pennington, Dan Reed, Mark Baker, Maxine Brown, and Rajkumar Buyya, 257–265, 2002.
BibTeX

*Prototype of AM3: Active Mapper and Monitoring Module for
Myrinet Environment*, Seongbok Baik, Cynthia S. Hood, and **William D. Gropp**, HSLN (High-Speed Local Networks) workshop, 703–707, 2002.
BibTeX

*Noncontiguous I/O through PVFS*, A. Ching, A. Choudhary, W.-K. Liao, R. Ross, and **W. Gropp**, Proceedings of IEEE Cluster, **William Gropp**, Rob Pennington, Dan Reed, Mark Baker, Maxine Brown, and Rajkumar Buyya, 405–414, 2002.
BibTeX

*Building Library Components that Can Use Any MPI
Implementation*, **William D. Gropp**, Recent Advances in Parallel Virtual Machine and Message
Passing Interface, Dieter Kranzlmüller, Peter Kacsuk, Jack Dongarra, and Jens Volkert, 280–287, 9th European PVM/MPI Users' Group Meeting, Linz, Austria, 2002.
BibTeX

*MPI on the Grid*, **William Gropp** and
Ewing L. Lusk, Recent Advances in Parallel Virtual Machine and Message
Passing Interface, Dieter Kranzlmüller, Peter Kacsuk, Jack Dongarra, and Jens Volkert, 12, 2002.
BibTeX

*MPICH2: A New Start for MPI Implementations*, **William Gropp**, Recent Advances in Parallel Virtual Machine and Message
Passing Interface, Dieter Kranzlmüller, Peter Kacsuk, Jack Dongarra, and Jens Volkert, 7, 2002.
BibTeX

*Toward Understanding Soft Faults in High Performance Cluster
Networks*, Jeffrey J. Evans, Seongbok Baik, Cynthia S. Hood, and **William Gropp**, Proceedings of the 8th IFIP/IEEE International Symposium on
Integrated Network Management, 117–120, 2003.
BibTeX

*Trends in High Performance Computing*, **William Gropp**, High Performance Computing with QCDOC and BlueGene, 91–97, Abstract and six major slides from the presentation, 2003.
BibTeX

*Noncontiguous I/O Accesses Through MPI-IO*, A. Ching, A. Choudhary, K. Coloma, W.-K. Liao, R. Ross, and **W. Gropp**, Proceedings of the 3rd IEEE/ACM International
Symposium on Cluster Computing and the Grid (CCGrid2003), 104–111, 2003.
BibTeX

*Future Developments in MPI*, **William Gropp**, Recent Advances in Parallel Virtual Machine and Message
Passing Interface, Jack Dongarra, Domenico Laforenza, and Salvatore Orlando, 15–15, 10th European PVM/MPI User's Group Meeting, Venice, Italy, 2003.
BibTeX

*Improving the Performance of Collective Operations in
MPICH*, **Rajeev Thakur** and **William Gropp**, Recent Advances in Parallel Virtual Machine and Message
Passing Interface, Jack Dongarra, Domenico Laforenza, and Salvatore Orlando, 257–267, 10th European PVM/MPI User's Group Meeting, Venice, Italy, 2003.
BibTeX

*MPI on BlueGene/L: Designing an Efficient General
Purpose Messaging Solution for a Large Cellular System*, G. Almási, C. Archer, J. G. Castaños, M. Gupta, X. Martorell, J. E. Moreira, **W. D. Gropp**, S. Rus, and B. Toonen, Recent Advances in Parallel Virtual Machine and Message
Passing Interface, Jack Dongarra, Domenico Laforenza, and Salvatore Orlando, 352–361, 10th European PVM/MPI User's Group Meeting, Venice, Italy, 2003.
BibTeX

*Implementing Fast and Reusable Datatype Processing*, R. Ross, N. Miller, and **W. D. Gropp**, Recent Advances in Parallel Virtual Machine and Message
Passing Interface, Jack Dongarra, Domenico Laforenza, and Salvatore Orlando, 404–413, 10th European PVM/MPI User's Group Meeting, Venice, Italy, 2003.
BibTeX

*High-Level Programming in MPI*, **William Gropp** and Ewing L. Lusk, Recent Advances in Parallel Virtual Machine and Message
Passing Interface, Jack Dongarra, Domenico Laforenza, and Salvatore Orlando, 27, 2003.
BibTeX

*Exploring the Relationship Between Parallel Application
Run-Time Variability and Network Performance*, Jeffrey Evans, Cynthia Hood, and **William Gropp**, Workshop on High-Speed Local Networks (HSLN), IEEE
Conference on Local Computer Networks (LCN), 538–547, 2003.
BibTeX

*Parallel netCDF: A High-Performance Scientific
I/O Interface*, J. Li, W. Liao, A. Choudhary, R. Ross, **R. Thakur**, **W. Gropp**, R. Latham, A. Siegel, B. Gallagher, and M. Zingale, Proceedings of SC2003, 2003.
BibTeX

*Efficient Structured Data Access in Parallel File Systems*, A. Ching, A. Choudhary, W.-K. Liao, R. Ross, and **W. Gropp**, Proceedings of IEEE Cluster, 2003.
BibTeX

*Implementing MPI on the BlueGene/L Supercomputer*, George Almasi, Charles Archer, Jose
G. Castanos, C. Chris Erway, Philip
Heidelberger, Xavier Martorell, Jose
E. Moreira, Kurt Pinnow, Joe Ratterman, Nils Smeds, Burkhard Steinmacher-Burow, **William Gropp**, and Brian Toonen, Proceedings of EuroPar2004, 833–845, Selected as distinquished paper, 2004.
BibTeX

*MPI and High Productivity Programming*, **William Gropp**, Recent Advances in Parallel Virtual Machine and Message
Passing Interface, Dieter Kranzlmüller, Peter Kacsuk, and Jack Dongarra, 7, 11th European PVM/MPI User's Group Meeting, Budapest, Hungary, 2004.
BibTeX

*Minimizing Synchronization Overhead in the
Implementation of MPI One-Sided Communication*, **Rajeev Thakur**, **William Gropp**, and Brian Toonen, Recent Advances in Parallel Virtual Machine and Message
Passing Interface, Dieter Kranzlmüller, Peter Kacsuk, and Jack Dongarra, 57–67, 11th European PVM/MPI User's Group Meeting, Budapest, Hungary, 2004.
BibTeX

*Efficient Implementation of MPI-2 Passive
One-Sided Communication on InfiniBand Clusters*, W. Jiang, J. Liu, H.-W. Jin, D. K. Panda, D. Buntinas, **Rajeev Thakur**, and **William Gropp**, Recent Advances in Parallel Virtual Machine and Message
Passing Interface, Dieter Kranzlmüller, Peter Kacsuk, and Jack Dongarra, 68–76, 11th European PVM/MPI User's Group Meeting, Budapest, Hungary, 2004.
BibTeX

*Design and Implementation of MPICH2 over Infiniband with
RDMA support*, Jiuxing Liu, Weihang Jiang, Pete Wyckoff, Dhabaleswar K. Panda, David Ashton, Darius Buntinas, **William Gropp**, and Brian Toonen, Proceedings of IPDPS 2004, 2004.
BibTeX

*Providing Efficient I/O Redundancy in MPI Environments*, **William Gropp**, Robert Ross, and Neill Miller, Recent Advances in Parallel Virtual Machine and Message
Passing Interface, Dieter Kranzlmüller, Peter Kacsuk, and Jack Dongarra, 77–86, 11th European PVM/MPI User's Group Meeting, Budapest, Hungary, 2004.
BibTeX

*MPI and High Productivity Programming*, **William D. Gropp**, Recent Advances in Parallel Virtual Machine and Message
Passing Interface, Dieter Kranzlmüller, Peter Kacsuk, and Jack Dongarra, 7, 11th European PVM/MPI User's Group Meeting,
Budapest, Hungary, 2004.
BibTeX

*Collective Error Detection for MPI Collective Operations*, Chris Falzone, Anthony Chan, **Ewing Lusk**, and William
Gropp, Recent Advances in Parallel Virtual Machine and Message
Passing Interface, Beniamino Di Martino, Dieter Kranzluüller, and Jack
Dongarra, 138–147, 12th European PVM/MPI User's Group Meeting, Sorrento, Italy, 2005.
BibTeX

*Designing a Common Communication Subsystem*, Darius Buntinas and **William Gropp**, Recent Advances in Parallel Virtual Machine and Message
Passing Interface, Beniamino Di Martino, Dieter Kranzluüller, and Jack
Dongarra, 156–166, 12th European PVM/MPI User's Group Meeting, Sorrento, Italy, 2005.
BibTeX

*An Evaluation of Implementation Options for MPI One-Sided
Communication*, **William Gropp** and **Rajeev Thakur**, Recent Advances in Parallel Virtual Machine and Message
Passing Interface, Beniamino Di Martino, Dieter Kranzluüller, and Jack
Dongarra, 415–424, 12th European PVM/MPI User's Group Meeting, Sorrento, Italy, 2005.
BibTeX

*Towards a Productive MPI Environment (abstract)*, **William Gropp**, Recent Advances in Parallel Virtual Machine and Message
Passing Interface, Beniamino Di Martino, Dieter Kranzluüller, and Jack
Dongarra, 4, 12th European PVM/MPI User's Group Meeting, Sorrento, Italy, 2005.
BibTeX

*Collective Communication on Architectures that Support
Simultaneous Communication over Multiple Links*, Ernie Chan, **William Gropp**, **Rajeev Thakur**, and Robert van de Geijn, Proceedings of the 2006 ACM SIGPLAN Symposium on
Principles and Practice of Parallel Programming, 2–11, 2006.
BibTeX

*Design and Evaluation of Nemesis, a scalable, low-latency,
message-passing communication subsystem*, Darius Buntinas, Guillaume Mercier, and **William Gropp**, Proceedings of the 6th IEEE International
Symposium on Cluster Computing and the Grid (CCGrid2006), Stephen John Turner, Bu Sung Lee, and Wentong Cai, 521–530, 2006.
BibTeX

*Issues in Developing a Thread-Safe MPI
Implementation*, **William D. Gropp** and **Rajeev Thakur**, Recent Advances in Parallel Virtual Machine
and Message Passing Interface, Bernd Mohr, Jesper Larsson Träff, Joachim
Worringen, and Jack Dongarra, 12–21, 2006.
BibTeX

*Formal Verification of Programs that Use MPI
One-Sided Communication*, Salman Pervez, Ganesh Gopalakrishnan, Robert
M. Kirby, **Rajeev Thakur**, and **William D. Gropp**, Recent Advances in Parallel Virtual Machine
and Message Passing Interface, Bernd Mohr, Jesper Larsson Träff, Joachim
Worringen, and Jack Dongarra, 30–39, 2006.
BibTeX

*Implementation and Shared-Memory Evaluation of
MPICH2 over the Nemesis Communication Subsystem*, Darius Buntinas, Guillaume Mercier, and William
D. Gropp, Recent Advances in Parallel Virtual Machine
and Message Passing Interface, Bernd Mohr, Jesper Larsson Träff, Joachim
Worringen, and Jack Dongarra, 86–95, 2006.
BibTeX

*An Interface to Support the Identification of
Dynamic MPI 2 Processes for Scalable Parallel Debugging*, Christopher Gottbrath, Brian Barrett, William
D. Gropp, Ewing "Rusty" Lusk, and Jeff Squyres, Recent Advances in Parallel Virtual Machine
and Message Passing Interface, Bernd Mohr, Jesper Larsson Träff, Joachim
Worringen, and Jack Dongarra, 115–122, 2006.
BibTeX

*Automatic Memory Optimization for Improving MPI
Derived Datatype Performance*, Surendra Byna, Xian-He Sun, **Rajeev Thakur**, and **William D. Gropp**, Recent Advances in Parallel Virtual Machine
and Message Passing Interface, Bernd Mohr, Jesper Larsson Träff, Joachim
Worringen, and Jack Dongarra, 238–246, 2006.
BibTeX

*Parallel Implicit Solution of Diffusion-limited
Radiation Transport*, **William D. Gropp**, Dinesh K. Kaushik, David
E. Keyes, and Barry F. Smith, Domain Decomposition Methods in Science and
Engineering XVI, Olof B. Widlund and David E. Keyes, 579–586, 2006.
BibTeX

*Grid-based Image Registration*, **William Gropp**, Eldad Haber, Stefen Heldmann, David Keyes, Neill Miller, Jennifer Schopf, and Tianzhi Yang, Grid-Based Problem Solving Environments, Patrick W. Gaffney and James C. T. Pool, 435–448, IFIP International Federation for Information
Processing, Volume 239, 2007.
BibTeX

*Observations on WoCo9*, **William Gropp**, Grid-Based Problem Solving Environments, Patrick W. Gaffney and James C. T. Pool, 451–453, IFIP International Federation for Information
Processing, Volume 239, 2007.
BibTeX

*Self-consistent MPI Performance Requirements*, Jesper Larsson Träff, **William Gropp**, and **Rajeev Thakur**, PVM/MPI, 36–45, 2007.
BibTeX

*Test Suite for Evaluating Performance of MPI Implementations
That Support MPI\_THREAD\_MULTIPLE*, **Rajeev Thakur** and
**William Gropp**, 46–55, 2007.
BibTeX

*Extending the MPI-2 Generalized Request Interface*, Robert Latham, **William Gropp**, Robert Ross, and **Rajeev Thakur**, 223–232, 2007.
BibTeX

*Revealing the Performance of MPI RMA Implementations*, **William D. Gropp** and
**Rajeev Thakur**, 272–280, 2007.
BibTeX

*Practical Model-Checking Method for Verifying Correctness
of MPI Programs*, Salman Pervez, Ganesh Gopalakrishnan, Robert M. Kirby, Robert Palmer, **Rajeev Thakur**, and **William Gropp**, 344–353, 2007.
BibTeX

*Nonuniformly Communicating Noncontiguous Data: A
Case Study with PETSc and MPI*, Pavan Balaji, Darius Buntinas, S. Balay, B.
Smith, **Rajeev Thakur**, and **William Gropp**, 1–10, 2007.
BibTeX

*Advanced Flow-control Mechanisms for the Sockets
Direct Protocol over InfiniBand*, Pavan Balaji, S. Bhagvat, Dhabaleswar K. Panda, **Rajeev Thakur**, and **William Gropp**, 73, 2007.
BibTeX

*Open Issues in MPI Implementation*, **Rajeev Thakur** and **William Gropp**, Advances in Computer Systems Architecture, 12th
Asia-Pacific Conference, ACSAC 2007, Seoul, Korea,
August 23-25, 2007, Proceedings, Lynn Choi, Yunheung Paek, and Sangyeun Cho, 327–338, 2007.
BibTeX

*Scaling Science Applications on Blue Gene*, **William D. Gropp**, Wolfgang Frings, Marc-André Hermanns, Ed Jedlicka, Kirk
E. Jordan, Fred Mintzer, and Boris Orth, Parallel Computing: Architectures, Algorithms, and Applications, Christian Bischof, Martin Bücker, Paul
Gibbon, Gerhard Joubert, Thomas Lippert, Bernd Mohr, and Frans Peters, 583–584, Summary of the Mini-Symposium, 2007.
BibTeX

*Recent Advances in Parallel Virtual Machine and Message
Passing Interface, 14th European PVM/MPI User's Group Meeting,
Paris, France, September 30 - October 3, 2007, Proceedings*, 2007.
BibTeX

*21th International Parallel and Distributed Processing Symposium
(IPDPS 2007), Proceedings, 26-30 March 2007, Long Beach,
California, USA*, 2007.
BibTeX

*2007 International Conference on Parallel Processing (ICPP
2007), September 10-14, 2007, Xi-An, China*, 2007.
BibTeX

*Parallel I/O Prefetching Using MPI File Caching
and I/O Signatures*, Suren Byna, Yong Chen, **W. D. Gropp**, Xian-He
Sun, and **Rajeev Thakur**, Proceedings of SC08, 2008.
BibTeX

*Hiding I/O Latency with Pre-execution Prefetching
for Parallel Applications*, Suren Byna, Yong Chen, **W. D. Gropp**, Xian-He
Sun, and **Rajeev Thakur**, Proceedings of SC08, Finalist for Best Paper and Best Student Paper., 2008.
BibTeX

*Communication Analysis of Parallel 3D FFT for Flat
Cartesian Meshes on Large Blue Gene Systems*, Anthony Chan, Pavan Balaji, **William Gropp**, and **Rajeev Thakur**, 15th IEEE International Conference on High
Performance Computing, 422–429, 2008.
BibTeX

*Improving the Performance of Tensor Matrix Vector
Multiplication in Cumulative Reaction Probability
Based Quantum Chemistry Codes*, Dinesh Kaushik, **William Gropp**, Michael Minkoff, and Barry Smith, 15th IEEE International Conference on High
Performance Computing, 120–130, 2008.
BibTeX

*MPI and Hybrid Programming Models for Petascale Computing*, **William D. Gropp**, PVM/MPI, 6–7, 2008.
BibTeX

*Non-data-communication Overheads in MPI: Analysis on Blue
Gene/P*, Pavan Balaji, Anthony Chan, **William Gropp**, **Rajeev Thakur**, and Ewing L. Lusk, PVM/MPI, 13–22, 2008.
BibTeX

*A Simple, Pipelined Algorithm for Large, Irregular All-gather
Problems*, Jesper Larsson Träff, Andreas Ripke, Christian Siebert, Pavan Balaji, **Rajeev Thakur**, and **William Gropp**, PVM/MPI, 84–93, 2008.
BibTeX

*Toward Efficient Support for Multithreaded MPI Communication*, Pavan Balaji, Darius Buntinas, David Goodell, **William Gropp**, and **Rajeev Thakur**, PVM/MPI, 120–129, 2008.
BibTeX

*Self-consistent MPI-IO Performance Requirements and
Expectations*, **William D. Gropp**, Dries Kimpe, Robert Ross, **Rajeev Thakur**, and Jesper Larsson Träff, PVM/MPI, 167–176, 2008.
BibTeX

*Implementing Efficient Dynamic Formal Verification Methods
for MPI Programs*, Sarvani S. Vakkalanka, Michael Delisi, Ganesh Gopalakrishnan, Robert M. Kirby, **Rajeev Thakur**, and **William Gropp**, PVM/MPI, 248–256, 2008.
BibTeX

*A Formal Approach to Detect Functionally Irrelevant Barriers
in MPI Programs*, Subodh Sharma, Sarvani S. Vakkalanka, Ganesh Gopalakrishnan, Robert M. Kirby, **Rajeev Thakur**, and **William Gropp**, PVM/MPI, 265–273, 2008.
BibTeX

*Recent Advances in Parallel Virtual Machine and Message
Passing Interface, 15th European PVM/MPI Users' Group Meeting,
Dublin, Ireland, September 7-10, 2008. Proceedings*, 2008.
BibTeX

*MPI on a Million Processors*, Balaji, Pavan, Buntinas, Darius, Goodell, David, Gropp, William, Kumar, Sameer, Lusk, Ewing, Thakur, Rajeev, and Träff, Jesper Larsson, Proceedings of the 16th European PVM/MPI Users' Group
Meeting on Recent Advances in Parallel Virtual
Machine and Message Passing Interface, 20–30, 2009.
BibTeX

*MPI at Exascale: Challenges for Data Structures and
Algorithms*, Gropp, William, Proceedings of the 16th European PVM/MPI Users' Group
Meeting on Recent Advances in Parallel Virtual
Machine and Message Passing Interface, 3, 2009.
BibTeX

*Processing MPI Datatypes Outside MPI*, Ross, Robert, Latham, Robert, Gropp, William, Lusk, Ewing, and Thakur, Rajeev, Proceedings of the 16th European PVM/MPI Users' Group
Meeting on Recent Advances in Parallel Virtual
Machine and Message Passing Interface, 42–53, 2009.
BibTeX

*Hierarchical Collectives in MPICH2*, Zhu, Hao, Goodell, David, Gropp, William, and Thakur, Rajeev, Proceedings of the 16th European PVM/MPI Users' Group
Meeting on Recent Advances in Parallel Virtual
Machine and Message Passing Interface, 325–326, 2009.
BibTeX

*Natively Supporting True One-Sided Communication in MPI on
Multi-core Systems with InfiniBand*, Santhanaraman, G., Balaji, P., Gopalakrishnan, K., Thakur, R., Gropp, W., and Panda, D. K., CCGRID '09: Proceedings of the 2009 9th IEEE/ACM
International Symposium on Cluster Computing and the
Grid, 380–387, 2009.
BibTeX

*Investigating High Performance RMA Interfaces for the
MPI-3 Standard*, Tipparaju, Vinod, Gropp, William, Ritzdorf, Hubert, Thakur, Rajeev, and Träff, Jesper L., ICPP '09: Proceedings of the 2009 International
Conference on Parallel Processing, 293–300, 2009.
BibTeX

*An Adaptive Performance Modeling Tool for GPU
Architectures*, Sara S. Baghsorkhi, Matthieu Delahaye, Sanjay J.
Patel, **William D. Gropp**, and Wen-mei W. Hwu, Proceedings of the 15th ACM SIGPLAN Symposium on Principles
and Practice of Parallel Programming, PPOPP 2010, Bangalore,
India, January 9-14, 2010, R. Govindarajan, David A. Padua, and Mary W. Hall, 105–114, 2010.
BibTeX

*An introductory exascale feasibility study for FFTs and multigrid*, Gahvari, H. and Gropp, W., Parallel Distributed Processing (IPDPS), 2010 IEEE
International Symposium on, 1–9, 2010.
BibTeX

*A Scalable MPI\_Comm\_split Algorithm for Exascale Computing*, Sack, Paul and Gropp, William, Recent Advances in the Message Passing Interface, Keller, Rainer, Gabriel, Edgar, Resch, Michael, and Dongarra, Jack, 1–10, 2010.
BibTeX

*Toward Performance Models of MPI Implementations for
Understanding Application Scaling Issues*, Hoefler, Torsten, Gropp, William, Thakur, Rajeev, and Träff, Jesper, Recent Advances in the Message Passing Interface, Keller, Rainer, Gabriel, Edgar, Resch, Michael, and Dongarra, Jack, 21–30, 2010.
BibTeX

*Enabling Concurrent Multithreaded MPI Communication on
Multicore Petascale Systems*, Dózsa, G\'abor, Kumar, Sameer, Balaji, Pavan, Buntinas, Darius, Goodell, David, Gropp,
William, Ratterman, Joe, and Thakur, Rajeev, Recent Advances in the Message Passing Interface, Keller, Rainer, Gabriel, Edgar, Resch, Michael, and Dongarra, Jack, 11–20, 2010.
BibTeX

*PMI: A Scalable Parallel Process-Management Interface for
Extreme-Scale Systems*, Balaji, Pavan, Buntinas, Darius, Goodell, David, Gropp, William, Krishna, Jayesh, Lusk, Ewing, and Thakur, Rajeev, Recent Advances in the Message Passing Interface, Keller, Rainer, Gabriel, Edgar, Resch, Michael, and Dongarra, Jack, 31–41, 2010.
BibTeX

*Load Balancing for Regular Meshes on SMPs with MPI*, Kale, Vivek and Gropp, William, Recent Advances in the Message Passing Interface, Keller, Rainer, Gabriel, Edgar, Resch, Michael, and Dongarra, Jack, 229–238, 2010.
BibTeX

*Minimizing MPI Resource Contention in Multithreaded Multicore
Environments*, David Goodell, Pavan Balaji, Darius Buntinas, Gabor
Dozsa, **William Gropp**, Sameer Kumar, Bronis
R. de Supinski, and **Rajeev Thakur**, IEEE International Conference on Cluster Computing, 1–8, 2010.
BibTeX

*Performance modeling as the key to extreme scale
computing*, **William D. Gropp**, Proceedings of the 25th International Conference on
Supercomputing, 2011, Tucson, AZ, USA, May 31 -
June 04, 2011, David K. Lowenthal, Bronis R. de Supinski, and Sally
A. McKee, 213, 2011.
BibTeX

*Scalable Memory Use in MPI: A Case Study with
MPICH2*, David Goodell, **William Gropp**, Xin Zhao, and **Rajeev Thakur**, Recent Advances in the Message Passing Interface -
18th European MPI Users' Group Meeting, EuroMPI
2011, Santorini, Greece, September 18-21, 2011.
Proceedings, Yiannis Cotronis, Anthony Danalis, Dimitrios S.
Nikolopoulos, and Jack Dongarra, 140–149, 2011.
BibTeX

*Multi-core and Network Aware MPI Topology
Functions*, Mohammad J. Rashti, Jonathan Green, Pavan Balaji, Ahmad Afsahi, and **William Gropp**, Recent Advances in the Message Passing Interface -
18th European MPI Users' Group Meeting, EuroMPI
2011, Santorini, Greece, September 18-21, 2011.
Proceedings, Yiannis Cotronis, Anthony Danalis, Dimitrios S.
Nikolopoulos, and Jack Dongarra, 50–60, 2011.
BibTeX

*Performance Expectations and Guidelines for MPI
Derived Datatypes*, **William Gropp**, Torsten Hoefler, **Rajeev Thakur**, and Jesper Larsson Träff, Recent Advances in the Message Passing Interface -
18th European MPI Users' Group Meeting, EuroMPI
2011, Santorini, Greece, September 18-21, 2011.
Proceedings, Yiannis Cotronis, Anthony Danalis, Dimitrios S.
Nikolopoulos, and Jack Dongarra, 150–159, 2011.
BibTeX

*LACIO: A New Collective I/O Strategy for
Parallel I/O Systems*, Yong Chen, Xian-He Sun, **Rajeev Thakur**, Philip
C. Roth, and **William D. Gropp**, IPDPS, 794–804, 2011.
BibTeX

*Architectural Constraints to Attain 1 Exaflop/s for
Three Scientific Application Classes*, Abhinav Bhatele, Pritish Jetley, Hormozd Gahvari, Lukasz Wesolowski, **William D. Gropp**, and Laxmikant V. Kalé, IPDPS, 80–91, 2011.
BibTeX

*Modeling the performance of an algebraic multigrid
cycle on HPC platforms*, Hormozd Gahvari, Allison H. Baker, Martin Schulz, Ulrike Meier Yang, Kirk E. Jordan, and William
Gropp, Proceedings of the 25th International Conference on
Supercomputing, 2011, Tucson, AZ, USA, May 31 -
June 04, 2011, David K. Lowenthal, Bronis R. de Supinski, and Sally
A. McKee, 172–181, 2011.
BibTeX

*Avoiding hot-spots on two-level direct networks*, Abhinav Bhatele, Nikhil Jain, **William D. Gropp**, and Laxmikant V. Kalé, Conference on High Performance Computing Networking, Storage
and Analysis, SC 2011, Seattle, WA, USA, November 12-18,
2011, Scott Lathrop, Jim Costa, and William Kramer, 76, 2011.
BibTeX

*Weighted locality-sensitive scheduling for mitigating noise
on multi-core clusters*, Kale, V., Bhatele, A., and Gropp, W.D., High Performance Computing (HiPC), 2011 18th
International Conference on, 1–10, 2011.
BibTeX

*Faster topology-aware collective algorithms through non-minimal communication*, Sack, Paul and Gropp, William, Proceedings of the 17th ACM SIGPLAN symposium on Principles and Practice of Parallel Programming, 45–54, Best Paper, 2012.
BibTeX

*Hybrid Static/dynamic Scheduling for Already Optimized Dense
Matrix Factorization*, Donfack, Simplice, Grigori, Laura, Gropp, William
D., and Kale, Vivek, Proceedings of the 2012 IEEE 26th International Parallel
and Distributed Processing Symposium, 496–507, 2012.
BibTeX

*Leveraging MPI's One-Sided Communication Interface for
Shared-Memory Programming*, Hoefler, Torsten, Dinan, James, Buntinas, Darius, Balaji, Pavan, Barrett, Brian, Brightwell,
Ron, Gropp, William, Kale, Vivek, and Thakur,
Rajeev, Recent Advances in the Message Passing Interface, Träff, Jesper, Benkner, Siegfried, and Dongarra, Jack, 132–141, 2012.
BibTeX

*MPI 3 and Beyond: Why MPI Is Successful and What
Challenges It Faces*, Gropp, William, Recent Advances in the Message Passing Interface, Träff, Jesper, Benkner, Siegfried, and Dongarra, Jack, 1–9, 2012.
BibTeX

*Efficient Multithreaded Context ID Allocation in MPI*, Dinan, James, Goodell, David, Gropp, William, Thakur, Rajeev, and Balaji, Pavan, Recent Advances in the Message Passing Interface, Träff, Jesper, Benkner, Siegfried, and Dongarra, Jack, 57–66, 2012.
BibTeX

*Adaptive Strategy for One-Sided Communication in MPICH2*, Zhao, Xin, Santhanaraman, Gopalakrishnan, and Gropp, William, Recent Advances in the Message Passing Interface, Träff, Jesper, Benkner, Siegfried, and Dongarra, Jack, 16–26, 2012.
BibTeX

*Modeling the Performance of an Algebraic Multigrid Cycle
Using Hybrid MPI/OpenMP*, Hormozd Gahvari, **William Gropp**, Kirk E. Jordan, Martin Schulz, and Ulrike Meier Yang, ICPP, 128–137, 2012.
BibTeX

*41st International Conference on Parallel Processing, ICPP
2012, Pittsburgh, PA, USA, September 10-13, 2012*, 2012.
BibTeX

*A Decoupled Execution Paradigm for Data-Intensive High-End Computing*, Chen, Y., Chen, C., Sun, X.H., Gropp, W.D., and Thakur, R., Cluster Computing (CLUSTER), 2012 IEEE International Conference on, 200–208, 2012.
BibTeX

*A Case for Optimistic Coordination in HPC Storage Systems*, Philip Carns, Kevin Harms, Dries Kimpe, Robert Ross, Justin Wozniak, Lee Ward, Matthew Curry, Ruth Klundt, Geoff Danielson, Cengiz Karakoyunlu, John Chandy, Bradley Settlemeyer, and **William Gropp**, High Performance Computing, Networking Storage and Analysis, SC Companion:, 48–53, 2012.
BibTeX

*Toward Asynchronous and MPI-Interoperable Active Messages*, Xin Zhao, Buntinas, D., Zounmevo, J., Dinan, J., Goodell, D., Balaji, P., Thakur, R., Afsahi,
A., and Gropp, W., Cluster, Cloud and Grid Computing (CCGrid), 2013 13th
IEEE/ACM International Symposium on, 87–94, 2013.
BibTeX

*Parallel Adaptive Deflated GMRES*, Wakam, Désir\'e Nuentsa, Erhel, Jocelyne, and Gropp, William D., Domain Decomposition Methods in Science and Engineering XX, Bank, Randolph, Holst, Michael, Widlund, Olof, and Xu, Jinchao, 631–638, 2013.
BibTeX

*Analysis of topology-dependent MPI performance on
Gemini networks*, Antonio J. Peña, Ralf G. Correa Carvalho, James Dinan, Pavan Balaji, **Rajeev Thakur**, and **William Gropp**, 20th European MPI Users's Group Meeting, EuroMPI
'13, Madrid, Spain - September 15 - 18, 2013, Jack Dongarra, Javier García Blas, and Jes\'us
Carretero, 61–66, 2013.
BibTeX

*Performance Analysis of the Lattice Boltzmann Model
Beyond Navier-Stokes*, Amanda Peters Randles, Vivek Kale, Jeff Hammond, **William Gropp**, and Efthimios Kaxiras, IPDPS, 1063–1074, 2013.
BibTeX

*Systematic Reduction of Data Movement in Algebraic
Multigrid Solvers*, Hormozd Gahvari, **William Gropp**, Kirk E. Jordan, Martin Schulz, and Ulrike Meier Yang, IPDPS Workshops, 1675–1682, 2013.
BibTeX

*Locality-Optimized Mixed Static/Dynamic Scheduling for
Improving Load Balancing on SMPs*, Vivek Kale, Amanda Peters Randles, and William D.
Gropp, 21st European MPI Users' Group Meeting,
EuroMPI/ASIA '14, Kyoto, Japan - September 09 - 12,
2014, Jack Dongarra, Yutaka Ishikawa, and Atsushi Hori, 115, 2014.
BibTeX

*Rethinking Key-value Store for Parallel I/O Optimization*, Yin, Yanlong, Kougkas, Antonios, Feng, Kun, Eslami, Hassan, Lu, Yin, Sun, Xian-He, Thakur, Rajeev, and Gropp, William, Proceedings of the 2014 International Workshop on Data Intensive Scalable Computing Systems, 33–40, 2014.
BibTeX

*Nonblocking Epochs in MPI One-sided Communication*, Zounmevo, Judicael A., Zhao, Xin, Balaji, Pavan, Gropp, William, and Afsahi, Ahmad, Proceedings of the International Conference for High
Performance Computing, Networking, Storage and
Analysis, 475–486, Best paper finalist, 2014.
BibTeX

*Algebraic Multigrid on a Dragonfly Network: First
Experiences on a Cray XC30*, Gahvari, Hormozd, Gropp, William, Jordan, Kirk E., Schulz, Martin, and Yang, Ulrike Meier, High Performance Computing Systems. Performance
Modeling, Benchmarking, and Simulation, Jarvis, Stephen A., Wright, Steven A., and Hammond, Simon D., 3–23, 2015.
BibTeX

*Towards a More Fault Resilient Multigrid Solver*, Calhoun, Jon, Olson, Luke, Snir, Marc, and Gropp, William D., Proceedings of the High Performance Computing Symposium, 2015.
BibTeX

*Runtime Support for Irregular Computation in
MPI-Based Applications*, Xin Zhao, Pavan Balaji, and **William Gropp**, CCGrid, 701–704, 2015.
BibTeX

*A Multiplatform Study of I/O Behavior on Petascale
Supercomputers*, Huong Luu, Marianne Winslett, **William Gropp**, Robert B. Ross, Philip H. Carns, Kevin Harms, Prabhat, Surendra Byna, and Yushu Yao, Proceedings of the 24th International Symposium on
High-Performance Parallel and Distributed Computing,
HPDC 2015, Portland, OR, USA, June 15-19, 2015, Thilo Kielmann, Dean Hildebrand, and Michela
Taufer, 33–44, 2015.
BibTeX

*DAME: A Runtime-Compiled Engine for Derived
Datatypes*, Tarun Prabhu and **William Gropp**, Proceedings of the 22nd European MPI Users' Group
Meeting, EuroMPI 2015, Bordeaux, France, September
21-23, 2015, Jack J. Dongarra, Alexandre Denis, Brice Goglin, Emmanuel Jeannot, and Guillaume Mercier, 4:1–4:10, 2015.
BibTeX

*Composing Low-Overhead Scheduling Strategies for
Improving Performance of Scientific Applications*, Vivek Kale and **William D. Gropp**, OpenMP: Heterogenous Execution and Data Movements -
11th International Workshop on OpenMP, IWOMP 2015,
Aachen, Germany, October 1-2, 2015, Proceedings, Christian Terboven, Bronis R. de Supinski, Pablo
Reble, Barbara M. Chapman, and Matthias S.
Müller, 18–29, 2015.
BibTeX

*Efficient Disk-to-disk Sorting: A Case Study in the
Decoupled Execution Paradigm*, Eslami, Hassan, Kougkas, Anthony, Kotsifakou, Maria, Kasampalis, Theodoros, Feng, Kun, Lu, Yin, Gropp, William, Sun, Xian-He, Chen, Yong, and Thakur, Rajeev, Proceedings of the 2015 International Workshop on
Data-Intensive Scalable Computing Systems, 2:1–2:8, 2015.
BibTeX

*Building Performance Transportable Codes for Extreme Scale*, **William Gropp**, 2015 SIAM Conference on Computational Science and
Engineering, 287, Poster in Minisymposterium 204: CSE Software, 2015.
BibTeX

*Towards Millions of Communicating Threads*, Dang, Hoang-Vu, Snir, Marc, and Gropp, William, Proceedings of the 23rd European MPI Users' Group Meeting, 1–14, 2016.
BibTeX

*Modeling MPI Communication Performance on SMP Nodes: Is It Time to Retire the Ping Pong Test*, Gropp, William, Olson, Luke N., and Samfass, Philipp, Proceedings of the 23rd European MPI Users' Group Meeting, 41–50, 2016.
BibTeX

*Scalable Non-blocking Preconditioned Conjugate Gradient Methods*, Eller, Paul R. and Gropp, William, Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis, 18:1–18:12, 2016.
BibTeX

*Towards a More Complete Understanding of SDC
Propagation*, Jon Calhoun, Marc Snir, Luke N. Olson, and **William D. Gropp**, HPDC, H. Howie Huang, Jon B. Weissman, Adriana
Iamnitchi, and Alexandru Iosup, 131–142, 2017.
BibTeX

*A DSL for Performance Orchestration*, Thiago Santos Faria Xavier Teixeira, David Padua, and **William Gropp**, 26th International Conference on Parallel Architectures and Compilation
Techniques, PACT 2017, Portland, OR, USA, September 9-13, 2017, 372, 2017.
BibTeX

*High Performance Computing Systems. Performance
Modeling, Benchmarking,
and Simulation - 5th International Workshop, PMBS
2014, New Orleans,
LA, USA, November 16, 2014. Revised Selected Papers*, 2015.
BibTeX

*Numerical Solution of Transport Equations*, **William D. Gropp**, Stanford University, STAN-CS-81-888, Ph.D. Thesis, 1981.
BibTeX

*Local Uniform Mesh Refinement for Elliptic Partial
Differential Equations*, **William D. Gropp**, Yale University, Department of Computer Science, YALE/DCS/RR-278, 1983.
BibTeX

*The Computer as an Aid in the Asymptotic Estimation
of Integrals*, G. W. Hedstrom and **William D. Gropp**, Lawrence Livermore National Laboratory, UCRL-87297, 1983.
BibTeX

*A High Performance Bulk Memory System*, W. D.~Gropp, J.~J.~O'Donnell, S.~T.~O'Donnell, M. H.~Schultz, and B.~Weston, Yale University, Department of Computer Science, YALE/DCS/RR-311, 1984.
BibTeX

*Local Uniform Mesh Refinement with Moving Grids*, **W. D. Gropp**, Yale University, YALEU/DCS/RR-313, 1984.
BibTeX

*Local Uniform Mesh Refinement on Loosely-Coupled
Parallel Processors*, **W. D. Gropp**, Yale University, YALEU/DCS/RR-352, 1984.
BibTeX

*A Comparison of Domain Decomposition Techiques for
Elliptic Partial Differential Equations and Their
Parallel Implementation*, D. E. Keyes and **W. D. Gropp**, Comput. Sci. Dept., Yale Univ., YALEU/DCS/RR-448, 1985.
BibTeX

*Dynamic Grid Manipulation for PDE's on Hypercube
Parallel Processors*, **W. Gropp**, Department of Computer Science, Yale University, YALEU/DCS/RR-458, 1986.
BibTeX

*Computational Fluid Dynamics on Parallel Processors*, **W. D. Gropp** and E. B. Smith, Department of Computer Science, Yale University, YALEU/DCS/RR-570, 1987.
BibTeX

*A Domain Decomposition Technique on a Loosely
Coupled Array of Processors*, R. H.~Herbin, W.~D.~Gropp, D.~E.~Keyes, and V.~Sonnad, IBM Kingston, KGN-124, 1987.
BibTeX

*Recursive Mesh Refinement on Hypercubes*, **W. Gropp** and I. Ipsen, Department of Computer Science, Yale University, YALE/DCS/RR-616, 1988.
BibTeX

*A Parallel Version of the Fast Multipole Method*, L. Greengard and W.~Gropp, Yale University, Department of Computer Science, YALE/DCS/RR-640, 1988.
BibTeX

*Krylov methods preconditioned with incompletely
factored matrices on the CM-2*, H. Berryman, J. Saltz, **W. Gropp**, and R.
Mirchandaney, NASA Langley Research Center, ICASE, 89-54, Also Yale University YALE/DCS/RR-685, March 1989, 1989.
BibTeX

*Domain Decomposition on Parallel Computers*, **William D. Gropp** and David Keyes, Yale University, Department of Computer Science, YALE/DCS/RR-723, 1989.
BibTeX

*Domain Decomposition with Local Mesh refinement*, David E. Keyes and **William D. Gropp**, Yale University, YALEU/DCS/RR-726, 1989.
BibTeX

*Parallel Performance of
Domain-decomposed Preconditioned Krylov Methods for
PDEs with Adaptive Refinement*, **William D. Gropp** and David Keyes, Yale University, Department of Computer Science, YALE/DCS/RR-773, Also ANL Preprint MCS-P147-0490, May 1990, 1990.
BibTeX

*Parallel Domain decomposition and the Solution of
Nonlinear Systems of Equations*, **William D. Gropp** and David E. Keyes, Mathematics and Computer Science Division, Argonne
National Laboratory, MCS-P186-1090, 1990.
BibTeX

*Domain-decomposable Preconditioners for Second-order
Upwind Discretizations of Multicomponent Systems*, David E. Keyes and **William D. Gropp**, Mathematics and Computer Science Division, Argonne
National Laboratory, MCS-187-1090, 1990.
BibTeX

*Convergence Rate Estimate for a Domain Decomposition Method*, X.-C. Cai, **William D. Gropp**, and David E. Keyes, Yale University, Department of Computer Science, YALE/DCS/RR-827, also ANL Preprint MCS-P202-1290, January 1991, 1991.
BibTeX

*Domain Decomposition with Local Mesh Refinement*, **William D. Gropp** and David E. Keyes, ICASE, 91-19, 1991.
BibTeX

*Domain Decomposition Methods in Computational Fluid
Dynamics*, **William D. Gropp** and David E. Keyes, ICASE, 91-20, Also ANL Preprint MCS-P210-0191, April 1991, 1991.
BibTeX

*Parallel Computing and Domain Decomposition*, **William D. Gropp**, Mathematics and Computer Science Division, Argonne
National Laboratory, MCS-P257-0891, 1991.
BibTeX

*A Test Implementation of the MPI Draft
Message-Passing Standard*, **William Gropp** and **Ewing Lusk**, Mathematics and Computer Science Division, Argonne
National Laboratory, ANL-92/47, 1992.
BibTeX

*Experiences with Domain Decomposition in Three
Dimensions: Overlapping Schwarz Methods*, **William D. Gropp** and Barry F. Smith, Mathematics and Computer Science Division, Argonne
National Laboratory, Appeared in the Proceedings of the Sixth
International Symposium on Domain Decomposition
Methods, 1992.
BibTeX

*Domain Decomposition as a Mechanism for Using
Asymptotic Methods*, David E. Keyes and **William D. Gropp**, Mathematics and Computer Science Division, Argonne
National Laboratory, MCS-P322-0892, 1992.
BibTeX

*Early Experiences with the IBM SP-1*, **William Gropp** (ed.), Mathematics and Computer Science Division, Argonne
National Laboratory, ANL-MCS-TM-177, 1993.
BibTeX

*Early Experiences with the IBM SP1 and the
High-Performance Switch*, **William Gropp**, Mathematics and Computer Science Division, Argonne
National Laboratory, ANL-93/41, 1993.
BibTeX

*The design of data-structure-neutral libraries for the
iterative solution of sparse linear systems*, **William D. Gropp** and Barry F. Smith, Argonne National Laboratory, MCS-P356-0393, 1993.
BibTeX

*Numerical Simulation of Vortex Dynamics in
High- T_c Superconductors*, William D. Gropp, Hans Kaper, G. Leaf, D. Levine, V. Vinokur, and M. Palumbo, Mathematics and Computer Science Division, Argonne
National Laboratory, MCS-P476-1094, 1994.
BibTeX

*Integral Equations in Nonlinear 3d Magnetostatics*, L. Kettunen, K. Forsman, D. Levine, and **W. Gropp**, Mathematics and Computer Science Division, Argonne
National Laboratory, MCS-P460-0894, 1994.
BibTeX

*Early applications in the Message Passing Interface
(MPI)*, Anthony Skjellum, **Ewing Lusk**, and **William Gropp**, Department of Computer Science, Mississippi State
University, 1994.
BibTeX

*An Introduction to Performance Debugging for Parallel
Computers*, **William Gropp**, Argonne National Lab, MCS-P500-0295, 1995",
abstract-url = "http://www.mcs.anl.gov/division/publications/abstracts95.html.
BibTeX

*Using the Scalable Nonlinear Equations Solvers
Package*, **William D. Gropp**, Lois Curfman McInnes, and Barry F.
Smith, Argonne National Lab, ANL/MCS-TM-193, 1995.
BibTeX

*Solution of Dense Systems of Linear Equations
Arising from Integral Equation Formulations*, K. Forsman, **W. Gropp**, L. Kettunen, D. Levine, and J. Salonen, Mathematics and Computer Science Division, Argonne
National Laboratory, MCS-P538-0895, 1995.
BibTeX

*I/O Characterization of a Portable Astrophysics
Application on the IBM SP and Intel Paragon*, **Rajeev Thakur**, **Ewing Lusk**, and **William Gropp**, Mathematics and Computer Science Division, Argonne
National Laboratory, MCS-P534-0895, Revised October 1995, 1995.
BibTeX

*An Experimental Evaluation of the Parallel I/O
Systems of the IBM SP and Intel Paragon Using a
Production Application*, **Rajeev Thakur**, **William Gropp**, and **Ewing Lusk**, Argonne National Laboratory, MCS-P569–0296, 1996.
BibTeX

*An Abstract-Device Interface for Implementing Portable
Parallel-I/O Interfaces*, **Rajeev Thakur**, **William Gropp**, and **Ewing Lusk**, Argonne National Laboratory, Mathematics and Computer
Science Division, MCS-P592-0596, 1996.
BibTeX

*Optimization Environments and the NEOS Server*, **William D. Gropp** and Jorge Moré, Mathematics and Computer Science Division, Argonne
National Laboratory, ANL/MCS-P654-0397, Also CRPC-TR97708 and available at
http://www.crpc.rice.edu/softlib/TRs_online.html, 1997.
BibTeX

*Why are PVM and MPI so different?*, **William D. Gropp** and **Ewing Lusk**, Mathematics and Computer Science Division, Argonne
National Laboratory, ANL/MCS-P667-0697, 1997.
BibTeX

*I/O in Parallel Applications: The Weakest Link*, **Rajeev Thakur**, **Ewing Lusk**, and **William Gropp**, Mathematics and Computer Science Division, Argonne
National Laboratory, ANL/MCS-P700-1197, Appeared in IJSA, 1997.
BibTeX

*A Case for Using MPI's Derived Datatypes to Improve I/O
performance*, **Rajeev Thakur**, **Ewing Lusk**, and **William Gropp**, Mathematics and Computer Science Division, Argonne
National Laboratory, ANL/MCS-P717-0598, Appeared at Supercomputing'98., 1998.
BibTeX

*Data Sieving and Collective I/O in ROMIO*, **Rajeev Thakur**, **William Gropp**, and **Ewing Lusk**, Mathematics and Computer Science Division, Argonne
National Laboratory, ANL/MCS-P723-0898, Submitted to Frontiers'99., 1998.
BibTeX

*A Microkernel Design for Component-based Numerical Software Systems*, Satish Balay, **William Gropp**, Lois Curfman McInnes, and Barry Smith, Mathematics and Computer Science Division, Argonne
National Laboratory, ANL/MCS-P727-0998, 1998.
BibTeX

*Globalized Newton-Krylov-Schwarz Algorithms and Software for Parallel Implicit CFD*, **William Gropp**, David E. Keyes, Lois C. McInnes, and M. D. Tidriri, ICASE, 98-24, Also NASA/CR-1998-208435, 1998.
BibTeX

*Achieving High Performance with MPI-IO*, **Rajeev Thakur**, **William Gropp**, and **Ewing Lusk**, Mathematics and Computer Science Division, Argonne
National Laboratory, ANL/MCS-P742-0299, 1999.
BibTeX

*Infrastructure and Interfaces for Large-Scale Numerical
Software*, Lori Freitag, **William Gropp**, Paul Hovland, Lois
Curfman McInnes, and Barry Smith, Mathematics and Computer Science Division, Argonne
National Laboratory, ANL/MCS-P751-0599, 1999.
BibTeX

*Reproducible Measurements of MPI Performance
Characteristics*, **William Gropp** and **Ewing Lusk**, Mathematics and Computer Science Division, Argonne
National Laboratory, ANL/MCS-P755-0699, 1999.
BibTeX

*A Standard Interface for Debugger Access to Message Queue
Information in MPI*, James Cownie and **William Gropp**, Mathematics and Computer Science Division, Argonne
National Laboratory, ANL/MCS-P754-0699, 1999.
BibTeX

*Toward Scalable Performance Visualization with Jumpshot*, Omer Zaki, **Ewing Lusk**, **William Gropp**, and Deborah
Swider, Mathematics and Computer Science Division, Argonne
National Laboratory, ANL/MCS-P763-0699, 1999.
BibTeX

*Achieving High Sustained Performance in an Unstructured Mesh
CFD Application*, W. Kyle Anderson, **William D. Gropp**, Dinesh Kaushik, David E. Keyes, and Barry F. Smith, Mathematics and Computer Science Division, Argonne
National Laboratory, ANL/MCS-P776-0899, Appeared in Proceedings of SC99, 1999.
BibTeX

*Globalized Newton-Krylov-Schwarz Algorithms and
Software for parallel implicit CFD*, **William Gropp**, David Keyes, Lois McInnes, and M. D. Tidiri, Mathematics and Computer Science Division, Argonne
National Laboratory, ANL/MCS-P788-0100, Appeared in High Performance Computing Applications, 2000.
BibTeX

*Exploiting Hierarchy in Parallel Computer Networks to
Optimize Collective Operation Performance*, N. T. Karonis, B. R. de Supinski, I. Foster, **W. Gropp**, **E. Lusk**, and J. Bresnahan, Mathematics and Computer Science Division, Argonne
National Laboratory, ANL/MCS-P788-0200, 2000.
BibTeX

*Runtime Checking of Datatype Signatures in MPI*, **William D. Gropp**, Mathematics and Computer Science Division, Argonne
National Laboratory, ANL/MCS-P826-0500, 2000.
BibTeX

*A Scalable Process-Management Environment for Parallel
Programs*, Ralph Butler, **William Gropp**, and **Ewing Lusk**, Mathematics and Computer Science Division, Argonne
National Laboratory, ANL/MCS-P812-0400, 2000.
BibTeX

*Performance Modeling and Tuning of an Unstructured Mesh
CFD Application*, **William D. Gropp**, Dinesh K. Kaushik, David E. Keyes, and Barry F. Smith, Mathematics and Computer Science Division, Argonne
National Laboratory, ANL/MCS-P833-0700, 2000.
BibTeX

*Software for the Scalable Solution of PDEs*, Satish Balay, **William Gropp**, Lois Curfman McInnes, and Barry F. Smith, Mathematics and Computer Science Division, Argonne
National Laboratory, ANL/MCS-P834-0700, 2000.
BibTeX

*Parallel I/O*, **Rajeev Thakur** and **William Gropp**, Mathematics and Computer Science Division, Argonne
National Laboratory, ANL/MCS-P837-0700, 2000.
BibTeX

*MPICH-GQ: Quality of Service for Message Passing Programs*, Alain Roy, Ian Foster, **William Gropp**, Nicholas
Karonis, Volker Sander, and Brian Toonen, Mathematics and Computer Science Division, Argonne
National Laboratory, ANL/MCS-P838-0700, 2000.
BibTeX

*Understanding the Parallel Scalability of an Implicit
Unstructured Mesh CFD Code*, **W. D. Gropp**, D. K. Kaushik, D. E. Keyes, and B. F. Smith, Mathematics and Computer Science Division, Argonne
National Laboratory, ANL/MCS-P845-0900, 2000.
BibTeX

*Latency, Bandwidth, and Concurrent Issue Limitations in
High-Performance CFD*, **W. D. Gropp**, D. K. Kaushik, D. E. Keyes, and B. F. Smith, Mathematics and Computer Science Division, Argonne
National Laboratory, ANL/MCS-P850-1000, 2000.
BibTeX

*High Performance Parallel Implicit CFD*, **W. D. Gropp**, D. K. Kaushik, D. E. Keyes, and B. F. Smith, Mathematics and Computer Science Division, Argonne
National Laboratory, ANL/MCS-P863-1200, 2000.
BibTeX

*Components and Interfaces of a Process Management System for
Parallel Programs*, Ralph Butler, **William Gropp**, and **Ewing Lusk**, Mathematics and Computer Science Division, Argonne
National Laboratory, ANL/MCS-P872-0201, 2001.
BibTeX

*Scalable Unix Commands for Parallel Processors: A
High-Performance Implementation*, Emil Ong, **Ewing Lusk**, and **William Gropp**, Mathematics and Computer Science Division, Argonne
National Laboratory, ANL/MCS-P885-0601, 2001.
BibTeX

*Learning from the Success of MPI*, **William Gropp**, Mathematics and Computer Science Division, Argonne
National Laboratory, ANL/MCS-P903-0801, 2001.
BibTeX

*Building Library Components That Can Use Any MPI
Implementation*, **William Gropp**, Mathematics and Computer Science Division, Argonne
National Laboratory, ANL/MCS-P956-0502, 2002.
BibTeX

*Scalable Log Files for Parallel Program Trace Data*, Anthony Chan, **William Gropp**, and **Ewing Lusk**, Mathematics and Computer Science Division, Argonne
National Laboratory, ANL/MCS-TM-256, 2002.
BibTeX

*A Multilevel Approach to Topology-Aware Collective
Operations in Computational Grids*, Nicholas T. Karonis, Bronis de Supinski, Ian Foster, **William Gropp**, **Ewing Lusk**, and Sebastien Lacour, Mathematics and Computer Science Division, Argonne
National Laboratory, ANL/MCS-P948-0402, 2002.
BibTeX

*Improving the Performance of MPI Derived Datatypes by
Optimizing Memory-Access Cost*, Surendra Byna, **William Gropp**, Xian-He Sun, and Rajeev
Thakur, Mathematics and Computer Science Division, Argonne
National Laboratory, ANL/MCS-P1045-0403, 2003.
BibTeX

*Cray X1 Evaluation*, A. S. Bland, J. J. Dongarra, J. B. Drake, T. H. Dunigan, Jr., T. H. Dunning, Jr., A. Geist, B. Gorda, **W. D. Gropp**, R. J. Harrison, R. Kendall, D. Keyes, J. A. Nichols, L. Oliker, H. Simon, R. Stevens, J. B. White, III, P. H. Worley, and T. Zacharia, Oak Ridge National Laboratory, ORNL/TM-2003/67, 2003.
BibTeX

*Improving the Performance of Collective Operations in
MPICH*, **Rajeev Thakur** and **William Gropp**, Mathematics and Computer Science Division, Argonne
National Laboratory, ANL/MCS-P1038-0403, Appeared in Euro PVMMPI'03, 2003.
BibTeX

*Implementing Fast and Reusable Datatype Processing*, R. Ross, N. Miller, and **W. D. Gropp**, Mathematics and Computer Science Division, Argonne
National Laboratory, ANL/MCS-P1068-0703, Appeared in Euro PVMMPI'03, 2003.
BibTeX

*Design and Implementation of MPICH2 over Infiniband with
RDMA support*, Jiuxing Liu, Weihang Jiang, Pete Wyckoff, Dhabaleswar K. Panda, David Ashton, Darius Buntinas, **William Gropp**, and Brian Toonen, Mathematics and Computer Science Division, Argonne
National Laboratory, ANL/MCS-P1103-1003, 2003.
BibTeX

*High Performance MPI-2 One-Sided Communication
over InfiniBand*, Weihang Jiang, Jiuxing Liu, Hyun-Wook Jin, Dhabaleswar K. Panda, **William Gropp**, and **Rajeev Thakur**, Mathematics and Computer Science Division, Argonne
National Laboratory, ANL/MCS-P1119-0104, 2004.
BibTeX

*Evaluating Structured I/O Methods for Parallel File
Systems*, Avery Ching, Alok Choudhary, Wei-keng Liao, Robert
Ross, and **William Gropp**, Mathematics and Computer Science Division, Argonne
National Laboratory, ANL/MCS-P1125-0204, To appear in IJHPCN, 2004.
BibTeX

*Optimization of Collective Communication Operations
in MPICH*, **Rajeev Thakur**, Rolf Rabenseifner, and **William Gropp**, Mathematics and Computer Science Division, Argonne
National Laboratory, ANL/MCS-P1140-0304, 2004.
BibTeX

*Fault Tolerance in MPI Programs*, **William Gropp** and **Ewing Lusk**, Mathematics and Computer Science Division, Argonne
National Laboratory, ANL/MCS-P1154-0404, 2004.
BibTeX

*Minimizing Synchronization Overhead in the
Implementation of MPI One-Sided Communication*, **Rajeev Thakur**, **William Gropp**, and Brian Toonen, Mathematics and Computer Science Division, Argonne
National Laboratory, ANL/MCS-P1158-0504, 2004.
BibTeX

*Efficient Implementation of MPI-2 Passive
One-Sided Communication on InfiniBand Clusters*, Weihang Jiang, Kiuxing Liu, Hyun-Wook Jin, Dhabaleswar K. Panda, Darius Buntinas, Rajeev
Thakur, and **William Gropp**, Mathematics and Computer Science Division, Argonne
National Laboratory, ANL/MCS-P1164-0504, 2004.
BibTeX

*Providing Efficient I/O Redundancy in MPI Environments*, **William Gropp**, Robert Ross, and Neill Miller, Mathematics and Computer Science Division, Argonne
National Laboratory, ANL/MCS-P1178-0604, 2004.
BibTeX

*The Design and Implementation of Message Passing
Services for the BlueGene/L Supercomputer*, George Almási, Charles Archer, Jos\'e
G. Castaños, John Gunnels, Chris Erway, Philip Heidelberger, Xavier Martorell, José
E. Moreira, Kurt Pinnow, Joe Ratterman, Burkhard Steinmacher-burow, **William Gropp**, and Brian Toonen, Mathematics and Computer Science Division, Argonne
National Laboratory, ANL/MCS-P1183-0604, 2004.
BibTeX

*Issues in Accurate and Reliable Use of Parallel
Computing in Numerical Programs*, **William D. Gropp**, Mathematics and Computer Science Division, Argonne
National Laboratory, ANL/MCS-P1193-0804, 2004.
BibTeX

*Optimizing the Synchronization Operations in MPI
One-Sided Communication*, **Rajeev Thakur**, **William Gropp**, and Brian Toonen, Mathematics and Computer Science Division, Argonne
National Laboratory, ANL/MCS-P1232-0205, 2005.
BibTeX

*Implementing MPI-IO Atomic Mode without File
System Support*, Robert Ross, Robert Latham, **William Gropp**, **Rajeev Thakur**, and Brian Toonen, Mathematics and Computer Science Division, Argonne
National Laboratory, ANL/MCS-P1235-0305, 2005.
BibTeX

*Understanding the Requirements Imposed by
Programming Model Middleware on a Common
Communication Subsystem*, Darius Buntinas and **William Gropp**, Argonne National Laboratory, ANL/MCS-TM-284, 2005.
BibTeX

*The Design and Evaluation of Nemesis, a Scalable
Low-Latency Message-Passing Communication Subsystem*, Darius Buntinas, Guillaume Mercier, and William
Gropp, Argonne National Laboratory, ANL/MCS-TM-292, 2005.
BibTeX

*Data Transfers Between Processes in an SMP System:
Performance Study and Application to MPI*, Darius Buntinas, Guillaume Mercier, and William
Gropp, Argonne National Laboratory, ANL/MCS-P1306-1105, Submitted to International Conference on Parallel and
Processing (ICPP) 2006., 2005.
BibTeX

*Parallel Tools and Environments: A Survey*, **William Gropp** and Andrew Lumsdaine, Argonne National Laboratory, ANL/MCS-P1342-0406, To appear in a SIAM volume of work presented at the SIAM Parallel Processing Conference in 2004, 2006.
BibTeX

*Annotations for Productivity and Performance Portability*, Boyana Norris, Albert Hartono, and **William Gropp**, Argonne National Laboratory, ANL/MCS-P1392-0107, 2007.
BibTeX

*Multiphysics Simulations: Challenges and Opportunities*, David E. Keyes, Lois Curfman McInnes, Carol Woodward, **William D. Gropp**, Eric Myra, Michael Pernice, John Bell, Jed Brown, Alain Clo, Jeffrey Connors, Emil Constantinescu, Don Estep, Kate Evans, Charbel Farhat, Ammar Hakim, Glenn Hammond, Glen Hansen, Judith Hill, Tobin Isaac, Xiangmin Jiao, Kirk Jordan, Dinesh Kaushik, Efthimios Kaxiras, Alice Koniges, Kihwan Lee, Aaron Lott, Qiming Lu, John Magerlein, Reed Maxwell, Michael McCourt, Miriam Mehl, Roger Pawlowski, Amanda Peters, Daniel Reynolds, Beatrice Riviere, Ulrich Rüde, Tim Scheibe, John Shadid, Brendan Sheehan, Mark Shephard, Andrew Siegel, Barry Smith, Xianzhu Tang, Cian Wilson, and Barbara Wohlmuth, Argonne National Laboratory, ANL/MCS-TM-321, Workshop Report, Park City, Utah, July 30 - August 6, 2011, sponsored by the Institute for Computing in Science (ICiS), 2012.
BibTeX

*A Case for Optimistic Coordination in HPC Storage Systems*, Carns, P., Harms, K., Kimpe, D., Wozniak, J.M., Ross, R., Ward, L., Curry, M., Klundt,
R., Danielson, G., Karakoyunlu, C., Chandy, J., Settlemyer, B., and Gropp, W., Oak Ridge National Laboratory (ORNL), 2012.
BibTeX

*CFD Vision 2030 Study: A Path to Revolutionary
Computational Aerosciences*, Jeffrey Slotnick, Abdollah Khodadoust, Juan
Alonso, David Darmofal, **William Gropp**, Elizabeth Lurie, and Dimitri Mavriplis, NASA, NASA/CR-2014-218178, 2014.
BibTeX

*PETSc Users Manual*, Satish Balay, Shrirang Abhyankar, Mark F. Adams, Jed Brown, Peter Brune, Kris
Buschelman, Lisandro Dalcin, Victor Eijkhout, William D. Gropp, Dinesh Kaushik, Matthew G. Knepley, Lois Curfman McInnes, Karl
Rupp, Barry F. Smith, Stefano Zampini, and Hong
Zhang, Argonne National Laboratory, ANL-95/11 - Revision 3.6, 2015.
BibTeX

*BDEC Pathways to Convergence: Toward a Shaping Strategy
for a Future Software and Data Ecosystem for Scientific Inquiry*, ICL-UT-17-08, 2017.
BibTeX

*Users Manual for the Chameleon Parallel Programming Tools*, **William D. Gropp** and Barry Smith, Mathematics and Computer Science Division,
Argonne National Laboratory, ANL-93/23, 1993.
BibTeX

*Users Manual for KSP:
Data-Structure-Neutral Codes Implementing Krylov Space Methods*, **William D. Gropp** and Barry Smith, Mathematics and Computer Science Division,
Argonne National Laboratory, ANL-93/30, 1993.
BibTeX

*Users Guide for the ANL IBM SP1*, **William D. Gropp**, **Ewing Lusk**, and Steven Pieper, Mathematics and Computer Science Division, Argonne National Laboratory, ANL/MCS-TM-198, 1994.
BibTeX

*Users Guide for the ANL IBM SPx*, **William D. Gropp** and **Ewing Lusk**, Mathematics and Computer Science Division,
Argonne National Laboratory, ANL/MCS-TM-199, 1994.
BibTeX

*Users Manual for doctext: Producing Documentation
from C Source Code*, **William D. Gropp**, Mathematics and Computer Science Division, Argonne National Laboratory, ANL/MCS-TM 206, 1995.
BibTeX

*Users Manual for tohtml:
Producing True Hypertext Documents from LaTeX*, **William D. Gropp**, Mathematics and Computer Science Division, Argonne National Laboratory, ANL/MCS-TM 207, 1995.
BibTeX

*Users Manual for bfort:
Producing Fortran Interfaces to C Source Code*, **William D. Gropp**, Mathematics and Computer Science Division, Argonne National Laboratory, ANL/MCS-TM 208, 1995.
BibTeX

*Installation Guide for mpich, a Portable
Implementation of MPI*,

*User's Guide for mpich, a Portable
Implementation of MPI*,

*PETSc 2.0 Users Manual*, Satish Balay, **William Gropp**, Lois Curfman McInnes, and Barry Smith, Mathematics and Computer Science Division, Argonne
National Laboratory, ANL-95/11, 1997.
BibTeX

*Users Guide for ROMIO: A High-Performance, Portable
MPI-IO Implementation*, **Rajeev Thakur**, **William Gropp**, and **Ewing Lusk**, Mathematics and Computer Science Division, Argonne
National Laboratory, ANL/MCS-TM-234, 1997.
BibTeX

*Commodity Software?*, **William Gropp**, ``Head Node'' article, 2004.
BibTeX

*Performance Modeling for Systematic Performance Tuning*, **William D. Gropp**, Torsten Hoefler, and Marc Snir, Abstract only., 2011.
BibTeX