\A PARTNERSHIP FOR ADVANCED COMPUTATIONAL
INFMSTRUCTURE PROGRAM
Y4,SCI 2:104/47
Partnership for ftdvinced Conputatio. . .
jijiaRING
BEFORE THE
SUBCOMMITTEE ON BASIC RESEAKCH
OF THE
COMMITTEE ON SCIENCE
U.S. HOUSE OF REPRESENTATIVES
ONE HUNDRED FOURTH CONGRESS
SECOND SESSION
MARCH 19, 1996
[No. 47]
Printed for the use of the Committee on Science
OEPaSffORY '
SEP 1 8 1996
BnsrOWP«BUCLlBBAP'
U.S. GOVERNMENT PRINTING OFFICE
26-018 CC WASHINGTON : 1996
For sale by the U.S. Government Printing Office
Superintendent of Documents, Congressional Sales Office, Washington, DC 20402
ISBN 0-16-052912-3
PARTNERSHIP FOR ADVANCED COMPUTATIONAL
INFRASTRUCTURE PROGRAM
Y 4. SCI 2:104/47
Partnership for Advanced Conputatio .. .
nr^ARING
BEFORE THE
SUBCOMMITTEE ON BASIC RESEAKCH
OF THE
COMMITTEE ON SCIENCE
U.S. HOUSE OP REPRESENTATIVES
ONE HUNDRED FOURTH CONGRESS
SECOND SESSION
MARCH 19, 1996
[No. 47]
Printed for the use of the Committee on Science
SUPEBIfiTfNDEKTOFOOCOriL
DEWSiTORy
SEP 1 8 1996
BOSTON P«8UCLIBBAPV
U.S. GOVERNMENT PRINTING OFFICE
26-018 CC WASHINGTON : 1996
For sale by the U.S. Government Printing Office
Superintendent of Document.s. Congressional Sales Office. Washington. DC 20402
ISBN 0-16-052912-3
COMMITTEE ON SCIENCE
ROBERT S. WALKER,
F. JAMES SENSENBRENNER, Jr.,
Wisconsin
SHERWOOD L. BOEHLERT, New York
HARRIS W. FAWELL, Illinois
CONSTANCE A. MORELLA, Maryland
CURT WELDON, Pennsylvania
DANA ROHRABACHER, California
STEVEN H. SCHIFF, New Mexico
JOE BARTON, Texas
KEN CALVERT, California
BILL BAKER, California
ROSCOE G. BARTLETT, Maryland
VERNON J. EHLERS, Michigan**
ZACH WAMP, Tennessee
DAVE WELDON, Florida
LINDSEY 0. GRAHAM, South Carolina
MATT SALMON, Arizona
THOMAS M. DAVIS, Virginia
STEVE STOCKMAN, Texas
GIL GUTKNECHT, Minnesota
ANDREA H. SEASTRAND, California
TODD TL^RT, Kansas
STEVE LARGENT, Oklahoma
VAN HILLEARY, Tennessee
BARBARA CUBIN, Wyoming
MARK ADAM FOLEY, Florida
SUE MYRICK, North Carolina
Pennsylvania, Chairman
GEORGE E. BROWN, Jr., California RMM*
HAROLD L. VOLKMER, Missouri
RALPH M. HALL, Texas
BART GORDON, Tennessee
JAMES A. TRAFICANT, Jr., Ohio
JOHN S. TANNER, Tennessee
TIM ROEMER, Indiana
ROBERT E. (Bud) CRAMER, Jr., Alabama
JAMES A. BARCL\., Michigan
PAUL McHALE, Pennsylvania
JANE HARMAN, California
EDDIE BERNICE JOHNSON, Texas
DAVID MINGE, Minnesota
JOHN W. OLVER, Massachusetts
ALCEE L. HASTINGS, Florida
LYNN N. RIVERS, Michigan
KAREN McCarthy, Missouri
MIKE WARD, Kentucky
ZOE LOFGREN, California
LLOYD DOGGETT, Texas
MICHAEL F. DOYLE, Pennsylvania
SHEILA JACKSON LEE, Texas
WILLIAM P. LUTHER, Minnesota
David D. Clement, Chief of Staff and Chief Counsel
Barry Beringer, General Counsel
TiSH Schwartz, Chief Clerk and Administrator
Robert E. Palmer, Democratic Staff Director
Subcommittee on Basic Research
STEVEN SCHIFF, New Mexico, Chairman
JOE BARTON, Texas
BILL BAKER, California
VERNON J. Ehlers, Michigan
GIL GUTKNECHT, Minnesota
CONSTANCE A. MORELLA, Maryland
CURT WELDON, Pennsylvania
ROSCOE G. BARTLETT, Maryland
ZACH WAMP, Tennessee
DAVE WELDON, Florida
LINDSEY O. GRAHAM, South Carolina
VAN HILLEARY, Tennessee
SUE MYRICK, North Carolina
ROBERT E. (Bud) CRAMER, JR., Alabama
ALCEE L. HASTINGS, Florida
LYNN N. RIVERS, Michigan
LLOYD DOGGETT, Texas
WILLIAM P. LUTHER, Minnesota
JOHN W. OLVER, Massachusetts
ZOE LOFGREN, California
MICHAEL F. DOYLE, Pennsylvania
SHEILA JACKSON LEE, Texas
HAROLD L. VOLKMER, Missouri
BART GORDON, Tennessee
*Ranking Minority Member
**Vice Chairman
(II)
CONTENTS
WITNESSES
Page
March 19, 1996:
Dr. Paul Young, Assistant Director for CISE, National Science Founda-
tion, Arlington, Virginia 7
Dr. Edward Hayes, Chairman, Report on the Task Force on the Future
of NSF Supercomputing Centers Program, and Vice President for Re-
search, Ohio State University, Columbus, Ohio 26
Dr. Malvin Kalos, Director, Cornell Theory Center, Ithaca, New York 102
Dr. Larry Smarr, Director, National Center for Supercomputing Applica-
tions at UIUC (NCSA), Champaign, Illinois 107
Dr. Michael Levine and Dr. Ralph Roskies, Scientific Directors, Pitts-
burgh Supercomputing Center, Pittsburgh, Pennsylvania 168
Dr. Douglas Pewitt, Acting Director, San Diego Supercomputing Center,
La Jolla, California 169
Dr. Mary Vernon, Department of Computer Sciences and Engineering,
University of Washington, Seattle, Washington 188
Dr. Kelvin Droegemeier, CAPS Director, University of Oklahoma, Nor-
man, Oklahoma 192
Dr. Douglas Gale, Assistant Vice President of Information Systems and
Services, George Washington University, Washington, D.C 198
APPENDIX
Opening Statements:
Congressman Steve Schiff, Chairman, Subcommittee on Basic Research ... 204
Congressman Robert E. (Bud) Cramer, Jr., Ranking Minority Member,
Subcommittee on Basic Research 205
Congressman Michael F. Doyle 207
Revised statement for the record submitted by Dr. Ralph Roskies, Scientific
Director, Pittsburgh Supercomputing Center 207
Answers to questions submitted to Dr. Paul Young, Assistant Director for
CISE, National Science Foundation, by Members of the Subcommittee on
Basic Research 209
(III)
PARTNERSHIP FOR ADVANCED COMPUTA-
TIONAL INFRASTRUCTURE PROGRAM
TUESDAY, MARCH 19, 1996
U.S. House of Representatives,
Committee on Science,
Subcommittee on Basic Research,
Washington, DC.
The Subcommittee met at 1 p.m. in Room 2318 of the Rayburn
House Office Building, the Honorable Steven H. Schiff, Chairman
of the Subcommittee, presiding.
Mr. Schiff. The Subcommittee will please come to order.
Today the Subcommittee will exercise its oversight responsibil-
ities for the National Science Foundation by receiving testimony on
the solicitation of the Partnership for Advanced Computational In-
frastructure Program.
As this Congress continues to find new ways to balance the budg-
et and reduce the size of government, agencies are looking to maxi-
mize the value of each taxpayer dollar.
Of the government agencies, the National Science Foundation is
one of the best Federal agencies at running a lean and efficient or-
ganization. I extend my compliments to Dr. Lane and all of his em-
ployees.
This Congress provided the NSF with an increase in FY 1996
funding over FY 1995, but hard choices must still be made on fund-
ing priorities and the direction programs should be taking as we
head into the 21st Century.
One of those programs undergoing review is NSF's
Supercomputer Center program. NSF is making tough decisions on
the direction of the Supercomputer Centers, their funding, and
what role they should play in research applications.
The Partnership for Advanced Computational Infrastructure Pro-
gram builds on and replaces the current NSF Supercomputer Cen-
ters program established in 1985. The new program will focus on
taking advantage of newly emerging opportunities in high-perform-
ance computing and communications.
The Subcommittee is interested in assessing the major accom-
plishments of the NSF's Supercomputing Centers program over the
last 10 years, as well as the proposed restructuring of the program.
Additionally, the subcommittee desires to receive any comments
or specific suggestions from witnesses on the recompetition process
which NSF has developed for the program.
To outline a brief history of the Supercomputer program, the
NSF Supercomputing Centers Program was established in 1984 fol-
(1)
lowing strong expressions of the need for such computing resources
for the academic research community.
During a review in 1990, there was a distinct effort to expand
outreach services with initial efforts intended to forge closer ties to
industries that could profit from exposure to high-performance
computing and to include the community at large.
The Task Force Report we will receive testimony on today is the
latest of many studies of the Centers. The Director of the National
Science Foundation established the Task Force on the Future of
the NSF Supercomputer Centers Program in December 1994.
The Task Force was asked to analyze various alternatives for the
continuation, restructuring, or phaseout of NSF's current
Supercomputing Centers Program or the development of similar fu-
ture programs, and to make recommendations among the alter-
natives.
In January the National Science Board adopted the recommenda-
tion for a new competition of the Centers, and NSF announced the
Partnership for Advanced Computational Infrastructure.
Preliminary solicitations are due April 15th, 1996. The final
awards will be announced in the fall of 1996.
To learn more about the partnership program, it is a pleasure to
welcome Dr. Paul Young, the Assistant Director for Computer and
Information Science and Engineering at the National Science Foun-
dation.
We also welcome Dr. Ed Hayes, Chairman of the Task Force on
the Future of NSF Supercomputing Centers Program who rec-
ommended the National Science Board have a recompetition of the
existing Supercomputer Centers.
For our second panel we will welcome the directors of the four
Centers. Dr. Malvin Kalos of the Cornell Theory Center; Dr. Larry
Smarr of the National Center for Computing Applications; Dr.
Douglas Pewitt of the San Diego Supercomputing Center; and from
the Pittsburgh Supercomputing Center we have Dr. Michael Levine
and Dr. Ralph Roskies.
On our third panel we will hear from supercomputer researchers
Dr. Mary Vernon and Dr. Kelvin Droegemeir. We will hear also
from Dr. Douglas Gale who will present the views of organizations
which want to be involved in the new supercomputing program at
NSF.
Before calling the first panel up to testify, I first want to wel-
come officially here at the Subcommittee — I do not know that we
have done it on the record in the Full Committee yet — but welcome
officially to the Subcommittee my new Ranking Member from Ala-
bama, Congressman Cramer, and you are recognized for whatever
remarks you might like to make at this time.
Mr. Cramer. Thank you, Mr. Chairman. I will look forward to
working with you and the Committee Members both on my side of
the aisle and your side of the aisle, as well.
I am pleased to join the chairman in welcoming our distin-
guished witnesses this afternoon. I congratulate him on calling the
hearing on this important subject matter, the Future of the Na-
tional Science Foundation's Supercomputer Centers.
Today we are reviewing an NSF program which this subcommit-
tee has supported and encouraged since its inception ten years ago.
Hearings in the early 1980s highlighted concerns from the aca-
demic research community that researchers had very limited access
to advanced scientific computers. This was of concern because high-
performance computers were beginning to show enormous promise
as tools for attacking previously intractable problems. They had
opened entirely new areas of scientific inquiry.
At the outset, the Supercomputer Centers Program simply pro-
vided computer time to academic researchers at a few existing
supercomputer sites. That is not the way it works anymore.
The program was subsequently enlarged in scope to accelerate
the development and use of new hardware and software for science
and engineering applications, and to expand the number of re-
searchers skilled in the use of advanced computing technologies.
The NSF Supercomputer Centers have now become important in-
tellectual centers driving the rapid progress of scientific computing.
The impact of this program is summed up very well in a recent
report of the Task Force on the Future of the NSF Supercomputer
Centers Program.
Thorough review and assessment of the Supercomputer Centers
Program after 10 years is both reasonable and expected. NSF has
instituted two major external reviews over the past four years pro-
viding many opportunities for the affected research community to
express their views.
The restructuring plan for the Centers program, which the sub-
committee will hear more about today, has resulted from the rec-
ommendations of the Hayes Task Force. I am pleased that Dr.
Hayes is present to discuss them.
I would like to call attention to one particular recommendation
of this Hayes Task Force, which was. That the Leading Edge Com-
puter Center should be partnered with Regional Computer Centers
for maximum benefit to the academic research community.
We have such a regional computer center in my Congressional
District in Huntsville, Alabama, and I would like to submit for the
record a written statement from the Alabama Supercomputer Au-
thority, which indicates some of the activities of the Center and in-
dicates how regional centers can contribute to the new NSF Part-
nerships Program.
In light of the past performance of the Supercomputer Centers,
any proposal for restructuring the program must lay down a con-
vincing set of arguments that change will be beneficial.
This is particularly true since there seems to be agreement that
a principal role of the program remains unchanged. That is, to pro-
vide access to high-end computing infrastructure for the academic
science and engineering community.
I look forward to review of this Supercomputer Centers Program,
and in particular to a discussion of how the program may be
strengthened and how the proposed changes will affect the re-
search community, which has come to rely on this important source
for the conduct of science and engineering research.
I look forward to the subcommittee hearing today.
Mr. SCHIFF. Thank you, Mr. Cramer.
I want, before recognizing other Members, I just want to say
briefly that I want to affirm to you publicly what I have said pri-
vately.
During the six years I served on the Science Committee under
the Democratic Majority, I always felt treated very fairly by the
committee chairman, and by the subcommittee chairs, and I always
felt that partisanship never entered into this Committee, and that
if we had disagreements they were on the merits as we saw mat-
ters, and I want to assure you of my best intention to continue that
precedent.
With that
Mr. Cramer. Thank you.
Mr. SCHIFF. I note that we are joined by our Ranking Democratic
Member, Congressman Brown from California, who I recognize for
as much time as he wishes to consume.
[Bells ring.]
Mr. Brown. Thank you, Mr. Chairman. I will not consume a
great deal of time
[Bells begin to ring again.]
[Laughter.]
Mr. Brown. Uh-oh.
Mr. SCHIFF. I think that is a recess.
Mr. Brown. Two weeks ago there was a weekend program to
wire up as many schools as possible in California to the Internet,
which I thought was a marvelous PR thing, and actually did hook
up quite a few schools.
Yesterday morning I visited one of those schools and talked to an
8th grade class, or rather watched them operating their computers
and accessing the Worldwide Web and doing a lot of other things
that I thought were incomprehensible, or probably would not ever
have occurred.
This morning I met with the representatives of most of the major
cable companies in California.
Now I cite these three things to illustrate the changes that have
taken place in the relatively recent past.
Many of these changes I think can be attributed to this program
which has developed the capability for making better use of com-
puters in our institutions of higher learning, as well as our institu-
tions of lower learning.
It is one of the marvels of my experience here in Congress to
have been a part of this development over these last 10 or 15 years.
The next 10 or 15 years can be equally exciting. I won't be here
for most of that time, but many of you will and I urge you to take
the same care and concern over the further development of these
capabilities that have proven to be such a successful activity, and
one in which the committee has had a fairly important role over
the last 10 or 15 years.
It can give you great satisfaction in your work, I can assure you.
Thank you, Mr. Chairman.
Mr. SCHIFF. Thank you, Mr. Brown.
Does any other Member seek recognition for any opening state-
ment?
Mr. Ehlers?
Mr. Ehlers. Thank you, Mr. Chairman. Just briefly, several
items.
First of all, I am very impressed with the panel you have assem-
bled and I look forward to the hearing. I apologize that I have to
go to the Floor at some point to defend a bill that is up for consid-
eration.
The Chairman of the House Oversight Committee is tied up in
another committee, so I have to go there to defend it, and I apolo-
gize to the panel when I am absent.
The other point I would make is. A few weeks ago I circulated
a letter among my colleagues in the matter and in just a few days
received 89 signatures asking that we immediately put in place
full-year funding for the National Science Foundation.
I was assured that this would receive serious consideration, and
in fact it did, but it has been held up because there is an effort to
provide full-year funding for every part of the government that is
not fully funded at this point.
So that issue is in abeyance.
I simply want to get this on the record. Once again, if for some
reason we are not able to complete work on the Fiscal 1986 budget
soon, I believe it imperative that we provide full-year funding for
the National Science Foundation as soon as possible, since they op-
erate largely through grants, and many grants are being held up
pending completion of the budget, and many of the individuals in
this room, plus thousands of others throughout the country, are
anxiously awaiting confirmation of their grguits and final award of
the grants.
I think it is extremely important for us to recognize as a Con-
gress and as a Nation the important role that science plays in the
infrastructure for our economic engine. What we are about to hear
today says the same thing.
The Ranking Member of the Full Committee, Mr. Brown, for
years has been a strong advocate not only of science but particu-
larly of the supercomputing centers, as well as coordination of com-
puting activities among various agencies.
We are trying to carry on that mandate, and it has become ex-
tremely difficult because of the uncertainty of funding.
Now, Mr. Chairman, I believe it is very important that we get
the funding issue settled as soon as possible and ensure that we
continue as a Nation to have the computing capability we need to
carry on the scientific enterprise in this Nation.
I thank you for holding the hearing, and thank you especially for
assembling the panel that you have.
Thank you.
Mr. SCHIFF. Thank you, Mr. Ehlers.
As one of many of the dozens of lawyers in Congress, I am
pleased to have a real scientist here on the subcommittee.
Before I recognize more Members, I want to add two things very
briefly. The first is. I join you in the belief that if we are not able
to fund all agencies for the balance of the fiscal year, that we
should identify and fund the National Science Foundation sepa-
rately and would support that, and I appreciate your leadership in
it.
The second is, as you mentioned you have to go to the House
Floor in a bit to speak on behalf of a bill, so do I. I want to express
to those of you who are here who may never have seen a Congres-
sional hearing before, those of you who have been here before are
quite accustomed to the fact that we schedule several things at any
one time.
Those of you who have never been to a hearing before I under-
stand can be a little disconcerted at our coming and going, but
what I want to emphasize is that the main purpose of a hearing
is to record the information so in due course it is made available
to all Members of Congress.
And of course the most important person at the hearing, there-
fore, is the lady who is taking down everything that we are saying
back and forth because, regardless of who is sitting in this chair
or the other chairs, all that information gets across to the entire
Congress.
So I want to assure you of the importance we place on this hear-
ing, even if at times we may have to personally leave it for a bit.
Let me now recognize the gentleman from Pennsylvania, Con-
gressman Doyle.
Mr. Doyle. Thank you, very much.
Mr. Chairman, I want to thank you for holding today's hearing
on the National Science Foundation's Supercomputing Centers Pro-
gram.
As a Member who is fortunate enough to have one of these cen-
ters in my District, I am well aware of the contribution that they
have made and will continue to make to our Nation's technological
infrastructure.
I do want to start by welcoming our witness from the Pittsburgh
Center, Dr. Ralph Roskies. He, along with many other dedicated in-
dividuals, has made the Pittsburgh Supercomputing Center an
internationally recognized success.
The PSC has made meaningful contributions in many areas-no-
tably in assisting NIH research, developing weather models for the
National Weather Service, and helping EPA with its Air Quality
Assessments.
Today's hearing examines the status of all the Supercomputing
Centers, which I believe is a timely and worthwhile undertaking.
We have before us three relatively new reports, the most recent
of which, the Hayes Report, calls for some rather significant action
to be taken on this program.
I am hopeful that today's hearing will give us an opportunity to
understand why the Hayes Report calls for a departure from what
has been an unquestionably successful program.
Again, I am pleased that we are having this hearing. I hope it
will allow us to understand the significance of this program prior
to acting in any way to dismantle it.
Thank you very much, Mr. Chairman.
Mr. SCHIFF. Thank you, Mr. Doyle. I wanted to appreciate your
efforts on behalf of supercomputing both on behalf of the Pitts-
burgh Center and on behalf of the issue generally. It has been a
good contribution to this subcommittee.
Mr. Doyle. Thank you, Mr. Chairman.
Mr. SCHIFF. Now I want to say that all Members' statements,
without objection, will be made part of the record but if any other
Member desires to make an opening oral statement I will recognize
them at this time.
[No response.]
Mr. SCHIFF. I see no requests, so I am going to invite our first
panel.
Dr. Young and Dr. Hayes, please joint us at the witness table.
Gentlemen, while you are getting set up there, let me say that
your complete written statements will be made a permanent part
of this record and invite you to summarize or proceed as you think
best.
Dr. Young, please proceed.
STATEMENT OF DR. PAUL YOUNG, ASSISTANT DIRECTOR FOR
COMPUTER AND INFORMATION SCIENCE AND ENGINEER-
ING, NATIONAL SCIENCE FOUNDATION
Dr. Young. All right. I appreciate that, and I
Mr. SCHIFF. I think your microphone may not be on.
Dr. Young. Thank you. I appreciate that, and I will limit myself
to oral comments that I've made.
Chairman Schiff, Members, staff, guests:
I am Paul Young, Assistant Director of the National Science
Foundation's Computer and Information Science and Engineering.
I would like to begin by commenting how much I, and I am sure
all members of the Foundation and members of the science and en-
gineering communities generally, appreciate the comments of sup-
port and the efforts by Mr. Ehlers, by chairman Schiff, and by this
committee as a whole in support of both the 1996 budget, and basic
research and science and engineering generally.
My Directorate at the Foundation supports both research by, and
infrastructure, for the scientific and engineering communities. I am
here today to discuss our new program, Partnerships in Advanced
Computational Infrastructure, which is a follow-on to the very suc-
cessful NSF Supercomputer Centers Program begun in 1984.
Our research programs in computer science and engineering sup-
port basic research in computing information and communications.
Our organization's other infi-astructure program is in networking,
a program which has led to the current development of the
Internet, a subject about which you see daily references in the
press and elsewhere, and indeed Mr. Brown referenced the current
linking of the California schools, and we are very enthused about
that as well.
It is interesting to note that our two infi-astructure programs
began in the same NSF office about 1984 and have continued to
support and complement each other to this day.
I am going to first talk a bit about the background that led to
the establishment of the Supercomputer Centers Program, and
then briefly review some of its successes and the reasons for them.
I will then discuss the process that led to the restructured pro-
gram and its goals.
And finally, I would like to lay out the process and time line for
putting our new program in place and our commitment to a smooth
transition, a transition which will minimize the impact on the com-
munity of scientists and engineers who depend on us to support
their research.
When I finish, I would like to leave you with the following points:
First, the Supercomputer Centers Program was created in re-
sponse to a critical need of the research and education commu-
8
nities. It has not only succeeded in meeting that need, it has gone
well beyond.
Second, the science and engineering enabled by the program is
of extremely high quality. The need for continued support remains,
and is in fact increasing as computation becomes increasingly im-
portant in more and more research areas.
Third, advances in technology will now enable us to restructure
the program to involve a broader spectrum of individuals and insti-
tutions, chosen in a competition where the best ideas will be sought
and implemented.
Fourth, we at NSF are committed to a fair and open competition
using the best expertise in industry, academia, and government to
review proposals.
And finally, every effort will be made to maintain quality service
to the research community during the transition to the new pro-
gram.
The history of the Supercomputer Centers Program can be traced
back to the efforts of a number of people who realized that com-
putation could provide an important tool for research in science
and engineering, who also realized that advanced computation was
not generally available to the research communities that NSF tra-
ditionally supports.
In the early 1980s, NASA and DOE scientists had access to ad-
vanced computational tools for programmatic needs. NSF research-
ers whose work was related to these programmatic agency goals
could, in some cases get, access, but others could not.
For example, there were cases at that time where the only way
that NSF researchers could do their computational work was to go
overseas.
The realization of the need for computation to support a broad
range of research in science and engineering and the support of
many in Congress and in the Administration led to the establish-
ment of, first an access program; and then the NSF Supercomputer
Centers in the 1984-85 time range.
To say that this program has been a success is truly an under-
statement. Things that were not even imagined at the beginning
have been accomplished in the fields of science and engineering
that no one envisioned as having any need for computation have
been positively impacted.
It is often said that success has many parents and failure is an
orphan. If this can be quantified, then the degree of success must
be related to the number of people who have been a part of the en-
terprise over the last 11 years.
There are a multitude. Some of them are here. But since the in-
ception of the program, more than 20,000 people have used the pro-
gram in support of their research, and over 100 industrial firms
have used the program for training and to test the applicability of
supercomputing for their firms.
In one recent year alone, over 20,000 people had contact with the
Centers' education and training programs. Let me give a few exam-
ples of technology and research activities associated with the pro-
gram, but remind you that later speakers will continue to provide
instances of major successes in research and education.
One important far-reaching development in the success of the
Centers was the reahzation that researchers needed access from
their home institutions. Originally, several of the Centers even ran
their own networks; but as interest broadened, it became clear that
a national network was required.
The Centers were the original nodes on the NSFNet's National
Network, and the original nodes on the high-speed Net backbone,
and as such they played a major role in driving the networking
technologies needed for today's Internet.
Today they are playing a similar role in the evolution of NSF's
experimental, very high-speed network, the VBNS.
Another tool that assumed importance came from the realization
that the results of complex calculations could not be understood
from conventional computer output. The program has dramatically
enhanced the development of software tools to visualize the results
of calculations.
With the network connection, access to high-speed computers,
and visualization tools, computational scientists and engineers
were empowered to follow their imagination to new results and to
new understandings.
What are some of those research areas? And how is the field de-
veloping?
As one example, Gerry Ostriker, a cosmologist, who is now the
Provost at Princeton University, in briefing the National Science
Board and the Hayes Task Force, said that years ago he believed
that computers were having a negative impact on his area of re-
search. Promising graduate students would get intrigued with at-
tacking fi-ont-line physics problems with underpowered computers
and software, and then get absorbed in the technology.
Since then, the power of the machines and the understanding of
programming them have made enormous strides. Today, Ostriker
is one of the principal investigators in a team that has made im-
portant progress in modeling the formation of the universe.
The models are now so good that they can differentiate between
competing theories of the universe, and focus both theoretical and
observational work on the most fruitful lines of inquiry.
Another rapidly growing area of computation is in biology, where
physiologically interesting molecules can be successfully modelled
by computational methods and their behavior understood.
It is more and more likely that biologically active molecules — for
example, new drugs — can be designed and their behavior, both
good and bad, predicted with computational models.
Climate and meteorology research continue to benefit from com-
puting. Later this afternoon you will hear about storm prediction.
Work is also going on trying to understand the effects of green-
house gases on long-term climate change. These calculations re-
quire a sophisticated understanding of the interaction of the atmos-
phere and the ocean, which will have important other major bene-
fits to all of us.
Every time we think we understand which fields can benefit from
computational results, a new threshold in computing capability is
crossed and another field blossoms. As our progress has continued,
it can be truly said that computation has become a full partner in
10
the scientific method, along with experimentation, theory, and ob-
servation.
In 1993 a blue-ribbon panel chaired by Lou Branscomb issued a
report on The Future of Computation and the Tools Needed To Ex-
ploit It. Along with a truly ringing endorsement of the accomplish-
ments of the NSF's Centers Program, this report called for distrib-
uting the technology across a pyramid of capability from the apex
to work stations.
At the time of the Branscomb report, the budget of the national
High Performance Computing and Communications Program was
predicted to double in five years. While the Branscomb Report was
well received, budget realities prevailed.
The National Science Board, when confi-onted with a proposal to
continue the Centers without recompetition, was reluctant. Later
that year, the Board approved a two-year extension to the program
while a task force chaired by Ed Hayes evaluated a direction that
was more consistent both with budget predictions and with a rap-
idly evolving technology.
The Task Force on The Future of the NSF Supercomputers Pro-
gram met and conferred during calendar 1995, issuing a final re-
port to the NSF Director in September. The Task Force was an
independent group with many sectors represented, and it dealt
with the really hard issues.
The next speaker, Ed Hayes, was chairman of that Task Force
and will speak to its conclusions and the report. Based on the
Hayes Report and on deliberations within the Foundation, NSF
management decided in late 1995 to put forth a new program to
the National Science Board which was approved at their December
1995 meeting.
It is designed to officially take advantage of new partnerships,
explicitly including regional and state partnerships, and these in
turn take advantage of and drive both scalable parallel computing
and new advanced networking capabilities. The program solicita-
tion for the new program was released on the Worldwide Web just
prior — literally, the day of — the government shutdown on Decem-
ber 15th, 1995.
Later, a formal printed solicitation and a "Dear Colleague" letter
were issued, and a series of informational meetings were held on
both Coasts and in the Midwest.
As evidence of the accessibility of this information, and in spite
of the shutdown, our records show that over 1000 accesses were
made to the Web information before the shutdown ended, and that
over 3000 accesses have been made to the Hayes report and the so-
licitation from over 2000 distinct Internet addresses.
The current Centers Program and the new Partnerships Program
are designed to support the academic research community. As a
general rule, proposals and principal investigators at NSF are
drawn from academia. This is pointed out in the solicitation.
However, NSF is open to good ideas from any sector, and the cur-
rent program involves industry successfully in a variety of ways.
We will be pleased to see such involvement continue in the future,
and we will carefully review any new proposal that meets NSF's
broad acceptance criteria.
11
The time line and the process for a fair evaluation of proposals
is painstakingly detailed in our written testimony, and the solicita-
tion and the executive summary of the Hayes report are also in-
cluded.
In my opinion, the NSF has listened carefully to its constituents,
has paid close attention to the scientific needs of the Nation, and
is going forward with the program designed to capitalize on tech-
nical and budget realities.
We feel very confident that the new proposals we receive will be
very innovative, and they will be carefully reviewed using NSF's
best traditions of peer review.
We expect to go forward with a program that the Nation can be
proud of, and one that will continue the unparalleled successes of
the past.
Thank you, Mr. Chairman.
[The prepared statement of Dr. Young follows:]
12
Testimony for
Paul R. Young, Assistant Director
Computer and Information Science and Engineering
National Science Foundation
before the
Basic Science Subcommittee
House Committee on Science
March 19, 1996
Mr. Chairman and Members of the Subcommittee, thank you for the
opportunity to testify here today on NSF's new program of Partnerships for
Advanced Computational Infrastructure (PACI).
I am Paul Young, Assistant Director for NSF's Directorate for Computer and
Information Science and Engineering (CISE).
The new PACI program, which replaces the successful NSF Supercomputer
Centers program, is one important element in the Foundation's
implementation of the National High Performance Computing and
Communications (HPCC) Program.
At NSF the HPCC Program focuses on advancing the full range of advanced
computing, communications, information technologies, and infrastructure,
and supports fundamental research in these areas of computer science and
engineering, as well as the great diversity of applications in all areas of science
and engineering research which require HPCC. Thus, the research in
computer science and engineering supported in CISE is the prerequisite to the
development of future high performance computing, communications and
information systems, and forms the basis of an advanced National
Information Infrastructure. CISE also provides computing and
communications infrastructure for all research and education supported by
the Foundation through support for the Internet and the Supercomputer
Centers.
In this context, we expect that the new PACI Program will benefit from, and
provide new opportunities in, future high performance computing and
communications activities throughout the nation.
NSF's Early Participation in High Performance Computing
The National Science Foundation is now a major partner in providing the
nation's high performance computing infrastructure, but this was not always
the case. In the early 1970s the NSF ceased its support of campus computing
centers, and by the mid-1970s there were no "supercomputers" generally
13
a\-ailable to the academic community on any campus. Although computers of
this capability were available through other government agency (DoE and
NASA) laboratories, NSF did not play a role. As a consequence, most
academic researchers did not have the ability to perform computational
research on anything other than a departmental minicomputer, thereby
limiting the scope of research in many fields of science and engineering.
This lack of access to high performance computing was noted in the early
1980's in of a growing number of reports. For example a report to the NSF
Division of Phvsics Advisory Committee in March 1981 entitled "Prospectus
for Computational Physics", edited by William Press, identified a "crisis" in
computational physics, and recommended support for facilities. Subsequent
to this report a joint agency study, "Large Scale Computing in Science and
Engineering", edited by Peter Lax, appeared in December 1982 and acted as the
catalyst for NSF's reemergence in the support of high performance
computing. The Lax Report presented four recommendations for a
government-wide program:
• Increased access to regularly upgraded supercomputing facilities via high
bandwidth networks
• Increased research in computational mathematics, software, and
algorithms
• Training of personnel in scientific computing
• R&D for new supercomputer systems
The key suggestions contained in the Lax Report were studied by an internal
NSF working group, and the findings were issued in July 1983 as "A National
Computing Environment for Academic Research", a report edited by M.
Bardon and K. Curtis. The report studied NSF supported scientists' needs for
academic computing, and validated the conclusions of the Lax Report for the
NSF supported research community. The findings of Bardon/Curtis
reformulated the four recommendations of the Lax Report into a six point
implementation plan for the NSF. Part of this action plan was a
recommendation to establish ten academic supercomputer centers.
The immediate NSF response was to set up a means for academic researchers
to have access, at existing sites, to the most powerful computers of the day.
This was an interim step prior to a solicitation for the formation of academic
supercomputer centers directly supported by the NSF. By 1987, five NSF
Supercomputer Centers had been established, and all had completed at least
one year of operation.
During this early phase the Centers were essentially isolated "islands of
supercomputing" whose role was to provide supercomputer access to the
academic community. This aspect of the Centers' activities has changed
considerably. The NSF concept of the Centers' activities was mandated to be
much broader, as indicated by the Center's original objectives including:
14
• Access to state of the art supercomputers
• Nurture computational science and engmeermg m all fields
• Traming of computational scientists and engineers
• Encourage collaboration among researchers in academia, industry and
government
In 1988-1989 NSF conducted a review to determine whether support was
justified bevond 1990. In developing proposals, the NSF Centers were advised
to increase their scope of responsibilities. Quoting from the solicitation:
"To insure the long term health and value of a supercomputer center, an
intellectual environment, as well as first class service, is necessary. Centers
should identify an intellectual component and research agenda".
In 1989 NSF approved continuation through 1995 of the Cornell Theory
Center, the National Center for Supercomputing Applications, the Pittsburgh
Supercomputing Center, and the San Diego Supercomputer Center. Support
for the John von Neumann Center located at Princeton University vvas not
continued.
Scientific accomplishments of the Program
The Centers have fostered fundamental advances in our understanding of
science and engineering, expanded the use of high-end computing in new
disciplines, enabled the major paradigm shift to the acceptance of
computational science as a full partner in the scientific method, and
facilitated the education of a new generation of computational scientists and
engineers in support of that shift.
Several common themes emerge from examples of the impact of
supercomputing.
• First, the rapid growth of supercomputing together with its availability to
the research community have enabled computational science and
engineering to contribute to significant advances in a very wide and still
growing set of scientific and engineering fields.
• Second, high performance computing is making it possible to perform
complex simulations in three dimensions, rather than just two. This
important shift has dramatically enhanced the usefulness of
computational approaches, and will continue to do so at least through the
coming decade.
• Third, supercomputer-based simulations have combined multiple
disciplines and different physical phenomena to yield new scientific
discoveries and understanding.
• Last, increases in supercomputing capability and advances in
computational techniques are beginning to enable computer-based
15
bimulations to predict new scientific advances and to make new
discoveries.
In later testimony at this hearing the current Center Directors will highlight
the scientific and engineering accomplishments of the program in the last 11
vears.
Relevant Reports
Report of the Blue Ribbon Panel
Following the renewal of four of the Centers in 1990, the National Science
Board (NSB) asked the director of NSF to appoint a blue ribbon panel "... to
investigate the future changes in the overall scientific environment due [to]
the rapid advances occurring in the field of computers and scientific
computing." The resulting report, "From Desktop to Teraflop: Exploiting the
U.S. Lead in High Performance Computing," was edited by Lewis Branscomb,
and was presented to the NSB in October, 1993.
This report pointed to the Foundation's accomplishments in the seven years
since the initial implementation of the recommendations of the Lav Rr^port
on high performance computing (HPC) and the establishment of the
Supercomputer Centers. The report asserted that the NSF Centers had created
an enthusiastic and demanding set of sophisticated users who make
fundamental advancements in their scientific and engineering disciplines
through the application of rapidly evolving high performance computing
technology. Other measures of success cited include the thousands of
researchers and engineers who have gained experience in HPC, and the
extraordinary technical progress in realizing new computing environments.
The report noted that, through the NSF program and those of sister agencies,
the U. S. enjoys a substantial lead in computational science and in the
emergmg, enabling technologies. It called for NSF to capitalize on this lead,
which not only offers scientific preeminence, but also aids the associated
industrial lead in many growing world markets.
Primary recommendations included the following:
• The NSF should retain the Centers and reaffirm their mission with an
understanding that they now participate in a much richer
computational infrastructure than existed at their formation.
• The NSF should assist the university community in acquiring mid-
range systems to support scientific and engineering computation and to
break down the software barriers associated with massively parallel
systems.
• The NSF should initiate an interagency plan to provide a balanced
teraflop system, with appropriate software and computational tools, at
the apex of the computational pyramid.
16
These recommendations and the accompanying challenges could be
summarized as calling for a broad based infrastructure and research program
that would not only support the range of computational needs required by the
existing user base, but would also broaden that base in terms of the range of
capabilities, expertise, and disciplines supported.
Report of the NRC-HPCC Committee
In 1994, Congress asked the National Research Council (NRC) to examine the
status of the High Performance Computing and Communications Initiative.
The NRC committee stated that it "believes that strong public support for a
broadly based research program in information technology is vital to
maintaining U.S. leadership in information technology". While the
committee did not make explicit recommendations for funding levels or
management structures for the Supercomputer Centers program, it did say:
"The committee recognizes that advanced computation is an
important tool for scientists and engineers and that support for
adequate computer access must be a part of the NSF research program
in all disciplines. The committee also sees value in providing large-
scale, centralized computing, storage, and visualization resources that
can provide unique capabilities. How such resources should be funded
and what the long term role of the Centers should be with respect to
both new and maturing computing architectures are critical questions
that NSF should reexamine in detail, perhaps via the newly
announced Ad Hoc Task Force on the Future of the NSF
Supercomputer Centers Program."
The Task Force on the Future of the NSF Supercomputer Centers Program
The Task Force met during NSF's 1995 Fiscal Year to advise NSF on several
important issues related to the review and management of the NSF
Supercomputer Centers program. The Task Force was charged to analyze
various alternatives for the continuation, restructuring, or phase-out of
NSF's current Supercomputer Centers program, or the development of
similar future program(s), and to make recommendations among the
alternatives.
The final report of the Task Force was presented to the NSF Director on
September 15, 1995. The following language is excerpted from the executive
summary of the report, which is appended to this testimony.
The Task Force believes that the future for computational science and
engineering can be as bright or even brighter than in the past decade. If we
seize the opportunity, over the next decade we can make major progress on
multiple fronts.
There will be
• opportunities for exciting applications of our nation's exponentially
increasing computational capacity, for example:
17
- more complete models, and hence deeper understanding of
physical systems by moving to three and higher dimensions;
- progress in computational tools to aid drug and protein design;
- computational predictions of scientifically and commercially
significant materials;
- multidisciplinary models of physical systems (e.g., combining
fluid dynamics and electromagnetic models of the heart);
- increased interconnectivity of supercomputers and high impact
instrumentation; and
- models of anatomical and physiological processes leading to nezv
insights of benefit to human health.
• more quantitative computational results in unanticipated areas.
• more explosive groxvth of communications as a component of the
computational science and engineering paradigm; and, importantly,
• continued progress in the tools and methods for developing code that
is both portable and yet takes advantage of unique parallel
architectures.
These advances will not automatically become available to American
researchers. To position the U.S. academic community to participate in the
exciting research possibilities enabled by these developments, the Task Force
has the follozving recommendations leading to a restructured Centers
program.
In order to maintain world leadership in computational science and
engineering, NSF should continue to maintain a strong, viable Advanced
Scientific Computing Centers program, whose mission is:
• providing access to high-end computing infrastructure for the
academic scientific and engineering community;
• partnering with universities, states, and industry to facilitate and
enhance that access;
• supporting the effective use of such infrastructure through training,
consulting, and related support services;
• being a vigorous early user of experimental and emerging high
performance technologies that offer high potential for advancing
computational science and engineering;
• facilitating the development of the intellectual capital required to
maintain world leadership.
18
NSF should abSiire that the Centers program provides national "Leading-edge
>ites" that have a balanced set of high-end hardware capabilities, coupled with
appropriate staff and software, needed for continued rapid advancement in
computational science and engineering.
NSF, through its Centers program, should assure that each leading-edge site is
partnered with experimental facilities at universities, NSF research centers,
and/or national and regional high performance computing centers.
Appropriate funding should be provided for the partnership sites.
NSF should announce a new competition of the High Performance
Computing Centers program that would permit funding of selected sites for a
period of five years. If regular reviews of the Program and the selected sites
are favorable, it should be possible to extend initial awards for an additional
five years without a full competition.
The Centers program should continue to support need-based research in
support of the program's mission, but should not provide direct support for
independent research.
NSF should increase the involvement of NSF's directorates in the process of
allocating service units at the Centers.
NSF should provide leadership in working toward the development of
interagency plans for deploying balanced systems at the apex of the
computational pyramid and ensuring access to these systems for academic
researchers.
National Science Board Actions
In October of 1994, the National Science Board (NSB) approved a two year
"extension of the original Centers program (through FY 1997) in order for the
Task force to complete its work and to allow time for a new competition, if
NSF recommended one and the NSB agreed.
The Task Force report was presented to the NSF Director on September 15,
1995 and to the NSB at the October 1995 meeting on the Committee on Plans
and Policy. The NSF recommendations to the NSB were scheduled for
presentation at the November meeting, but were postponed to December by
the first government shutdown. At their December meeting the NSB
approved a new competition for a program called Partnerships in Advanced
Computational Infrastructure (PACI).
The Program solicitation for the Partnerships meeting was released on the
World Wide Web (WWW) on December 15, 1995, immediately prior to the
second FY96 shutdown. Since that time a printed solicitation, a dear colleague
letter and a series of informational meetings have been held to insure that
the community was aware of the program and had all the required
information to propose to participate. In addition, a set of frequently asked
questions and their answers has been posted on the WWW and kept
19
continuously updated. The complete program solicitation is appended to this
testimony.
Partnerships program competition
Mission
As part of its strategic plan, NSF in a Changing World, a key NSF goal is to
enable the United States to uphold a position of world leadership in science
and engineering research and education. In order to maintain world
leadership in computational science and engineering, NSF intends to create
an adyanced national computational infrastructure whose overall mission is
to:
• provide, facilitate, and enhance access to high performance computational
infrastructure for the U. S. academic, scientific, and engineering
communities by partnering with universities, states, and the private
sector;
• promote vigorous early use of experimental and emerging high
performance computational and associated communications technologies
that offer high potential for advancing science and engineering;
• enable the effective use of such infrastructure and technologies through
education, training, consulting, and related support services, including
appropriate software development, experimentation, and support;
• foster interdisciplinary research in science and engineering;
• facilitate the development of the intellectual capital required to maintain
world leadership in computational science and engineering; and
• broaden the base for the nation's advanced computational and
communications infrastructure.
Solicitation
The program solicitation for the Partnerships for Advanced Computational
Infrastructure program, builds on and replaces the current NSF
Supercomputer Centers program, and focuses on taking advantage of newly
emerging opportunities in high performance computing and
communications. The new program will provide flexibility, both to adapt to
rapidly evolving circumstances and to meet the need for high-end
computation, in order to enable continued world leadership in
computational science and engineering. The program will provide access for
researchers to high performance computing systems as its core, with
associated highly trained staff and researchers necessary to develop and
optimize their use. The emergence of scalable parallel systems, high
performance networking and high bandwidth, large capacity mass storage
systems creates the opportunity for a national infrastructure consisting of a
number of geographically distributed sites strongly coupled to high-end
20
computational resources and to each other \'ia high-speeci communication
networks.
NSF envisions an Advanced Computational Infrastructure consisting of one
or more leading-edge sites together with cooperating partners. Leading-edge
sites are expected to maintain high-end hardware systems that are one to two
orders of magnitude more capable than those typically available at a major
research university. These systems should be balanced in terms of processor
^peed, memorv, and storage systems, and should be accompanied by
appropriate staff, software and high speed communications capability. The
partners will, in the aggregate, complete the overall infrastructure by, among
other things, (a) facilitating research and experimentation with new hardware
and software, including appropriate support technologies such as
visualization and mass storage, (b) providing scalable resources for
applications and applications development that can be best done on mid-level
systems, (c) providing access to unique experimental systems and facilities,
and (d) promoting education and training.
Smce the Partnerships for Advanced Computational Infrastructure Program
will primarily support academic research, it is expected that proposals will
come from, and the partnership's cooperative agreement will be with, a U. S.
academic institution. However, a successful proposal must involve multiple
partners who can be, but are not limited to:
• universities, including research groups within universities;
• NSF-funded Centers and facilities, such as. Science and Technology
Centers, Supercomputer Centers, Engineering Research Centers, and
Industry-University Cooperative Research Centers;
• research and educational consortia, organizations, and groups;
• regional and state-supported high-performance computing centers;
• private sector organizations; and
• federal laboratories.
A full description of organizations and individuals who are eligible to submit
proposals to NSF and the conditions under which they may compete, are
given in the NSF Grant Proposal Guide (GPG) (NSF 95-27).
Review Process
Overview
The review process will involve three stages; preproposals, full proposals and
site visits. The preproposals, due April 15, 1996, will be evaluated to
determine who will be encouraged to submit a full proposal. The intent of
the preproposal round is to give the community an opportunity to put forth
ideas, to provide proposers with information that will assist in developing a
21
stronger proposal, and to identity at an early stage, preproposals that are not
expected to be competitive. Feedback will be provided to all preproposal
submitters; they may submit a full proposal regardless of the opinions
expressed by the reviewers.
Full proposals are due September 1, 1996. Both preproposals and full
proposals will be evaluated by external panels. In late fall, 1996,
recommended proposals will be subjected to a site visit (but not necessarily all
sites in a given partnership) in order to clarify issues raised during the panel
review, and to explore additional matters as needed. A summary panel,
made up of the chairs of the site visit teams and the chair of the full proposal
review panel, will meet after the site visits.
Criteria
As with all proposals to NSF, these proposals will be subject to the four
standard review criteria, i.e. the quality of the proposed scientific effort,
competence of the investigators, relevance of the research, and impact on the
infrastructure of science and engineering. However, for this particular
solicitation, impact on infrastructure and competence of investigators will be
especially important. Thus both preproposals and final proposals for the
Partnerships for Advanced Computational Infrastructure program will be
subject to four, more specific, criteria dealing with, the effectiveness of the
overall partnership in addressing the program's mission, the quality of the
individual partners, the effectiveness of the proposed management, and the
degree of financial leverage. While these four, more specific, criteria just
mentioned are listed in priority order (with the first being the most
important), all will be considered by the panels in arriving at
recommendations. Finally, the summary panel will be asked to advise NSF
on what each proposed partnership can contribute to the technical diversity
and balance of the program as a whole.
Management
The review process will be managed by the Division of Advanced Scientific
Computing (ASC). In order to assure that all NSF directorates are involved
in the process, an NSF internal PACI Review Committee will be established
chaired by the Centers Program Director in ASC. Each directorate will be
invited to appoint one or more representatives to the committee. Ideally,
there should be strong overlap between the membership of the High
Performance Computing and Communications Coordinating Committee and
the Review Committee. The Review Committee will have the following
responsibilities:
• Nominate panelists and site visitors for the review processes outlined
below.
• Participate in review panels, including serving as moderators of possible
hubpanels.
10
22
• Represent their directorate on site visits to candidate partnerships.
• Pro\ide a knowledgeable link between their directorate and the process.
Structure of Proposal Review Panels
Panelists will be chosen from a variety of sectors: academia, the private sector,
national laboratories, and other government agencies. Since the academic
community will be the major focus of the PACI Program, subject to possible
conflict of interests, a majority of reviewers will be chosen from this sector.
As described above, recommendations from all NSF directorates for possible
panelists will be solicited through the PACI Review Committee. Reviewers
will be chosen from the computational science and engineering and
computer science and engineering communities whose areas of expertise
represent those of users of the infrastructure, enablers of the underlying
technologies, and managers of large facilities, and encompass both research
and educational needs for the PACI program.
Preproposal panel procedure
Written reviews of each preproposal will be provided by at least 3 readers, and
a summary of the subpanel discussion of each preproposal will be recorded on
a panel review form which highlights the major review criteria. The task of
the subpanels will be to recommend whether a full proposal is to be
encouraged or not. A target number of proposals to be encouraged will be
suggested to the panels. All reviews and the panel summary will be provided
to the PI.
No PI or named collaborator will be chosen to be a reviewer (panelist) due to
disqualifying conflict of interest. Reviewers from the same institution may
have to be used due to the large number of partners expected in the
submissions, but an attempt will be made to segregate these into subpanels
considering preproposals not involving their institution; if this is not
possible, the reviewer will be absent from the discussion of any preproposal
involving his/her institution.
Full proposal panel procedure
The panelists will be chosen from the original subpanels reviewing the
preproposals, from the pool of submitters of preproposals not submitting full
proposals, and from the community at large. While there will be some
overlap between the panels reviewing the preproposals and the final
proposals, the panelists will be instructed that any recommendations based on
the preproposal review were not intended to be binding on the proposer and
should not be considered in reviewing the final proposal. Preproposal
reviews and recommendations will not be available to the panelists.
Each reviewer will be provided with all of the proposals and a specially
prepared review form which will reflect the major review criteria. Each
proposal will have at least 3 written reviews prepared by reviewers prior to
the panel discussion; each panelist will be assigned approximately 4 of the
11
23
proposals tor which he/she must prepare written reviews. These written
reviews will include ratings categorizing the assigned proposals into the top
25"/u, the next IS"";., and the remainder. All proposals will then be discussed in
the panel. The results of the panel discussions will be recommendations
which rate the proposals as "definitely site visit", "possibly site visit", and "do
not consider further". All reviews and a panel summary will be provided to
the PI.
It is expected that conflicts will be less than in the preproposal stage, and any
that arise will be handled in a similar manner.
Site visit procedure
NSF staff will review the recommendations for site visits. In order that the
portfolio for site visits assures consideration of a balanced program, NSF may
possibly augment the "definitely visit" group with some from the "possibly
visit" group.
All proposals recommended will be reviewed by a team of site visitors
consisting of previous panelists augmented by other reviewers as appropriate.
Partnerships to be visited will be provided with a set of questions prior to the
visit. NSF staff members, including members of the PACI Review
Committee described above, will be invited to participate in all site visits as
observers. Each site visit team will have a chair designated by NSF. Each
visitor will go on at least two site visits (preferably more) in order to have an
overview of the possible varieties of partnerships available. The
recommendations of the team will be to either fund, fund if possible, or not
fund the partnership.
Upon completion of all site visits, a panel of the site visit chairs, chaired by
the chair of the proposal review panel, will meet at NSF to review the overall
recommendations made by the site visit teams, and to provide NSF with
several options (e.g., one large partnership, or several smaller partnerships)
which meet the overall mission of the PACI Program.
Management
Active management of the program by the Foundation is essential, since the
new program has been designed to provide a national infrastructure that is
built on one or more multi-institutional partnerships. Because of the
complexity and distributed nature of the new program, both external and
internal management issues must be considered. The management model
draws on experience gained with a number of other multi-institution NSF
programs, but management of this program is still expected to offer unique
challenges.
Each partnership will work with NSF to develop criteria by which it can be
evaluated. These criteria will be used in all formal reviews. Each partnership
will submit an annual program plan which summarizes its progress and
describes activities and budgets for the coming year. This may include
12
24
proposed changes in membership, research, service, networking, computing
bvstems, or funding changes for its members. These plans will be
indi\-iduaily reviewed by an external program review panel appointed by
NSF which will provide recommendations to NSF regarding funding levels,
addition of or deletion of partners, new opportunities, etc. Although they
will not be members of the panel, it is expected that representatives of all
partners in each partnership will be at these reviews and that their locations
may vary between partnership sites and NSF.
To provide flexibility in meeting special needs, funding can be provided for
partnerships having a membership that may change over the duration of the
ceioperative agreement; thus initial proposals involving partners for a shorter
period are encouraged, where appropriate. Furthermore the partnership can
propose, at any time, a change in the number or composition of its partner
sites. Normally such proposals will be part of the annual program plan
review process and would be considered along with other substantive issues.
Possible justifications for such a change might be the occurrence of an
unusual opportunity, completion of a partner's task, changes or relocation of
a critical person, etc.
More frequent informal monitoring will be carried out by NSF staff. Each
partnership will be required to contribute to a database relating to usage,
users, training, outreach, education, industrial interactions, and other
significant data relating to the operation of the program. NSF will also
require periodic meetings of the Pi's and co Pi's of the partnerships for overall
program coordination. This will permit the Foundation to better understand
and monitor both the operation of a partner within the partnership, the
partnership itself, and the program as a whole.
A single institution, through the PI, is expected to be responsible for overall
management and leadership of the partnership as a whole, and each partner
site will have a local director. Partnerships will have a policy and operations
management committee that will meet regularly. Its membership will
consist of a senior representative from each site. NSF will host an annual
meeting of representatives from all the partnerships to discuss program wide
coordination.
Every partnership will be encouraged to appoint an independent external
advisory committee which will provide oversight and guidance necessary for
the management of the partnership. This committee will include users of the
partnership resources and others who are in a position to evaluate the
activity and provide feedback to its management. This group will be selected
in consultation with program management staff and will meet at least once a
year.
13
25
The Transition
The transition from the old Centers Program to the new Partnerships
Program will take place beginning immediately after Board approval of the
new partnerships.
Phase in of a new partnership is expected to occur over a one year period.
Partnership plans and proposed budgets must be prepared with enough
flexibility to allow the phasing in of partnership resources during the first
vear, as negotiated with NSF.
It is critical that this transition be conducted smoothly to ensure that there is
continuity in the availability of high performance computing resources to
support the research of the academic science and engineering community.
The final transition plan will be designed in accordance with the principles
below:
• minimum disruption for the scientific research community;
• deferral of all FY97 Supercomputer Centers program expenditures
possible, until awards for the new program are announced;
• rapid phase out of old participants; rapid phase in of new partners;
• immediate and active NSF involvement in the transition process.
Summary
At the conclusion of this exercise, NSF will continue world class support of
computational science and engineering for at least another 5 years. During the
fourth year of the program an evaluation will determine whether to continue
without recompetition for another five years at which the program will be
"sunset". Before this occurs another review will determine the future for
continued NSF support of the infrastructure.
14
26
Mr. SCHIFF. Well, Dr. Hayes, you got quite an introduction from
the first witness, so with that I will call upon you, please.
STATEMENT OF DR. EDWARD HAYES, CHAIRMAN, REPORT ON
THE TASK FORCE ON THE FUTURE OF NSF
SUPERCOMPUTING CENTERS PROGRAM AND VICE PRESI-
DENT FOR RESEARCH, OHIO STATE UNIVERSITY, COLUM-
BUS, OHIO
Dr. Hayes. Thank you, Mr. Chairman.
I would also like to add my thanks to you and to the Committee
for their strong interest in the Supercomputing Centers Program
and their bipartisan support over the years for the science pro-
grams funded by the Federal Government.
In addition to being the chair of the Task Force, my da5rtime job
is vice president for research at the Ohio State University; and my
research interests are in the area of computational chemistry.
I am very pleased to have this opportunity to talk with you today
about the new partnerships for Advanced Computational Infra-
structure, a Program that was recently announced and that Paul
has been talking about.
My objective is to give you a context as well as a brief overview
of the Task Force Report. A key message that I would like to leave
with the Committee today is how pleased the Task Force is with
the process that NSF has followed in setting the directions for the
future of its highly successful Supercomputing Centers Program.
The members of the Task Force are listed in my prepared re-
marks. This is a group with exceptional talent and breadth of expe-
rience. It made the job of the chair quite enjoyable, as well as chal-
lenging.
All are knowledgeable about one or more aspects of high-perform-
ance computing. They are all independent-minded and committed
to doing their work in a professional way. They are pleased, and
we were pleased to be able to report the unanimous view of what
steps should be recommended to the National Science Foundation
on the future of the program.
In essence, we found that the Supercomputing Centers Program
has been a critical component of the strong U.S. position in science
and technology, and we strongly recommended that it be continued
but with important changes.
With the issuance of the new Partnerships Program announce-
ment, the National Science Foundation initiated the important pro-
posal development stage. At this stage, various groups of univer-
sities and other potential partners are putting their best ideas to-
gether to see if they will be able to convince their peers that they
have the best plans to enable the future of computational science
and engineering.
While our Task Force set the stage for the final program an-
nouncement, and the program announcement set the stage for
these proposal development efforts, there is still significant room
for creativity on the part of the Partnership Centers.
The Task Force clearly understood the importance of not tying
the hands of creative people at the proposal development stage.
From this perspective, we are pleased with the clarity of focus ex-
27
emplified in the program announcement, as well as the appro-
priateness of the selection criteria.
The competitive responses often bring out important new ideas,
insights, and commitments that cannot be accurately forecast in
advance. The final stage of the process will begin after the propos-
als are received and the merit review begins.
The Task Force has a very high level of confidence in Paul
Young, the assistant director, who has line management respon-
sibility for this Program. He and his staff are most knowledgeable
in this area and have a great track record for running fair and ef-
fective review processes.
At this point I want to make just a few brief comments from my
statement that focus on just a couple of aspects of the program.
The Task Force heard from many knowledgeable individuals
from industry, academia, government, and government labora-
tories, NSF assistant directors, academic researchers, representa-
tives of other federal agencies, center directors, as well as members
of the centers allocation committees. We talked with them all.
Considerable input was received from many thoughtful people.
This input, as well as our survey to which there were 500 replies,
was essential. It extended the knowledge base of the individual
members of the Task Force, and informed us about the key issues
surrounding the Centers Program.
This input also gave us an excellent sense of the important role
that the Centers Program has played in making computational
science and engineering a significant contributor to the long-term
progress of fundamental research and, importantly, in providing
the infrastructure needed for education and training of future gen-
erations of world leaders in science and technology.
The recommendations of the Task Force follow from our vision of
the future of high-performance computing. This vision has several
key points. I want to mention just one of these — there are seven
in total — this afternoon. The others are listed in my prepared re-
marks.
A new element of our vision is a strong coupling of selected re-
search centers and university laboratories with the leading edge
sites that have the highest end computational systems.
We believe that such coupling has great potential for the future
in terms of an enhanced program flexibility, creativity, and effi-
ciency.
With advances in high-speed communications, this approach will
also provide the infrastructure needed to support creative experi-
ments with distributed computing.
In my prepared remarks I have commented on each of the Task
Force recommendations. Two of these recommendations are central
to the new Partnerships Program.
In our second recommendation we noted that NSF needs to sup-
port a few leading edge sites, sites that have a balanced set of
high-end hardware capabilities, coupled with appropriate staff and
software. Raw processor speed or compute- power is not sufficient
to achieve the balanced high-performance systems needed for para-
digm-shifting research.
In our third recommendation we focused on partnerships. Part-
nerships will be important for the future of the program. With stra-
28
tegic coupling between leading-edge sites and research centers and
university laboratories, it is a significant new change.
These partnership sites will provide better coupling to computer
scientists developing new tools and software. This coupling will be
particularly important when their research on mid-range systems
reaches the point when it needs to examine scalability issues that
become manifest only on the leading edge systems. But the impact
of this research will extend beyond computer science.
The partnership sites will also provide more cost-effective and di-
verse platforms for early design and testing of applications soft-
ware prior to the stage where access to a leading-edge system is
required.
Although we began with the work of earlier reports, we did not
accept the conclusions or the recommendations uncritically. We car-
ried out our own analysis particularly of the merit review process
for allocating resources at the current centers, an analysis of the
educational benefits of the program, and we also sought significant
input from the community, including several key NSF advisory
committees particularly on the benefits of the current program.
In preparing for this hearing I was able to talk to most of the
members of the Task Force to confirm their enthusiasm for the
manner in which NSF management is moving forward with the
program announcement and the receipt of proposals. All indicated
that they are pleased with the progress that has been made up to
this point.
Thank you, Mr. Chairman.
[The prepared statement and attachments of Dr. Hayes follow:]
Statement of Dr. Edward F. Hayes
Vice President for Research
The Ohio State University
AND
Chair, Task Force on the NSF Supercomputer Program
Mr. Chairman and Members of the Subcommittee:
I am very pleased to have this opportunity to talk with you about the New Part-
nerships for Advanced Computational Infrastructure program that was recently an-
nounced by the National Science Foundation.
One of the relevant reports that guided the National Science Foundation in the
design of this new program was the Report of the Task Force on the Future of the
NSF Supercomputer Program. It was my honor to be asked by the Director of the
National Science Foundation to chair this Task Force.
My objective today is to give you a context as well as a brief overview of the Task
Force Report. A brief overview is not an easy task because our report deals with
many significant questions and issues. A key feature of our report is the extent to
which we dealt with the tough issues and seriously considered alternatives.
A key message that I would like leave with the committee is how pleased the
Task Force has been with the process that NSF has followed in setting the direc-
tions for the future of its highly successful Supercomputer Centers Program.
The Task Force was appointed by the Director of the Foundation, Dr. Neal Lane,
and asked to provide him with recommendations on the future of the Supercomputer
Centers Program and to report our findings as soon as possible. The work of the
Task Force began in January of 1995 and culminated in our final report to the Di-
rector last September.
The members of the Task Force are listed below:
• Dr. Arden L. Bement, Jr., Basil S. Turner Distinguished Professor Of Engineering,
Purdue University
29
• Chairman — Dr. Edward F. Hayes, Vice President for Research, The Ohio State
University
• Dr. John Hennessy, Chair Computer Science, Stanford University
• Dr. John Ingram, Schlumberger Research Fellow, Schlumberger, Austin
• Dr. Peter A. KoUman, Professor of Chemistry and Pharmaceutical Chemistry, Uni-
versity of California — San Francisco
• Dr. Mary K. Vernon, Professor of Computer Science, University of Wisconsin
• Dr. Andrew B. White, Jr., Director of Advanced Computing Laboratory, Los Ala-
mos National Laboratory
• Dr. William A. Wulf, AT&T Professor of Engineering and Applied Science, Univer-
sity of Virginia
Ex-officio Non-Voting members:
• Dr. Robert G. Voigt, NSF, Acting HPCC Coordinator
• Dr. Paul R Young, NSF, Assistant Director, CISE
• Dr. Nathaniel G. Pitts, NSF, Director/OSTI
There is considerable talent and experience in this group. Some of these names
may be famihar. All are knowledgeable about one or more aspects of high perform-
ance computing. They are all independent minded and committed to doing their
work in a professional way. We did not start out with a common view of what rec-
ommendations we might make to the National Science Foundation. In fact, we had
many discussions in which one or more members of the group were observed in var-
ious stages of testiness. However, I believe that we were all pleased to be able to
report that after a great deal of effort on everyone's part we did come to a unani-
mous view of what steps should be recommended. In essence, we found that the
Supercomputer Centers Program has been a critical component of the strong U.S.
position in science and technology and we strongly recommended that it be contin-
ued, but with important changes.
After receiving our report, the Director of the Foundation initiated the second
stage of the process. During this stage he reviewed the report with key staff within
the agency and formulated an action plan to be discussed with the National Science
Board for their review and approval. The National Science Board provides a very
important perspective on major Foundation programs because they represent a very
broad cross section of expertise and considerable experience with policy issues and,
importantly, experience in setting priorities in the context of the whole NSF and
Federal Research Budgets. At the October 1995 meeting of the National Science
Board, Arden Bement, another member of the Task Force, and I were asked to pro-
vide members of the Board with an overview of the Task Force Report. At this meet-
ing the Board also heard from Fred Brooks, co-chair of the NRC report on the HPCC
Program, and Jerry Ostriker, an eminent computational scientist who has made
many significant contributions to our understanding of cosmology.
Following the October meeting of the National Science Board, the third stage of
the review process began as the NSF Director put together the proposed program
announcement for the new partnerships program. The Board was scheduled to dis-
cuss and act upon this proposal at its November meeting, but strange things were
happening with the FY 1996 Budget discussions during this period — unfortunately
the NSF was shut down during the period of the scheduled November Board meet-
ing. Ultimately, the NSF was able to stay open long enough for the Board to meet
in December — at which time the program announcement was given final approval
and was quickly put out on the streets just before the government shut down again.
The issuance of the Program Announcement began the fourth stage in the proc-
ess — the important proposal development stage is when various groups of univer-
sities and other potential partners put their best ideas together to see if they will
be able to convince their peers that they have the best plans to enable the future
of computational science and engineering. While our Task Force set the stage for
the final program announcement and the program announcement set the stage for
these proposal development efforts, there is still significant room for creativity on
the part of the partnership centers. The Task Force clearly understood the impor-
tance of not tying the hands of creative people at the proposal development stage.
From this perspective, we are pleased with the clarity of focus exemplified in the
program announcement as well as the appropriateness of the selection criteria. The
competitive responses at this stage often bring out important new ideas, insights
and commitments that cannot be accurately forecast in advance.
The final stage will begin after proposals are received and the merit review be-
gins. The Task Force has a very high level of confidence in Paul Young, the Assist-
ant Director, who has line-management responsibility for this program. He and his
staff are most knowledgeable in this area and have a great track record in running
30
fair and effective review processes. If we can just keep the agency open, they will
do a great job in picking the winners.
At this point, I want to return to the report of the Task Force.
The current NSF Supercomputer Centers Program was initiated in the 1985-86
time period with the formation of five supercomputer centers. Each one had a vector
supercomputer that was considered big for the time period. Over the first five years
of the program's existence these centers provided vector supercomputing services for
the research community and training for the many researchers who lacked experi-
ence with such systems. Prior to 1985, U.S. academic researchers had very limited
access to vector supercomputers.
At the end of the first five years, NSF held a major external review of the five
Centers. Four of the five Centers were approved for continuation for another five
years. The John von Neuman Center in New Jersey was not renewed.
In the second five years of the Center Program's existence several significant
changes were made:
• There was a distinct effort to expand the outreach efforts of the centers and to
forge closer ties with industryy.
• The four remaining centers — at Cornell University, the University of Illinois, the
Pittsburgh Center and the San Diego Center — formed themselves into a
MetaCenter with resources sharable on a national scale and coordinated plan-
ning to develop and not duplicate areas of special expertise within the program.
• Finally there was a move to change the computational paradigm as a new genera-
tion of parallel architecture machines started hitting the market.
So it was after ten years and many successes that our Task Force was called to-
gether to advise the National Science Foundation on the future of the Centers Pro-
gram.
In arriving at our conclusions, we benefited significantly from two earlier reports
that I want to comment on briefly because of their direct relevance to our own work.
From Desktop Teraflop: Exploiting the U.S. Lead in High Performance Computing,
NSF Blue Ribbon Panel on High Performance Computing, August 1993
Evolving the High Performance Computing and Communications Initiative to Sup-
port the Nation's Information Infrastructure, Computer Science and Telecommuni-
cations Board, National Research Council, 1995
Both of these reports speak to the importance of exploiting the U.S. lead in high
performance computing.
The Blue Ribbon Panel emphasized the importance of maintaining a balance in
the pyramid of computational capability. See Attachment 1.
The NRC/HPCC Committee made several important observations central to the
work of our Task Force.
I would like to quote a few sentences from the executive summary of the HPCC
report. These quotes capture for me, in a succinct way, the importance of high per-
formance computing for the nation.
In the section dealing with high performance we read:
" Tiigh performance' — which involves bringing more powerful computing and com-
munications technology to bear on a problem — has enabled advances on several
fronts. High performance system.s, for example, deliver answers sooner to complex
problems that need large amounts of computing. Timely and accurate forecasting of
weather, mapping of oil reservoirs, and imaging of tumors are among the benefits
encompassed by the goals of the HPCC initiative."
Then continuing on in the next paragraphh.
"Information technology evolves as new and valuable applications are found for
hardware that gets steadily more powerful and cheaper. To benefit, users need af-
fordable hardware, but they also need the software that implements the new appli-
cations. Yet learning how to build software takes many years of experimentation.
If this process starts only when the hardware has already become cheap, the bene-
fits to users will be delayed by years. Research needs to treat todays expensive
equipment as a time machine, learning how it will be used when it is cheap and
widely available, as surely it will be tomorrow."
In addition to the insights obtained from these two reports, the Task Force heard
from many knowledgeable individuals from industry, academia, government, and
government laboratories. NSF Assistant Directors, Academic Researchers, Rep-
resentatives of other Federal Agencies, Center Directors as well as members of the
centers allocation committees — we talked with them all. Considerable input was re-
ceived from many thoughtful people. This input as well as our survey, to which over
500 replied, was essential. It extended the knowledge base of the individual mem-
bers of the Task Force and informed us about the key issues surrounding the Cen-
ters Program. This input also gave us an excellent sense of the important role that
31
the Centers program has played in making computational science and engineering
a significant contributor to the long term progress of fundamental research, and, im-
portantly, in providing the infrastructure needed for the education and training of
future generations of world leaders in science and technology.
Today, much of the important computational science and engineering is being car-
ried out on workstations and mid-range machines. That is desirable. It is one of the
clear signs of the health of the U.S. high performance computing program. A signifi-
cant amount of the applications software that is being run on today's workstations
was developed during the early days of the NSF Supercomputer Centers Program.
The systems of that period had about the same power as today's workstations.
These earlier "time machines" worked just the way people predicted that they would
ten years ago.
The recommendations of the Task Force follow from our vision for the future of
high performance computing. This vision has seven key points that I summarize
below.
Future Vision
• Computational Science & Engineering is central to the long term development of
fundamental principles and understanding of complex systems
• Continued U.S. leadership in computational science and engineering requires ac-
cess to the highest end computational systems, systems that are balanced with
respect to processor speed, memory, storage and software
• Coupling of selected mid-range systems to the leading-edge systems has great po-
tential in the future in terms of enhanced program flexibility, creativity, and
efficiency
• Research on computing tools, algorithms and new models of physical systems pays
significant dividends, but such research takes time and commitment particu-
larly with changes in the underlying computing paradigm
• Unanticipated results are often the most important
• Research trains people for the future needs of the nation
• SjTiergy among industry, academia, and government
Computational science and engineering is central to the long term development
of fundamental principles and the understanding of complex systems. Computa-
tional science brings enhanced synthesis and analysis. It is the source of new in-
sights that facilitate a deeper understanding of fundamental issues and, impor-
tantly, computational science is a full partner with experimental and theoretical
science in extending the analysis of experimental results that cannot be fully syn-
thesized without the assistance offered by sophisticated computational models.
Continued leadership in computational science and engineering requires access to
the highest end computational systems. Here we mean systems that are balanced
with respect to processor speed, numbers of processors, memory, storage, software
and personnel. These systems act as time machines for academic researchers allow-
ing them to gain significant lead time in addressing computational problems and is-
sues that emerge whenever significant new levels of modeling capability emerge.
A new element in our vision is a strong coupling of selected research centers and
university laboratories with the leading-edge sites that have the highest end com-
putational systems. We believe that such coupling has great potential for the future
in terms of enhanced program flexibility, creativity, and efficiency. With advances
in high speed communications this approach will also provide the infrastructure
needed to support creative experiments with distributed computing.
The fourth element in our vision focuses on the nature and time scale for human
creativity. Research on computing tools, algorithms and new models of physical sys-
tems pay significant dividends but such research takes time and commitment par-
ticularly when associated with changes in the underlying computing paradigm.
This fourth element as well as the final three are themes that are also dealt with
in the HPCC report.
Our recommendations follow directly from this vision.
Recommendations
• Continuing Need for the Centers Program
• Specific infrastructure Characteristics for Leading-Edge Sites
• Partnering for a More effective National Infrastructure
• Competition and Evaluation
• Support of Research at the Centers
• Allocation Process for Computer Service Units
• NSF Leadership in Interagency Planning
The first recommendation is that NSF should continue to provide access to the
high-end. This infrastructure is central to maintaining U.S. researchers among the
32
leaders in long term fundamental research. The Centers Program is needed in our
view to do this effectively. As new advanced computer hardware becomes commer-
cially available, scalability will continue to be a challenge for the foreseeable future.
Each new generation of computing power brings with it vast new opportunities
for advances in fundamental understanding and synthesis of complex — often inter-
disciplinary — research aieas. The take-home message from our report is that lead-
ing edge computing has and will continue to make significant contributions to the
advance of science and engineering.
Our second recommendation is that NSF needs to support a few "Leading Edge"
sites. Sites that have a balanced set of high-end hardware capabilities coupled with
appropriate staff and software. Raw processor speed or compute power is not suffi-
cient to achieve the balanced high-performance systems needed for paradigm shift-
ing research.
In our third recommendation we focus on partnerships. Partnerships will be im-
portant for the future of the centers program. The strategic coupling between lead-
ing-edge sites and research centers and university laboratories is a significant new
change that we are recommending. These partnership sites will provide better cou-
pling to computer scientists developing new tools and software — this coupling will
be particularly important when their research on mid-range systems reaches the
point when it needs to examine scalability issues that become manifest only on the
leading-edge systems. But the impact of this research will extend beyond computer
science. The partnership sites will also provide more cost effective and diverse plat-
forms for early design and testing of applications software prior to the stage where
access to a leading-edge system is required.
It has long been recognized that some portion of the resources at the NSF Centers
was being used for jobs that did not require access to leading-edge systems. This
is particularly true during the startup, design and test stages of a project — even for
projects with significant overall computational requirements. With the development
of mid-range systems that are upwardly compatible with the leading-edge systems,
it should now be possible to make more effective use of the leading-edge systems
by reserving them, as much as possible, for computations that require that unique
level of capability. The resources on these mid-range systems should also be used
to obtain preliminary data on software performance that would guide the allocations
committees in their review of requests for access to leading-edge systems.
Recommendation four deals with the need for a new competition and for evalua-
tion of the overall program on a regular basis. We believe that a new competition
is the best way to proceed. A new competition can be fair to all and stimulate the
greatest levels of creativity in proposal preparation.
Given the rapid advance of computing hardware and communications technology,
there will be a continuing need to review the centers program. NSF will need to
carry out regular annual reviews, as in the past, and probably a full program review
in about five years.
Recommendation five deals with the support of research at the centers. Centers
need to have high levels of expertise. This requires significant staff involvement
with research, but not direct funding of independent research by the Centers Pro-
gram.
Recommendations six and seven are suggestions to NSF management on leader-
ship issues that the Task Force believes that management needs to focus on, but
I do not believe that they are central to these hearings.
Although we began with the work of earlier reports, we did not accept their con-
clusions or recommendations uncritically. We carried out our own analysis, particu-
larly of the merit review process for allocating resources at the current centers, an
analysis of the educational benefits of the program, and we also sought significant
input from the community, including several key NSF Advisoryy Committees, on the
benefits of the program.
In preparing for is hearing I have been able to talk with most of the members
of the Task Force to confirm their enthusiasm for the manner in which NSF man-
agement is moving forward with the program announcement and the receipt of pro-
posals. All indicated that they were pleased with the progress that has been made
up to this point.
Thank you for your attention. I will be happy to answer questions.
33
ATTACHMENT 1
a
Ph
O
U
O
a
cd
c2
Ah
bX)
H
CIh
Cd C^ c/2
.Jri 'U ^
Cd (D (D
:^ ^ u
/
^
>
*2
s
^
o
^
^^
1
1
cd
o
s
U(
C/3
r\
;-i
D
<1>
-4— >
^— »
cd
C
4->
(D
00
\
Cd
OX) cf^
^
cd
34
PARTNERSHIPS FOR
ADVANCED
COMPUTATIONAL
INFRASTRUCTURE
Program Solicitation
DIRECTORATE FOR COMPUTER AND INFORMATION SCIENCE
AND ENGINEERING
PREPROPOSAL DEADLINE: April 15, 1996
PROPOSAL DEADLINE: September 1, 1996
S?^ NATIONAL SCIENCE FOUNDATION
35
Program Solicitation
Partnerships for Advanced Computational Infrastructure
Overview
NSF support of high performance computing has played a
major role m advancing science and engineering research and in
enabhng U. S. world leadership in computational science and
engineering. The NSF Supercomputer Centers program has
served as a cornerstone of these advances by expanding the use
of high-end computing in new disciplines, facilitating the accep-
tance of computation as a full partner in scientific research, and
facilitating the education of a new generation of computational
scientists and engineers in support of that shift.
This program solicitation for the Partnerships for Advanced
Computational Infrastructure program, builds on and replaces
the current NSF Supercomputer Centers program established in
1985, and focuses on taking advantage of newly emerging op-
portunities in high performance computing and communica-
tions. This new program will provide flexibility, both to adapt to
rapidly evolving circumstances and to meet the need for high-end
compulation, in order to enable continued world leadership in
computational science and engineering. The program will pro-
vide access for researchers to high performance computing sys-
tems as its core, with associated highly trained staff and
researchers necessary to develop and optimize their use The
emergence of scalable parallel systems, high performance net-
working and high bandwidth, large capacity mass storage sys-
tems creates the opportunity for a national infrastructure
consisting of a number of geographically distributed sues
strongly coupled to high-end computational resources and to
each other via high-speed communication networks.
NSF envisions an Advanced Computational Infrastructure
consisting of one or more leading-edge sites together with coop-
erating partners. Leading-edge sites are expected to maintain
high-end hardware systems that are one to two orders of magni-
tude more capable than those typically available at a major
research university. These systems should be balanced in terms
of processor speed, memory, and storage systems, and should be
accompanied by appropriate staff, software and high speed com-
munications capability. The partners will, in the aggregate,
complete the overall infrastructure by, among other things,
(a) facilitating research and experimentation with new hardware
and software, including appropriate support technologies such as
visualization and mass storage, (b) providing scalable resources
for applications and applications development that can be best
done on mid-level systems, (c) providing access to unique ex-
perimental systems and facilities, and (d) promoting education
and training.
Background
Several reports analyzing the current Supercomputer Centers
Program and making recommendations for future computational
infrastructure have appeared recently. Most directly relevant to
this solicitation is the Report of the Task Force on the Future of
the NSF Supercomputer Centers Program, chaired by Dr Ed-
ward Hayes, September, 1995, which discusses the history, cur-
rent context, and rationale for NSF support of high performance
computational infrastructure for the science and engineering
research communities.
Two other reports. From Desktop to Teraflop: Exploiting the
U.S. Lead in High Performance Computing, chaired by Dr.
Lewis Branscomb, October, 1993, and Evolving the High Per-
formance Computing and Communications Initiative to Support
the Nation's Information Infrastructure, chaired by Drs.
Frederick Brooks and Ivan Sutherland, National Research Coun-
cil, February, 1995 provide additional background and discuss
the overall impact of these programs in the context of the larger,
federally supported High Performance Computing and Commu-
nications program.
These reports are available on the World Wide Web via URL
http://www.cise.nsf.gov/.
Hard copies of the first two reports may be obtained by
sending e-mail to bmelvin@nsf gov. The NRC report is available
from the National Research Council, 2101 Constitution Ave.
N.W., Washington. DC 20418.
Mission
As part of its strategic plan. NSF in a Changing World, a key
NSF goal IS to enable the United States to uphold a position of
world leadership in science and engineering research and educa-
tion. In order to maintain world leadership in computational
science and engineering, NSF intends to create an advanced
national computational infrastructure whose overall mission is
to:
• provide, facilitate, and enhance access to high performance
computational infrastructure for the U. S. academic, scien-
tific, and engineenng communities by partnering with
universities, states, and the private sector;
• promote vigorous early use of experimental and emerging
high performance computational and associated commu-
nications technologies that offer high potential for advanc-
ing science and engineering;
• enable the effective use of such infrastructure and tech-
nologies through education, training, consulting, and re-
lated support services, including appropriate software
development, experimentation, and support;
• foster interdisciplinary research in science and engineer-
mg;
• facilitate the development of the intellectual capital re-
quired to maintain world leadership in computational sci-
ence and engineering; and
36
• broaden the base for the nation's advanced computational
and communications mfraslructure.
Purpose of this Solicitation
This solicitation calls for Innovative proposals to provide a
national computational infrastructure that will address the mis-
sion stated above. Guidance for participating in this solicitation
IS provided below.
Since the Partnerships for Advanced Computational Infra-
structure Program will primarily support academic research, it is
expected that proposals will come from, and the partnership's
cooperative agreement will be with, a U. S. academic institution.
However, a successful proposal must involve multiple partners
who can be. but are not limited to:
• universities, including research groups within universities;
• NSF-funded Centers and facilities, such as. Science and
Technology Centers. Supercomputer Centers, Engineering
Research Centers, and Industry-University Cooperative
Research Centers;
• research and educational consortia, organizations, and
groups;
• regional and stale-supported high-performance computing
centers;
• private sector organizations; and
• national laboratories.
A full description of organizations and individuals who are
eligible to submit proposals to NSF and the conditions under
which they may compete, are given in the NSF Grant Proposal
Guide (GPG) (NSF 95-27). Single copies of this brochure are
available at no cost from the NSF Forms and Publications Unit,
(703) 306- 1 1 30, or via e-mail: pubs(a'nsf gov. Brochures are also
available electronically through NSF's Science and Technology
Information System (STIS) and on the World Wide Web at URL
htlp://www.nsf.gov/.
Review Process and Criteria
The review process will involve preproposals and full propos-
als, both of which will be reviewed by external panels using
criteria discussed below. Panelists will be chosen from a variety
of sectors including academia, the private sector, national labo-
ratories, and other government agencies. Site visits are antici-
pated for finalists in the competition in order to clarify issues
raised during the reviews, and to explore additional matters as
needed. A summary panel, consisting of the chairs of the site visit
teams and the chair of the final proposal review panel, will meet
after the site visits to formulate final recommendations for a
balanced program.
As with all proposals to NSF, these proposals will be evaluated
using the four standard review criteria: (I) the quality of the
proposed scientific effort. (2) competence of the investigators.
(3) relevance of the research, and (4) impact on the infrastructure
of science and engineering. However, for this particular solicita-
tion, impact on infrastructure and competence of investigators
will be especially important. Therefore preproposals and final
proposals for this program will be subject to four, more specific,
criteria dealing with the effectiveness of the overall partnership
in addressing the program's mission, the quality of the individual
partners, management, and financial leverage. While these more
specific criteria are listed in priority order (with the first being
the most important), all will be considered by the panels in
arriving at recommendations. Finally, the summary panel will be
asked to advise NSF on what each proposed partnership can
contribute to the technical diversity and balance of the program
as a whole. In assessing the degree to which each preproposal
and proposal satisfies a particular criterion, the reviewers will be
asked to consider each of the following points.
1 ) The degree to which the proposed partnership addresses the
overall mission statement and demonstrates that it will be able
to:
• provide the physical and human infrastructure needed to
fulfill a national leadership role with an intellectual envi-
ronment that can foster world leadership in computational
science and engineering research across all NSF disci-
plines;
• maintain balanced computational resources, including at
least one site with capabilities one to two orders of magni-
tude greater than that typically available at a major re-
search university;
• enable interdisciplinary partnerships among the academic
computer science, mathematics, and computational sci-
ence research communities;
• support and develop software enabling computational sci-
ence and engineering;
• effect outreach and technology transfer to a heterogeneous
user base;
• support cooperative relationships with hardware and soft-
ware vendors;
• foster the integration of research, education, and training
in computational science and engineering;
• promote the advancement of kindergarten through under-
graduate science and engineering education generally; and
• provide world-class leadership through a diverse and syn-
ergistic set of partners.
2) The degree to which the proposed individual partners demon-
strate expertise in at least one of the following:
• providing access to. and support for. high performance
computing, including appropriate supporting technolo-
gies;
• conducting world-class research in computational science
and engineering, computer science and engineering, or an
appropriate related field;
37
• promoting vigorous early use of experimental and emerg-
ing scalable computing, communication, or mass storage
technologies;
• providing training, education, and outreach to the research
and educational communities and to the private sector: and
• supporting cooperative efforts across multiple intellectual
and/or institutional sectors.
3) The quality of the proposed management of the partnership
including:
• clear plans for the close coordination and management of
the partnership, including the computational, communica-
tions, and intellectual resources of the partners; and
• plans for measuring the partnership's success in meeting
its goals.
4) The degree of financial leverage in the proposed program,
including cost sharing. This can include personnel provided
by institutions and vendors, state and local support, facilities,
vendor discounts beyond normal educational discounts, soft-
ware, etc.
In addition, the final selection by NSF will consider what each
partnership can contribute to the technical diversity and balance
of the program as a whole.
Funding Levels and Duration of Awards
NSF funding for the current NSF Supercomputer Centers
Program totals about $65 million per year. Funding levels for the
new program will depend on expected overall funding levels for
NSF at the time of the award(s). NSF funding will enable active
participation in the partnership and create a broad base support-
ing the national computational infrastructure for research in
science and engineering. The number and size of awards will be
based on the quality and potential impact of the proposed part-
nerships, and on the availability of funds. Each partnership will
be funded through a single cooperative agreement specifying the
level of support for each partner, as proposed by the partnership,
and negotiated with NSF before the award. The cooperative
agreement for the partnerships is expected to cover a five-year
period beginning in fiscal year 1998.
Phase in of a new partnership is expected to occur over a one
year period. Partnership plans and proposed budgets must be
prepared with enough flexibility to allow the phasing in of
partnership resources during the first year, as negotiated with
NSF. To provide flexibility in meeting special needs, funding can
be provided for partnerships having a membership that may
change over the duration of the cooperative agreement; thus,
proposals involving partners for a shorter period are encouraged,
where appropriate. Funding will be provided in yearly incre-
ments subject to successful annual review of program plans,
performance, and the availability of funds.
To provide a partnership with the flexibility to respond to an
unusual opportunity, completion of a partner's task, changes or
relocation of a critical person, etc., the partnership may propose
(at any time, subject to review), a change in the number or
composition of its partner sites. Normally such proposals will be
pan of the annual program plan review process and would be
considered along with other substantive issues.
The partnerships will be reviewed annually by an external
partnership review panel, with site visits in the second year. In
the fourth year of the program, NSF will conduct a full inde-
pendent review of the overall program. Subject to the needs of
the scientific and engineering community as determined by this
review; the acceptable annual progress and performance reviews
of the partnerships; and the availability of funds, NSF may invite
renewal proposals, with the intent of extending successful part-
nerships for an additional five-year period.
At the end of ten years, this program will be "sunset" . During
the eighth year there will be a full and independent review to
determine the anticipated future needs of the academic science
and engineering community
Program IVIanagement
Each partnership will offer a variety of resources. In addition
to computational resources, there might be visualization re-
sources, such as access to a virtual reality environment, mass
storage resources located at a single site, discipline-specific
knowledge repositones, software development resources, code
porting and optimization resources, discipline-specific subrou-
tine and program libraries, performance instrumentation tools
and libraries, generic subroutine libraries such as linear algebra
codes, and educational and training resources.
The individual partnerships are being asked to propose man-
agement plans, allocation mechanisms, and means for obtaining
external advice. However, NSF will require that the partnerships
develop a single mechanism for allocation of the use of the bulk
of the program's resources, independent of where they are lo-
cated
Budget
Funds may be requested to support scientific and technical
staff essential for systems programming, software development,
research and user services, coordination and conduct of educa-
tion, training and outreach activities, and participant support;
training materials and software; networking equipment and op-
erating costs; and computer systems, indirect costs and other
appropriate costs associated with the overall project. Proposers
should Itemize costs on the NSF Budget Form 1030 and include
additional supporting documentation Proposed cost sharing
should be recorded on Line M of the Budget Form 1030.
Cost Sharing
Cost sharing may be in cash or in-kind from any private or
non-federal source. The estimated value of any in-kind contribu-
tions should be included and an explanation of the source, nature,
amount, and availability of any proposed cost sharing should also
be provided. Cost sharing must occur during the award period
and must be consistent with OMB Circular A- 1 10, Section .23.
Cost sharing specified in the proposal will be referenced and
included as a condition of the award.
38
Preproposal Guidelines
In order to be eligible to submit a formal proposal, the institu-
tion must have submitted, by April 15, 1996, a preproposal that
was reviewed, or been named as a partner on a preproposal that
was reviewed. Preproposals will be reviewed by external panels
using the criteria listed above. The intent of the preproposal
round is to give the community an opportunity to put forth
diverse ideas, provide information that will assist in developing
a stronger proposal, and point out preproposals that are not
expected to be competitive. Review comments from the prepro-
posal round will be supplied about the middle of May 1996 to
the proposers in order to provide feedback for the final proposal
preparation. Multiple preproposals from the same proposing
organization involving different partners, funding levels, and
objectives are allowed Neither institutional approvals nor signa-
tures are required at this stage; however, proposals should follow
any applicable institutional guidelines for preproposals.
Preproposals should be mailed to;
Partnerships in Advanced Computational Infrastructure
NSF, Room 1 1 22
4201 Wilson Boulevard
Arlington, VA 22230
Preproposals must be received by NSF no later than April 15,
1996; or postmarked no later than five (5) days prior to the
deadline date; or sent via commercial overnight mail no later than
two (2) days prior to the deadline date, to be considered for
review.
Preproposals submitted in response to this solicitation must be
prepared and submitted in accordance with the guidelines pro-
vided in the NSF Gram Proposal Guide. (GPG ) (NSF 95-27); in
particular, page formatting requirements given in Chapter 2,
Section C, will be strictly enforced and preproposals not com-
plying will be returned without review. Preproposals should
allow reviewers to address each of the evaluation criteria listed
above.
Preproposals (15 copies) must include:
• A cover page listing the Pi's e-mail address and FAX
number, and all participating sites and organizations;
• An executive summary of 2-3 pages;
• A main body of 10 single-spaced pages, to include:
- goals and objectives;
- a description of the proposed partnership activities;
- an overview of the partnership, its participants, and
plans for closely coordinating the participants;
- a staffing description and summary of the key person-
nel at each site;
- a list of the expected accomplishments of the partner-
ship over the five-year period of the award;
- a proposed management plan for the partnership, in-
cluding mechanisms for making policy and planning
decisions, and obtaining advice from the community;
- a proposed method for allocating and closely coordi-
nating the computational and intellectual resources of
the partnership; and
- a plan for evaluating the success of the partnership.
• Adescription of proposed hardware, networking, and soft-
ware, including planned upgrades;
• Key planned software and algorithm developments, if
applicable;
• Cost sharing projections, both cash and in kind;
• A non-binding draft budget for each year (the proposed
funding level for each partner should be specified);
• An appendix containing one to two page vita including the
most relevant publications for the key personnel; and
• No additional appendices are allowed.
The preproposals will be reviewed by external panels. It is
anticipated that preproposal review will be completed and feed-
back provided to the proposers by mid-May 1996.
Proposal Guidelines
To be eligible to submit a formal proposal, the institution must
have submitted a preproposal that was reviewed, or have been a
partner on a preproposal that was reviewed.
Proposals should be mailed to:
Partnerships in Advanced Computational Infrastructure
NSF Room 1122
4201 Wilson Boulevard
Arlington, VA 22230
Proposals must be; received by NSF no later than September I ,
1996; or postmarked no later than five (5) days prior to the
deadline date; or sent via commercial overnight mail no later than
two (2) days prior to the deadline date, to be considered for
review.
Proposals submitted in response to this solicitation must be
prepared and submitted in accordance with the guidelines pro-
vided in the NSF Grant Proposal Guide (GPG) (NSF 95-27); in
particular, page formatting requirements given on page 3, Sec-
tion C, will be strictly enforced and proposals not complying will
be returned without review. Proposals should allow reviewers to
address each evaluation criterion listed above.
Proposals (15 copies) should include the following:
• A cover page listing the Pi's e-mail address and FAX
number, and all participating sites and organizations;
• An executive summary of 2-3 pages;
• A main body of 40 single-spaced pages, to include:
- goals and objectives;
- a description of the proposed partnership activities;
- an overview of the partnership, its participants, and
plans for closely coordinating the participants;
39
- a staffing description and summary of the key person-
nel at each site;
■ a hst of the expected accomplishments of the partner-
ship over the five-year period of the award;
- a proposed management plan for the partnership in-
cluding mechanisms for making policy and planning
decisions, and obtaining advice from the community;
a proposed method for allocating and closely coordi-
nating the computational, and intellectual resources of
the partnership; and
- a plan for evaluating the success of the partnership.
• A description of proposed hardware, networking, and soft-
ware, including planned upgrades;
• Key planned software and algorithm developments, if
applicable;
• Cost sharing proposed, both cash and in kind;
• Abudget for each year(the proposed funding level for each
partner should be specified);
• An appendix containing one to two page vita including the
most relevant publications for the key personnel;
• An appendix containing letters that commit actual re-
sources such as funds, hardware, people, space, etc. from
the participating institutions, vendors, states, etc. and
• No additional appendices are allowed.
Proposals will be reviewed by external panels. While there will
be some overlap between the panels reviewing the preproposals
and final proposals, the panelists will be instructed that any
recommendations based on the preproposal review were not
intended to be binding on the proposer Recommendations and
reviews from the preproposal round will not be available to the
final proposal panel, and should not be considered in reviewing
the final proposals. Site visits are anticipated for finalists in the
competition in order to clarify issues raised during the reviews,
and to explore additional matters as needed.
Panel reviews are expected to be completed by October, 1996,
and site visits will occur before the end of the 1 996 calendar year.
Awards should be announced in the Spring of 1997.
Inquiries related to the program should be sent via e-mail to
PACI(s)nsfgov
or they may be sent by US mail to:
Partnerships in Advanced Computational Infrastructure
NSF, Room 1122
4201 Wilson Boulevard
Arlington, VA 22230
GENERAL INFORMATION
The Foundation provides awards for research in the sciences
and engineering. The awardee is wholly responsible for the
conduct of such research and preparation of the results for
publication. The Foundation, therefore, does not assume respon-
sibility for the research findings or their interpretation.
The Foundation welcomes proposals from all qualified scien-
tists and engineers and strongly encourages women, minorities,
and persons with disabilities to compete fully in any of the
research related programs described here.
In accordance with federal statues, regulations, and NSF poli-
cies, no person on grounds of race, color, age, sex, national
origin, or disability shall be excluded from participation in. be
denied the benefits of, or be subject to discrimination under any
program or activity receiving financial assistance from the Na-
tional Science Foundation.
Facilitation Awards for Scientists and Engineers with Dis-
abilities (FASEDl provide funding for special assistance or
equipment to enable persons with disabilities (investigators and
other staff, including student research assistants) to work on NSF
projects. See the program announcement ( NSF 9 1 -54 ) or contact
the program coordinator at 306-1636.
Privacy Act and Public Burden. The information requested
on proposal forms is solicited under the authority of the National
Science Foundation Act of 1950, as amended. It will be used in
connection with the selection of qualified proposals and may be
disclosed to qualified reviewers and staff assistants as part of the
review process; to applicant institutions/grantees to provide or
obtain data regarding the application review process, award
decisions, or the administration of awards; to government con-
tractors experts, volunteers, and researchers as necessary to
complete assigned work; and to other government agencies in
order to coordinate programs. See Systems of Records. NSF 50,
Principal Investigators/Proposal File and Associated Re-
cords, and NSF-51. 60 Federal Regislei 4449 (January 23,
1995) Reviewer/Proposal File and Associated Records, 59
Federal Register 8031 (February 17, 1994). Submission of the
information is voluntary. Failure to provide full and complete
information, however, may reduce the possibility of your receiv-
ing an award.
Public reporting burden for this collection of information is
estimated to average 120 hours per response, including the time
for reviewing instructions. Send comments regarding this burden
estimate or any other aspect of this collection of information,
including suggestions for reducing this burden, to Herman G.
Fleming. Reports Clearance Officer. Contracts, Policy, and
Oversight, National Science Foundation, 4201 Wilson Boule-
vard, Arlington, VA 22230.
The National Science Foundation has TDD and FIRS capabil-
ity, which enables individuals with hearing impairment to com-
municate with the Foundation about NSF programs,
employment, or general information. To access TDD dial (703)
306-0090; for FIRS, 1-800-877-8339.
40
6ETTIN6 NSF INFORMATION AND PUBLICATIONS
The National Science Foundation (NSF) has several ways for the public to receive information
and publications. Electronic or printed copies of the NSF telephone directory, abstracts of
awards made since 1989, and many NSF publications are available as described below. To
access information electronically, there is no cost to you except for possible phone and Internet
access charges. Choose the method of access that matches your computer and network tools. For
general information about Internet access and Internet tools, please contact your local computer
support organization.
WORLD WIDE web:
NSF HOME PACE
The World Wide Web (WWW) syslem
makes it possible to view text material
as well as graphics, video, and sound.
You will need special software (a "web
browser") to access the NSF Home
Page. The URL (Uniform Resource
Locator) is http://www.n5f.g0v/.
INTERNET «OPHER
The Internet Gopher provides access to
information on NSF's Science and
Technology Information System
(STIS) through a series of menus. To
access the Gopher, you need Gopher
client software; the NSF Gopher server
Is on port 70 ofstis.nsf.gov.
ANONYMOUS FTP (FILE
TRANSFER PROCRAM)
Internet users who are familiar with
FTP can easily transfer NSF
documents to their local system for
browsing and printing. The best way
to access NSF information is to first
look at the index (file name:
index.txt). From the index, you can
select the files you need. FTP
instructions are:
■ FTPtostis.nsf.gov.
m Enter anonymous for the user name.
and your e-mail address for the
password.
• Retrieve the appropriate file (i.e.,
filename.ext)
E-MAIL (ELECTRONIC-MAIL)
To get documents via e-mail, send your
request to the Internet address
slisserve@nsf.gov The best way to
find NSF information is to request the
index. Your e-mail message should
read: get index.txt. An index with file
names will be sent to you. However if
you know the file name of the
document you want, your e-mail
message should read:
get <filenanie.ext>
E-MAIL MAILINC LISTS
NSF maintains several mailing lists to
keep you automatically informed of
new electronic publications. To get
descriptions of the mail lists and
instructions for subscribing, send your
request to: slisserve@nsf.gov. Your
message should read: get stisdirm.txt.
ON-LINE STIS
NSF's Science and Technology
Information System (STIS) is an
electronic publications dissemination
system available via the Internet (telnet
to slis.nsf.gov), you will need a VTIOO
emulator. The system features a full-
text search and retrieval software
(TOPIC) to help you locate the
documents. Login as public and follow
the instructions on the screen.
To get an electronic copy of the "STIS
USERS GUIDE." NSF 94-10. send an
e-mail request to: stisserve@nsf.gov.
Your message should read:
get NSF9410.txt. For a printed copy of
the "STIS USERS GUIDE," see
instructions "How To Request Printed
NSF Publications."
NON-INTERNET ACCESS
VIA MODEM
If you do not have an Internet
connection, you can use remote login
to access NSF publications on NSF's
on-line system, STIS You need a
VTIOO terminal emulator on your
computer and a modem.
• Dial 703-306-0212,
• choose 1 200, 2400, or 9600 baud,
• use settings 7-E- 1 , and
• login as public and follow the on-
screen instructions.
NSF 95-64 (Replaces NSF 94-4)
HOW TO REQUEST PRINTED
NSF PUBLICATIONS
You may request printed publications
in the following ways:
• send e-mail request to:
pubs@nsf.gov
m fax request to: 703-644-4278
■ for phone request, call: 703-306-
1130 or Telephonic Device for the
Deaf (TDD 703-306-0090)
■ send written request to:
NSF Forms and Publications Unit
4201 Wilson Boulevard
Room P- 15
Arlington, VA 22230
When making a request, please include
the following information:
• NSF publication number;
• number of copies; and
• your complete mailing address.
QUESTIONS ABOUT NSF
PUBLICATIONS. PROCRAMS/
ETC
Contact the NSF Information Center if
you have questions about publications,
including publication availability,
titles, and numbers. The NSF
Information Center maintains a supply
of many NSF publications for public
use. You may:
• visit the NSF Information Center,
located on the second fioor at 4201
Wilson Blvd., Arlington, Virginia.;
or
• call the NSF Information Center at
703-306-1234; or 703-306-0090 for
TDD; or
• send e-mail message to
info@nsf.gov
QUESTIONS ABOUT THE
ELECTRONIC SYSTEM
Send specific, system-related questions
about NSF electronic publication
services that are not answered in this
fiyer, to webmaster@nsf.gov or call
703-306-0214 (voice mail).
41
Report of the Task Force on the Future of the NSF Supercomputer Centers Program
Executive Summarv
The NSF supported Supercomputer Cen-
ters have played a major role in advanc-
ing science and engineering research.
They have enabled collaboration among
academic, industrial and government re-
searchers on the solution of problems re-
quiring demanding computational and
visualization tools. In the 10 years of
their existence, the Centers have fostered
fundamental advances in our under-
standing of science and engineering, ex-
panded the use of high-end computing in
new disciplines, facilitated the major
paradigm shift of the acceptance of com-
putational science as a full partner in the
.scientific method, and facilitated the edu-
cation of a new generation of computa-
tional scientists and engineers in support
of that shift. These statements are docu-
mented in the body of the report as well
as in its appendices.
Having accomplished so much in the la.st
decade, it is natural to ask what the future
role of the NSF should be in high-end
computing. In October 1994. the Na-
tional Science Board approved two-year
contmuation funding for the Supercom-
puter Centers. This provided time for the
director of NSF to appomt this Task
Force to analyze the alternatives. The
possibilities considered include continua-
tion, restructuring, or phasing-out of the
current program, as well as creation of
alternative models.
The Task Force believes that the future
for computational science and engineer-
ing can be as bright or even brighter than
in the past decade. If we seize the op-
portunity, over the next decade we can
make major progress on multiple fronts.
There will be
• opportunities for exciting applica-
tions of our nation's exponentially
increasing computational capacity,
for example:
- more complete models, and hence
deeper understanding of physical
systems by moving to three and
higher dimensions;
- progress in computational tools to
aid drug and protein design;
- computational predictions of sci-
entifically and commercially sig-
nificant materials;
- multidiscipiinary models of
physical systems (e.g., combining
fluid dynamics and electromag-
netic models of the heart);
- increased interconnectivity of su-
percomputers and high impact in-
.strumentation; and
- models of anatomical and
physiological proces.ses leading to
new insights of benefit to human
health.
• more quantitative computational
results in unanticipated areas.
• more explosive growth of com-
munications as a component of the
computational .science and engi-
neering paradigm; and. impor-
tantly.
• continued progress in the tools and
methods for developing code that
is both portable and yet takes ad-
vantage of unique parallel archi-
tectures.
These advances will not automatically be-
come available to American researchers.
To position the U.S. academic commu-
nity to participate in the exciting research
possibilities enabled by these develop-
ments, the Task Force has the following
recommendations leading to a restruc-
tured Centers program.
September 15, 1995
42
Report of the Task Force on the Future of the NSF Supercomputer Centers Program
Executive Summary
In order to maintain world leadership
in computational science and engi-
neering, NSF should continue to
maintain a strong, viable Advanced Sci-
entific Computing Centers program,
whose mission is:
• providing access to high-end
computing infrastructure for the
academic scientific and engi-
neering community;
• partnering with universities,
states, and industry to facilitate
and enhance that access:
• supporting the effective use of
such infrastructure through
training, consulting, and related
support services:
• being a vigorous early user of
experimental and emerging high
performance technologies that
offer high potential for advanc-
ing computational science and
engineering;
• facilitating the development of the
intellectual capital required to
maintain world leadership.
NSF should assure that the Centers
program provides national "Leading-
edge sites" that have a balanced set of
high-end hardware capabilities, coupled
with appropriate staff and software,
needed for continued rapid advance-
ment in computational science and en-
gineering.
NSF, through its Centers program,
should assure that each leading-edge
site is partnered with experimental fa-
cilities at universities, NSF research
centers, and/or national and regional
high performance computing centers.
Appropriate funding should be provided
for the partnership sites.
NSF should announce a new com-
petition of the High Performance Com-
puting Centers program that would
permit funding of selected sites for a
period of five years. If regular reviews
of the Program and the selected sites
are favorable, it should be possible to
extend initial awards for an additional
five years without a full competition.
The Centers program should continue
to support need-based research in sup-
port of the program's mission, but
should not provide direct support for
independent research.
NSF should increase the involvement of
NSF's directorates in the process of al-
locating service units at the Centers.
NSF should provide leadership in
working toward the development of in-
teragency plans for deploying balanced
systems at the apex of the computa-
tional pyramid and ensuring access to
these systems for academic researchers.
These recommendations are designed to
set the Centers program on a new course
that builds on its past successes, yet shifts
the focus to the present realities of high-
performance computing and communica-
tions, and provides flexibility to adapt to
changmg circumstances. It is our expec-
tation, that at current NSF budget levels
and absent new outside resources, there
will be a reduction in the number of lead-
ing-edge sites to effect the benefits of the
Task Force recommendations.
In developing these recommendations,
the Task Force obtained extensive input
from academic, government, and indus-
trial leaders: visited Centers and sought
written input from the community.
Some of this input is included as appen-
dices, and the complete input is available
on the Internet. The issues are complex
and there are many strongly held opin-
ions on the purpose, execution, and value
of the program. The Task Force has tried
to hear and understand all of the input.
September 15, 1995
43
Report of the Task Force on the Future of the NSF Supercomputer Centers Program
Executive Summarv
but in the end has, of necessity, formed
its own judgment of what is best for the
country. This report attempts to explain
that judgment.
The report begins with a history of the
Centers and how they fit into the nation's
high performance computing infrastruc-
ture.
The .second section attempts to identify
factors the Task Force thinks are impor-
tant in the evaluation process, including
staff involvement in research, size of the
u.ser ba.se, scientific discipline of the us-
ers, funding leverage, industrial partner-
ships, multidi.sciplinary activities, re-
source availability, and education.
The "hard issues" surrounding the Cen-
ters, particularly those not adequately dis-
cussed in previous reports, are discussed
in the third section. This .section exam-
ines such issues as: sunsetting the Cen-
ters: industrial u.se; effect of "free" com-
puter cycles on the market: the total need
for high-end computing; quality of the
science; role of other centers; technology
and computer industry trends; and the
role of the Centers in the larger federal
and international context.
Section four examines five options for a
Centers program, ranging from the cur-
rent system to termination of the pro-
gram. Other options include partnership
centers with stronger links between lead-
ership centers and university or state fa-
cilities; a single partnership center; and
disciplinary centers along the lines of the
National Center for Atmospheric Re-
.search. The pros and cons of each option
are discussed.
The fifth section of the report discusses
future directions and priorities for the
Centers program. The final section re-
states and explains each of the seven spe-
cific recommendations designed to sup-
port the Task Force vision for the future.
September 15, 1995
44
High Performance Computing Infrastructure and Accomplishments
October 1994
Table of Contents
INTRODUCTION 1
IMPORTANT TECHNOLOGY ACCOMPLISHMENTS 1
hi(;h performance computing i
Supcrcunipuler Usage at NSF Centers 2
Aicliuectuies and Vendors 2
Center Program Chronology 2
National Access to Vector Multiprocessors 4
Aclueving Production Parallelism 4
Early Migration to the UNIX Operating System 4
Early Access to Massively Parallel Computers 4
Superlincar speedup on heterogeneous processors 5
Workstation Clusters 5
PORTABLE PARALLEL PROGRAMMING TOOLS 6
Prototype Parallel Programming Environments 6
Extensions of PVM 6
Scalahle Libraries <i
STORAGE TECHNOLOGIES 6
AFS-Establishing a National File System.. 7
HDF-Creating a Standard File Formal 7
Migrating to a Standard Archiving Software 7
Development ol high-densily magnetic media 8
NET\VORKIN(; 8
Evolution ol NSFNET 9
High Performance LANs 4
Gigabit Testheds 10
Secure Networks H'
New Science Enabled by Networks— Teleniicroscopy 10
Nil Testbeds II
VISUALIZATION AND VIRTUAL REALITY 12
DcNclopincnt of Scientitic Visualization 12
Virtual Realit) Impacts Industrial Design 1 3
DcvelopnieiU of Immersive Science Projects 14
Virtual Reality over ATM networks 14
Alpha Shapes. Biomolecules. and Cosmology 14
DKilTAL LIBRARIES AND INFOSERVERS 14
Digital Libraries 15
Scalable Information Servers 15
The Rise ol the Mosaic/WWW Information Infrastructure 15
DESKTOP SOFTWARE 15
Cunneclivitv Tools 16
45
High Performance Computing Infrastructure and Accomplishments
Collaboration Tools 16
Graphics Tools 16
Scientist' s Workbench 17
ACCOMPLISHMENTS IN EDUCATION AND OUTREACH 17
EDUCATION 17
Researchers and Students 17
Supercomputer Centers Educational Activity Support Summary 17
Outreach to Educators 18
OUTREACH 19
Application ot Scientific Compulation and Visualization to Industrial Production 20
Impact on Vendors of High Performance Computini! Equipment 20
Stimulation of New, Computationally Dependent Ventures 21
Development of Nationally Valuable Reservoirs of Skill 22
Community Service 22
IMPORTANT SCIENCE AND ENGINEERING ACCOMPLISHMENTS 22
SUMMARIES OF COMPUTATIONALLY INTERESTING PROBLEMS IN THE NSF
CENTERS PROGRAM BY THE NATIONAL SCIENCE AND ENGINEERING COM-
MUNITIES: 22
QUANTUM PHYSICS AND MATERIALS 23
Phase Transition in QCD 23
Phase Transitions of Solid Hydrogen 23
Prediction of new Nanomalerials 23
Theory of High Temperature Superconductors 24
Magnetic Materials 24
Understanding Glass 24
BIOLOCY AND MEDICINE 25
Crystallography 25
Folding Proteins using Artificial InlelligcnLe 25
Protein Kinase solution 26
Molecular Neuroscience-Serotonin 26
Molecular Neuroscience-Aceiylcholinesierase 26
Kinking DNA ' 27
Antibody-Antigen Docking 27
Tuning Biomolecules to Fight Asthma 27
Virtual Spider and Artificial Silk 28
Heart Modeling 28
ENGINEERING 28
Ullra-high-strength Steels.. 28
Continuous Casting of Sleel 29
Beverage can design 29
Designing a Leakproof Diaper 29
Bone Transplant Bioengineering 29
Improving Performance with Riblels 29
Designing Belter Aircraft.. 30
Crash Testing Street Signs 30
46
High Performance Computing Infrastructure and Accomplishments
EARTH SCIENCES AND THE ENVIRONMENT 30
Deioxiticalion of Ground Water 30
Sage Grouse-Endangered Species and the US Army 31
Slorm modeling/forecasting 31
Los Angeles Smog 32
Upper Ocean Mixmg 32
Simulating Climate using Distributed Supercomputers 32
PLANETARY SCIENCES, ASTRONOMY, AND COSMOLOGY 32
Comet Collision with Jupiter 33
Discovery ol First Extrasolar System Planet 33
Building the BIMA Radio Telescope 33
Pulsar Searching and Discovery • 34
Accretion Disks Around Black Holes 34
Black Hole Collision Dynamics 34
Largest cosniological simulation 35
EVOLUTION OF THE METACENTER CONCEPT 35
RECOGNITION ACCORDED NSF SUPERCOMPUTER USERS AND PROJECTS 36
ACRONYMS 3 8
47
High Performance Computing Infrastructure and Accomplishments
Introduction
The NSF Centers Program was established
to provide access to high pert'ormance com-
puters (supercomputers) tor the broad Sci-
ence and Engmeermg Research Community.
The program has evolved from one com-
prising independent, competitive. ar>d similar
computer centers to one mcluding more co-
operative and diverse activities. Coordinating
the mission of tlie indi\iduai centers has in-
creased the diversity of computer architec-
tures available to the research community,
and has accelerated outreach to segments of
the community which had not before been
able to use the power of high performance
computers. At the same time, competition
between centers has been managed by NSF
and its advisory committees to the advantage
of the engineering and science communities
which the program was established to .serve.
Building on each center's tradition of pro-
viding a stable source of computer cycles for
a large community of .scientists and engi-
neers, the centers have evolved into a unique
resource to test which diverse computer ar-
chitectures best match the most demanding
problems posed by the community of uni-
versity researchers and to develop the neces-
sary supporting software and algorithms. For
example, this approach has enabled the cen-
ters to test which applications can be effi-
ciently served on newly developed systems
using clusters of the new generation of work-
stations that are now being introduced. Such
experiments are enabled b\ the open en\i-
ronment characteristic of the program.
During the first decade of the centers pro-
gram, major improvements in the delivery of
high performance computing have been de-
veloped, mainly by American computer re-
searchers and companies. But advances in
computing technology have been matched in
equal measure by improvements in computer
networking, and as a consequence the NSF
Centers have been a primary focus for accel-
erating the evolution of the Internet via
NSFnei. NREN. and the still evolving broad-
band width technology.
in this appendix, we provide detailed exam-
ples of the centers activities.
Important Technology Accomplishments
High Performance Computing
Originall> set up in 1985 to provide national
access to traditional supercomputers, the
NSF Centers have evolved to a much larger
mission. The Centers now offer a wide vari-
ety of high performance architectures from a
large array of American vendors. No longer
lUst adopting technology from the national
labs, the NSF Centers Program has become a
pioneering vanguard of technology — a
model for other agencies with a vested inter-
est in the high performance computing to
emulate. This today is dominated by research
efforts in software, with vital collaborations
with computer scientists, focusing on oper-
ating systems, compilers, network control,
mathematical libraries, and programming
languages and environments. The feedback to
the leading US vendors is increasing the use-
fulness of their product offerings to the sci-
entific and engineering communities, while
making them more competitive.
48
High Performance Computing Infrastructure and Accomplishments
Supercomputer Usage at NSF Centers
Fiscal Year
Active Users
Usage in CPU
Hours
1986
1,336
29,485
1987
3,299
95,751
1988
5,042
121,615
1989
5,967
165,960
1990
7,357
250,628
1991
7,723
361,073
1992
8,252
398,931
1993
7,735
910,088
1994
7,395
2,370,794
1995
6,601
4,590,606
(Usage IS 111 normalized CPU hours, based on comparative performance tests. The astoundmg leap m ca-
pacity in 1993 is mamly a result of the introduction of new computing architectures to solve the most
demanding of computational problems — the Gr2tnd Challenges. The slight decrease in the number of
users is the result of a concerted effort by the Centers to assist many of their users with small memory or
CPU-time requirements to meet their computational needs by the increasingly powerful workstations of
the mid-90's. The greatest benefactors of the increase m massively parallel cycles are the scientists and
engmeers addressmg the problems with the greatest computing demands)
Architectures and Vendors
The national community has been offered
access to a wide and frequently updated set of
high performance architectures since the begin-
ning of the NSF Supercomputer Centers Pro-
gram The current rate of change of the types of
architectures, and the number of vendors of-
i'enng them, is probably near an all time high.
We are in a period of ferment which the sci-
ence and engineermg communities sort out the
choices for finding an architecture that matches
theii various computational problems. A list of
architectures that the NSF Centers Program has
offered would include: single and clustered
high performance workstations or workstation
multiprocessors, minicomputers, graphics
supercomputers, mainframes with or without
attached processors or vector units, vector
supercomputers, and SIMD and MIMD mas-
sively parallel processors. Similarly, the list of
current vendors whose top machines have been
made available would include IBM, DEC,
Hewlett-Packard, Silicon Graphics, Sun Mi-
crosystems, Cray Research, Convex Com-
puter, Intel Supercomputer, Kendall Square.
Thinking Machines. nCUBE. Aliiant, Floating
Point Systems. ETA. Stellar, Ardent, and
Slardent.
49
High Performance Computing Infrastructure and Accomplishments
Center Program Chronology
f\
Milestone or Event
Description
1986
? NSF Superaimputer Centers become
Dperational
Cornell Theory Center
National Center tor Supercomputing Applications
Pittsburgh Supercomputing Center
San Diego Superconiputer Center
lohn von Neumann Center
lyss-iywy
Renewal Re\ iew
1989
Renew 4 NSF Supercomputer Centers
CTC, NCSA, PSC, SDSC
1990-1492
ASL Ad\isor\- Committee report com-
pleted
strong recommendations tor adding parallel systems to
accompan\ the stable, production vector svstems
1991
,s \ ector Supercomputers operatmt;
3 Scalable parallel systems operatmg
IBM 3090 h processors (2)
Cray YMP 8 processors (2)
Alh'ant FX80 S processors
Cray 2S 4 processors
Cray YMP 4 processors
Con\'ex C240 4 processors
Intel iPSC/860 32 processors'''
TMC, CM2 32,000 processors (2|'^
1992
7 Vector Supercomputers operatmg
9 Scalable parallel systems operating
lomt Plannmg Initiated
Cray s\'stems remain
IBM ES9000/900 (Upgrade 3090)
Alhant 2800 (upgrade FX80)
Con\'e\ C3880 8 processors (Upgrade C240)
Intel iPSC/860 upgrade 64 processors)!
nCUBE2 128 processorst
TMC, CM5 512 processorst
KSRl 64 processorst
DEC Workstation Cluster (2)t
IBM Workstation Cluster
IBM PVS 32 processors
Initial meeting at SDSC Fall 92
1993
Joint Activities Began
hVector Supercomputers operating
13 Scalable parallel systems operating
Meeting at PSC
First |omt proposals to PPRP
First of Joint projects
Cray C90 16 processors
(other Crays, IBM, Convex stay)
Intel Paragon 400 processors upgrade t
KSRl upgrade 160 processorst
Hewlett Packard Cluster^
MasPar 2 16,000 processors^
IBM SPl 64 processorst
Cra\ T3D 5]2 processorst
1994
lomt Activities
4 Vector Supercomputers operating
14 Scalable parallel systems operating
.Meeting at CTC
Metacenter Regional Alliances - Mar 94
Expansion of |oint proiects
One YMP changed to C-90, others the same
IBM SP2 upgrade 512 processors upgrade SPlt
Cray T3D 512 processors t
Convex Exemplar 8 Nodest
SGI Challenge 32 Nodes
^ Ma|ority of funding provided by other Federal agency (ARPA, NIH) or state.
i Donated m full or in part by the manufacturer for extended evaluation
50
High Performance Computing Infrastructure and Accomplishments
The Centers Prdgram provides a stable sup-
ply of vector computing cycles needed by the
research community while investmg in sig-
nificant capacity of scalable parallel systems,
capable of ultimately growing to a size neces-
sary for full scale grand challenge problems.
While the numbers of Vector Supercom-
puters has decreased, the computing power
represented by that group in fact increased
substantially. However, the increase in the
scalable parallel systems was much larger,
reflecting the growth potential of this type of
computing platform and a strong
NSF/ARPA partnership.
National Access to Vector Multiproc-
essors
The NSF Supercomputer Centers established
in the mid-1980s brought access to state-of-
the-art supercomputers for the first time in at
least 15 years. Indeed, in the 1960s, only a
few universities had such access. This open-
ing of universal access led to an unprece-
dented increase in the number of researchers
and universities involved in advancing the
frontiers of scientific and engineering research
by using high performance computing. By the
early 1990s, some 15,000 researchers in over
200 universities had u.sed one of the Cray
Research vector multiprocessors or the IBM
vector mainframe in one of the NSF centers.
This wide pool of computational researchers
made it possible for the center's program to
begin to respond to the demand for paral-
lelism that had been developed in the Com-
puter Science Community, and adopted by the
most adventurous user. The 90" s saw the
NSF center's program substantially widen its
range of architectural offerings.
Achieving Production Parallelism
The Cornell Theory Center (CTC) became the
first member Center of the NSF MetaCenter
to achieve production parallelism on a vector
supercomputer, with over 1/3 of its vector
supercomputer cycles used for parallelism in
1989. CTC integrated its two (ES/3090 600)
vector supercomputers using a special 200
mbyte/sec hardware interface, allowing paral-
lel jobs the potential of executing across 12
vector processors. Users u.sed a shared-
memory parallel FORTRAN developed by
IBM in a joint project with the Theory Center.
Early Migration to the UNIX Operat-
ing System
During the early and mid-1980's the UNIX
operating system was widely viewed as inap-
propriate for supercomputers for reasons of
performance, system management tools, ap-
plication development and measurement
mechanisms, and security. Cray supercom-
puters were run mostly with operating sys-
tems that were designed at national laborato-
ries (LLNL and LANL in particular) and this
required extensive local software support. In
1987, NCSA became the first major super-
computer center to migrate its Cray super-
computer from CTSS (Cray's proprietary
time-.sharing system) to UNICOS, a UNIX-
based operating system developed at Cray
Research for its supercomputers. This move
to UNIX was the beginning of a merger be-
tween computational science and computer
science, because most computer .science re-
search involved the UNIX operating system at
that time. Coincidentally, CTC was the first
site to run IBM's high performance UNIX
system on its ES/3090 and ES/9000 main-
frames in production.
Early Access to Massively Parallel
Computers
Beginning in 1985, CTC provided ex-
perimental scalable parallel machines, first
the EPS T-series and an iPSC/l parallel sys-
tem, to its user community. In 1988. with the
installation of its iPSC/2 with 32 processors,
CTC made early .scalable computing available
for production use by the national com-
munity. Massively parallel computing was
introduced to the re.search community begin-
ning with NCSA's CM-2 in 1989. Each
Center provided early access to new genera-
tion MPPs. The CTC was the first site to in-
stall an IBM SPI and SP2. PSC installed the
first Cray T3D, and early CMS (NCSA),
Paragon(SDSC), Ncube(SDSC), and Ken-
dall Square (CTC) machines were installed.
Early access to these machines, enabled by
-4-
51
High Performance Computing Infrastructure and Accomplishments
support from ARPA and NIH, allowed pio-
neering users to explore the benefits of fine-
grained parallelism. Each Center worked
with the user community and the vendors to
develop application codes, which could then
be ported to other platforms. In 1992, the
CM-5 was added to the program at NCSA as
the largest distributed memory parallel super-
computer available to the national academic
and industrial communities. From 1992 to
the present NCSA has worked closely with
national users and the computer .science
community to create a wide range of 512-
way parallel application codes that can in
1995 be moved to other large MPP architec-
tures such as the T3D at PSC. the Intel Para-
gon at SDSC. or the IBM SP-2 at CTC.
Superhnear speedup on heterogeneous
processors
In 1991. PSC was the first site to distribute
code between a massively parallel machine
(TMC-CM2) and a vector supercomputer
(Cray YMP). linked by a high speed channel
(HiPPI). Experiments on applications as di-
verse as molecular dynamics, medical imag-
ing, chemical flowsheeting and gene sequence
alignment showed superhnear speedup (the
applications on the linked system ran more
than twice as fast on each system separately).
This formed part of the motivation for het-
erogeneous computing, as later embodied in
the tightly coupled Cray T3D/C90 systems.
PSC's T3D was the first shipped anywhere.
PSC developed a set of codes for transferring
data between the Cray and CM2 which later
enabled them to communicate between the
T3D and C90 at speeds superior to what was
available from the vendor.
A similar superhnear speedup was obtained
on the CASA gigabit testbed set-up included
parts of two supercomputers (64 nodes of the
528-node Intel Delta svstem at Caltech and
one processor of the CRAY C-90 at SDSC.
150 miles away); two HiPPI-SONET gate-
ways; and a SONET wide-area link between
San Diego and Pasadena, which is itself a
prototype undergoing tests in a collaboration
between MCI and Pacific Bell. Usine an en-
vironment for distributed parallel computing
called Express (a product of ParaSoft Corpo-
ration). Aron Kuppermann and Mark Wu
(Dept. of Chemistry, CalTech) did a test cal-
culation of the reaction of atomic hydrogen
with molecular heavy hydrogen (deuterium)
at a total energy of 2.5 eV. This problem had
taken 100 hours to .solve on the SDSC
CRAY Y-MP a year earlier. On the new C-
90, It took 17 hours. On the Delta alone, it
took 16. But when the problem was distrib-
uted between the C-90 and the Delta, the
whole problem was solved by the two ma-
chines in just under five hours, a factor of 3.3
faster than it could be done on either machine
alone.
Workstation Clusters
Given historical trends showing much more
rapid improvement in microprocessor tech-
nology than in vector technology, many Cen-
ters began exploring workstation clusters, in
1990. NCSA examined the usage of the Cray
Y-MP and determined that a significant
amount of capacity could be gained by mov-
ing appropriate applications to scalar RISC
processors. Based on this study and on pre-
dictions that microprocessor technology
would surpass vector technology by the mid-
1990's. NCSA set up an IBM RS/6000 clus-
ter as a farm of processors used as stand-
alone compute servers. Beginning in 1991,
CTC did pioneering work with IBM on clus-
tered RS/6000 workstations with high-speed,
proprietary communications links, including
an experimental optical switch. The informa-
tion gained during this joint project was used
to guide the design of the SP systems. Also in
1991. PSC set up a cluster of DEC worksta-
tions, and working with Florida State Univer-
sity, significantly enhanced queuing, account-
ing and control software and also integrated
the AFS file system into this environment.
These clusters all served as computing re-
sources and al.so as platforms for the devel-
opment of distributed memory message
passing codes. These projects generated high
interest in industry, and NCSA trained several
dozen industrial sites on the integration, op-
-5'
52
High Performance Computing Infrastructure and Accomplishments
eration. and management of clusters ot mi-
croprocessors. NCSA is now moving to es-
tablish clusters of shared memory workstation
multiprocessors from Convex/HP and SGI,
while PSC will develop applications for
Cray's new offering, the J90. The CTC was
given supplemental funding by NSF to build
its cluster to 32 processors, providing a com-
patible path to Its (later) IBM SPl and SP2
systems. The system, including experimental
high-speed switch, was used for production
work by the community, which has now mi-
grated to the SPl and SP2 environments.
Portable Parallel Programming
Tools
Although the architectures of massively par-
allel systems differ greatly, the major time
and money investment of the research com-
munity (as contra-sted to the center's person-
nel) is in developing and converting codes
(porting) to operate in different envi-
ronments. The close cooperation between the
NSF Centers via the MetaCenter and infor-
mal contacts among its research users and
cooperating agencies such as ARPA and the
various National Labs have resulted in sub-
stantial progress ensuring that labor intensive
programming operations need not be dupli-
cated needlessly.
Prototype Parallel Programming En-
vironments
Working with the Parascope Group at the
Center for Research in Parallel Computing, an
NSF Science and Technology Center, CTC
developed extensions supporting new parallel
programming paradigms and extensions
making ports from one type of parallel pro-
gramming platform to another easier. ParaS-
cope is a prototype parallel programming en-
vironment. Both the BIMA and Cosmology
GCs at NCSA are working closely with Indi-
ana University Computer Scientist Dennis
Gannon to move application codes previously
written in FORTRAN to the portable pC-n-
which IS the model for HPC-t-i-, the equivalent
of HPF in the C-i-i- world.
Extensions of PVM
In the Dome project, Adam Beguelin, one of
the original developers of PVM now working
jointly at PSC and in Computer Science at
Carnegie Mellon University, is extending
PVM to improve load balancing and fault
tolerance. His work is guided by the experi-
ence of PSC's cluster users.
Scalable Libraries
The goal of the ARPA funded Scalable Paral-
lel Libraries project is to develop mathematical
software libraries for massively parallel proc-
es.sors that are roughly comparable in scope to
the math libraries typically available on con-
ventional supercomputers. Michael Heath and
his group at NCSA are one team in this multi-
institutional project and they have been devel-
oping parallel direct methods for solving
sparse linear systems. For this purpose, they
have developed a fully parallel sparse solver
for distributed memory parallel computers.
Unlike most other efforts, which have focused
only on factorization, this solver performs all
phases of the computation in parallel mode,
including the symbolic preprocessing neces-
sary to reorder the sparse matrix and distribute
it across processors to maintain data locality.
With funding from IBM, CTC staff devel-
oped scalable versions of key numerical li-
brary routines for its IBM cluster system:
these routines were included by IBM in its
ESSL library product.
Storage Technologies
With the vast increase in both simulation and
observational data, the MetaCenter has
worked a great deal on problems of storage
technologies. Here again, many of the biggest
areas of progress are in software. The crea-
tion of a universal file format standard, a na-
tional file system with a single name space,
and a multivendor archiving software are
some of the results of MetaCenter innova-
tion, collaboration with computer scientists,
and with other leading national laboratories.
There are even examples of the Program's
computational facilities being used to im-
-6-
53
High Performance Computing Infrastructure and Accomplishments
prove the basic storage capacity of the physi-
cal medium ot storage itself.
AFS-Estabiishing a National File Sys-
tem
PSC recognized that the Andrew File System
(AFS). developed at Carnegie Mellon Uni-
versity with IBM support for a workstation
environment, was particularly-well suited for
use in high performance computing, because
of its superior security, .scaling properties,
and manageability. PSC undertook a major
program of adapting AFS to the high per-
formance computing environment which has
led to a MetaCenter wide effort to develop a
shared national file system. PSC's AFS en-
hancements to Cray's UNICOS are installed
at a number of advanced computing centers
(SDSC. .NERSC. IPP. LRZ (Germany).
ETH (Switzerland) and the University of
Stuttgart). PSC has also extended AFS to
multi-resident AFS which enables AFS to be
a component of a hierarchical storage system.
permitting transparent and cost-effective stor-
age of large amounts of data. These en-
hancement are installed at other MetaCenter
sites. NERSC. Cray Research. Transarc Cor-
poration. Ma.x Planck Institute for Plasma
Physics, and the University of Cologne. With
AFS. distributed applications across the
MetaCenter are now possible. For instance.
Paul Dawson (Dept. of Mech. and Aerospace
Engineering. Cornell Univ.). simulating de-
formations of aluminum, uses AFS to build
a distributed application with modeling per-
formed at PSC and visualization at the CTC.
The SDSC is running an AFS cell supporting
the Computational Center for Macromolecu-
lar Structure (CCMS). an SDSC. UCSD. and
Scripps Research Institute collaboration. The
charier of the CCMS is the development and
distribution of portable, innovative sotfware
for the study of macromolecular structure.
AFS simplifies the cross-institution distribu-
tion and maintenance of software and textual
information.
CTC was the first Center to put AFS in pro-
duction on its HPC systems; in fact, all CTC
production systems, except KSR. and servers
are integrated through AFS. including its new
mass storage environment. CTC has lead in a
number of areas of file system integration,
including a project with the University of
Michigan integrating its AFS mainframe
port. IFS. into its HPC environment, and
joint efforts with TRANSARC in improving
AFS performance over high speed networks,
including an FDDI testbed. New efforts in-
volving TRANSARC include optimization
over ATM and within IBM's SP2 scalable
switch.
HDF-Creating a Standard File For-
mat
The NCSA Hierarchical Data Format (HDF).
created by Michael Folk and his group at
NCSA has become one of the leading self-
describing file formats in the world today.
Many scientific institutions, organizations and
programs have adopted HDF as a standard
file format for data exchange and/or archiving.
In 1992. NASA selected ^HDF as the basis
from which to develop an EOSDIS standard
data format (SDF). The goal of SDF is to
provide a single, self-describing format for
distributing data derived from approximately
1.2 terabytes of data daily that EOSDIS will
eventually produce. Other examples of HDF
adoption include the Institute of Applied
GeoScience (seismic data). Pacific Northwest
Laboratory (cancer research). Children's Hos-
pital in Boston (x-ray images), the UCLA
Scientific Visualization Lab. By working
closely with many different user communities
to support the harmonization of data models
and metadata conventions across as many
disciplines as possible. NCSA is helping to
create a software foundation for the Nil en-
abling It to reach its potential to support the
broadest constituency possible.
Migrating to a Standard Archiving
Software
In 1985. the NSF Centers' goal was simply
to establish national access for the academic
community to the type of advanced super-
computing and archiving systems found in
the Dept. of Energy national laboratories.
NCSA and SDSC "duplicated the Los Ala-
54
High Performance Computing Infrastructure and Accomplishments
mos computing environment in 1985, includ-
ing the Common File System (CFS) archive,
while PSC adopted Westinghouse's PDM
software that had additional data migration
facilities. As time went on, the Centers began
to develop innovation in storage software. It
is the strategy of the MetaCenter to explore
alternative technical approaches to storage at
the bits level, while maintaining interop-
erability through standard protocols. SDSC.
through the DISCOS (Distributed Computer
Solutions) division of General Atomics
(which is now owned by Open Vision), pio-
neered productization of distributed, hierar-
chical file and storage management ap-
plications software for networked, multi-
vendor and open sy.stems environments,
based on the IEEE storage model. Working
with the UIUC Computer Science De-
partment, NCSA was able to encode data
migration and caching strategies into CFS to
improve its ability to minimize disk cache
misses. In 1992, NCSA developed CFS-to-
UniTree data formatting and migration tools
as well as a suite of archive management
tools. PSC integrated Cray's proven com-
mercial archiving technology (DMF) with
more usable front-end software and with its
multi-resident AFS. SDSC collaborates with
OpenVision and the National Storage Labo-
ratory in the further development and stabili-
zation of UniTree as a robust production ar-
chival storage system. SDSC has developed
transaction journaling software that is criti-
cally important for guaranteeing integrity of
the file name.server. CTC and SDSC are in-
.stalling the first generation of a high-
performance variant of UNITREE which has
been developed by the National Storage
Laboratory, to provide enhanced L/O capabil-
ity to balance the increa.se in in.stalled com-
puting capacity. CTC will continue to work
closeFy with IBM in the testing and deploy-
ment of the next generation of mass storage
systems, the High Performance Storage
System (HPSSl. This will include the ca-
pability of utilizing parallel VO to speed data
transfer to the SP2.
Development of high-density magnetic
media
The Center for Magnetic Recording Research
(CMRR), located on the campus of UCSD.
has funding from NSF and 21 corporations
having enterprises connected with the storage
and retrieval of magnetically written infor-
mation. The chief technical problem CMRR
has been addressing has been magnetic noise
in the metallic thin-film media used to coat
high-density disks. Neal Bertram (Depi. of
Electrical and Computer Engineering,
UCSD) is a researcher at CMRR who tackles
the problem of magnetic noise computa-
tionally. Magnetic thin films are poly-
cry.stalline. rather than continuous or amor-
phous, and the grainy, particulate nature of
the medium is a fundamental source of noise.
Bertram's calculations have explored the ef-
fects of two primary types of interaction be-
tween grains that cause noise: magnetostatic
coupling and exchange coupling. The calcu-
lations resulted in recommendations for al-
loys and fabrication processes that would re-
duce noise from both sources. The research
enabled engineers at IBM's Almaden Re-
search Center to design a disk coating that
packs a gigabit (one billion bits) of in-
formation onto a square inch, which is 15-30
times current storage densities. Bertram has
now turned his attention to calculations ot
other effects, including giant magnetoresis-
tance, that can be important in designing
high-density disk media and disk recording
and playback heads. His codes model the
process of recording bits in intricate detail;
the process of laying down a single bit of in-
formation takes several minutes to calculate
on SDSC's Cray Research C90.
Networking
One of the great succes.ses of the NSF Meta-
Center has been in providing the "high-end
pull" that has led to the creation and expo-
nential evolution of the NSFnet. As a result,
the NSFnet backbone of 1995 has 3000
times the bandwidth of the backbone of
1986. The Centers have also prototyped the
high performance local area networks that are
55
High Performance Computing Infrastructure and Accomplishments
needed to feed into the national backbone as
well as the next generation of gigabit back-
bones. Security over networks is essential not
only for industrial usage, but more and more
for widespread citizen usage. Again, the
MetaCenter has created innovations such as
dealing with mdustrial firewalls. The exis-
tence of the MetaCenter network testbeds al-
lows for new kinds of science to be attacked,
perhaps best illustrated by the rise of telemi-
croscop), m which leading edge projects are
being carried out at each of the NSF Super-
computer Centers. With the national priority
on the .Ml, the Centers are moving rapidly to
expand their networking research to commu-
nity based Nil lestbeds. including local
healthcare, education, government, and small
business partners.
Evolution of NSFNET
The 56kbps connection between the NSF
Centers, established in 1986, was the begin-
ning of the NSFnet. Based on the successes of
ARPAnet and the TCP/IP protocol within the
computer science and Dept. of Defense com-
munities, the NSFnet rapidly grew to provide
remote access to the NSF Supercomputer
Centers by the creation of regional and cam-
pus connections to the backbone. Although
started by the pull from the high end, the
NSFnet soon began to provide ubiquitous
connectivity to the academic research commu-
nity for electronic mail, file transport, and
remote login, as well as supercomputer con-
nectivity. These daily uses soon became in-
di.spensable to the research community and
the sustained exponential growth of the In-
temet took off. The MetaCenter" s industrial
partner network was among the first in the
coiporate world to use NSFnet/lnternet tech-
nology to connect corporations to the Internet
for the purposes of computational science.
This was an important precursor for today's
rapid commercialization of the Internet.
By early 1995, the NSFnet will return to a
high speed backbone connecting the Meta-
Center and some of the newly selected Meta-
Center Regional Alliance members. However,
the bandwidth of the backbone will be 3000
times higher than that of the original backbone
(56 kbps). While .some tend to think of the
MetaCenter as focusing on high performance
computing only, it is useful to remember that
computing power of the fastest supercom-
puter /?n«c'.v.sr.<;- in the program has grown by
little more than 100 times during the same
period. Indeed, it is likely that the 155 mbps
vBNS will be upgraded to 622 mbps nirliin
two years. Even by the time the Centers re-
ceive the first teraflop machines in 1997-98,
realizing a factor of 1000 increase in speed
over 1985. the backbone will have grown by a
factor of at least 25,000 fold in bandwidth.
As part of an SDSC/UCSD collaboration,
Kimberly Claffy recently completed a Ph.D.
dissertation that outlined a methodology for
profiling Internet traffic flows at a variety of
granularities. The methodologies and models
developed as part of this traffic characteriza-
tion effort should prove very useful as the
Internet evolves to an even larger system in
which the traffic composition needs to be un-
derstood, particularly for planning future
technology and capacity.
High Performance LANs
The center's program has also pioneered sev-
eral transitions in local area and metropolitan
area networks both on site and on university
campuses, acting as a prototyping facility for
other campuses who needed to know how to
develop long range networking plans for their
campuses. In 19S8, NCSA in.stalled the first
Ultranet Gigabit LAN networks with multiple
supercomputers and demonstrated 480 Mbil/s
between the CRAY-2 and Cray Y-MP su-
percomputers. In 1989, NCSA replaced tradi-
tional HYPERchannel backbone networks
with the then-emerging 100 Mbit/s FDDI
standard. In 1991. PSC began its move to a
HIPPI-based interconnect between its major
systems.
In 1993, NCSA, several industrial partners,
and the UIUC Computer Science Department
established a local area ATM network testbed
to help corporations gain hands-on experience
with ATM switches and interfaces. Insight
from this ATM testbed has already been used
56
High Performance Computing Infrastructure and Accomplishments
to develop long-range corporate network
strategies for J. P. Morgan, Phillips Petroleum.
FMC Corporation, and United Technologies.
CTC was the first site to integrate ATM into a
parallel supercomputer environment on its
IBM SPl in April 1994 and its IBM SP2 in
July. ATM will be u.sed for AFS-based file
service and other high speed transport needs,
including distributed applications and image
transport.
Gigabit Testbeds
Since 1987. NCSA and the University of Illi-
nois Computer Science Department have
worked with AT&T on the XUNET research
network testbed with capacity that is one step
beyond what is available on the Internet.
While the NSFnet has moved from 56 Kbs
through 1.5 Mbps to 45 Mbs, XUNET has
moved from 1.5 Mbs through 45 Mbs to 622
Mbs. In July 1993, XUNET was upgraded to
622 Mbs. the first network testbed to intercon-
nect ATM switches using 622 Mbs transmis-
sion technology over long (>50 miles) dis-
tance using pure optical fibers with in-line
optical amplifiers.
PSC has worked with CMU's Computer Sci-
ence Department in the Nectar Metropolitan
area gigabit testbed to develop new network-
ing technology for very high-speed, low-
latency multi-machine interconnects and to
develop the applications base which can bene-
fit from such technology. This work is fully
collaborative with PSC's ground-breaking
systems and applications level work in het-
erogeneous systems. As a result of testbed
work, numerous applications can now run
routinely between advanced machines at
PSC's main hardware facility and tho.se on the
CMU campus, 15 miles distant, at speeds of
up to 1 Gbit/s.
In partnership with NYNEX, Syracu.se and
Rome Laboratory, now extended to Colum-
bia, SUNY Stonybrook and Polytechnic In-
stitute. CTC participated in building a produc-
tion-level ATM network focused on demon-
strating research and commercial applications.
This network was demonstrated to the Gover-
nor of New York in January 1994. NYNET
is also a Nil te.stbed. involving outreach,
medical applications, video on demand, as
described in a later section.
Various applications are being tested in the
CASA te.stbed. in which SDSC is a major
partner. Besides the chemical reaction dy-
namics, led by Aron Kuppermann of Caltech
and mentioned above in the section on super-
linear speedup, there is a coupled atmo-
sphere/ocean model developed by the group
led by Roberto Mechoso at UCLA. Another
is Calcrust, a project directed by JPL. which
has u.sed distributed heterogeneous comput-
ing over the CASA links to combine satellite
imaging, .seismic data, and surface topogra-
phy in visualizing the foci of aftershocks of
the 1992 Landers earthquake in Southern
California.
Secure Networks
In 1987 NCSA installed a 1.5 Mbit/s DS-I
connection to Eastman Kodak in Rochester,
New York, followed by another DS-1 con-
nection to Amoco laboratories in Chicago and
Tulsa. By 1990 NCSA had connected over a
dozen indu.strial laboratories to the Internet us-
ing a combination of innovative security pre-
cautions. The.se included various forms of
"firewalls" which have now become com-
monplace on the Internet.
New Science Enabled by Networks—
Telemicroscopy
The San Diego Microscopy and Imaging Re-
source (SDMIR), led by UCSD neurocy-
tologist Mark Ellisman. is an NIH-funded
Research Resource centered on a new, fully
computerized Intermediate Voltage Electron
Micro.scope (IVEM). The IVEM is used to
look at comparatively thick tissue sections (2-
10 microns) and it has been employed in
studies of cortical neurons with and without
symptoms of Alzheimer's, in .studies of an-
other type of brain cell, called a Purkinje neu-
ron, and studies of cell membranes. A long-
term collaboration between SDMIR and
SDSC has made the microscope usable inter-
10-
57
High Performance Computing Infrastructure and Accomplishments
actively, over the Internet, coupled to the
SDSC Cray Research C-90.
Computational analysis and simulation is al-
lowing biomedical researchers to study and
predict the activity of potential new drugs at
the molecular level. CTC is working jointly
v\ ith Steven Ealick. et al.. director of Mac-
CHESS. a group using the Cornell High En-
ergy Synchrotron Source for Macromolecular
Modeling. Using existing high-speed connec-
tions between CTC and MacCHESS. the
pioieci IS building the capability, for the first
time, for pharmaceutical companies and aca-
demic researchers to interact dynamically with
x-ray crystallographic analyses at the syn-
chrotron, rather than discovering long after the
beam run that the sample was defective or the
beam positioning non-optimal. While initially
the researchers are using processors on the
CTC SP systems, ultimat'ely a small IBM SP
may be installed at the synchrotron site used
for dynamic analysis, with longer-scale simu-
lation needs being met using the far larger
SP2 at the CTC.
PSC is working with the Center for Light
Microscopy and Biotechnology, an NSF Sci-
ence and Technology Center at Carnegie
Mellon University, to develop an Automated
Interactive Microscope. This microscope will
couple leading edge-microscopy and high per-
formance computing through high speed net-
works allowing the real-time tagging of
chemical reactants in the cell. It will open new
research horizons in biology by giving re-
searchers the ability to control the release of
chemically active agents at critical moments in
cell life, and to monitor the celFs subsequent
development.
The personal computer controlling a scanning
tunneling micro.scope (STM) in the Beckman
Institute at UIUC used software integrated
over a LAN with the NCSA Convex C3880.
TMC CM-5. and SGI graphics workstation to
enable realtime imaging and nanolithography
of silicon surfaces in order to create novel
quantum electronic devices. Working with the
laboratory's director, Joseph Lyding (Dept. of
Electrical and Computing Engineering, UIUC
and Beckman Institute) and iiuenioi of a
widely used STM, NCSA staff members
Rachael Brady and Clint Potter extended this
lelemicroscope to the Internet and demon-
strated the feasibility of using advanced im-
aging instrumentation linked with advanced
computing capabilities from anywhere in the
world. The project was featured in the special
issue of Research and Development Mai^a-
:ine (Oct 25. 1993) on Winning in the 21st
century.
Nil Testbeds
As a partner in Common Knowledge; Pitts-
burgh, an innovative project introducing net-
working and computing into the entire Pitts-
burgh School District. PSC is working with
numerous partners, including Digital Equip-
ment, Apple Computer and both telephone
(Bell of Pennsylvania) and cable TV (TCI)
companies, to create a prototypical, cost-
effective approach to widespread use of ad-
vanced technology in public education.
In collaboration with researchers and physi-
cians at the University of Pittsburgh Medical
Center, the PSC is developing an Nil-based
digital library of pathology images, and the
applications and software technology which
will enhance the practice, teaching and cost-
effective delivery of pathology.
NCSA in collaboration with UIUC and the
Champaign County Chamber of Commerce
have been building CCnet. an Nil testbed
during the last 18 months. Over 200 people
from over 70 community organizations have
been involved since April, 1993 in defining
six major applications experiments in small
business, health care, education, government
and community services, agribusiness, and
geographic information systems (GIS). The
first of 20 multimegabil/s links into the com-
munity was established with the Urbana Free
Library in August 1994. All the high .schools
and a number of small businesses are hooking
on in September. NCSA is establishing a
large GIS server which will be available over
CCnet to community projects. Partners in
CCnet include Time-Warner cable.
11-
58
High Performance Computing Infrastructure and Accomplishments
Ameritech. DEC, Motorola, and potentially
MCI and AT&T.
NYNET, one of the gigabit testbeds, is also
designated as an Nil testbed providing out-
reach, medical appHcations, video on demand
to CTC and New York academic and indus-
trial partners.
InterNIC is the latest in an evolutionary line of
support from the NSF for the use of the In-
ternet by the science, research, and education
communities. The InterNIC provides three
types of services: Information Services
(provided through General Atomics and the
SDSC), Directory and Database Services, and
Registration Services. Information Services
provides procedures for connecting to the
Internet, pointers to resources and tools avail-
able over the network, training seminars for
new and experienced users and up-to-date
reports on new resources and activities on the
Internet. Several innovative approaches to
distributed .services have been implemented,
including the InfoGuide, an on-line Intemet
information service. The Scout Report is a
weekly summary of Internet highlights which
combines in one place the highlights of new
resource announcements and other news that
occurred on the Internet during the previous
week. The InterNIC Reference Desk acts as
the "NIC of first and last resort." The desk
supports a variety of users answering
"starter" questions from novice users who are
unfamiliar with the Internet as well as spe-
cialized questions from intermediate and ad-
vanced users.
Visualization and Virtual Reality
The NSF center's were instrumental in
bringing the notion and tools of scientific vi-
.sualization to the research community in the
1980s. By combining advanced visualization
resources with simulation data.sets created by
remote users on the centers program's high
performance computers, new visualization
paradigms for interpreting numerical data
were developed. This led scientists to consider
visualization as an intimate part of their com-
putational toolkit. In addition, the centers
worked closely with the pre-existing computer
graphics community to get them creating new
tools for scientists as well as for entertain-
ment. Already by 1 987. the staffs of the cen-
ters, working with national users, were creat-
ing scientific visualizations so compelling that
they became regularly chosen to be part of the
SIGGRAPH Film and Video Show, the
"academy awards" of the visualization indus-
try. Today the centers visualization staff and
their allied visualization Centers are at the
forefront of research into how to turn virtual
reality technologies into useful tools for scien-
tific and engineering research.
Development of Scientific Visual-
ization
From its inception, NCSA has worked with
computer artists like Donna Cox (UIUC Dept.
of Art) and Dan Sandin (UIC School of Art
and Design) and computer scientists like Tom
DeFanti (Dept. of Electrical Engineering and
Computer Science, UIC) to create cross disci-
plinary teams with end users in order to create
new levels of scientific visualizations. NCSA
also hired a number of staff from leading
companies in the California entertainment
industry to bring the software tools of special
effects in movies or TV commercials to the
use of the scientific and engineering commu-
nities. Initially, the NCSA visualization envi-
ronment was built on Alliant shared-memory
multiprocessors using the Wavefronl visu-
alization software. As specialized graphics
hardware became available on Silicon Graph-
ics systems, the NCSA visualization envi-
ronment migrated from the Alliant to Silicon
Graphics systems. The scientific visualiza-
tions created by NCSA staff have not only
broken new ground for scientists viewing
their data, they have also won awards world-
wide for aesthetic quality. NCSA's Renais-
sance Experimental Laboratory (REL), cre-
ated by Donna Cox with a major donation
from Jim Clark (founder of Silicon Graphics)
was the first advanced visualization training
facility in the centers and continues to support
university courses in geology, mathematics,
graphics design, computer .science, and other
disciplines.
12-
59
High Performance Computing Infrastructure and Accomplishments
The SDSC has developed a variety of soft-
ware tools that can be used to access and con-
nect existing visualization resources automati-
cally. The goal has been to provide researchers
with training and access to tools that will sup-
port their research needs. The tools include: 1 )
The SDSC Image Library, a collection of im-
age manipulation and conversion utility rou-
tines that can be embedded in existing soft-
ware. An interesting example of the use of
this library are the image conversion modules
developed by the International AVS Center,
which quickly became the top 2 user modules
in their distribution; 2) the SDSC Image
Tools, a collection of utilities based on the
Image Library, are software tools for reading,
writing, and manipulating raster images. This
toolset also allows researchers to convert file
formats among over thirty widely used
graphics formats (e.g., from HDF to
PICT). Now in its .second release, it runs on
Cray Research. DEC. HP. IBM. SGI, and
Sun Micro.systems platforms. Over 4,000
sites worldwide have uploaded the Image
Tools from SDSC's anonymous FTP area; 3)
vpr, a client-server visualization hardcopy
utility, vpr takes advantage of the Internet by
allowing remote users to send images to local
hardcopy devices. Popular hardcopy devices
that are connected to vpr include a variety of
film recording devices, a color paper plotter,
and video recording; and 4) the SDSC Color
Tutorial, a SuperCard-based hypertext explo-
ration in color theory for computer graphics.
Examples show the different points that are
being made, while hyperlinks allow the u.ser
to jump to the most interesting references.
CTC Visualization staff developed Visual
Programming Language for Animation
(VPLA). a program that easily integrates
sound and image .sequences into scientific
animations. VPLA can use rendered images
from several standard visualization packages
including DataExplorer. As the national re-
pository for Data Explorer software and lead
training site, the CTC works with faculty,
industrial users and students across the
country in developing state-of-the art anima-
tions and images. In addition, using DX, the
CTC has spearheaded using visual program-
ming languages not only for visualizing, but
for managing distributed applications running
across the centers program. For example, the
CTC built a Data Explorer module allowing
its researchers to access the CMS at NCSA.
The CTC was instrumental in IBM's agree-
ing to support DX across all major vendor
platforms, including SGI.
PSC has concentrated its visualization efforts
on the development of tools for remote users.
Its GPLOT software is in use at over 300
sites. Its automated animation facility has en-
abled researchers to produce hundreds of
videotapes without physically visiting the any
specific center. It is now turning its efforts to
develop such tools embodying virtual reality.
Virtual Reality Impacts Industrial De-
sign
In 1992 NCSA began a transition from the
now-traditional workstation visualization ac-
tivities to virtual environments, with leader-
ship provided by Caterpillar Inc., an NCSA
Industrial Partner. Traditionally, translating
electronic CAD blueprints into full scale
wooden models of new heavy earth moving
equipment in order to evaluate design
changes required 6-9 months. Working with
NCSA staff. Caterpillar built up a VR labo-
ratory in the UIUC Beckman Institute and
networked the SGI graphics workstations
which create the VR images to their Peoria
headquarters facilities. Using a variety of VR
viewing technologies, a number of design
options already have been tested for new
models of Caterpillar wheel loaders and
backhoe loaders that will be introduced by
1996. Design changes can now be made in
less than one month. Caterpillar design engi-
neers Dave Stevenson and John Bettner re-
ceived the 1993 NCSA Industrial Grand
Challenge Award for their innovative work.
Media coverage of this award reached over
200 media outlets.
-13-
60
High Performance Computing Infrastructure and Accomplishments
Development of Immersive Science
Projects
The transition from stand-along workstation
visualization to Nil distributed visualization
was emphasized at SIGGRAPH"92 in Chi-
cago when NCSA collaborated with the UI-
Chicago's Electronic Visualization Labora-
tory and dozens of science teams to demon-
strate wide area interactive visualization at
Showcase. A new level of realism in virtual
reality was debuted there as well with the
public showmg of EVL's Cave Automated
Virtual Environment (CAVE), which pro-
vided complete immersion in complex 3-D
data sets at workstation levels of resolution.
In 1993-1994, EVL, NCSA, and Argonne
organized a national call for proposals which
resulted in over 60 EVL/NCSA computer
science and visualization staff and graduate
students helping researchers from over 30
institutions in porting their applications into
the CAVE environment. For the first time,
this included realtime coupling to parallel su-
percomputers so that dynamic 3-D evolu-
tions could be viewed immersively and
steered interactively. At SIGGRAPH 94,
8,000 attendees were able to directly experi-
ence these science projects. CTC developed a
specific Cave Visualization on Macromolecu-
lar Modeling: the Structure of Acetylcholine
Esterase; this visualization was integral to the
researcher's understanding of the molecule's
activity. This application runs not only on the
CAVE'S SGI workstations, but on CTC's
IBM SP2 system as well.
Virtual Reality over ATM networks
In a project IN 1994 with Rome Laboratory,
demonstrated virtual reality techniques over an
ATM network between CTC SGI computers
and Rome Laboratory. Researching ATM
technologies for real time applications and
demonstrating software tools for application
steering important to molecular modeling,
telemedicine and command and control.
The Sequoia 2000 Visualization Group at
SDSC developed a prototype data visualiza-
tion system "Tecate" using virtual reality
technology to address many of the issues in-
volved in exploring the informational content
of networked data .servers. Tecate enables the
browsing for data that resides in repositories
managed by a database management system
via user-interaction with graphical renditions
of objects that represent data features.
Alpha Shapes, Biomolecules, and
Cosmology
Alpha shapes, a form of geometric modeling
developed by the 1993 Waterman Award
winner Herbert Edelsbruner (Dept. of Com-
puter Science, UIUC) and NCSA staff mem-
ber Ping Fu, focuses on the formal definition,
construction, and measurement of shapes for
any given point set in space. The discrete na-
ture of the alpha shape complex has computa-
tional advantages over any other known
method which can be exploited in computing
surface area and volume of a space filling dia-
gram and in localizing and measuring voids.
The latter is useful in studying water mole-
cules residing inside a protein. NCSA users
have discovered other related applications of
alpha shapes by applying them to such diverse
fields as adaptive grid generation, medical
image analysis, visualizing the structure of
earthquake data, and the large-scale structure
of the universe.
Digital Libraries and Infoservers
The National Information Infrastructure re-
quires many software, computer, and com-
munications resources that were not tradi-
tionally thought to be part of high perform-
ance computing. In particular, knowledge
organization, location, and navigating tools
needed to be developed. The NSF Su-
percomputer Center staffs and their as-
sociated universities have proven to be fertile
ground for developing their new tools. Per-
haps the most spectacular success has been
NCSA Mosaic, which in less than 18 months
has become the Internet knowledge browser
of choice by over a million users. The Mo-
saicAVorld Wide Web infrastructure has set
off an exponential growth in the number of
decentralized authoring of information serv-
ers.
-14.
61
High Performance Computing Infrastructure and Accomplishments
Digital Libraries
In 1989, NCSA as part of its XUNET appli-
cation testbed proposed that multimedia
digital libraries would require gigabit net-
works in order to fully support high defini-
tion imagery and the coupling of large data
sets with computing resources and geo-
graphically dispersed researchers. This re-
sulted in research developments like DICE
(Distributed Collaboration Environment).
Parallel efforts in providing researchers with
global infonnation retrieval and display capa-
bilities over existing environments combined
collaboration with Bruce Schatz (then at U.
Arizona) and his Worm Community Sys-
tem, with internet-based designs of compo-
nent client/server architectures, like the World
Wide Web. These approaches influenced the
design of the current grand challenge digital
library prototype for accessing radio astron-
omy images and data sets. NCSA built on
the success of Internet access tools such as
NCSA Telnet, adapted this modified digital
library paradigm to the Internet with NCSA
Mosaic. Today, the grand challenge image
library uses NCSA Mosaic as it's user inter-
face, and Schatz has joined NCSA to head
the recently awarded NSF/ARPA Digital Li-
braries project, which combines a testbed
based on the component architecture with
experiments in object-based designs.
Scalable Information Servers
The enormous success of NCSA Mosaic and
CERN's WorldWideWeb has resulted in
explosive growth in the use of NCSA's
WWW server. By the end of 1993, NCSA's
server load had grown beyond the capabilities
of any single server. This resulted in the de-
sign of an innovative distributed scalable
.server architecture that involved a modifica-
tion of the Internet's Domain Name System
software. By Sept. 1994, the NCSA WWW
server was handling over 2 million connec-
tions per week. NCSA's Hewlett-Packard
workstation cluster based distributed infor-
mation server has now been duplicated at
many WWW and PTP sites on the Internet
and within coiporations. A number of corpo-
rations are presently working with NCSA on
the next generation of this distributed ar-
chitecture.
The Rise of the MosaicAVWW In-
formation Infrastructure
NCSA developed the Mosaic user interface
software which provides point-and-click ac-
cess to the diverse information storage proto-
cols of the Internet, such as World Wide Web
(WWW), Gopher, FTP, and WAIS. NCSA
Mosaic establishes the necessary connections,
file transmissions, decompression, launch of
viewer programs, and screen display of text,
images, animations, or audio, in response to a
single mouse click from the u.ser. NCSA Mo-
saic is available for Mac, Windows, and Unix
computers for free to individual users, for
government and educational use, and for in-
ternal use within companies. Monthly down-
load rates from the NCSA site alone are con-
sistently over 30,000. Although accurate esti-
mates are difficult, it is widely felt that over a
million copies of NCSA Mosaic are in use.
Further, commercial versions of NCSA Mo-
saic are available. The principal licensee. Spy-
glass, Inc., has announced orders for over five
million copies of their enhanced version, with
projections to twenty million copies within a
year. Use of NCSA Mosaic has increased
WWW traffic on the NSF backbone by over
10,000 fold since Jan. 1993. Overall WWW
traffic in August hit 1.3 Terabytes or 8 % of
the total NSFnet backbone traffic, higher than
SMTP. Because of this, NCSA has become
the second biggest Internet site in the world in
terms of traffic from its site. The NCSA Mo-
saicAVWW information infrastructure is al-
lowing for an enormous growth in decentral-
ized authoring of infoservers throughout the
world. In 1994, NCSA was given Infoworld's
Publisher's Industry Achievement Award.
DESKTOP Software
From the beginning, the NSF Supercomputer
Centers provided focal points for pulling to-
gether teams of computer scientists and
software developers. Since the history of the
centers has greatly overlapped with the
worldwide rise of the personal computer and
15-
62
High Performance Computing Infrastructure and Accomplishments
workstation, it is not surprising that the soft-
ware developers focused on creating easy-to-
use software tools for the desktop machines
themselves. These tools have had a major
influence on the usefulness of the supercom-
puter facilities to the remote science and en-
gineering community. The collaboration tools
will have a great impact on tying together the
newly emerging electronic teams of .scientists
made possible by the growth of the Internet.
Connectivity Tools
NCSA Telnet was a ground-breaking desktop
application that provided access to the emerg-
ing NSF Supercomputing Centers in the late
■80s. Developed by the new Workstation
Tools Group (later the SDG) at NCSA, this
brought full TCP connectivity to researchers
using IBM and Macintosh sy.stems, signifi-
cantly broadening the participation base be-
yond Unix users, thereby introducing thou-
sands to both the internet and the NSF Centers
Program. Continuously supported up to the
present time, these tools have also led to a
spin-off company Intercon, headed by one of
NCSA Telnet developers.
Collaboration Tools
NCSA has supported a program of research
and development on collaboration technology
for science and engineering researchers for
over 3 years. NCSA Collage, a tool that runs
across MSWindows, Mac, and XWindows
sy.stems, provides the capability to carry on
remote digital conferencing sessions between
researchers. The first live MetaCenter collabo-
rative session using NCSA Collage was held
in 1992 . Collage combines many of the fea-
tures of NCSA's communications and
graphic data analysis tools. NCSA also con-
tinues to innovate in asynchronous col-
laboration tools such as asynchronous col-
laboration tools, hence the interest in annota-
tion and workgroup support capabilities in
NCSA Mosaic. Current work focuses on
choosing and combining the best of these syn-
chronous and asynchronous capabilities in
usable next-generation global collaboration
tools for the scientific and educational com-
munities.
Cornell University's CUSEEME video tele-
conferencing .software, aimed at providing
video teleconferencing on low-end worksta-
tions, is in use at the NSF Centers and as part
of NYNET, NYSERNET and other organi-
zations for routine use, including medical
projects between the CTC and the Cornell
University Medical College in NYC. This
work, funded by NSF and Cornell itself, is
freely distributed and runs on Mac and PC
platforms using inexpensive video equip-
ment. Several national and international col-
laborations have successfully utilized this
software. The centers have gained experience
with traditional video teleconferencing sys-
tems, through its NSF-funded system. It is
now looking at packet video systems using
the vBNS and other network facilities. These
systems will have the capability of moving
the videoconference from a set of specially
equipped rooms to the desktop. An investi-
gation is now underway to develop the opti-
mal system for CTC's requirements.
Graphics Tools
NCSA Image was the first scientific vi-
sualization tool developed for the desktop
viewing of supercomputing output in the pro-
gram. It provided the research community
Mac and Unix based visualization methods
for analysis of huge data sets, as well as cre-
ating some of the first client/server tools
which integrated remote desktop workstations
and personal computers with the center pro-
grams high performance engines.
The SDSC Image Tools are software tools for
reading, writing, and manipulating raster im-
ages. This toolset also allows researchers to
convert file formats among over thirty widely
u.sed graphics formats (e.g., from HDF to
PICT) and includes extensive C library func-
tionality for creating custom image-
manipulation applications. Now in its second
release, it runs on Cray Research, DEC, HP,
IBM, SGI, and Sun Microsy.stems platforms.
Binaries and sample source code are available
in the public doinain by accessing SDSC's
anonymous ftp area (ftp.sdsc.edu).
16-
63
High Performance Computing Infrastructure and Accomplishments
Scientist's Workbench
The Scientist's Workbench is an X and Motif-
based software package developed at the CTC.
The main functions of the Scientist's Work-
bench are to bring together the tools and soft-
ware required by scientific researchers in a
distributed computing environment, to pro-
vide a graphical interface to access those tools.
and to provide the software necessary to allow
researchers to easily build their own graphical
interfaces. This tool has been used at most of
the CTC's Smart Nodes (affiliates) by users
and as part of teaching environments for high
performance computing, as well as at the
other centers and by companies and national
labs developing "custom" programming in-
terfaces for their communities.
Accomplishments in Education and Outreach
Education
Familiarity with the tools of computation and
visualization is quickly becoming a sine qua
lion for both researchers and the public. The
spread of access to these tools, like access to
the telephone and television before them, is a
democratizing force in itself: the world of the
shut-in is opened up, the disadvantages of
distance are minimized, the exchange of
techniques and knowledge is enhanced. Edu-
cation, training, and outreach are thus fun-
damental to the programs of the MetaCenter.
Each member of the centers program has
developed educational programs targeted to a
variety of constituencies: university research-
ers, graduate students, undergraduates, edu-
cators at all levels, and K-12 students.
Researchers and Students
One- or two-day workshops are offered by
centers program staff to researchers on site
and at associated institutions, covering intro-
ductions to the computational environments,
scientific visualization, and the optimization
and parallelization of scientific code. In ad-
dition, special workshops have been offered
throughout the centers program on the use
and extension of computational and visuali-
zation techniques specific to various disci-
plines (from biochemistry to lattice gauge
theory). On the campuses of centers program
institutions and on other campuses, centers
program scientists and engineers are active
teachers, either through regular academic ap-
pointments or as adjuncts, lecturers, seminar
leaders, or teachers in extension divisions.
Graduate students often receive fellowships
or similar appointments at centers program
institutions, as their contributions may benefit
a large academic research community or the
computational community generally. As an
example, Ph.D. student Kimberly Claffy
(Computer Science, UCSD) recently com-
pleted her dissertation on a flow-based meas-
ure of Internet traffic that she developed as a
Junior Fellow at SDSC. Dr. Claffy's tech-
nique is the first to permit traffic characteri-
zation on the basis of a temporally and spa-
tially flexible unit, and it is thus an enabling
technology for further advanced network re-
search at SDSC and elsewhere.
The centers program has contributed to the
research projects of hundreds of graduate
students through stipends, access to re-
sources, and relations with centers program
researchers. Each Center fosters collaborative
research by multidisciplinary, multi-
institutional teams of computer scientists,
research scientists, and engineers; postdoc-
toral research associates; and graduate stu-
dents from the national and international
community. These teams forge new ap-
proaches to previously insoluble research
problems, develop community codes, and
host workshops and seminars to transfer
technology.
-17-
64
High Performance Computing Infrastructure and Accomplishments
Supercomputer Centers Educational Activity Support Summary
Educational Activities
FY91
FY92
FY93
High School
Institutes
7
4
5
Attendees
128
131
121
Other K- 12 Events
15
8
17
Attendees
715
1 .370
1 .985
Research Institutes
13
1 1
6
Attendees
262
377
390
Training CoursesAVorkshops |
On Site Events
18
102
134
Attendees
1.414
1.773
1.929
Off Site Events
17
23
17
Attendees
295
622
104
Seminars/Colloquia
Events
138
114
132
Attendees
2.251
2.788
3.085
Academic Course Accounts
64
63
79
Monthly Newsletter Circulation
234.986
247.692
165.176
Visitors
13.506
16.380
16.392
For undergraduates, the Research Ex-
periences for Undergraduates programs,
funded by NSF, bring in undergraduates to
work for a summer or a school semester or
quarter on specific projects devised by cen-
ters program researchers and/or faculty advi-
sors. The projects are significant in their
scope of computational science and in many
instances have resulted in presentations at
meetmgs and publications. A special project
is CTC's Supercomputing Programs for Un-
dergraduate Research (SPUR), in which stu-
dents apply to work on one of a selection of
projects developed by Cornell faculty in col-
laboration with CTC. One REU student at
SDSC went on to win the top prize in the
Westinghouse Science Talent Search in 1991.
Another developed a program to teach the
use of the Braillewriter to blind students,
which was presented to the Commission on
Equal Opportunity in Science and En-
gineering at NSF (this student, herself blind,
is now a successful computer scientist in
Silicon Valley). Undergraduate assistantships
and internships are also available in the cen-
ters program. Undergraduate student pro-
grammers have worked on many research
problems including numerical weather pre-
diction, the visualization of numerical
spacetimes, and social network analysis.
They have developed numerous applications
and utilities to improve the computational en-
vironment for MetaCenter researchers. Stu-
dents have also worked on library, visualiza-
tion, and educational projects. The REU pro-
grams have been ongoing in various forms
for more than five years.
Outreach to Educators
One particularly effective approach to edu-
cating the next computational generation is
the training of teachers, and many centers
program efforts have been devoted to teacher
training and curriculum development.
Common Knowledge: Pittsburgh is a na-
tional pilot program developed by PSC, the
University of Pittsburgh, and the Pittsburgh
Public Schools to institutionalize educational
18-
65
High Performance Computing Infrastructure and Accomplishments
technologies within the Pittsburgh PubHc
School District by having PSC implement
the network infrastructure and develop spe-
cific curriculum-based network and computer
applications. PSC's High School Initiative
(1992-1994) involves studentyteacher teams
using PSC facilities to develop computational
tools for inclusion in their schools' science or
mathematics curriculum, with an emphasis
on integrating high-performance computing
into the curriculum and thus bridging the gap
between textbook instruction and real world
applications of science.
SuperQuest is a program involving centers
program sites that brings teams of teachers
and students from selected high schools to
summer institutes to develop computational
and visualization projects that they work on
throughout the following year. In addition to
educational workshop programs associated
with SuperQuest. NCSA has developed five
interactive simulation programs now being
tested in classrooms across the country and
around the world. These include GalaxSee,
an N-body simulator of galaxy formation and
interaction; the Fractal Microscope, which
enables the exploration of self-similar pat-
terns; SimSurface and SimElevator, simu-
lated annealing programs; and LaplaceSeein',
an electrostatic potential solver. Students can
change initial conditions and watch the
simulation evolve as the parameter space is
explored.
SDSC's computational .sciences curriculum
coordinator, Kris Stewart (who is a professor
of mathematics at San Diego State Univer-
sity) has conducted summer workshops,
funded by NSF and Cray Research, with fac-
ulty from primarily undergraduate institu-
tions to develop ways of incorporating high-
perfoiTnance computing into the curriculum.
Stewart uses the workshop materials in her
own SDSU classes, Supercomputing for the
Sciences and an Introduction to Compu-
tational Analysis. SDSC is now halfway
through a three-year, NSF-funded Super-
computer Teacher Enhancement Program
targeted to high-school teachers whose
classes contain underrepresented minorities.
Dr. Bruce Land, of the CTC, has developed
an undergraduate course in scientific vi-
•sualization and computer graphics, using the
data flow block diagram capabilities of
IBM's Data Explorer software. This cur-
riculum, lab exercises and the resulting stu-
dent projects have been shared with the larger
educational community through the CTC's
Education and Training home page. Addi-
tionally, Prof. Steve Vavasis has developed
an interdisciplinary course in .scientific com-
puting using high performance computing
for graduate and undergraduate students.
Over the past year, the CTC had established
the Data Explorer repository, a full .set of tu-
torials for parallel computing on diverse plat-
forms, a complete .set of lecture notes for use
by educators as well as researchers, and a
gateway to materials on the network for sec-
ondary school .science and mathematics edu-
cation.
The educational outreach programs of the
centers program enable students to expe-
rience the advantages of connectivity and
training in all aspects of modern compu-
tational practice. The challenge to effectively
deliver centers program resources to all class-
rooms is being met mainly by the distance-
defeating and multiplicative effects of high-
performance computation itself. The dis-
semination of training and curriculum mate-
rials over the National Information Infra-
structure is a major way in which the suc-
cessful pilot programs can be turned into a
new class of educational resources.
Outreach
Because advances in high-performance com-
puting and communications (HPCC) are
driven by the needs of the practitioners with
the most advanced problems, the centers pro-
gram's .scientific mission includes the con-
struction of an extensive web of relationships
with research and development efforts in
American industry and commerce. Collec-
tively, the centers program's outreach pro-
19-
66
High Performance Computing Infrastructure and Accomplishments
grams represent a long record of sustained
collaboration among scientists, HPCC de-
velopers, and industrial researchers. Another
aspect of outreach is the effort to find and
serve local and regional needs of govern-
ment, schools, and communities. Some as-
pects of these activities are discussed below.
Application of Scientific Computation
and Visualization to Industrial Pro-
duction
Half of the partnerships between the individ-
ual centers and industry are collaborations
with major industrial firms. These include
American Cyanamid, Amoco, Alcoa,
AT&T. Caterpillar, Corning, Dow Chemical,
Eastman Kodak, Eli Lilly, FMC, Gencorp.
General Dynamics. Hughes Aircraft, IBM,
JP Morgan, Martin-Marietta, McDonnell-
Douglas," Merck Research Labs. Motorola.
Parke-Davis. Philips Petroleum. Schlumber-
ger, USX, and Xerox.
In their original form, the partnerships repre-
sented the first introduction of large-scale
computation and visualization into the store
of resources possessed by even the largest ot
these Fortune 500 companies. While the
companies are for the most part fully com-
puterized now, the majority of these partner-
ships continue today because centers pro-
gram expertise has been essential to the in-
troduction of new ways of employing the
resources of supercomputing: the algorithms,
visualization routines, and engineering codes
are being combined in ways that result in
such advances as high-end rapid prototyping
of new products. As a result, for example.
Eli Lilly maintains its partnership although
the company has purchased its own super-
computer — the useful interactions with cen-
ters program scientists, consultants, and
visualizers continue. In many of these ar-
rangements, the industrial partner's re-
searchers are frequent visitors to the NSF
centers, and centers program researchers also
visit the partner's installations.
Thus, while it is extremely important to Al-
coa that it was able to produce an optimum
aluminum can. to Gencorp that it was able to
design a better injection molding process, to
McDonnell-Douglas that it could perform
rapid airfoil analyses, to American Cyanamid
that it could reformulate soil enhancers, the
sum of these long-term relationships is im-
portant in another dimension as well. The-
Center outreach efforts are helping to revital-
ize American industry, making it more com-
petitive in an increasingly competitive world
market.
Impact on Vendors of High Per-
formance Computing Equipment
The centers program has had a major impact
on the vendors of major high performance
computing equipment. All Centers have
taken early prototypes of machines, have
provided national access to the largest scale
version of such machines, and provided
critical feedback to the vendors. Several have
entered into strategic development efforts
with the vendors. For example, PSC is a
formal partner with Cray Research in the de-
velopment of applications for its massively
parallel T3D. Cray has also internalized
some of PSC's file sy.stem developments.
CTC played an integral role in IBM's re-
entry into the High Performance Computing
arena, as the first customer for its IBM
ES/3090 vector supercomputers and as a
partner in the design and development of its
parallel FORTRAN products. CTC, through
Its director Malvin H. Kalos, was a key in-
fluence in IBM's decision in 1991 to build
the IBM SP systems and ensured that IBM
adopted a strategy that was scalable ulti-
mately up to the teraflops and down to the
desktop.
SDSC has established a clo.se collaboration
with the Supercomputer Systems Division
(SSD) of Intel Corporation to develop sys-
tems software to support multi-user systems,
to serve as a test site for new operating sys-
tem releases, and to improve the .stability of
the Paragon system. SDSC staff have de-
veloped MACS, the Multi-user Accounting
and Control Sy.stem, which Intel offers as
part of the Paragon's standard operating sys-
tem software. This system includes a dy-
20
67
High Performance Computing Infrastructure and Accomplishments
namic job mix scheduling algorithm, a port
of the Network Queuing System batch job
submission software, and CPU quota and
accounting systems to control resources used
by separate projects.
SDSC has also collaborated with Cray Re-
search to develop support for multiple-user
systems on Cray systems. SDSC staff have
developed a resource management system
that controls access to various resources on
the system (CPU. memory, and disk) and a
dynamic job mix scheduler (DJMS) to dy-
namically adjust the workload for optimal
performance. SDSC has recently run a T3D
emulator on the Cray C90 and is providing
feedback on its performance. SDSC now
plans to install and evaluate Cray's new
FDDI card, a fiber-optic high-speed network
interface.
Digital Equipment Corporation recently
awarded UCLA and SDSC an external re-
search grant to acquire nine Alpha 3000
model 400 workstations. The cluster, which
has a peak speed of 1 .2 Gflops and is con-
nected at 100 Mbps via a Gigaswitch, will be
used primarily for climate studies led by Dr.
Roberto Mechoso of UCLA. It will also be
available for scientific use and performance
testing by the SDSC user community. SDSC
staff are collaborating with DEC to port the
global climate model to the Alpha cluster
using DEC's High Performance Fortran
compiler.
DEC is a major supporter of Project Sequoia
2000, a collaboration of scientists, computer
and information experts, government agen-
cies, and industrial sponsors to develop an in-
formation-management system for studying
global climate change. The Sequoia vi-
sualization group, centered at SDSC, has
been developing a system that will build on
the strengths of existing hardware and soft-
ware to support next-generation visualiza-
tions. Recently, the group collaborated with
Kubota-Pacific Computer Corporation to
combine Kubota's Denali system with DEC
Alpha machines for advanced 3D color
graphics capabilities. Such collaborations
have benefited DEC. Kubota-Pacific. and the
Sequoia 2000 project.
The National Storage Laboratory (NSL). a
consortium that is developing next-generation
high-speed storage devices, has selected the
UniTree system as the production archival
storage system for all centers program sites.
UniTree, which was originally developed at
the Lawrence Livermore National Labora-
tory, was commercialized by DISCOS, a
spin-off of SDSC and a former division of
SDSC's parent company. General Atomics.
DISCOS was. in turn, sold to OpenVision.
which continues to market the product.
SDSC staff are collaborating with NSL.
IBM. and OpenVision to implement the
NSL's base version of the UniTree archival
storage system on the center's IBM RS/6000
model 980 workstation. They are adding a
new. more robust name server developed by
Lawrence Livermore National Laboratory for
OpenVision, transaction journaling (which
allows reconstructing the database in case of
catastrophic failure), and adding and enhanc-
ing system administrator tools.
The integration of PSC's Multi-resident AFS
into the NSL UniTree environment is
planned to provide user-friendly access for
the centers program members into SDSC'S
NSL archival storage system. The future in-
tegration of HIPPI-attached peripherals, in-
cluding high-speed, high-density tape and
high-speed RAID disk arrays, using third
party data transfer is planned as a way to
substantially increase both archival system
storage capacity and data transfer speed.
A prototype HPSS parallel I/O archival stor-
age system is also planned for evaluation as
the follow-on to the NSL UniTree system.
This system will support striping across
multiple high-speed peripherals to even fur-
ther increase the speed of file transfers.
Stimulation of New, Computationally
Dependent Ventures
About a fourth of the industrial partnerships
are with smaller and newer firms, many of
them leaders in biotechnology. Some are
-21
68
High Performance Computing Infrastructure and Accomplishments
firms designing new pharmaceuticals (e.g.,
Agouron Pharmaceuticals, Genentech), oth-
ers develop and market the software pack-
ages required for these enterprises. Biosym
Technologies, for example, is working with
both CTC and SDSC to develop parallel ver-
sions of its popular Discover and Insight
packages.
Outreach efforts of the centers program have
resulted in actual spinoff ventures as well.
The commercialization of the software devel-
oped at individual centers is being undertaken
by a number of companies. For example,
NCSA Telnet has been commercialized by
Intercon, and Spyglass will relea.se a package
containing upgraded versions of image tools
and Mosaic. Some 20 companies have now
licen.sed NCSA Mosaic. CERFnet, a Califor-
nia wide-area network for Internet access has
pioneered in supplying access to library and
other large databases; and DISCOSAJniTree,
a mass storage system, is in use at more than
twenty major computer sites. A new mo-
lecular modeling system, called Sculpt, de-
veloped at SDSC, is being commercialized
by a new company. Interactive Simulations.
Sculpt enables drag-and-drop molecular
modeling in real time while preserving
minimum-energy constraints; its output was
featured on the cover of Science last May.
Development of Nationally Valuable
Reservoirs of Skill
About a fourth of the partnerships between
the centers and industry are collaborations
with manufacturers of high-performance
computational, networking, telecommunica-
tions, and visualization equipment. Of par-
ticular interest here are the several partner-
ships funded by the NSF and ARPA through
CNRI to construct and test the "gigabit test-
beds," prototypes of the connectivity that will
be required for the future International In-
formation Infrastructure. Both academic and
industrial research groups are developing the
codes to test the connections, even as manu-
facturers develop the connections themselves,
and the experts assembled in the centers pro-
gram supply links in the form of .specifica-
tions and software.
Community Service
Local and regional outreach efforts range
from the tours given at all centers program
installations through the hosting of visits by
national, regional, and local officials and
commissions, to the kinds of full-scale part-
nerships mentioned above. The NCSA rela-
tionship with the Champaign County Cham-
ber of Commerce has resulted in the forma-
tion of a nonprofit public network, CCnet,
which is already benefiting the Chamber it-
self as well as local schools. Plans are in the
works with Time-Warner to start pilot tests
of the use of public-access cable for a data
highway.
SDSC is working with the City of San Diego
on plans to connect all units of city gov-
ernment, including a high-technology re-
source center to be developed with De-
partment of Commerce funding that will
connect local industry (with a lot of defense
reconversion efforts) to business and com-
putational resources, including SDSC itself.
PSC is exploring extension of the technology
it developed for its Common Knowl-
edge;Pittsburgh K-12 project to embrace
major units of city government.
Outreach is also represented by the publica-
tioi programs of the centers program, the
production of scientific videos and/or multi-
media CD-ROMs, and a collaborative pro-
gram for maintaining a lively and informative
presence on World-Wide Web servers,
which will make information on the pro-
grams easily accessible over the NIL
Important Science and Engineering Accomplishments
Summaries of computationally interesting problems in the NSF Centers Program by the
national science and engineering communities:
-22-
69
High Performance Computing Infrastructure and Accomplishments
Quantum Physics and Materials
The great disparity between nuclear, atomic,
or molecular scales and macroscopic material
scales, implies that vast computing resources
are needed to attempt to predict the char-
acteristics of bulk matter from fundamental
laws of physics. Therefore, it is not surpris-
ing that since the beginning of the NSF Cen-
ters program this area of science has brought
us some of the largest users of supercom-
puters. Materials .scientists have often been
among the first group of researchers to try
out new architectures that promise even
higher computational speeds.
Below are outlined some outstanding exam-
ples of studymg properties of bulk matter
from extreme conditions, such as occur in
nuclear collisions, the early universe, or in the
core of Jupiter; new materials such as nan-
otubes and high temperature super-
conductors; and more practical materials used
today such as magnetic material and glass.
Phase Transition in QCD
The MIMD Lattice Calculations Col-
laboration (MILC) is attacking the Grand
Challenge problem of "the origins of mass."
Their objective is to use the theory of the
forces governing what are called the "strong
interactions" of elementary particles (quarks
and gluons) to calculate the observed masses
and interactions of the particles that are made
out of them: the hadrons, which include the
familiar proton and neutron. The theory is
called quantum chromodynamics (QCD),
and its numerical incarnation is called "lattice
gauge theory," because the quarks and glu-
ons are represented on a four-dimensional
space-time lattice. They have published nu-
merous studies of the mass spectrum of the
hadrons; the transition between ordinary
matter and the quark-gluon plasma, which is
important in the study of the conditions of the
early universe; and the decays of hadrons via
weak interactions. A number of investigators,
coordinated by Robert Sugar (Dept. of
Physics, UCSB), are engaged in this project
including: Claude Bernard (Washington
Univ.), Thomas A. DeGrand (Univ. of Colo-
rado), Carleton DeTar (Univ. of Utah), Ste-
ven Gottlieb and Alexander Krasnitz (Indiana
Univ.). Douglas Toussaint (Univ. of Ari-
zona). Julius Kuti (UCSD). The consortium
has used large allocations of time on a wide
range of MetaCenter computational facilities
including: Intel Paragon (SDSC), TMC CM-
5 (NCSA), clustered IBM RS/6000s under
PVM (CTC. NCSA), Crav Research C-90
(SDSC and PSC)
Phase Transitions of Solid Hydrogen
Calculations by Natalie, Martin and Ceperley
(Dept. of Physics UIUC, NCSA), carried out
on the CRAY Y-MP at NCSA have estab-
lished the series of crystalline phase transi-
tions of hydrogen as it is compressed to sev-
eral million atmospheres of pressure, such as
found in the interior of the giant planets.
Since Wigner and Huntington in 1935
pointed out that a transformation from a mo-
lecular to atomic state is inevitable at high
pressure, there have been extensive specula-
tions on when and how this transformation
would take place. The recent development of
the diamond anvil technique have allowed
experiments to be performed at pressure
slightly lower than the atomic transition.
Those experiments confirmed the existence
of an molecular orientation transition which
had been earlier computationally predicted by
Ceperley at 1.5 Mbar. Extensive and highly
accurate quantum Monte Carlo calculations
on a variety of crystal structures now predict
that the metallic transition will take place
from a distorted molecular hexagonal struc-
ture into an atomic diamond lattice. For this
and other pioneering science, David Ceperley
was awarded the fifth Eugene Feenberg
Memorial Silver Medal in 1994. David's
PhD advi.sor and the third Feenberg awardee
is Mai Kalos. Director of the CTC. himself a
major user of several MetaCenter supercom-
puters.
Prediction of new Nanomaterials
Marvin L. Cohen (NAS) and Steven G.
Louie (Dept. of Physics, UC Berkeley) have
23-
70
High Performance Computing Infrastructure and Accomplishments
used MetaCenter computational resources
(SDSC. PSC, NCSA Cray Research Y-MP
and C90) to make numerous advances in
computational materials science. Most re-
cently, they have used both first-pnnciples
and tight-bindmg codes to examine the prop-
erties of carbon nanotubes and nanotubes
composed of boron, carbon, and nitrogen.
Carbon nanotube.s-essentially rolled mi-
crosheets of graphite-are well known, thanks
to the work of Sumio lijima and colleagues at
NEC. They have diameters on the order of 1-
20 nm and are producible in the same carbon
arc chambers used to produce fullerenes (also
called "buckyballs"- assemblages of 60 or
more carbon atoms in cage-like structures).
They have interesting capillarity and elec-
tronic properties. Cohen. Louie and their col-
leagues have predicted the structure and
properties of nanotubes made of boron ni-
tride, which appear to be more stable and
controllable in terms of their electronic prop-
erties. They have also predicted nanotubes of
BC_2N. a boron-carbon-nitrogen compound,
whose electronic properties are even more in-
teresting: they should behave like nanoscale
induction coils. Most exciting, the structures
that were predicted computationally are now
being produced experimentally in the lab of
Alex ZettI at Berkeley, where their electronic
properties can be confirmed.
Theory of High Temperature Su-
perconductors
The Nobel Prize in Physics in 1987 was for
the discovery of a new class of high tem-
perature superconductors. Thousands of re-
search papers have been written about these
unique materials, but the battle is .still raging
over the fundamental mechanism that causes
the ^uperconducting transition at Tc ~ 90K
for the cuprate oxides such as YBA2CU3O7.
David Pines (NAS and first Feenberg
Medalist) and Philippe Monthoux
(Department of Physics, UIUC) used the
NCSA Cray Research Y-MP to carry out a
strong coupling (Eliashberg) calculation of
the normal state properties and Tc for the
model experiment-based magnetic interaction
between quasiparticles. They found that when
the full structure of the quasiparticle interac-
tion is taken into account, a superconducting
transition into a d-wave planar pairing siaie
occurs at Tc ~ 90K for comparatively modest
values of the coupling constant. Although
still an area of active research, this computa-
tion lends credibility to the model that it is the
coupling of planar quasiparticles to the ex-
perimentally measured planar electronic spin
fluctuation excitations which determines the
normal state properties (which they show
acts like a nearly antifcrromagnetic Fermi
liquid) and makes possible high temperature
superconductivity.
Magnetic Materials
James Sethna (Laboratory of Atomic and
Solid State Physics. Cornell Univ.) uses CTC
parallel supercomputers to study the dynam-
ics of disorder-driven first-order phase trans-
formations, including 3-D numerical simula-
tions of hysteresis loops. He is developing
scalable parallel algorithms for systems of
size N = 2000^. Sethna's work enables the
prediction of phase transitions with critical
tluctuations, the simulation of orders-of-
magnitude larger systems to explore critical
phenomena, and detailed computational
studies in materials science as applied to
magnetic storage media, metallurgical phase
transformations, and gases adsorbed on sur-
faces.
Ujiderstanding Glass
For ab initio dynamical calculations to be
useful for real materials in an industrial set-
ting, they must be able to deal with ensem-
bles of thou.sands of atoms for dynamical
effects modeled over microseconds. Signifi-
cant algorithmic developments made jointly
at Corning. Inc. and the CTC, coupled with
the much increased capability of the CTC's
IBM SP2 system, allow this threshold to be
crossed for the first time. Postdoctoral Fel-
low Stefan Goedecker, hired jointly by
Corning, Inc. and CTC, has developed new
extremely fast ways of doing tight-binding
which he can parameterize with the ab initio
24-
71
High Performance Computing Infrastructure and Accomplishments
codes. This is the only approach currently
known that will handle thousands of atoms
for millions of time steps, bringing the re-
searchers close to observing many of the
mysteries involved in glass chemistry which
have been not well understood for over 2000
years.
Biology and Medicine
Living creatures exhibit some of the greatest
complexity found in nature. Therefore, su-
percomputers have made possible unprece-
dented opportunities to explore these com-
plexities ba.sed on the fundamental advances
made in biological research of the last fifty
years. These activities include: inverting the
data from x-ray crystallography experiments
to obtain the molecular structure of macro-
molecules: learning how to use artificial in-
telligence to fold polypeptide chains, deter-
mined from genetic sequencing, into the
three-dimensional proteins; and determining
the function of proteins by studying their dy-
namic properties, as well as how they interact
with each other or with the DNA backbone
from whence they were created.
These insights are beginning to make signifi-
cant impacts on medicine and plant and ani-
mal biology. New fields of computational
science, such as molecular neurosciences, are
being enabled by academic access to Meta-
Center computing and visualization resources
and staff. Corporations are using supercom-
puters and advanced visualization techniques
in collaboration with the NSF MetaCenter to
create new drugs to fight human disea.ses
such as asthma. New insights into economi-
cally valuable bioproducts are being gained,
for instance, by combining molecular and
medical imaging techniques to create "virtual
spiders'" which can be digitally dissected to
understand the production of silk. Finally,
high performance computers are just be-
coming powerful enough that some dedicated
researchers are able to program mathematical
models of realistic organ dynamics, such as
the human heart.
Crystallography
Herbert Hauptman (Medical Foundation of
Buffalo. Inc.) won a Nobel Prize in 1985 for
development of the "direct method" of pro-
tein structure determination from X-ray
crystallographic data. In a collaboration with
Russ Miller (State University of New York
at Buffalo), these researchers have developed
a numerical approach that extends the "direct
method" of determining molecular structure
from X-ray crystallographic data to larger
molecules, beyond its present limit of about
100 atoms. The algorithm they have devel-
oped, called "Shake-and-Bake," runs on a
number of computing platforms (PSC CM-2
and Cray T3D, NCSA and PSC CM-5) and
has proven it.self effective in more than 20
cases at accurately determining the structure
of proteins that have taken as long as 10
years by existing methods, reducing the time
to a matter of hours.
Folding Proteins using Artificial In-
telligence
One of the most pressing problems in mo-
lecular biology is how to determine the fold-
ing and 3-D structure of a protein, given its
sequence. Peter Wolynes (NAS), Zan
Schulten. and coworkers (Dept. of Chemis-
try. UIUC) have developed a novel approach
to this classic problem using elements from
the theory of spin glasses, associative mem-
ory models, and neural networks. Spin glass
theory provides a framework for under-
standing the cooperative nature of the folding
transition and the qualitative nature of the
phase diagram describing the thermo-
dynamics of proteins. Wolynes et al. devel-
oped simulation codes, based on associative
memory Hamiltonians, and characterized
their phase diagrams .semi-quantitatively.
These Hamiltonians are ba.sed on an energy
function which correlates the sequence of the
protein to be folded with those of proteins
whose structure is known. They were intro-
duced several years ago by Wolynes and co-
workers as polymer analogues of the Hop-
field neural nets. Their work, carried out on
NCSA's Cray-2 and CRAY Y-MP, shows
25-
72
High Performance Computing Infrastructure and Accomplishments
that even primitive associative memory
Hamiitonians can recognize protein structures
from sequences that are only moderately re-
lated to those already existing in the database.
These procedures are somewhat similar in
effectiveness to the rule-based homology
modeling.
Protein Kinase solution
The Computational Center for Macro-
molecular Structure (CCMS). founded in
1990, is an NSF-funded joint project of
UCSD. SDSC. and The Scripps Research
Institute, with collaborators from ail over the
country. The center made headlines in 1991
when a group led by Susan Taylor (Dept. of
Chemistry. UCSB), one of the principal in-
vestigators of CCMS published the three-
dimensional structure of the catalytic unit of
cyclic-AMP-dependent protein kinase, or
cAPK. This was the first kinase structure to
be solved. The solution was achieved by a
combination of computational methods, in-
cluding refinement on the SDSC CRAY Y-
MP using the program XPLOR, developed
by Axel Bruenger of Yale University. Most
important to the solution was the ability of
the scientists to study .stereo visualizations of
the structure on a large screen at SDSC, so
scientists from every discipline within the
group could contribute their insight to a col-
lective determination of the structure. Kinases
play important messenger roles in cell me-
tabolism, and hundreds of such compounds
have been identified and sequenced. Becau.se
sequences are homologous in long stretches,
the solution for cAPK is proving extremely
valuable as a template for modeling and de-
riving the structure of other kinases. Taylor
and her group have collaborated with several
other groups since in modeling proposed
solutions for other kinases, including those
known to have carcinogenic properties or to
be involved in other disease processes. In all
of these studies, computation and visualiza-
tion have played an important role. Solutions
for various kinases can lead to the design of
inhibitors to prevent the enzymes from acting
to produce diseases. The work won the Fore-
fronts of Large-Scale Computation Award
presented at Supercomputing '93.
Molecular Neuroscience-Serotonin
A number of cardiovascular and psychiatric
diseases are currently treated with drugs that
act on the neurotransmitter serotonin and its
receptors. The cellular receptor for serotonin
is a gatekeeper molecule that recognizes and
binds the serotonin and then transmits the
signal to the cell by binding to a special class
of transducers: the G-proteins. Using the
CTC's ES/9000, Dr. Harel Weinstein,
chairman of the Department of Physiology
and Biophysics at Mount Sinai Medical Cen-
ter, made a breakthrough in modeling the se-
rotonin receptor. His breakthrough came
from modeling the structural changes that
occur in the serotonin receptor when it binds
to a ligand and to the G-protein. causing it to
carry out its function. This research shows
how G-proteins can be switched on by
structural changes in specific regions of the
receptor molecule and is expected to lead to
the development of more effective drugs,
specific ligands aimed at the regions where
the structural change takes place. Weinstein
believes that his work may be applied more
broadly to other receptor molecules, includ-
ing all neurotransmitters, that communicate
with cells via G-proteins. If Weinstein can
demonstrate a common mechanism of re-
spon.se in these receptors, he will have a new
type of molecular approach to treating a vast
range of diseases. Weinstein has also u.sed
the Cray Research C-90 and Intel Paragon at
SDSC.
Molecular Neuroscience-Acetyl-
cholinesterase
A collaboration between Michael Gilson,
T.P. Straatsma, and Andrew McCammon
(Dept. of Chemistry. University of Houston),
Daniel Ripoll (Research Associate. CTC),
Carlos Faerman (Dept. of Molecular and Cell
Biology, Cornell Univ.). Paul Axelsen (Dept.
of Pharmacology, University of Pennsylva-
nia School of Medicine), and Israel Silman
and Joel Sussman (Weizmann Institute of
Science, Israel) has used molecular dynamics
26-
73
High Performance Computing Infrastructure and Accomplishments
algorithms to investigate the rapid activity of
the enzyme acetylchoHnesterase (AChE). The
enzyme breaks down the neurotransmitter
acetylcholine diffused across nerve cell syn-
aptic gaps. Its three-dimensional crystal
structure was solved by Joel Sussman and
colleagues at the Weizmann Institute in Re-
hovot, Israel, several years ago. That struc-
ture showed the active site to be a long, nar-
row channel — too narrow to deal rapidly with
the job of dissociating acetylcholine into
choline and an acetate ion. Yet it is known
that AChE acts very rapidly, no doubt be-
cau.se of aeons of evolutionary pressure to
optimize the response of the nervous system
in all organisms. Molecular dynamics calcu-
lations performed on the MetaCenter's Cray
Research C-90 (SDSC and PSC), TMC CM-
5 (NCSA), and Kendall Square KSR-1
(CTC) showed that there was also a "back
door" to the active site, that might open to
facilitate the exit of the acetate ion from the
site. A study of the electromagnetic fields of
acetylcholinestera.se with the back door
closed and then open supplied confirming
evidence. Since inhibitors of AChE are im-
portant medications for myasthenia gravis,
glaucoma, and Alzheimers's disease, this
new insight may lead to more effective
pharmaceutical agents to fight these diseases.
This work was the cover story of the March
4, 1994 issue of Science magazine.
Kinking DNA
John M. Rosenberg (University of Pitts-
burgh) used the PSC Cray Research C-90
vector supercomputer to determine how a
protein identifies and interacts with specific
sites of DNA — a fundamental biological
process called "protein-DNA recognition,"
which is related to many disease processes
and is also a vital tool in the biotechnology
industry. Rosenberg's molecular dynamics
simulations have refined the structure of an
important protein, Eco Rl endonuclease,
u.sed in DNA cloning, and they have resulted
in a clear understanding of a "kink" in the
DNA backbone that results when Eco Rl
endonuclease binds with DNA. Rosenberg
won the 1991 Forefronts of Large-Scale
Computation award for this research, and his
work was cited in the 1993 Compute rwnrld
Smithsonian award for .science given to the
PSC.
Antibody-Antigen Docking
A collaboration among computer scientists
Michael Hoist and Faisal Saied (Dept. of
Computer Science, UIUC) and two biolo-
gists Richard Kozack and Shankar Subra-
maniam (Dept. of Physiology and Biophys-
ics. UIUC and Beckman Institute/NCSA)
has been able to solve for the first time the
complete nonlinear Poisson-Boltzmann
equation, which is the fundamental equation
of macromolecular electrostatics. A method
based on multigrid-inexact Newton algo-
rithms has been developed and large memory
applications run in parallel on the NCSA
Convex C3 show that this has profound con-
sequences for protein structure, enzyme
mechanisms and protein design. Coupling
this new approach with a Brownian dynam-
ics method, the largest simulation ever of an
encounter between two proteins, an antibody
and an antigen, has been carried out using the
NCSA CM-5 and SGI Challenge. This
simulation for the first time is able to give
rate constants for association of proteins that
is comparable to experimental measure-
ments. The results of the electrostatics work
was the cover story of the March 1994 issue
of Proteins: Structure, Function, and Ge-
netics.
Tuning Biomolecules to Fight Asthma
Over the last 20 years, the number of asthma
cases has almost tripled in the U.S. David
Herron, senior research scientist at Eli Lilly
and Company, is searching for new drugs
that will inhibit the action of leukotrienes, in-
flammatory agents released by several types
of cells in the lungs, which cause the lungs to
stiffen and become imtated. Several gi-
gabytes of data from molecular dynamics of
three key leukotrienes, run on NCSA's and
Lilly's Cray-2 supercomputer were analyzed
in a lengthy scientific visualization created
with NCSA staff. Using the animations as a
27-
74
High Performance Computing Infrastructure and Accomplishments
guide. Lilly asthma researchers synthesized
highly active antagonists against the leuko-
trienes. Some have been tested in asthma
sufferers and found to be effective medically.
For this work, Herron was a co-recipient of
the first NCSA Industrial Grand Challenge
Award.
Virtual Spider and Artificial Silk
Biophysicist Lynn Jelinkski, director of
Cornell University's Biotechnology Center
for Advanced Technology (CAT) is com-
bining medical imagmg techniques with the
state-of-the-art computer visualization re-
sources of the CTC to study the molecular
structure of the strongest silk of the golden
orb weaver spider and its transformation
from a viscous fluid into the extremely
strong cry.stalline fiber which has the poten-
tial to replace manmade fibers, such as nylon,
manufactured from petrochemicals. Jelinski
has devised a way to create a 3-D "computer
spider" by compiling stacks of the 2-D MRI
images using the IBM POWER Visualiza-
tion System, one of the high-performance
computing resources of the CTC. Each im-
age contains over 100,000 pixels. Hundreds
of images are combined to construct the 3-D
simulated .spider. Once in hand, this virtual
spider can be dissected by computer to de-
scribe the anatomy of the glandular system
and to provide the physical processing in-
formation Jelinski seeks. This understand-
ing, coupled with molecular-level studies of
the amirio acids that make up the web silk
polymer, may aid in genetically engineering
plants to produce fibers as strong as those
produced by the spider. Jelinski's work
blazes a path toward the development of a
new class textiles with superior strength at
the same time that it promises fundamental
insight into the mystery of the spider's web.
Heart Modeling
Charles S. Peskin and David M. McQueen
(Courant Institute, New York University)
have developed over the last decade a fully
functionmg three-dimensional model of the
heart, its valves and nearby major vessels.
This computational model will make it pos-
sible to study questions about normal and
diseased heart function that are difficult or
impossible to address through animal and
clinical studies. The complexity of the heart
model is so great that a single heartbeat re-
quires a 150 hour run on the PSC Cray Re-
search C90 and could not have been run
without the very large memory of the C90.
This research won the 1994 Computerworld
Smithsonian award for Breakthrough Com-
putational Science. Peskin was awarded a
Mac Arthur Prize Fellowship in 1983.
Engineering
Man-made devices have become so complex
that researchers in both academia and indus-
try have turned to supercomputers in order to
be able to analyze and modify accurate mod-
els in ways which complement the traditional
experimental methods. Such easily accessible
high performance computers enable aca-
demic engineers to study the brittleness of
new types of steel, to improve bone trans-
plants, or to reduce drag of flows over sur-
faces using riblets. Industrial partners of the
individual supercomputer centers within the
MetaCenter are using computational facilities
more advanced than they have access to in-
temally to improve indu.strial proces.ses such
as in metal forming. Better consumer prod-
ucts such as leakproof diapers, or more effi-
cient airplanes are being designed. Even
State agencies are able to use the MetaCenter
facilities to improve traffic safety or find bet-
ter ways to use recycled materials. Some 70
corporations have taken advantage of the
MetaCenter industrial programs to improve
their competitiveness.
Ultra-high-strength Steels
Gregory B. Olson and Arthur J. Freeman
(Northwestern University) use computer
modeling to design ultra-high strength steel
for weight-critical applications such as naval
aircraft landing gear, high-performance race
cars, and bearings in the main engine turbo
pumps of the space shuttle. In recent super-
computer modeling on the PSC Cray C90,
applying quantum mechanical calculations to
the structure of steel, they have explained the
28-
75
High Performance Computing Infrastructure and Accomplishments
molecular mechanisms that give rise to im-
purity-mduced embrittlement in steel, work
which IS expected to lead to steel that will not
shatter in frigid conditions. This work was
reported in the July 15, 1994 Science. Free-
man and his group have been users of NSF
supercomputers since the founding of the
program on a wide range of problems in
materials sciences. In recognition of his pi-
oneering work in computational materials
research. Freeman received the first Materials
Research Society Medal and the first Award
in Magnetism from the lUPAP.
Continuous Casting of Steel
Achilles Vassilicos (U.S. Steel Technical
Center) models the flow of molten steel in a
continuous-casting "tundish" on the PSC
Cray Research C90, resulting in improved
process control over the quality of steel. By
more accurately predicting the precise metal-
lurgical composition of the continuous-
casting output "slabs," U.S. Steel reduces
waste steel and the amount of inventory it
must keep on-hand, resulting in substantial
cost savings.
Beverage can design
Three-dimensional stress modeling of alumi-
num beverage cans on the PSC C90 by Rob-
ert E. Dick and Andrew B. Trageser
(ALCOA Laboratories) has greatly reduced
the expense of developing a new can design
that will meet customer specifications for
strength and appearance. By relying less on
costly, time-consuming prototype testing,
ALCOA engineers estimate a co.st savings of
SI 00,000 or more per can design. This re-
search has been described in articles in D/,?-
crnrr (March 1991), Business Week (Oct. 8,
1990) and in Science (June 23. 1989).
Designing a Leakproof Diaper
Designing effective and comfortable dispos-
able infant diapers requires greater un-
derstanding of the function of the diaper
components, such as the cellulosic fluff and
the superabsorbent polymer particles-and the
effect of variations of parameters related to
these components. Dow Chemical Company.
has done extensive experimental testing,
evaluation, and computer modeling that has
contributed to a faster developmental process
and shortening the time for new product in-
troduction. The innovative Dow design was
evaluated using a computer model run on
NCSA's CRAY-2 system. Three separate
time-dependent processes are modeled. The
first, a fast spreading process, involves the
insult on the pad by a quantity of liquid
(urine) which is transported through the pad
by wicking. During this process the fluff pad
collapses as it becomes wet. The second,
called the imbibition process, models the
swelling of the superabsorbent polymer par-
ticles and their uptake of liquid from the cel-
lulose fluff. During this process, the fluff ex-
pands again. The slowest-and final-process
tracks the redistribution of liquid in the fluff
pad as the saturation of the fluff adjacent to
the superabsorbents changes. The overall
model was compared to a magnetic reso-
nance imaging experiment, which provides a
three-dimensional image of the water distri-
bution in a diaper, and was shown to give
comparable results to the final steady-state
values. Optimization of these processes is
leading to an improved, quality diaper.
Bone Transplant Bioengineering
Dean Taylor and Donald Bartel (Dept. of
Mechanical and Aerospace Engineering,
Cornell University) have been able to investi-
gate bone-implant systems across a wide
range of design parameters by using high
performance parallel computing (including
the IBM SPI) and visualization resources at
CTC. Their long-term research has produced
models of the stresses placed on normal
bones and the artificial components of a hip
joint — these models have led to custom-
designed prostheses and reduced prosthesis
replacement surgery.
Improving Performance with Riblets
George Em Karniadakis' and his group at
Brown University are using the SDSC Intel
Uncidentally, Prof. Karniadakis is the current
chair of the joint NCSA /PSC National Peer
-29-
76
High Performance Computing Infrastructure and Accomplishments
Paragon to explain drag reduction in turbulent
flow by means of "riblets," parallel grooves
on the surface of an object moving through
the flow. Such grooves are used on aircraft,
in pipelines, and on racing cars and sleds, to
improve performance. The group also mod-
els flows in micro-electromechanical systems
used in surgery and other complex applica-
tions, where the molecules of the fluid are not
much smaller than the channels in which they
flow. Both projects are resulting in new ways
to optimize the performance of broad classes
of machinery.
Designing Better Aircraft
Dino Roman, John Vassberg, and Tom
Gruschus of McDonnell Douglas are using
SDSC's Cray C-90 for a computational fluid
dynamics simulation of an aircraft in flight
and to visualize the results using FAST
(Flow Analysis Software Toolkit) on a Sili-
con Graphics IRIS workstation. Tracer parti-
cles are released into the flow field in front of
the aircraft and allowed to follow the stream-
lines around the vehicle. A cutting plane
through the 3D volume of data is placed to
intersect the aircraft fuselage and wings. The
aerospace industry relies on computational
fluid dynamics-the simulation of air or fluid
flow-to design, develop, and test new aero-
nautical configurations. This process enables
companies to test new models quickly to .se-
lect candidates for wind-tunnel testing. Such
methods used in aircraft design and manufac-
ture can give American companies a techno-
logical edge in the global market.
Crash Testing Street Signs
California State Department of Trans-
portation (Caltrans) engineer Payam Row-
hani and SDSC engineer Chuck Charman
generated a computerized model of a crash-
test vehicle, called a "bogie," on the SDSC
Cray Research Y-MP for simulations of test
crashes with sign and lighting supports. They
fine-tuned the bogie front-end design with the
Review Board, another example of the tight
luiks between members of the MetaCenter and
the scientific community.
computer model to minimize the number of
validation tests necessary at the Federal Out-
door Impact Laboratory, operated by the
Federal Highway Administration. In a sec-
ond application, Charman is working with
Caltrans engineers William Nokes and Dario
Perdomo, who are designing pavement using
structural modeling techniques. They are re-
searching the use of new and recycled mate-
rials and these materials' response to differ-
ent axle and tire configurations. They are us-
ing the supercomputer and the visualization
facilities to explore new truck suspension
systems, and new tires and heavier loadings,
innovative pavement structures with recycled
materials and rubber and polymer-modified
binders. The results of this research are ex-
pected to lead to significant cost savings in
the design, construction, maintenance, and
rehabilitation of pavement structures.
Earth Sciences and the En-
vironment
From understanding the motions of the
Earth's convective mantle to daily compu-
tation of air pollution levels in southern Cali-
fornia, the resources of the NSF MetaCenter
are being used to compute and visualize the
complexity of the natural world around us.
The US Army is working with academics to
determine how they can practice tank maneu-
vers without endangering the breeding habits
of the sage grou.se. Pollution, whether under-
ground or in the air, is a difficult coupling of
chemical reactions and flow dynamics which
must be understood in detail if corrective
measures are to be efficacious. High per-
formance computers also act as time ma-
chines, allowing for faster-than-realtime
computation of severe storms. Finally, to
improve global weather or climate forecasts,
supercomputers allow researchers to zero in
on the critical coupling physics of such proc-
esses as mixing at the air/ocean interface.
Detoxiflcation of Ground Water
Christine Shoemaker (Dept. of Civil and En-
vironmental Engineering, Cornell University)
has been a pioneer user of the scalable IBM
SP machines at the CTC for the development
30-
77
High Performance Computing Infrastructure and Accomplishments
of numerically efficient supercomputer algo-
rithms for optimal control of dynamical sys-
tems and the application of these techniques
to detoxification of contaminated groundwa-
ter. Her efforts are leading to methods of
determining the most cost-effective way to
clean up the groundwater by computing time-
varying rates of pumping. Shoemaker's
group has also developed an animation, using
the visualization resources (both hardware
and personnel) of the CTC, that represents
the effects of different policies and natural
chemical and biological processes on
groundwater cleanup. Such animations are
crucial for conveying the results of basic re-
search to the mixed audience involved in set-
ting environmental policy.
Sage Grouse-Endangered Species and
the US Army
Working with the U. S. Army, Bruce Han-
non (Dept. of Geography, UIUC) and Jim
Westervelt, of the U. S. Army Construction
Engineering Research Laboratory, have de-
veloped an ecological model for the sage
grouse, an endangered species, population on
an army training base in Washington State.
Using the Macintosh software STELLA, the
CM-5 and the GRASS geographical infor-
mation systems, these researchers were able
to optimize the scheduling of training exer-
cises to maximize grouse reproduction and
longevity. For each geographic cell, a very
large STELLA model was constructed, rep-
resenting the grouse at various life stages,
different kinds of plants and predators, soil
type and moisture, weather-all the many
physical variables-and also introduce the nec-
essary human activities like tank and troop
maneuvers on the army range. Each cell, of
which there are over a hundred thousand in
the GIS covering the army base, could con-
tain 1 00 to 200 variables. This work demon-
strated the efficacy of coupling GIS datasets
to ecological models and running them in a
client-server fashion between a Macintosh
and the CM-5.
Storm modehng/forecasting
Robert Wilhelmson (Dept. of Atmospheric
Sciences. UIUC and NCSA) and his col-
leagues have been able to simulate the devel-
opment of tornadoes embedded within larger
storms called supercells (producing the larg-
est tornadoes) and along low level con-
vergence boundaries (e.g. along a thunder-
storm cold air boundary) using both tradi-
tional vector supercomputers (NCSA and
PSC Cray Research Y-MP and C90 and
NCSA TMC CM-2 and CM-5). Study of
these results is leading to a better under-
standing of when tornadoes will develop and
to more accurate tornado warnings. The visu-
alization of the intemal dynamics of a .severe
thunderstorm, created by the NCSA visuali-
zation team in 1989, is perhaps the most
widely viewed visualization of a supercom-
puter simulation ever made. It had a major
worldwide impact on the adoption of scien-
tific visualization as a working tool of com-
putational science.
Kelvin Droegemeier, a former student of
Wilhelmson's, and his colleagues associated
with the Center for Analysis and Prediction
of Storms (CAPS), an NSF S & T Center,
have used the NCSA and PSC Cray Re-
search supercomputers to develop the Ad-
vanced Regional Prediction System, a com-
putational model for forecasting severe
storms. As of Spring 1994, this model has
been used, with data augmented by the sin-
gle-Doppler radar network now being de-
ployed by NOAA, in daily weather reporting
on an experimental basis. Because of their
use of parallel supercomputers they have
shown that regional storm forecasts based on
very high resolution models are possible with
the advent of teraflop computing capabilities
in the next few years. The long-term objec-
tive is to improve the prediction of hazardous
weather on scales ranging from a few kilo-
meters (an individual storm) and tens of
minutes to hundreds of kilometers (a squall
line or other mesoscale system) and several
hours.
31
78
High Performance Computing Infrastructure and Accomplishments
Los Angeles Smog
Gregory J. McRae (Massachusetts Institute
of Technology) and Armistead Russell
(Carnegie Mellon University) have developed
the most comprehensive model of smog
formation available. Their modeling of smog
in Los Angeles on the PSC C90 showed that,
contrary to EPA policy at the time, it is nec-
essary to control nitrogen oxide emissions as
well as hydrocarbons to control smog. This
work formed the scientific underpinning for
the Air Quality Management Plan adopted in
1988-89 for the Los Angeles air basin, the
most stringent such plan in the United States.
Their modeling also showed that alternative
vehicle fuels, methanol in particular, repre-
sent a worthwhile strategy for improving ur-
ban air quality, which influenced inclusion of
this policy in the 1990 revisions to the Fed-
eral Clean Air Act. This work is being ex-
tended using the combination of the PSC
Cray Research C90 and T3D as an NSF
Grand Challenge. The first Forefronts of
Large-Scale Computation award, given in
1989, recognized McRae for this work.
These pioneering computations are leading to
practical tools for states to predict air pollu-
tion levels. The Modeling and Meteorology
Branch of the California Air Resources
Board joined the SDSC Industrial Partners
program in 1991. They are running, on the
SDSC Cray Research C90, the Urban Air-
shed model which estimates hourly pollutant
concentrations. They results are u.sed to es-
timate maximum pollutant concentrations or
population exposure statistics for different
emissions controls.
Upper Ocean Mixing
Sidney Leibovich (Dept. of Mechanical and
Aerospace Engineering, Cornell University)
has developed a mathematical model on the
CTC IBM ES/9000 and SPl that uses a .se-
ries of equations to simulate what happens
when wind blows, waves form, and Lang-
muir circulations begin to mix the upper
ocean. This mixing has a marked influence
on the density of the water; the ocean's den-
sity structure alters its current patterns, and
the combined phenomena influence the ex-
change of heat between the ocean and the at-
mosphere. All of this has environmental and
ecological consequences, and finding ways to
predict this chain of events is of interest to
ecologists. oil companies, meteorologists,
climatologists, undersea communications
experts, and government policy makers.
Simulating Climate using Distributed
Supercomputers
C. Roberto Mecho.so, University of Cal-
ifornia, Los Angeles; Larry Bergman, Jet
Propulsion Laboratory; Carl Scarbnick, Gary
Hanyzewski, and Bilal Chinoy, SDSC; Paul
Messina, California Institute of Technology;
and Robert Malone and Rick Smith, Los
Alamos National Laboratory are developing a
coupled general circulation model distributed
over a heterogeneous network. The atmos-
pheric component, which runs on JPL's
CRAY, is a finite-difference model with
state-of-the-art parameterizations of convec-
tion, planetary boundary layer processes, and
radiation. It provides wind stress and heat
flux information to the finite-difference oce-
anic component, which runs on Caltech's
Delta or SDSC's Paragon. The oceanic com-
ponent then returns the sea surface tempera-
ture to the atmospheric component. Results
of the coupled system show a very realistic
seasonal cycle. This is a significant step in the
development of a coupled system distributed
across heterogeneous computer environ-
ments, a mam goal of the CASA gigabit test-
bed project.
Planetary Sciences, astronomy,
AND Cosmology
The sciences of the world beyond the Earth
have always been an interaction of ob-
servation of unexpected events and theory
creating a model built on the laws of physics
and chemistry which explain the observa-
tions. Therefore, it is proper to see high per-
formance computing and communications
making major impacts on both observation
and theory. Indeed, as we saw in the recent
impact of Comet Shoemaker-Levy 9 with
Jupiter, the observatories on earth and in
32-
79
High Performance Computing Infrastructure and Accomplishments
space have become intimately linked with
each other and with the theoretical simula-
tions. Supercomputers are becoming inte-
grated into observational facilities, like the
Grand Challenge BIMA millimeter observa-
tory, and with observational programs like
the ones which have led to discovery of new
millisecond pulsars or the first extrasolar
system planets, themselves orbiting a pulsar.
The ability of numerical methods to solve
even the most complex of fundamental
physical laws, such as Einstein's equations
of General Relativity, is leading to a very
rapid understanding of the dynamics of
strong field events, such as the collision of
black holes. In perhaps the grandest .scale
challenge possible, the universe itself is a
subject of investigation by a several Grand
Challenge teams using resources of the
MetaCenter to discover how the large scale
structures in the universe evolved from
nearly perfect homogeneity at the time of the
formation of the microwave background.
Comet Collision with Jupiter
Mordecai-Mark Mac Low (Dept. of As-
tronomy & Astrophysics, Univ. of Chicago
and Dept. of Astronomy, UIUC) and Kevin
Zahnle (Space Science Division, NASA
.Ames Research Center) simulated the impact
of Comet Shoemaker-Levy 9 on Jupiter us-
ing the astrophysical gas dynamics code
ZEUS-3D, developed at the NCSA, running
on the PSC Cray Research C90. Their simu-
lations, made months before the actual im-
pact, were widely used by planetary as-
tronomers to prepare for their observing
runs. Their results showed that the comet
would penetrate only shallowly into the
planet's atmosphere before exploding and
that the explosion would break out of the at-
mosphere. Their animations were available
using the global MosaicAVWW infrastruc-
ture and received worldwide coverage in both
the popular and scientific press.
Discovery of First Extrasolar System
Planet
A statistical analysis by Alexander Wol-
szczan (Dept. of Astronomy and Astro-
physics, Penn State Univ.) of data received
from the radio telescope at Arecibo Ob-
servatory in Puerto Rico led him to theorize
the existence of planets orbiting around a pul-
sar. Searching millions of bits of informa-
tion using the supercomputer facilities at the
CTC, he detected an unusual complexity in
the pattern of the pulses' arrival time. The
behavior suggested that the pulsar's motion
is affected by the presence of other orbiting
objects. Applying a perturbation model pro-
posed by Frederic Rasio at Princeton Univer-
sity, along with Philip Nicholson, Stuart
Shapiro, and Saul Teukolsky at Cornell Univ.
to a new set of observations, Wolszczan
found that it predicted the effect that two or-
biting planets would have upon each other as
they moved through space. The model fits,
and Wolszczan can be credited with identify-
ing the first planetary system beyond our
own.
The April 22 1994 issue of Science featured
this work, with front page coverage of news
also in the New York Times that week as well
as several other major metropolitan newspa-
pers, television, and popular magazines.
Wolszczan continues to explore this new
world using parallel capabilities of the IBM
SP machines at the CTC.
Building the BIMA Radio Telescope
This HPCC Grand Challenge project at
NCSA is implementing a prototype of the
next generation of astronomical tele.scope
systems - remotely located telescopes con-
nected by high-speed networks to very high
performance, scalable architecture computers
and on-line data archives, which are accessed
by astronomers over Gbit/sec networks. Data
taken by the BIMA telescope in northern
California are now transferred in real time to
NCSA (using a combination of high-speed
leased line and NSFnet), entered into a data-
base program, and archived on the NCSA
mass storage system. Astronomers locate
33-
80
High Performance Computing Infrastructure and Accomplishments
data in the archive and load data onto NCSA
or remote computers for processing using the
NCSA Mosaic software. Also, a digital li-
brary of fully processed radio images, which
will mclude an all-sky radio survey created
by the NRAO's VLA with a size of 1
terabyte, is being developed. One of the ma-
jor activities that BIMA and NCSA staff are
involved in is the creation of a new standard
software package AIPS-i-i-, written in C++
for extensibility and maintainability, to re-
place the older standard packages AIPS,
which was written in FORTRAN. The
AlPS-t-t- software is being developed by an
international collaboration including BIMA
member institutions, the National Radio As-
tronomy Ob.servatory. and scientists from
Australia. Canada. England, the Netherlands
and India. With AIPS-)-+, an astronomer can
learn the system once and then process im-
ages on nearly any computer using data from
almost any radio telescope. NCSA will im-
plement computationally intensive routines
on the SGI Power Challenge. Finally, discus-
sions are underway to have the BIMA tele-
scope array be part of the development of a
very large national millimeter array radio
telescope, in cooperation with the Caltech and
the National Radio Astronomy Observatory.
FuLsar Searching and Discovery
Joseph Taylor (Dept. of Astrophysical Sci-
ences, Princeton Univ. and 1993 Nobel prize
winner in Physics) and his colleagues used
the PSC C90 to aid in the search for pulsars
by analyzing radio-frequency data gathered at
the Arecibo Observatory. Since using su-
percomputing to sift through the enormous
amount of data obtained, this approach has
resulted in the discovery of 20 previously
unknown pulsars, including five millisecond
pulsars.
Accretion Disks Around Black Holes
Most galaxies .seem to harbor massive black
holes (between one million and ten billion
solar masses) at their centers. When gas ac-
cretes onto these holes enormous energies
can be released, turning the most luminous of
these objects into quasars. John Hawley
(Dept. of Astronomy. Univ. of Virginia) has
been using supercomputers for over a decade
to investigate the accretion disks which form
around black holes. Using the Cray Research
Y-MP and C90 at NCSA and PSC. Hawley
developed a new magnetohydrodynamic
(MHD) code which enabled him and Steven
Balbus to explore the nature of a powerful
MHD instability in accretion disks. They
discovered that this almost universal instabil-
ity is the underlying physical mechanism
which gives rise to the effective turbulent vis-
cosity in the disk, which previously theorists
had only modeled parametrically. For this
discovery. Hawley received the 1993 Helen
B. Warner Prize from the American As-
tronomical Society.
Black Hole Collision Dynamics
The two-body problem in General Relativity
is represented by the motion of two black
holes. In contra.st to the two-body problem
in Newtonian gravity which was solved in
closed form over 300 years ago. the orbiting
motion of two black holes is an outstanding
research frontier, recently being funded by
the NSF as a Grand Challenge. Two re-
search teams in this Grand Challenge. Ed
Seidel, NCSA Director Larry Smarr. and
coworkers (Dept. of Physics and NCSA,
UIUC) and Stu Shapiro and Saul Teukolsky
(Dept. of Astronomy and Physics, Cornell
University) have used a variety of supercom-
puters at NCSA. PSC. and CTC to compute,
for the first time, the evolution of the event
horizon representing the head-on collision of
two black holes. The spacetime diagram of
the collision shows a picture like the pre-
dicted "pair of pants", analyzed qualitatively
by Stephen Hawking and others in the early
1970s, but the supercomputer computation
gives for the first time quantitative details
about the area increase and the caustics
formed as the null geodesies, which generate
the horizon, intersect. Seidel and coworkers
show that in general such violent deforma-
tions of black holes cause the horizon to os-
cillate with the quasinormal modes previ-
ously studied only in linear perturbatioiT the-
■34
81
High Performance Computing Infrastructure and Accomplishments
ory. The Grand Challenge team is now mov-
ing on to the full spiraiing-in situation, which
is exjjected to be one of the first events ob-
served by the LIGO gravitational wave ob-
servatory late in this decade.
Largest cosmological simulation
This simulation, carried out by Michael
Norman and Greg Bryan (NCSA and Dept.
of Astronomy), on the TMC CM-5 is the
first which is sufficiently accurate to allow
direct comparison with x-ray telescopic ob-
servations, thereby allowing a critical test of
this popular cosmological model. A block of
the universe half a billion light years on a side
was evolved from the era of the formation of
the microwave background (with tiny inho-
mogeneties as quantified by the COBE satel-
lite) until the present. A single evolution re-
quired 30 hours on the 512-node CM-5 and
over 14GB of memory to hold the model.
The results were announced at the May 1994
meeting of the American Astronomical Soci-
ety and covered in the popular press by CBS-
TV and the Washington Post. The computa-
tion is one of a number now being made by
members of the Grand Challenge Cosmol-
ogy Consortium using resources of the
MetaCenter. Jerry ©.striker (NAS), Chair of
the Department of Astrophysical Sciences at
Princeton University is The largest simula-
tion of the formation of x-ray clusters in a
universe dominated by a mixture of cold and
hot dark matter was carried out using
NCSA's 512 node CM-5. PI of the GC-"*.
Evolution of the MetaCenter Concept
As the decade of the 1990s opened, it became
clear that we were witnessing fundamental
changes in the way researchers use high-
performance computing and communications
systems, driven by technological advances in
computers and networking. Anticipating
these changes, the four NSF supercomputing
centers formed a collaboration based on the
concept of a national MetaCenter for compu-
tational science and engineering: a collective
of intellectual and physical resources unlim-
ited by geographical or technological con-
.straint.
The primary objective was to realize ways in
which researchers could be best served and
their problems directed to that architecture or
combination of architectures best suited to
their solution, on a nationwide basis. Thus
the MetaCenter. begun in 1992, was con-
ceived as a "garden of architectures" that
could demonstrate ways of providing na-
tionwide resources to support scientific and
technological advances and supply a frame-
work within which the NSF centers program
could move into the next era of high-
performance computing. In 1994. NCAR's
Scientific Computing Division joined the
MetaCenter. In addition, the NSF established
a program of MetaCenter Regional Affiliates
(MRA) under which other institutions could
pursue projects of interest in collaboration
with MetaCenter institutions; MetaCenter
sites were involved with several MRA pro-
posals.
MetaCenter projects involve the use of the
resources and expertise available at more than
one MetaCenter site. Often these projects are
grand challenges or national challenges-
many have been highlighted el.sewhere in this
report.
In addition to direct support of researchers,
the MetaCenter sites have joined to work on
specific projects of benefit to the .scientific
community. Projects begun by the Meta-
Center during the first two years include;
• National File System; Separate cells
have been established at each site based on
the Andrew File System (AFS); a national
cell that will serve as the basis for a pro-
duction system is being implemented
With such a system in place, users will be
able to move transparently among the
MetaCenter computational systems.
-35-
82
High Performance Computing Infrastructure and Accomplishments
National Archival Storage System;
Work has begun on a proposed archival
storage system based on manufacturer-
supported and MetaCenter-enhanced prod-
ucts, including versions of UniTree and
the Data Migration Facility from Cray Re-
search. With such a system in place, files
on any center machine can be migrated
transparently among other center ma-
chines, the national file system, and the na-
tional archival .storage system.
Video Collaborator/: Using video
teleconferencing equipment, the re-
searchers and staff of MetaCenter in-
stitutions meet regularly, often with par-
ticipation from the NSF. The equipment
has been used to broadcast joint training
workshops like the MetaCenter Computa-
tional Science Institutes in Parallel Com-
puting conducted at SDSC and NCSA.
Plans" are to install additional hardware to
strengthen this system.
Visualization Collaboration: Meta-
Center and NSF Science and Technology
Centers researchers participated in a
DASC-funded MetaCenter Visualization
Workshop held at SDSC in 1993, and ex-
periments have begun in passing visual-
izations between the centers in real time to
determine bottlenecks and potential prob-
lems.
Online Information: An online in-
formation system using the gopher proto-
cols has been implemented; it is now be-
ing converted to be accessible via World-
Wide Web browsers. Plans are to broaden
the information available and incoiporate
new technologies. A joint Science High-
lights Repository will soon include
WWW links to MetaCenter-supported re-
search projects worldwide.
• Networking: Plans were developed to
upgrade the high-speed, wide-area net-
work connecting the centers into a very
high speed backbone.
New projects for the MetaCenter include ex-
ploring virtual reality for use in the scientific
discovery process, u.se of the basis developed
in the Online Information project to develop a
national training program to promote under-
standing and expertise in the use of parallel
computing, and a joint evaluation of the
scalable parallel computers acquired by the
NSF centers. A project has also been initiated
to establish a security research and response
team to monitor and coordinate responses to
incidents of .security violations.
Recognition Accorded NSF supercomputer Users and P r oj ects
1994 [in part for this work performed at NSF
centers]. David's PhD advisor and the third
Feenberg awardee is Mai Kalos, Director of
the Cornell Center.
The previous sections on Accomplishments
of the NSF Supercomputer Program cite
several major ways in which the research of
the centers has been recognized. Specific no-
table citations and users are summarized
here, with the complete description available
in the appropriate sections. In addition, scien-
tific visualizations produced at the centers
have provided popular recognition as covers
of scientific magazines and technical pro-
grams, as evidenced by numerous covers of
Science, and NSF's own publication Mosaic.
Pha.se Transitions of Solid Hydrogen ...
David Ceperley was awarded the fifth Eu-
gene Feenberg Memorial Silver Medal in
Protein Kina.se solution ... (Susan Taylor et
al) won the Forefronts of Large-Scale Com-
putation Award presented at Supercomputing
'93 for this work...
Kinking DNA ... John Rosenberg won the
1991 Forefronts of Large-Scale Computation
award for this research, and his work was
cited in the 1993 Computerworld Smith-
sonian award for science which was given to
the Pittsburgh Supercomputer Center (PSC).
-36-
83
High Performance Computing Infrastructure and Accomplishments
Heart Modeling ... This research by Charles
Peskin won the 1994 Computerworld Smith-
sonian award tor Breakthrough Computa-
tional Science. Peskin was previously
awarded a Mac Arthur Prize Fellowship in
1983, for earlier studies in heart modelmg.
Ultra-high-strength Steels Gregory B. Olson
and Arthur J. Freeman (Northwestern Uni-
versity) use computer modeling to design
ultra-high strength steel for weight-critical
applications ... This work was reported in the
July 15. 1994 Science. Freeman and his
group have been users of NSF su-
percomputers since the foundmg of the pro-
gram on a wide range of problems in materi-
als sciences. Arthur Freeman received the
first Materials Research Society Medal and
the first Award in Magnetism from the Inter-
national Union of Pure and Applied Physics.
Los Angeles Smog ... The first Forefronts of
Large-Scale Computation award, given in
1989, recognized Gregory McRae for this
work.
Theory of High Temperature Super-
conductors ...The Nobel Prize in Physics in
1 987 was for the discovery of a new class of
high temperature superconductors. Thou-
sands of research papers have been written
about the.se unique materials... David Pines
(NAS and first Feenberg 'Medalist) and
Philippe Monthoux (Department of Physics,
UIUC) used the NCSA Cray Research Y-
MP to carry out a strong coupling
(Eliashberg) calculation of the normal state
properties and Tc for the model experiment-
based magnetic interaction between quasipar-
ticles...
Crystallography... Herbert Hauptman
(Medical Foundation of Buffalo. Inc.) won a
Nobel Prize in 1985 for development of the
"direct method" of protein structure de-
termination from X-ray crystallographic data.
In a collaboration with Russ Miller (State
University of New York at Buffalo). Haupt-
man and Miller have developed a numerical
approach that extends the "direct method" of
determining molecular structure from X-ray
crystallographic data to larger molecules, be-
yond its present limit of about 100 atoms...
Pulsar Searching and Discovery ... Joseph
Taylor (Dept. of Astrophysical Sciences.
Princeton and a 1993 Nobel prize winner in
Physics) and his colleagues used the PSC
C90 to aid in the search for pulsars by ana-
lyzing radio-frequency data gathered at the
Arecibo Observatory. Using supercomputing
to sift through the enormous amount of data
obtained, this approach has resulted in the
discovery of 20 previously unknown pulsars,
including five millisecond pulsars.
Accretion Disks Around Black Holes ... John
Hawley (Dept. of Astronomy. Univ. of Vir-
ginia) has been using supercomputers for
over a decade to investigate the accretion
disks which form around black holes. Using
the Cray Research Y-MP and C90 at NCSA
and PSC, Hawley developed a new magne-
tohydrodynamic (MHD)code which enabled
him and Steven Balbus to explore the nature
of a powerful MHD instability in accretion
disks. ... For this discovery, Hawley re-
ceived the 1993 Helen B. Warner Prize from
the American Astronomical Society.
Alpha Shapes, Biomolecules, and Cosmol-
ogy Alpha shapes, a form of geometric mod-
eling developed by the 1993 Waterman
Award winner Herbert Edelsbruner (Dept. of
Computer Science, UIUC) and NCSA staff
member Ping Fu. focuses on the formal def-
inition, construction, and measurement of
shapes for any given point set in space, [They
are...] useful in studying water molecules re-
siding inside a protein. NCSA u.sers have
discovered other related applications of alpha
shapes by applying them to such diverse
fields as adaptive grid generation, medical
image analysis, visualizing the structure of
earthquake data, and the large-scale structure
of the universe.
The Rise of the MosaicAVWW Information
Infrastruture NCSA developed the Mosaic
user interface software which provides point-
and-click access to the diverse information
storage protocols of the Internet, such as
World Wide Web (WWW), Gopher, FTP.
37-
84
High Performance Computing Infrastructure and Accomplishments
and WAIS.... In 1994. NCSA was given In-
joworld's Publisher's Industry Achievement
Award.
Acronyms
AFS
Andrew File System
(developed at Carnegie-Melon
CD
University and named for its
CFS
benefactors, Andrew Carnegie
CMRR
and Andrew Melon)
ARPA
Advanced Research Project
Agency of the Department of
CNRI
Defense (previously called
CPU
DARPA)
CTC
ASC
Division of Advanced Scien-
CTSS
tific Computing, Computer
DASC
and Information Sciences and
DEC
Engineering Directorate
(CISE). NSF
DICE
ATM
Asynchronous Transmission
Mode - high bandwidth
DJMS
transmission technology for
DMF
WAN use
DX
BIMA
Berkeley, Illinois, MAryland
telescope
EOSDIS
BLANCA
NSF Gigabit testbed - see
CASA
EPA
C
A programming language de-
veloped at Bell Labs
FAST
(probably the 3rd in a series)
and used as an easily portable
FDDI
systems development lan-
guage.
C++
An Object Oriented extension
ofC
FORTRAN
CAD
Computer Aided Design
CAPS
Center for Analysis and Pre-
diction of Storms
CASA
NSF Gigabit testbed - see
FTP
BLANCA
GC
CAT
Cornell University's Biotech-
Giga
GIS
nology Center for Advanced
Technology
CAVE
Cave Automated Virtual En-
vironment
Gopher
CCMS
Computational Center for
Macromolecular Structure —
HDF
an SDSC, UCSD, and
Scripps Research Institute
collaboration
Compact Disk
Common File System
Center for Magnetic Record-
ing Research
Corporation for National Re-
search Interests
Central Processor Unit
Cornell Theory Center
Cray Time Sharing System
see ASC
Digital Equipment Corpo-
ration
Distributed Collaboration En-
vironment)
Dynamic Job Mix Scheduler
Data Migration Facility (Cray)
Data eXplorer
NASA Earth Orbitting Satel-
lite Distributed Information
System
Environmental Protection
Agency
Flow Analysis Software
Toolkit
Fiber Data Distribution Inter-
face (a 100 Mps network me-
dium)
FORmula TRANslator - Ear-
liest high level programming
language — still in common
use on high performance
computers
File Transfer Protocol
Grand Challenge
One Billion- 10^
Geographic Information Sys-
tem
Internet Browser developed at
the University of Minnesota
(Golden Gophers)
NCSA Hierarchical Data
Format
38-
85
High Performance Computing Infrastructure and Accomplishments
HiPPI
High Speed Peripheral Paral-
lel Interface
NCAR
HP
Hewlett Packard
NCSA
HPC++
An object oriented varient of
the C language for High Per-
NIH
formance Computers
Nil
HPCC
High Performance Com-
puting and Communications
NOAA
HPF
High Performance FOR-
TRAN
NRAO
HPSS
High Performance Storage
System
NREN
HTTP
Hyper-Text Transfer Protocol
for the Internet
NSF
IBM
International Business Ma-
NSL
chines
PICT
IFS
AFS port by Univ. of Michi-
gan
PSC
InterNIC
Internet Network Information
Center
PVM
lUPAP
International Union of Pure
QCD
and Applied Physics
RAID
IVEM
Intermediate Voltage Electron
Microscope
REL
LANL
Los Alamos National Labo-
latory
REU
LAN
Local Area Network
LIGO
Laser Interferometry Gravita-
tional Observatory
RISC
LLNL
Lawrence Livermore National
ROM
Laboratory
SDF
MacCHESS
Cornell High Energy Syn-
chrotron Source for Macro-
SDMIR
molecular Modeling
SDSC
MHD
Magneto HydroDynamics
MILC
MIMD Lattice Calculations
SGI
Collaboration
SIGGRAPH
MIMD
Multiple Instruction, Multiple
Data
Mosaic
Internet Hypertext Browser
— WWW client developed at
SIMD
NCSA
MPP
Massively Parallel Processors
STM
MRA
MetaCenter Regional Affiliate
MRI
Magnetic Resonance In-
terferometry
TCP/IP
NAS
National Academy of Sci-
ences
Telnet
Tera
National Center for At-
mospheric Research
National Center for Su-
percomputer Applications
National Institutes of Health
National Information In-
frastructure
National Oceanic and Atmos-
pheric Administration
National Radio Astronomical
Observatory
National Research and Edu-
cation Network
National Science Foundation
National Storage Lab
A popular image format for
computers
Pittsburgh Supercomputer
Center
Parallel Virtual Machine
Quantum ChromoDynamics
Redundant Array of In-
expensive Disks
Renaissance Experimental
Laboratory
Research Experience for Un-
dergraduates (NSF program)
Reduced Instaiction Set
Computer
Read Only Memory
Standard Data Format
San Diego Microscopy and
Imaging Resource
San Diego Supercomputer
Center
Silicon Graphics, Inc.
Special Interest Group -
Graphics — an influential and
popular visualization organi-
zation and annual trade show
Single Instruction, Multiple
Data
Scanning Tunneling Mi-
croscope
Tranmission Control Pro-
tocolAnternet Protocol
Internet Computer logon
Services
One trillion- 10'2
39-
86
High Performance Computing Infrastructure and Accomplishments
TMC
UNICOS
UNIX
vBNS
VPLA
Thinking Machines. Corp.
Cray'.s Version of Unix
Popular portable computer
operation system developed at
Bell Labs - reportedly does
not stand for anything - al-
though some believe that it
was a pun on Multics, a time
sharing systems then in use at
MIT
very high speed Backbone
Network Service
Visual Programming Lan-
guage for Animation
VR
Virtual Reality
WAIS
Wide Area Information Sys
tem
WAN
Wide Area Networks
WWW
World Wide Web - a set of
commications protocols for
Hypertext established at
CERN.
XMP & YMP - versions of Cray computer
systems — vector multi-
processors— used as refer-
ences for computer perform-
ance
40-
87
Mr. SCHIFF. Thank you, Dr. Hayes.
I would like to recognize Members for questioning. I would ap-
preciate it if Members would try to stay at least loosely to five min-
utes just because we have two more panels.
I am going to recognize myself first. I have one question.
Dr. Hayes, I would like to be clear, myself, and be clear for the
record. As you well know of course there are right now four
supercomputing centers funded in the NSF program.
What I would like to know is. Does your Task Force's report spe-
cifically recommend more such centers, less such centers? Numeri-
cally speaking. I know you made several recommendations on how
to network and so forth, but in terms of the number of centers, is
there anything in the report that addresses that?
Dr. Hayes. As the Task Force looked at the overall program, the
advice that we had had fi-om the Foundation and our own sense of
what the budget environment was going to be for the next five to
ten years, suggested that in order to have a balanced program, that
it may be necessary to downsize the total number of partnership
sites.
I would also mention that the Task Force was also aware that
an important part of a growing component of the program is the
Metacenter Regional Affiliates.
In our recommendations about the partnership centers, we recog-
nized that these Metacenter Regional Affiliates could and should be
brought into the program as a natural evolution of the centers pro-
gram.
So within the Task Force there was an expectation that in order
to fund the balanced program that was envisioned, that it may re-
grettably be necessary to reduce the number of centers.
Mr. SCHIFF. But at the same time, you are proposing the origina-
tion or inclusion of other centers — I'm sorry? use the term? —
Dr. Hayes. Metacenter Regional Affiliates.
Mr. SCHIFF. Right.
Dr. Hayes. Yes, that presently are a part of the program, and
they are loosely coupled to the existing four centers.
Mr. SCHIFF. Right.
Dr. Hayes. I think one of the important things to appreciate is
that between the time of the Branscomb Committee Report and
when the Task Force was called together, there were three impor-
tant changes.
One had to do with the expectations with respect to the budget.
The Branscomb Committee, which I was a member of, was working
in an environment in which there was some hope and expectation
there would be significant growth in the budget for high-perform-
ance computing.
So the possibility that we would have to downsize the number of
centers was really not addressed in a formal way as being driven
by constraints on the overall budget.
The other is a full appreciation of the importance of having a bal-
ance within the p3rramid; that the mid-range systems, because of
important developments in the technology, were becoming increas-
ingly important.
So the possibility of buying from major commercial vendors mid-
range systems that were upwardly compatible by increasing the
88
total number of nodes on the systems really became a reality in
that very short period of time. And the importance of building on
that development in the technology was very much before us as the
Task Force was formed.
Our sense is that it was important to recognize that there were
a number of these mid-range centers that could be coupled more
effectively into the program, and there will be efficiency and cre-
ativity that would come from that formal coupling rather than
loose coupling as exists today.
Mr. SCHIFF. Thank you, Dr. Hayes.
Congressman Cramer?
Mr. Cramer. Thank you. I apologize for having to be out of the
room for a few minutes.
Welcome, both of you, and thank you for your testimony.
Dr. Hayes, are there any significant aspects of the recommenda-
tions of your committee that are not being followed by NSF in re-
structuring the supercomputer centers program?
Dr. Hayes. I am not aware of any major elements that are not
reflected in the program announcement.
There are other issues with respect to the high-performance com-
puting that we expect that the Foundation will follow through on,
such as taking the leadership with other agencies in high-perform-
ance computing and continuing to look at options to fine-tune the
review process for allocation at the centers, but those are not mani-
fest specifically in the program announcement.
Mr. Cramer. Dr. Young, the solicitation for the new centers pro-
gram requires submission of a preproposal
Dr. Young. Yes.
Mr. Cramer. So you are going to have a preproposal, if I can say
it right, and a final proposal as well. Will there be overlap between
reviewers of those two sets of proposals?
Dr. Young. I would certainly expect overlap. They may not be
exactly the same group, but I would expect very significant overlap,
yes.
Mr. Cramer. So you would expect that that would be monitored
so that we don't have problems as a consequence of trying to form
what you do around one, and then there not being continuity be-
tween the final proposal and the preproposal reviewers?
Dr. Young. The intent of the preproposal stage is to allow
preproposers to come in and get advice about the quality of their
proposals; to get input; and to not use the preproposal rankings as
a determinant factor in what the final will be, but rather as a
mechanism for giving advice to people coming in.
Mr. Cramer. And I want to say, I like that idea. That is what
I had hoped I would hear, because I think that is a good way to
explore it through that process that work with and try to cooperate
and encourage that.
Thank you, Mr. Chairman. That's all I have.
Mr. SCHIFF. Mr. Ehlers, you are recognized for five minutes.
Mr. Ehlers. Thank you, Mr. Chairman.
My question is somewhat tangential and I hope you do not think
this reflects on your report, but I think your testimony and the re-
port speak for themselves.
89
We have a particular problem in the Federal Government in
terms of the purchase and procurement of computers. The procure-
ment process itself is extremely complex because of the paranoia of
the public over the years that someone might be given favored con-
tracts, or some such, and it has reached the point where, when the
Federal Government wishes to purchase a new computer using the
process that exists there is an absolute guarantee that the com-
puter will be obsolete by the date it is delivered.
I have two questions based on that.
Number one. Is that a problem in any of the centers? Are you
in any way bound by any of the procurement requirements, either
directly or indirectly, that might be imposed through NSF?
And secondly. Do you have any advice to give us, particularly
since I am looking at ways of restructuring the federal procurement
policies vis-a-vis computers, do you have any advice on how I could
proceed or how we could proceed in order to eliminate the problem
we have now?
I would welcome comments from either one of you.
Dr. Young. Well I will take that first, and I may turn to Bob
Borchers if he has backup comments he wants to maike.
First of all, we do cooperative agreements with vendors. We are
on the leading edge, so we are not bound by the federal procure-
ment process in acquiring the leading edge computers here.
What we typically do is partner with vendors to bring in very
early versions of machines that are often very interested in coop-
eratively working with us on software development for scientific
applications because that is not the primary drive that is driving
them for their mass market.
So we have a mutual interest in doing software development and
early development on machines that enables us to get very com-
petitive rates in what we do in leasing.
So the flat answer to your question is: we simply are not bound
by the procurement process, and it works very well to partner with
industry to bring these machines in for early testing and develop-
ment.
With regard to the second question. No, I do not have advice on
what to do about the procurement problems. I recognize them.
Mr. Ehlers. Thank you.
Dr. Hayes?
Dr. Hayes. Yes. I would echo the comment about the efficiency
of the system. I think that many center directors that are in the
Federal Government just marvel at what the center directors — at
what our NSF Supercomputing Centers have been able to do in
terms of getting leading edge equipment on site and doing it in a
very short time frame.
The key point that Paul mentioned that I think it is important
to recognize is the extent of partnerships between the centers and
the vendors. I think that has been a key element in the success of
the program right from the beginning.
I think one of the new elements in all of this that is important
to keep in mind is that the recapitalization cycle for the new hard-
ware is getting shorter. One of the factors that went into our con-
sideration as to how much budget would be required to support a
center was part of the experience from the history of the program
90
that if you run with long recapitahzation cycles on leading edge
technology, that it leads to problems within the program.
So that the center directors themselves may have some addi-
tional insights on this particular question, but I think as long as
the resources are there in a timely way and importantly they do
not make these very long-term commitments that prevent them
from recapitalization on an appropriate time scale, which possibly
in this environment is three or four years, why you can see the
kind of turnover in a timely way that is very important to the suc-
cess of this program.
Mr. Ehlers. And do you have any advice for us?
Dr. Hayes. No, I would not hazard advice.
[Laughter.]
Mr. Ehlers. We may need a Hayes' test for a federal computer
system.
Dr. Young, did you have anjrthing to add?
Dr. Young. No.
Mr. Ehlers. Thank you, very much.
Mr. SCHIFF. Mr. Doyle?
Mr. Doyle. Thank you, Mr. Chairman.
Let me see if I have got this right. Now the principal role of the
new centers program is to provide access to high-end computing in-
frastructure for the academic, science and engineering community?
Is that correct?
Dr. Hayes. That is correct.
Mr. Doyle. And, Mr. Hayes, your report indicated that the
supercomputers at the existing center are oversubscribed by a fac-
tor of at least two, and the demand is growing.
Most of us here understand that little or no funds are available
for the restructured program which must fund many regional cen-
ters and research consortia in addition to each leading edge
supercomputer site, however many there are, one or two; and, fi-
nally, ARPA is not likely to provide support to the new centers for
hardware upgrades as has been the case for the current centers.
Therefore, since the new program will be increased significantly
in scope without additional funding, why do you believe the new
program will adequately support high-end computer users?
Dr. Young. I don't believe it will adequately support high-end
computer users. If you look across the Foundation as a whole, per-
haps one in four of the proposals for research that come into the
Foundation that are good proposals are in fact funded. So these are
tight times and Chairman Schiff reminded us of that at the begin-
ning. We are making tough choices.
Dr. Hayes. I would say that part of the perspective that the Task
Force brought to the table was the realization that there is a very
large amount of important computational research that can be done
on mid-range systems and work stations.
The view of the Centers Program is not to provide support for
the vast majority of computational studies. The NRC report on
the — the Brook-Sutherland Report talked about the Centers Pro-
gram and the highest end computing in the context of a time ma-
chine; that the work that is being done on these leading edge sys-
tems that are very expensive, that kind of work is basically trail
blazing work developing algorithms and methodology, testing new
91
physics and chemistry and mathematics, that will be used widely
on mid-range machines in the next five to ten years.
Why? Because the technology capability, the underlying tech-
nology capability is doubling every 18 to 24 months. So today's
work stations basically are equivalent to the supercomputers that
were used in the early stage of the program.
So from a national perspective, I think the most important aspect
of this program is to provide a cadre of scientists in many fields
with access to the highest end. Not that it is going to be the total
effort in the Nation, facilitating the total effort, in high-perform-
ance computing, but it is going to be providing the cadre of the best
and brightest students that are going to be doing trailblazing re-
search, but there is still going to be the vast majority of the work
that is going to be done on mid-range machines and on work sta-
tions.
That was one of the important points that was made in the
Branscomb Committee Report, and I think that is being borne out
by what is happening in many fields today.
Mr. Doyle. I have one more question.
I am just curious, and a lot of us are just trying to understand,
you know, what the driving forces are. We hear all the reports on
the four centers. They have all been glowing reports, and a lot of
us wonder why can we not just make modifications at the four ex-
isting centers driving this movement to make these significant
changes?
Would it be fair to say that, as you look at the budget — you
know, we all understand the budget climate that we are in right
now. I guess what I am trjdng to understand is. Is it dollars that
are driving these decisions to look at maybe going from these four
centers to maybe one, maybe two?
Why could we not do this by modifying what goes on at the four
existing centers and spare all the expense of a recompete?
What is driving this?
Dr. Young. I will take that, or attempt to take it.
It is certainly not being just driven by dollars, though that cer-
tainly drives part of it. If you have a constant budget and you allow
for inflation, it gets hard to do as much as you have done in the
past on a fixed budget.
But as Ed pointed out, the technology is also driving the struc-
ture of the new program. There are two parts to that technology.
One is the increasing viability of mid-level parallel machines
which scale up to the high end, as he mentioned. That enables new
regional and state centers, for example, or groups at universities to
partner with centers in a sjmergistic fashion to make use of the
technology in a more effective fashion.
We think that this new program will use that technology and a
corresponding increase in high-band width connections which will
enable these centers to partner together in a more effective way to
restructure the program so that, although we will have fewer high-
end leading centers, we will have the totality of more full partners
in the program and we think we will get greater efficiency with
that.
Mr. Doyle. Dr. Hayes?
92
Dr. Hayes. I think the concept that we are recommending —
namely, the partnership centers — is right, irrespective of whether
the number is two, three, four, or five. So let me be clear about
that.
Just two amplifying insights here:
One is the realization that the very best people in computer
science and in computational mathematics and in computational
engineering generally are not all located at one place. So that the
partnership centers, as we have seen with the Metacenter Regional
Affiliates, play a very important role in the training, and develop-
ment, and the inclusion of a much broader cross-section of talented
people within the concept of the program.
So the model facilitated by the technology we believe to be the
correct model; and that the effort to coordinate the development
and the acquisitions and the allocation of times is well worth the
effort from a national perspective.
The other point that is of course present here is that the tech-
nology is changing quite rapidly. And that if you do not keep these
leading edge sites one to two orders of magnitude ahead of what
you can do with a work station, the attractiveness and the con-
tributions that will be forthcoming from these centers will be sig-
nificantly reduced.
So you have these two things that are tugging at each other and
are important changes in the computing environment from what it
was at the time of the initiation of the Centers Program.
Mr. Doyle. Thank you, Mr. Chairman.
Mr. Ehlers [presiding] . Thank you.
I think Congressman Gutknecht left — oh, no, I am sorry. There
he is leaning back.
Mr. Gutknecht. I am here, but I would yield to somebody else
who has questions.
Mr. Ehlers. Congressman Gutknecht yields to Representative
Morella who has recently joined us.
Mrs. Morella?
Mrs. Morella. I thank the gentleman for yielding to me, and I
thank you, Mr. Chairman.
With the budget released today, what is NSF's budget for the
Supercomputer Centers for Fiscal Year 1997?
Dr. Young. For Fiscal Year 1997 it will be essentially the same
as Fiscal Year 1996, which is roughly — and I may be wrong by a
little here — $67 million.
Mrs. Morella. Sort of level funding
Dr. Young. Level funding.
Mrs. Morella. (continuing) — for the 18 months. And I guess
that ties into the question that was asked of the four directors that
will be succeeding you on this whole concept of is it collaboration,
or is it competition? And does that then set out turfdoms? And
what do you do about that?
Is there anything further you would like to add?
Dr. Young. When they testify, the directors themselves may
want to testify about that.
I think one of the big successes of the Supercomputer Center
Program over the past 10 years has been the degree to which com-
peting programs have learned to work in a cooperative fashion.
93
I think there has been a great deal of collaboration and very
fruitful interaction among the existing Supercomputer Centers.
I think one model of the developing program is that we will have
partnerships that extend beyond the current leading edge sites. To
some extent that is modeled on cooperation that already exists.
It is unquestionably true that, in the context of a recompetition
that, it does encourage each of the current centers that may be
coming in in the recompetition to put forward the best proposals.
At the same time, it is NSF's management's intent to continue
cooperation among leading edge sites and partnerships as they
come in.
But you are right. It presents new challenges for our manage-
ment, and we intend to follow through on that, and we intend to
monitor cooperation and put in place procedures which will encour-
age cooperation in the future.
Mrs. MORELLA. So with the procedures you have in mind and the
concept, you think this is going to work out well.
Dr. Hayes, do you want to add to that in any way?
Dr. Hayes. Yes. I would add two perspectives which expand on
the points that Paul has made.
One is that competition, if it is done in an appropriate environ-
ment, can bring out creative new ideas and it has many of the posi-
tive aspects. So the challenge — and I think, adding to Paul's
point — the challenge for NSF is to figure out how you keep the
playing field level and permit an appropriate level of competition
to bring out the creative juices, if you will, as part of this program.
And I think there are sort of two levels of partnerships that they
have to pay attention to.
One is the partnerships within the partnership centers. Then,
once the competition is over — and the Task Force was very strong
in recommending that two centers is infinitely better than one;
okay? so our expectation is that when there are two or more win-
ners, there is still an important issue of partnerships among the
centers that win out.
NSF has a good record with the present centers in encouraging
and facilitating that kind of collaboration and partnership, as well.
Dr. Young. May I come back to that
Mr. Ehlers. Yes, you may.
Dr. Young, (continuing) — in addressing partly that question and
the earlier question about why take a successful program and
recompete it, which I think was implicit.
The current program has been very successful, but part of NSF's
standard management is to take peer review of the very best ideas
and always put things out for recompetition after a certain amount
of time. We think that is very important.
We think that with the current solicitation we are going to get
a lot of new ideas, and very creative ideas, on how to structure this
program.
Ed mentioned earlier in his testimony that our solicitation allows
for a great deal of creativity in how these programs come together,
and I think a little competition and a chance for people to come in
with new ideas about the appropriate structure of the program is
going to lead to a very exciting new program, and I think that is
going to work well.
26-018 0-96
94
Mrs. MORELLA. I just wondered, if I have another moment, sim-
ply because from what I have read — and I am not expert on this —
that there have been high marks given by at least two review pan-
els of the current system and why modifications can't be made
without the additional expense that would be incurred by the
recompetition.
Do you have any comments? Do you wonder, too?
Dr. Young. I think I actually tried to answer that in the earlier
response. I believe that we will get a significant restructuring and
better use of existing technologies through the competition.
There is a price to doing that, but I think one of the things that
is true about Foundation policies and procedures generally is that
we find open competition — peer-reviewed — a useful way to bring
out new ideas, and that is a process we are going through. It is
often painful and it is not without cost, but we do believe that that
is the best way to bring new ideas forward, and I think this new
solicitation will do that.
Mrs. MORELLA. Thank you. Dr. Young.
Mr. Ehlers. The gentlelad/s time has expired, but if you have
a brief comment you can add to that.
Dr. Hayes. Just to follow up on the competing visions for the fu-
ture in terms of how one is going to deliver the highest perform-
ance computing, I think it is important to recognize that there have
been significant developments in the networking capability, and
they are projected for the future.
People have different and legitimate differences of opinion about
what the future holds. So one of the things that is going to be
worked out as part of this competition is just what the balance is
between how people are going to be doing the highest end comput-
ing and the extent to which they will be doing it in a distributed
mode.
Mrs. MoRELLA. Thank you, very much.
Mr. Ehlers. We have been joined by Congressman Volkmer. Do
you have any questions, Mr. Volkmer?
Mr. Volkmer. Yes.
I want to first apologize for not being here for the full testimony,
but I reviewed some of your statements and I was just discussing
it with the gentleman from Pennsylvania and I would like to con-
tinue along a similar line because I just do not quite understand
the reasoning for the recompetition. But from what he has told me,
perhaps there is a need for it if you want to try something dif-
ferent.
But do you, when you envision what is going to come from the
new competition, do you have any ideas of what you are looking for
to come forward?
Dr. Young. Yes.
Mr. Volkmer. What are those ideas?
Dr. Young. We believe that networking technology and the ad-
vent of scalable mid-level parallel machines that scale up to the
high end enable us to restructure a program in which we can get
a broader distribution of the technology across the country, more
participating centers, and better integration of mid-level systems
and better use of high-speed networking connections into a pro-
95
gram that is less centralized and broader than the current pro-
gram.
Mr. VOLKMER. All right. So what you envision is, you would still
have two, three, four, or five main centers?
Is that correct?
Dr. Young. That is correct.
Mr. VoLKMER. But they would be friendly users?
Dr. Young. Yes.
Mr. VOLKMER. So that other universities and other research cen-
ters would be able to utilize the information that they have devel-
oped, or what?
Dr. Young. Not only utilize the information they have developed,
but partner in the development of new software to partner in the
development of new algorithms and work in a fashion that is coop-
erative with them using the fact that they have similar models of
machines and similar software developments.
Ed spoke earlier of the fact that the expertise in computing in
the country is not located at just a few institutions. We believe that
we can get a better partnering of the expertise that is around the
country by a program that makes use of these new technologies.
Mr. VOLKMER. And the ultimate aim within four to five to six
years is what?
Dr. Young. The ultimate aim is to have the ability to do com-
putational science and engineering distributed even more broadly
across the country and have a more powerful and better program
for the country as a whole.
Mr. VOLKMER. Would we end up then with a perhaps new and
different software
Dr. Young. We will certainly have new and better software, and
we think that the restructured program will actually aid in that de-
velopment of software both for the underlying technology and for
the specific technology for computational science and engineering.
Mr. VOLKMER. And this could be transferred to private industry
and private business?
Dr. Young. We believe that much of the work that goes on in the
Foundation in this area ultimately finds application in business, for
sure. The
Mr. VOLKMER. All right. Fine.
Now do you envision that those who will be entering the competi-
tion — or is the competition still open?
Dr. Young. Pardon?
Mr. VoLKMER. Is the recompetition still open?
Dr. Young. Yes, it is. The deadline for preproposals is April
15th.
Mr. VOLKMER. All right.
Do you envision — about how many will be in that competition?
Dr. Young. I would guess on the order of four to half a dozen,
maybe more.
Mr. VOLKMER. Four, five, six, seven?
Dr. Young. Yes.
Mr. VOLKMER. Mostly the same centers that are out there right
now?
Dr. Young. That is the current indication, I believe.
96
Mr. VoLKMER. So basically we have a recompetition with, to me,
a goal of changing what is being done right now at those same cen-
ters, it looks like.
Dr. Young. It will change what is going on at those centers, and
it will change what is going on in other centers around the country.
Mr. VOLKMER. Well the other centers will change as a result of
what these centers do. Is that correct?
Dr. Young. I think both will change.
Ed?
Dr. Hayes. I think I need to draw a little cartoon of what this
Partnership Center looks like.
Mr. VoLKMER. All right. That might help.
Dr. Hayes. A Partnership Center has a leading edge site
Mr. VOLKMER. All right.
Dr. Hayes, (continuing) — which has a high-end balanced system.
But as part of the partnership there may be four to five — the exact
numbers will depend on the proposals of course — but of partner-
ships where there is human resource capability; there are existing
networks; and there will be hardware and software capability.
The center is the combination of the leading edge sites, as well
as the partnership sites.
Mr. VoLKMER. In other words
Dr. Hayes. It has been recognized since the beginning of high-
performance computing centers that a good deal of the use of these
high-end systems in fact is used by people who have major research
problems, but on a job-by-job basis, particularly in the development
of the software, may not require access to the highest end of the
performance spectrum.
So within this new Partnership Center there will be the possibil-
ity, particularly in the early development stages, to allocate time,
resources, and training on the mid-range machine.
Why?
Because now it is technically possible today to — for instance Ohio
State is affiliated with the Ohio Supercomputing Center, which
serves all the universities within Ohio, which is a Metacenter Re-
gional Affiliate, which is affiliated with the Pittsburgh Center, but
it is very loosely coupled.
It is not possible for the Allocations Committee to say yes to a
researcher, we think you have a good idea but we want more quan-
titative data. We want you to go to a Partnership Center like the
Ohio Supercomputing Center and demonstrate to us that you are
technically ready to make effective use of the high-end resource.
When you are ready, then you will be able to compete with oth-
ers to have that access.
I think in a sense it is akin to getting preliminary data for a
large experiment. This is viewed as a more cost-effective method of
doing that because now it is the whole center that is being man-
aged.
The other point has to do with software productivity. I think es-
sentially everyone who has been looking at the performance of the
centers program over the years has recognized the important
science that has been done. But the advances in terms of software
productivity have not been in parallel — have not made progress in
parallel with the underlying technology that is being used.
97
One of the hopes is that by bringing in a broader set of partners,
computer scientists, mathematicians, that can work on early ver-
sions of machines which may at some point be the leading edge
systems, that we will get significant advances upon which everyone
will benefit in terms of software productivity and, importantly, in
learning how to use distributed systems over the network.
I think the issue is on what sort of time scale are we going to
find that the highest performance computers in fact are assembled
by using computers that are distributed across the network.
This is a very important issue for industry, as well as for re-
search in this country.
Mr. VOLKMER. Mr. Chairman
Mr. Ehlers. The gentleman's time has expired.
Mr. VOLKMER. May I ask one more question?
Mr. Ehlers. If it is a very, very brief one.
Mr. VOLKMER. It is.
You have the primary center, what I call the primary center, and
an ancillary leading edge.
Dr. Hayes. Yes.
Mr. VoLKMER. Now those that join with that leading center,
would that be only university and academia? Or will that include
industry and private business at the research centers?
Dr. Hayes. We include industry, national laboratories-I think
there
Mr. VOLKMER. Whoever they-
Dr. Hayes. It is specifically stated in the program
Mr. VOLKMER. Whoever they can get to sign up, in other words?
Dr. Young. That is right. I could read from the program solicita-
tion. You probably do not want that, but just a subset of those list-
ed. It is not an exclusive list, but it includes universities, and in-
cluding research groups within universities, centers of various
forms, research and educational consortia organizations and
groups, regional and state supported high performance computing
centers, private-sector organizations, and national labs.
That list is not intended to be exclusive.
Mr. VOLKMER. Thank you.
Mr. Ehlers. Thank you.
We have been joined by Congressman Boehlert. Congressman, I
would like to ask you to take the Chair, and then you may have
your turn to ask questions, as well, because I will have
Mr. Boehlert. What a deal that is.
[Laughter.]
Mr. Ehlers. Because I have to leave for the Floor to defend a
bill.
Mr. Boehlert [presiding]. What a deal this is. I get the Chair
and the opportunity to question.
I am not quite certain I am clear in my own mind why we are
going in the direction we are going, because what I see is ten years
of success and we have got four outstanding centers doing a lot of
things you are talking about — their extensive outreach program,
their partnering, bringing in the private sector — I mean, you do not
have just four little islands unto themselves scattered around the
country.
98
It seems to me you are talking about reducing the number of cen-
ters at a time when I would think we would be expanding what we
are doing at the centers and encouraging them to keep up the good
work.
I am not quite sure I am clear why there seems to be the drive
toward a reduction in the number of centers after a decade of suc-
cess.
And once again, I seem to think — or that is my impression at
least — that the centers are all individually doing exactly what you
are talking about what you envision as a result of this competition,
the extensive outreach; the partnering; the bringing in the private
sector.
Enlighten me a little bit, if you will. Doctor?
Dr. Young. Shall I try that first and then let you come?
Dr. Hayes. Yes.
Dr. Young. I think your point is well taken. It may be a case
of success breeding success.
But I think it is also true that the technologies permit an accel-
eration of that process and a deepening of that process.
So I think that you will see more partnering and more outreach
with the program as it is structured.
As Ed mentioned earlier, and I think before you came in, the
structure that is envisioned here is a structure that makes sense
independently of the actual number of centers and partnerships. So
we think that this intent to make better use of networking and
scalable parallel computing is something that the centers, as you
point out, have been doing, and we think it is a natural evolution
of the program in that direction.
At the same time, we do have constant budgets, tight budgets,
and we still have inflation. There is a limited amount we can do
with tight budgets.
Ed mentioned that one of the things we need to do at the leading
edge sites is to keep them one or two orders of magnitude, ten to
one hundred times faster, ten to one hundred times more storage
power, just the technology needs to be a couple of places ahead.
And that is expensive and there are a limited number of places we
can do that.
We think we are going to get a more efficient program as we ex-
pand some of the things the centers have been doing very well.
Mr. BOEHLERT. Dr. Hayes, do you wish to add anything to that?
Dr. Hayes. Yes. I think one of the futures that the Task Force
was very concerned about is the scenario where we get four Part-
nership Centers, where the Metacenter Regional Affiliates that we
currently have, that that part of the program is folded into the
broader program, so you get four Partnership Centers.
Mr. Boehlert. Which is in effect what we have right now.
Dr. Hayes. Well I will come back to that as a detail. That is not
quite right. We have four leading edge sites right now, and we
have loosely coupled Metacenter Regional Affiliates. That is what
we presently have.
The concern is that, with the changing philosophy at ARPA,
DARPA now, that if these centers came into being and the NSF
budget did not grow at a rate that was significantly above inflation
for this program, that you would not be able to keep up with the
99
recapitalization cycle that would be necessary to keep the leading
edge sites at a level that would be sufficiently interesting to draw
the very best researchers to the centers.
You may want to explore this with the center directors, but the
impression that many of us have is that without the DARPA
money to help NSF move into the massively parallel arena over the
last five years of the program, why many of the centers would have
been stuck at the vector computing stage, because many had long-
term leases or payments on existing centers. And without an infu-
sion or a recapitalization, you could find that what appeared to be
a good investment and positioned the center for a period of two or
three years basically put a Partnership Center in the position that
they were not really capable of supporting the most interesting
science, and that that would basically put you on the kind of spiral
where the scientific community, the people who actually do the
science, would say, well, by buying a mid-range system at signifi-
cantly lower cost, they can do research which is competitive with
what you could do at a Partnership Center with six- or seven-year-
old technology.
So the concern about "sizing" the program that was very much
on our mind is that the program needs to be sized so that it can
be recapitalized on a reasonable time scale.
People debate about whether that is three or four years, but I
think there are a few that would argue that five to seven years is
the right recapitalization schedule.
So when you think about within the kind of budget environment
they are in, and the kind of funding that is available for the Cen-
ters Program, recapitalization on a shorter time scale, and also
bringing in the partnership aspect and being able to do realistic
and interesting experiments with distributed computing, the
present centers do not have that to the same extent that is envi-
sioned in our recommendations.
The current centers are doing many very important things, and
I do not want to in any way be critical of what they are doing
under the present program — and I think the Task Force wasn't.
But underlying all of this is the fact that, you know, we are
Blessed with the fact that technology is changing very rapidly, and
in thinking about what the program may look like over a five- to
ten-year period we need to have the flexibility within the program
so that it can recapitalize and keep up with the technology.
Mr. BOEHLERT. It seems to me you are making a persuasive case,
or fighting like hell for more bucks for the program, rather than
downsizing the program or restructuring the program.
Is it your interpretation of budget realities that is forcing this
new line of thinking?
Dr. Hayes. Let me be clear about two points.
There is nothing about the partnership concept that is related to
the budget. I think the Task Force would be unanimous that the
partnership aspect, where you have a leading edge site that is
tightly coupled, not loosely coupled but tightly coupled to partner-
ship sites, is the right model for the future.
Mr. BoEHLERT. But is that not what is evolving right now under
the present configuration?
100
Dr. Hayes. There — I would — you will have to ask the center di-
rectors their view on that. I happen to be familiar with one
Metacenter Regional Affiliate. There are a number of aspects that
we would expect to be present in a Partnership Center in terms of
tight coupling and management and planning, which at least do
not appear to be present in that particular Metacenter Regional Af-
filiate.
The coupling is much looser. The planning is looser. And the
ability to — in fact there is no ability for an Allocations Committee
to allocate resources at the Metacenter Regional Affiliate that I am
familiar with.
So if a proposal comes into NSF from New York State, for in-
stance, and says, look, they want to carry out a particular project,
and the Allocations Committee says, well, what you need is a mid-
range kind of machine, and that machine is available at the Ohio
Supercomputing Center, there is no way that they can allocate
time on that particular resource.
So what happens is, either the particular project does not get any
allocation at all because there is uncertainty about whether it is
technically ready, you see, or the decision is made, sometimes pre-
maturely, to allocate resources on a very high-end system that is
costly to iDasically demonstrate that they are technically ready.
It seems to me that leads to inefficiencies in the program.
Mr. BOEHLERT. Congressman Doyle?
Mr. Doyle. Thank you.
I just want to get one thing — ^We have four centers right now.
And, Mr. Boehlert, when you were talking about what is going on,
I think the reality is here, though, that under what is being pro-
posed here, what we are going to end up with is one, maybe two
leading edge centers.
Is anybody disputing that?
Do you see more than two leading edge centers, given the budget
constraints we are under right now?
Dr. Young. I think it is plausible that we will end up with be-
tween one and three.
Mr. Doyle. Three now?
Mr. Brown. Between one and three.
[Laughter.]
Mr. Doyle. Yes. "Between one and three" is two, yes.
[Laughter.]
Mr. Doyle. And the other observation is, I am just sitting here
looking and thinking that a lot of the changes you are talking
about, while they are going to open up, you know, more to mid-
range users, it seems to me it is going to be at the expense of the
high-end users; and the high- end users are oversubscribed as we
speak.
I am just wondering, you know, it gets back to what Mr. Boehlert
was sajdng, too. Maybe we need some people to sit up here and talk
about the need for some additional funding for what has been a
very successful program at these four regional centers.
It just seems that what is going to happen here is we are going
to open up the process for some at the expense of high-end users
that are already over-subscribed and end up with one, maybe two.
101
leading edge facilities here instead of our four regional centers.
That is just what it appears to me to be, anyway.
I would be interested to hear your comments.
Dr. Hayes. I think the point I made in my prepared statement
about the process that NSF has is that I think the Task Force felt
comfortable with the concept of the Partnership Centers, as I said.
But the subsequent steps that took place within the Foundation
were that basically the Director had to take a plan to the National
Science Board, and the Director and the National Science Board in
the context of the whole NSF program, the whole budget, had to
make a judgment on this as to what was a reasonable balance be-
tween high-performance computing and the rest of the NSF pro-
gram.
The Task Force certainly heard loud and clear that many of the
people that are using the existing centers in fact depend upon sup-
port from NSF through disciplinary programs and other programs
for their support.
So if you, again, look at things in that kind of perspective, some-
one has to look at the total NSF portfolio within what is a reason-
able expectation of the budget. I think this Committee looking at
this is very important can make a judgment that we want to keep
the whole organic whole healthy.
So if you have somebody like Jerry Ostriker, who Paul men-
tioned, or another one of the major users basically that loses fund-
ing for their overall program which is not, you know, exclusively
a high- performance computation program, if you visit any one of
these laboratories, including Ostriker's, he has a lot of mid-range
computing capability in his laboratory that is coupled appro-
priately.
Not everyone is so fortunate. I use Jerry Ostriker as an example
because I think it is quite unlikely that in the budget reduction he
loses his support, but I think that that is one of the issues as to
what gives us the best overall plan.
The sense of the Task Force was that you really need to have a
minimum of two Partnership Centers. And if the budget can afford
additional centers or, importantly, if there is additional significant
cost sharing that comes from vendors and from the partners and
the leading edge sites and from the states, that in fact it may be
possible to do more than two. And if the NSF budget would sup-
port, with the recapitalization that I mentioned earlier, more than
the minimum of two that we were strongly pushing for, then within
the concept of the partnership I think there will be quite a comfort
level and an enthusiasm for doing that. But only if it meets the re-
capitalization and the balance aspects that I was talking about.
Mr. BOEHLERT. Thank you very much, and thank both of you
very much.
We will now move to the next panel consisting of Dr. Malvin
Kalos, Director of the Cornell Theory Center; Dr. Larry Smarr, Di-
rector of the National Center for Supercomputing Applications; Dr.
Douglas Pewitt, Acting Director, San Diego Supercomputing Cen-
ter; and Dr. Ralph Roskies, a Scientific Director for the Pittsburgh
Supercomputing Center.
Gentlemen, your statements will appear in the record in their en-
tirety. I would ask in the interests of time you give us as much as
102
we need to know so that we can ask intelligent questions and un-
derstand the issue, but please do not be too lengthy.
I would ask also that we follow the order of introduction with the
privilege of leading off going to a fellow New Yorker, Dr. Kalos, as
soon as you are comfortably seated.
Dr. Kalos. Thank you
Mr. BOEHLERT. And would you pull the mike a little bit closer?
STATEMENT OF DR. MALVIN H. KALOS, DIRECTOR, CORNELL
THEORY CENTER, AND PROFESSOR OF PHYSICS, CORNELL
UNIVERSITY, ITHACA, NEW YORK
Dr. Kalos. Mr. Chairman, it is my privilege to be invited to com-
ment on the NSF's new program of Partnerships for Advanced
Computational Infrastructure.
I am Malvin Kalos, the Director of the Cornell Theory Center,
one of the four current NSF Supercomputer Centers. I am a Profes-
sor of Physics at Cornell University. I have been a computational
scientist for more than 40 years, and at present am the Vice Chair-
man of the Division of Computational Physics of the American
Physical Society.
Our Center was founded with the premise that computation has
a profound effect upon all the sciences and engineering.
From the beginning, our Center took a leadership role in scalably
parallel computing. Now computational science is undergoing a rev-
olution with the advent of practical scalable computing.
The 512-processor SP at Cornell is one of the most powerful com-
puting environments available today. We have seen leading re-
searchers in our community cross new thresholds of capability to
do qualitatively new science.
In the end, a program like this and our Center can only be
judged by the quantity of important scientific knowledge and un-
derstanding which could not have been accomplished without it.
I would like to call attention to a few special examples that illus-
trate the scope and influence of this program.
My first example is one referred to by Dr. Young in the impor-
tance of computation for molecular biology:
An international team of scientists investigated the rapid activity
of acetylcholinesterase which breaks down the neurotransmitter ac-
etylcholine.
Its three-dimensional crystal structure was solved a few years
ago. It seemed inconsistent with the rapid djmamics of the enzyme.
Using computing resources at all four of the NSF Centers, the re-
searchers discovered that there was a back door to the molecule
that explains the rate.
Since inhibitors of acetylcholinesterase are important medica-
tions for diseases such as myasthenia gravis, glaucoma, and Alz-
heimer's disease, this new insight may lead to more effective drugs
to fight these diseases.
A second example is in astrophysics:
Using both the NSF-supported Arecibo radiotelescope for obser-
vation and the Cornell Theory Center's computational facility for
intensive analysis and modeling, Alexander Wolszczan of Perm
State University succeeded in identifying the first planets outside
of our solar system. Wolszczan was recently awarded the pres-
103
tigious Beatrice M. Tinsley Prize of the American Astronomical So-
ciety for this discovery.
John Dawson and his research group at the University of Cahfor-
nia at Los Angeles studies turbulence in complex plasmas. The
power of the Cornell SP permit his group to support simulations
that are an order of magnitude improvement over previous ma-
chines in spatial resolution, which has allowed them for the first
time to simulate successfully the parameters of a large fusion de-
vice.
Here, as elsewhere, computation as an aid to understanding and
as an aid to predesign of costly experiments is an important way
to use limited budgets in an optimum way.
At the Massachusetts Institute of technology, Edmund
Bertschinger conducts Grand Challenge research aimed at under-
standing the formation of galaxies in the universe. The power and
memory of computers such as the SP permit him and the rest of
the Grand Challenge Team to study models large enough to cap-
ture these large-scale dynamical effects. This, too, is work that
spans the NSF Centers Program.
I would like to echo Dr. Hayes' comments about the relationship
with industry.
We believe that we have also made a singularly important con-
tribution to the national scientific effort by the depth and quality
of our partnership with IBM.
Through this partnership, we rekindled IBM's interest in high-
end technical computing and thereby helped to bring their superb
technology to the service of science and engineering.
As is often the case, the technical computing world has led the
way in bringing out new technologies for the broader commercial
arena.
We believe that IBM's success will help assure a future for com-
putational science.
Computation is a unique tool that permits quantitative connec-
tions among different disciplines. Every one of the large problems
that confront our society, and to whose solutions we expect science
to contribute, is in face a multi-disciplinary problem.
Issues of the environment and medicine, to cite only two, involve
many sciences, chemistry, physics, engineering, fluid flow, biology,
materials. Bringing the knowledge from these fields together to
make quantitative predictions — for example, about the effects of
some technological proposal — would be utterly impossible without
the use of high-performance computational modeling.
It is the indispensable natural language of quantitative multi-
disciplinary research.
Computational science is now an essential tool in experimental
science. The most advanced scientific instruments — optical and ra-
dial telescopes, particle accelerators, and computers themselves —
are designed, studied, optimized, and verified with computer sim-
ulation.
Data collection is usually automated and the reduction to com-
prehensible data sets or conceptual models may involve enormous
computations as in the case of Alexander Wolszczan.
104
Now the entire scientific and engineering community of the coun-
try has the opportunity to exploit the new tools of computational
science.
We share NSF's vision that education of tomorrow's young sci-
entists includes bringing them into the process of exploiting the
Nation's most advanced tools, including the computational re-
sources of the NSF centers. They will carry the message to the rest
of our society and to the future.
The supercomputing community now finds itself at a major cross-
roads. To exploit these new machines, the scalably parallel ma-
chines, a major retooling of software and algorithms must take
place.
In response to the new NSF program, but also to assure that re-
cent advances are sustained and extended, and that new challenges
can be met. The Cornell Center is creating a National Alliance for
Advanced Computational Infrastructure. This will comprise a new
network of cooperating organizations around the Nation that will
bring their varied talents and their resources to the Alliance, along
with the world-class supercomputing infrastructure of the Theory
Center and its strong and experienced professional staff.
Our partners are of world-class stature in scalable computing, in
related technologies such as virtual reality and computer graphics,
in all areas of computer science that relate to parallel computing,
in computational mathematics, in forefront applications.
Mr. Chairman, you have asked how the new competition has af-
fected our Center. It has certainly diminished the strong and fruit-
ful collaboration with the others that went by the rubric of
"Metacenter."
It has created a sense of uncertainty about the future in the
minds of some of our professional staff. Naturally the realization
that this is a very complex and serious competition with very high
stakes has generated an intense concentration on winning, distract-
ing us somewhat from our normal mission.
In the long term, I am sure that the intercenter cooperation will
re-establish itself. Nevertheless, we share a commitment to the
idea that a partnership of the kind that we envision is a logical
next step for the Cornell Theory Center. It will transform our pro-
gram into something quite new, something that will serve the best
interests of the community devoted to computational science, and
the broader community of research in basic science.
[The prepared statement of Dr. Kalos follows:]
Testimony of Malvin H. Kalos on the NSF Partnership for Advanced
Computational Infrastructure
before the
Subcommittee on Basic Research of the Committee on Science
March 19, 1996
Mr. Chairman, it is a privilege to be invited to comment on the NSFs new pro-
gram of Partnerships for Advanced Computational Infrastructure.
I am Malvin Kalos, the Director of the Cornell Theory Center, one of the four cur-
rent NSF supercomputer centers, and a Professor of Physics at Cornell University.
I have been a computational scientist for more than forty years, and at present am
105
the Vice Chairman of the Division of Computational Physics of the American Phys-
ical Society.
Our Center was founded with the premise that computation can have a profound
impact upon all the sciences and engineering. From the beginning, the Center took
a leadership role in scalably parallel computing as the ultimate source of the most
processing power and memory. We understood that orders of magnitude increases
in computational power could only come from harnessing large numbers of commod-
ity microprocessors, rather than smaller numbers of specially fabricated computer
chips. Now computational science is undergoing a revolution with the advent of
practical scalable computing. Computing resources have increased by three orders
of magnitude in speecf and memory during the past decade. The 512-processor SP
at Cornell is one of the most powerful computing environments available today. We
have seen leading researchers in our community cross new thresholds of capability
to do qualitatively new science — that is, to attack new problems which could not be
considered before and old problems in ways that provide new insights.
In the end, a program like this and our Center can only be judged by the quantity
of important scientific knowledge and understanding which could not have been ac-
complished without it and by the impact of this new knowledge upon our society.
An international team of scientists collaborated to investigate the rapid activity
of the enzyme acetylchoinesterase. The enzyme breaks down the neurotransmitter
acetylcholine diffused across nerve cell synaptic gaps. Its three-dimensional crystal
structure was solved several years ago, and showed the active site of the enzyme
to be a long, narrow channel — too narrow to deal rapidly with the job of breaking
down acetylcholine. Yet it is known that acetylcholinesterase acts very rapidly.
Using computing resources at all four of the national centers, the researchers dis-
covered that there was a "back door" to the active site of the enzyme that explained
its ability to work so quickly. Since inhibitors of acetylcholinesterase are important
medications for diseases such as myasthenia gravis, glaucoma, and Alzheimer's dis-
ease, this new insight may lead to more effective drugs to fight these diseases.
Using both the Arecibo radiotelescope for observation and the Cornell Theory Cen-
ter's computational facility for intensive analysis and modeling, Alexander
Wolszczan of Penn State University succeeded in identifying the first planets out-
side of our solar system. Wolszczan was recently awarded the prestigious Beatrice
M. Tinsley Prize of the American Astronomical Society for his discovery.
John Dawson and his research group at the University of California at Los Ange-
les use particle-in-cell methods to study turbulence in complex plasmas. The mem-
ory and speed of the Cornell SP permit his group to perform simulations that are
an order of magnitude improvement over previous machines in spatial resolution,
and so allow them to see interesting and important new effects. They have been
able, for the first time, to simulate successfully the parameters of major Tokomaks,
such as the TFTR at Princeton, which is the largest fusion device that exists in the
United States.
A number of cardiovascular and psychiatric diseases are currently treated with
drugs that act on the neurotransmitter serotonin and its receptors. The cellular re-
ceptor for serotonin is a gatekeeper molecule that recognizes and binds the serotonin
and then transmits the signal to the cell by binding to a special class of trans-
ducers — the G-proteins. Researchers at Mount Sinai Medical Center made a break-
through in modeling the serotonin receptor using CTC resources. The breakthrough
came from modeling the structural changes that occur in the serotonin receptor
when it binds to a ligand and to the G-protein, causing it to carry out its function.
This research shows how G-proteins can be switched on by structural changes in
specific regions of the receptor molecule and is expected to lead to the development
of more effective drugs for a vast range of diseases.
Edmund Bertschinger, Massachusetts Institute of Technology, conducts Grand
Challenge research aimed at the understanding of the formation of galaxies in the
universe. The memory and power of computers such as the SP permit him and oth-
ers to study model systems large enough to capture these large-scale dynamical ef-
fects.
George Karniadakis, Brown University, is studying turbulent flow through direct
numerical simulation in order to resolve flow fields at all scales from first principles.
A fundamental goal is to provide insight into a number of open questions in fluid
dynamics, including large-scale instabilities of turbulent wakes and associated drag
and lift variation in flows past bluff bodies; symmetry-breaking bifurcations and
low-dimensionality in flow-structure interactions; and shear stress control and vor-
ticity transport in wall-bounded flows using both passive and active means of con-
trol such as surface modifications and electromagnetic techniques.
Toichiro Kinoshita, Cornell University, studies the anomalous magnetic moment
of light elementary electrons, whose interactions are primarily electromagnetic and
106
which therefore provide very stringent tests of quantum electrodynamics. A power
series expansion of the magnetic moment gives successive terms that are integrals
over more and more dimensions. They are done by a very sophisticated Monte Carlo
but require much more resources at higher orders. Moving to the SP has permitted
Kinoshita to go to third and fourth order.
We believe that we have also made a singularly important contribution to the na-
tional scientific effort by the depth and quality of our partnership with IBM.
Through this partnership, we rekindled IBM's interest in high-end technical com-
puting and thereby helped to bring their superb technology to the service of science
and engineering. As is often the case, the technical computing world has led the way
in bringing about new technologies for the broader commercial arena; we believe
that IBM's success will help assure a future for computational science.
The essence of pure science is to connect our scientific knowledge in a seamless
web of quantitative understanding. This has become harder as we try to probe into
more and more complex phenomena that cannot be analyzed by the mathematical
tools at our disposal. Computational modeling is essential to fill this need.
Many areas of science involve this kind of systematic connection among different
phenomena at different scales of length or energy. All aspects of chemistry, biology
and medicine, the physics of materials, astrophysics and many others require this
approach.
Computational science is now also an essential tool in experimental science. The
most advanced scientific instruments, optical and radio telescopes, particle accelera-
tors, and computers themselves are studied, designed, optimized, and verified with
computer simulation. Data collection is usually automated, and the reduction to
comprehensible data sets may involve enormous computations. Exchange of large
data sets will require very heavy use of future high capacity data networks.
Now the entire scientific and engineering community of the country has the op-
portunity to exploit the new tools of computational science. Students and young sci-
entists, who are always the very heart of any important scientific change, are in-
volved. We share NSFs vision that education of tomorrow's young scientists includes
bringing them into the process of exploiting the nation's most advanced tools, in-
cluding the computational resources of the National Centers. They will carry the
message to the rest of our society and to the future.
Another vital role of computational science is that of permitting quantitative con-
nections among different disciplines, that is, in supporting multidisciplinary re-
search. Every one of the large problems that confront our society, and to whose solu-
tions we expect science to contribute, is in some sense a multidisciplinary problem.
Issues of the environment and medicine, to cite only two, involve many sciences —
chemistry, physics, engineering, fluid flow, biology, and materials.
Bringing the knowledge from these fields together to make quantitative pre-
dictions about the effects of some technological or regulatory proposal would be ut-
terly impossible without the use of high-performance computational modeling, the
indispensable natural language of quantitative multidisciplinary research.
The supercomputing community now finds itself at a major crossroads. To exploit
these new machines, a major re-tooling of software and algorithms will have to take
place.
Science and its application to societal problems involve the national community.
Bringing to bear the transformation made possible by computational science in the
most complete and positive way requires that its techniques and strategies be
learned, used, and shared by the widest possible group of researchers and educators.
All of these are necessary, and the appropriate level and balance among them is es-
sential.
In response to the new NSF program, but also to assure that these advances are
sustained and extended, the Cornell Theory Center is increasing its collaborations
with computer science and computational mathematics to meet the challenge of this
re-tooling. We are therefore creating a National Alliance for Advanced Computa-
tional Infrastructure. This will comprise a new network of collaborating organiza-
tions that will bring their varied talents and resources to the Alliance along with
the world-class supercomputing infrastructure of the Theory Center and its strong
and experienced professional staff. Our partners are of world-class stature in scal-
able computing; in related technologies, such as virtual reality and computer graph-
ics; in all areas of computer science that relate to parallel computing; in computa-
tional mathematics; in forefront applications.
You have asked how the new competition has affected our Center. It has certainly
diminished the strong sense of collaboration with the others that went by the rubric
of "MetaCenter." It has created a sense of uncertainty about the future in the minds
of some of our professional staff. Naturally the realization that this is a very serious
107
competition with very high stakes has generated an intense concentration on win-
ning, distracting us from our normal mission.
Nevertheless, we share a commitment to the idea that a Partnership of the kind
that we envision is a logical next step for the Cornell Theory Center, one that will
serve the best interests of the community devoted to computational science and the
broader community of research in basic science.
Mr. BOEHLERT. Thank you, very much.
Dr. Smarr?
STATEMENT OF DR. LARRY L. SMARR, DIRECTOR, NATIONAL
CENTER FOR SUPERCOMPUTING APPLICATIONS, AND PRO-
FESSOR OF PHYSICS AND ASTRONOMY, UNIVERSITY OF ILLI-
NOIS AT URBANA-CHAMPAIGN, ILLINOIS
Dr. Smarr. Thank you very much.
I am Larry Smarr. I am the Director of the National Center for
Supercomputing AppHcations at the University of Ilhnois at Ur-
bana-Champaign. I am also Professor of Physics and Astronomy.
I would like to make a few points that have been brought out by
the questions today. I will leave my written testimony to the
record.
I think one of the interesting things as I look back over the last
ten years, since I was very involved in getting this program started
in the first place — I was one of those folks who was mentioned ear-
lier that had to go to foreign shores to get access to American-built
supercomputers — is that in those days really it was just the na-
tional laboratories that had these capabilities.
There was a lot of uncertainty among folks like you in similar
Congressional hearings about whether universities would in fact be
able to rise to the occasion, and whether the NSF itself would be
able to handle something that traditionally had been dealt with my
the DOE or by the military.
I think that the fact that, say at our center alone which is en-
tirely run by the University of Illinois, the University staff, grad-
uate students, post-docs, faculty, and the fact that we have had
over 9000 users during that last ten years fi-om 500 different insti-
tutions, the fact that we have corporate partners from all sectors
of the economy speaks to the fact that the National Science Foun-
dation was in fact able to rise to the occasion.
The entire national network, the NSF Net that we take for
granted today and that is all commercialized even, arose completely
from the backbone that was used to tie those centers together at
56 kilobits. That speed is less than what you get into your home
today.
Remember the — I mean, ten years ago we were talking about
this national backbone being, today, of lower speed than ISD into
the home. And in fact our first supercomputer had the speed that
is about equal of what Nintendo is going to bring out next month
in their new Ultra-64. The 8-year-olds are going to have
supercomputers in their basement playing videogames.
That is what this dramatic technology does. There is no other
technology like it. At our center alone we have been able to sustain,
at essentially the same amount of dollars per year spent on hard-
ware, a 75 percent compounded annual growth rate in computer ca-
pacity.
108
We have gone through three complete generations, architectures,
from the original shared Vector machines to the massively parallel,
to the new microprocessor based systems that are going to combine
shared memory and ease of programming with the scalability of
massively parallel machines.
One thing I would like to read into the record is the appendix
from the Hayes Report — and I think all four directors want that to
happen — that give the major accomplishments of the entire NSF
Centers Program both in technology development, outreach, and in
the scientific research enablement of remote users throughout the
university community. There is a lot of work that went into prepa-
ration of that document that I think adequately will answer some
of the questions about the successes of the program.
I would like to move next to this notion that it is all about
supercomputing. That certainly was the case ten years ago. It is
not the case today.
The centers have evolved and matured and their role within the
context of the national program has changed dramatically.
For instance, the NSF center dollars, the Cooperative Agreement
we are talking about, the $65 million available each year from the
Division of Advanced Scientific Computing, at our Center is
matched dollar for dollar by other sources that grew up as a result
of having that money there. That came from the State. It comes
from corporate partners. It comes from vendors. It comes from
other agencies.
So that there has grown up around the core program a whole set
of partners. So the partnering idea that we are seeing in the solici-
tation is nothing new. It has organically grown out of the success
and maturation of the original center concept.
The network is now accelerating the need for that.
Furthermore, from the very beginning it was clear that visualiza-
tion and now Virtual Reality techniques are needed to be able to
understand the massive amount of numbers that are generated
from the supercomputers.
So Virtual Reality development itself has become a major aspect
of ours and other centers, as well as software development.
Many of you are I am sure aware that one software product,
NCSA MOSIAC, that came from our Center has completely trans-
formed the world in a matter of just a couple of years.
Before this, the Worldwide Web was something I do not think
any of us had ever heard about. With the ease of use of this one
software product, millions of users worldwide in probably just 18
months started using this to lay the basis for one of what Wall
Street considers the most successful technology transfer program in
probably the history of federal funding, which is the development
of companies like NetScape's SPYGLAS, the licensing of over 40
companies of this product including MicroSoft, and now this battle
between NetScape and MicroSoft.
All of that came from one set of mainly students working at our
Center.
So the ability to keep America on the cutting edge of this new
world of networks, and software, and the Web, the Centers have
played a decisive role in all of that.
109
The same goes — and I will not go into detail — for the K through
12, the outreach to communities, the work with industrial partners,
and indeed the Federal Government. There are 16 Federal agencies
that are going together to learn how to help in the streamlining of
the Federal Government system using these techniques at NCSA.
So what is wrong with the current picture?
That is what I am hearing.
I think the point is that, with networking you are having a phase
change in the way this country is put together, whether you are in
business, or whether you are in the government, or whether you
are in health care, or whether you are in education, or whether you
happen to be in high-performance computing.
That is, that distance is rapidly disappearing.
So the notion that a Center being in one geographical spot is be-
coming an anachronism. In fact, what is going to happen is in the
way you see all the modem management books for industry emerg-
ing is that of Virtual Teams that come together, go apart, are sup-
ported by the Network technologies, and the fact that we have now
these Lego-like units of microprocessors that can go from the
desktop machine to mid-level machines to the very highest leading
edge machines, that are distributed across and almost like a sea
of processors across this network.
That is the world. It has nothing to do with this solicitation. The
world has changed, and largely because of the success of the first
ten years of the program.
The way I interpret the NSF's solicitation is that they are now
saying. Given this new world, given that we have a fixed amount
of dollars each year in the Division of Advanced Scientific Comput-
ing, how can we create Virtual Centers that take advantage of this
new world and all of the ideas, all of the smart people that are not
in Champaign-Urbana?
How do they become part of the Center?
Right now the fundamental problem is. If I wanted to fund an
effort that was at some other university directly, rather than spend
it in Champaign- Urbana, I really cannot do that within the cur-
rent structure of the program.
The new program would envision that a portion of the coopera-
tive agreement that would come to me as PI at my center would
in fact be spent out in the field at these other partnering institu-
tions and not at the center.
I think it is only when you can actually get resources, dollar re-
sources, out of the partnering centers that the kind of tight cou-
pling that Ed Hayes referred to actually comes into being, because
once you are doing that, you also have the people at those other
centers that are involved in the management of your own central
center.
I think that that shared management of shared brains around
the table from a number of the very best places in the country is
a very different vision than the current one in which there is sim-
ply one cooperative agreement that has been in one place, and then
there are other programs.
So I think that is one of the key advantages.
The other is that there is this tendency to say, well, we are going
from four centers to two centers. I do not see that at all.
110
Technically, yes, to get within the same budget money to put out
in the Partnership Centers you may well have to reduce the num-
ber of leading edge centers, but you are gaining other centers in
the process — namely, the partnering centers.
They are becoming an integral part in a virtual teaming ap-
proach of this new PACI leading edge center. So we could end up
with Centers of Excellence in Virtual Reality, and Mid-Level Per-
formance, and Reaching Out in K Through 12, that are actually
more than we currently have.
The expense to do that within a fixed budget is to reduce the
number of leading edge centers.
All I can say is that the solicitation has generated more new
ideas, more good feeling among the community at large, than any-
thing I have seen in ten years. I think that the kinds of concerns
you have that we do not break the well-functioning current system
are well placed, and v/e have to be very careful that that is taken
care of during the solicitation.
The best protection for that is a strong merit review process. I
think and hope that the committee today will reinforce the appro-
priateness that merit review through the NSF system is the best
way to make sure that there is a smooth transition and that none
of the services to the national community are less, but in fact
greatly increased.
[The prepared statement of Dr. Smarr follows:]
Ill
Congressional Testimony
Prepared for
Chairman Steven H. Schiff
Subcommittee on Basic Research
Committee on Science
U.S. House of Representatives
On the National Science Foundation's
Partnership eor Advanced Computational Science
Solicitation
NCSA
March 19, 1996
Dr. Larry L. Smarr
Director
National Center for
Supercomputing Applications
University of Illinois at Urbana-Champaign
605 East Springfield Ave.
Champaign, Illinois
112
L'ni\'ersit\' of Illinois
at Lrbana -Champaign
National Center for
Supercomputing Applications
Larry L, Smarr
Director
5365 Beckman Institute
for Advanced Science
and Technology
Drawer 25
405 North Mathews Avenue
Urbana, IL 61801
FAX
217 244-8195
Inteniet
lsmarr(Q)ncsa. uiuc.edu
217 244-0078
March 19, 1996
The Honorable Steven H. Schiff
Chairman, Basic Research Subcommittee
U.S. House of Representatives
Committee on Science
Suite 2320 Raybum House Office Building
Washington, DC 20515-6301
Mr. Chairman:
Mr. Chairman, it is an honor to be here this afternoon. I am going to talk with you
today about the National Center for Supercomputing Applications at the University of
Illinois at Urbana-Champaign.
Specifically, I would like to talk with you about how NCSA is participating in the
National Science Foundation's Partnerships for Advanced Computational Infrastructure
solicitation.
In Section 1 of my written testimony I have provided you with an overview of NCSA's
programs. This section contains information on NCSA's history, funding, research
tools, technology and knowledge transfer, our Fortune 500 industrial partners, and the
Federal Consortium.
Section 2 provides a response to two of the questions you mentioned in your letter of
invitation.
Attachment 1 highlights major scientific accomplishments of the NSF Supercomputer
Centers Program.
Again, thank you Chairman Schiff for the opportunity to participate in today's hearing.
Eh-. LafTyL- Smarr
113
Table of Contents
Section 1 — Overview of the National Center for Supercomputing Applications 2
Background 2
Funding Sources 2
New Tools for Computational Science and Engineering 3
Technology Transfer 3
Knowledge Transfer 3
Fortune 500 Industrial Partners 3
NSF/NCSA WWW Federal Consortium 4
Section 2— Impact of the NSF Solicitation on NCSA
Attachment 1 — NSF Centers' Accomplishments
Dr. Lairy L Smarr. NCSA
Congressional Testimony
114
Section 1 — Overview of the National Center for Supercomputing Applications
Background
Under the leadership and guidance of the National Science Foundation the National Center for Supercomputing
Applications (NCSA) at the University of Illinois at Urbana-Champaign has evolved into a scientific research center
built around a national services facility. All aspects of NCSA are carried out within a major research university. At the
founding of the NSF Centers program 10 years ago, there were those who questioned whether a university could
successfully manage such a critical, national-scale program. History has shown that the integration of the program into
the university environment has resulted in much of the uniqueness and synergy leading to rapid advances in scientific
knowledge, application software development and technology transfer.
In its leadership role, NCSA is developing and implementing a national strategy to create, use, and transfer advanced
computing and communication tools and information technologies. These advances serve the center's diverse set of
constituencies in the areas of science, engineering, education, and industry.
When the center opened to the research community in 1986, researchers using the supercomputers were from the
traditional disciplines of physics, chemistry, materials, and astrophysics. But researchers in other disciplines quickly
realized they could also transform their fields by using supercomputing. Today's growth areas include biology, medicine,
environment, computer science, commercial and fmancial databases, and networked World Wide Web multimedia asset
management
Academic and industrial users from across the nation have utilized NCSA's leading edge supercomputers to enable
advances in their research. As a result of rapid changes in computing technology, NCSA has been able to continually
increase supercomputer capacity and capability while keeping dollar expenditures relatively constant. Specifically,
NCSA capacity of supercomputing time made available to national users has increased at 75 percent per year, resulting
in a 1,000 fold increase in capacity between the center's opening in 1986 and the end of the current cooperative
agreement in FY97. To utilize this rapid growth in computing power, NCSA has worked with users to develop and
migrate application codes through three distinct phases of supercomputer architectures: shared memory vector
processors; massively parallel processors; and scalable shared memory RISC microprocessors. During this period, nearly
9,000 users from more than 500 academic institutions and corporate laboratories in all 50 states have made use of
NCSA's computational resources. In addition, hundreds of university level courses in computational science and
engineering have used NCSA supercomputers. See Attachment 1 (Major Accomphshments of the NSF Supercomputer
Centers Program), detailing scientific highhghis and user accomplishments. The attachment is being jointly submitted by
all four NSF centers.
NCSA is dedicated to improving the competitive position of American industry by partnering with leading corporations
in a variety of sectors in the US economy to partner with NCSA. These Fortune 500 companies have selected NCSA lo
learn about all aspects of cutting-edge hardware and software, virtual prototyping, visualization, networking, and
databases.
NCSA works closely with computer vendors and national leaders of academic computer science to bring users the most
advanced methods in high-performance scalable computing. The center currently maintains these supercomputers:
Hewlett-Packard/CONVEX Exemplar, Silicon Graphics POWERCHALLENGEarray, and Thinking Machines CM-5.
NCSA has allowed its vendor partners to gain access to national leaders in science and engineering and, as a result, the
vendors have produced machines that are more useful to the American research community.
Funding Sources
NCSA relies on its cooperative agreement from NSF as the foundation of the center's efforts. However, NCSA has
aggressively leveraged the core NSF funding providing a dollar to dollar match from other sources. This has resulted in
an additional $132.8 million over 10 years being added to NCSA's program beyond the NSF core funding. Sources of
these leveraged funds are other federal agencies, corporate partners, the University of Illinois, and the State of Illinois.
This additional funding has been critical in expanding NCSA's offerings beyond a central supercomputing facility and
for developing world recognized staff in software development, scientific applications, and virtual environments.
Dr. Lany L. Smart, NCSA
Congressional Teslimony
115
New Tools for Computational Science and Engineering
To support the large number of remote users NCSA has developed a broadly based scientific research center of
application researchers housed in the Beckman Institute at the University of Illinois at Urbana-Champaign. These staff
members act as a bridge to their disciplines and have been a constant source of innovation in the use of new technology
10 empower new applications. NCSA has application scientists in the fields of astrophysics, biomedical science,
chemistry, chemical engineering, computational biology, computational fluid dynamics, condensed matter physics, earth
sciences, gravitational sciences, humanities/fine arts, information systems, mathematics, computer science, radio
astronomical imaging, and structural engineering. For example, one of NCSA's application scientists is PI on the joint
agency (NSF, ARPA, NASA) Digital Library Initiative at the University of Illinois at Urbana-Champaign.
NCSA's Virtual Environments Laboratory is one of the world's most advanced virtual reality (VR) research laboratories
available to academic and industrial researchers. The lab consists primarily of three projection-based modes of VR — the
CAVE™ and the ImmersaDesk™ (both developed by the Electronic Visuahzation Laboratory [EVL] at the University of
Illinois at Chicago) and the NlIAVall (developed by EVL in collaboration with NCSA and the Laboratory for
Computational Science and Engineering at the University of Minnesota).
Technology Transfer
In addition to developing new software tools and making them freely available to the academic community, NCSA has
had a significant impact on commercial software development and products. NCSA is proud of its proven record of
catalyzing new products and companies. NCSA software developers who joined InterCon Systems Corp. in 1987 started
a long-term product line based on NCSA Telnet™ . Similarly, former NCSA developers founded Spyglass Inc. in 1990
with software based on NCSA Image™. The 1992 release of the World Wide Web network browser NCSA Mosaic
created millions of Web users and both laid the technology groundwork and created a market for subsequent
commercialization. This commercialization was accomplished via two routes. First, NCSA software developers joined
companies such as Microsoft, Spry, Spyglass, and Netscape. Second, UlUC/NCSA intellectual property is licensed
through Spyglass to 40 companies, and incorporated into more than 100 products. The economic results are represented
by the example of just two of these companies. Spyglass and Netscape, which in the past twelve months have attained a
market valuation as high as $5 billion.
Knowledge Transfer
NCSA understands that transferring necessary technical skills and information is vital for prolonged growth and success
of any research activity. This transfer is accomplished by providing computational science, networking technologies, and
information technology training to fellow researchers, students, other educators, lifelong learners, government and
community organizations, and members of the business community.
NCSA addresses science and curriculum issues through programs designed not only to give a technical jump-start to a
school but also to provide continuing support to educators and their administrators. Current major programs include: the
Education Affiliates (EA) program, the Resource for Science Education (RSE) program, the Networking Infrastructure
for Education (NIE) project, EduLinK-12 (IlUnois Education Link), Tech Corps Illinois, and Illinois Learning Mosaic
(ILM, a World Wide Web-based educational information resource).
NCSA's community networking and technology transfer program works with various communities to deploy computing
and communications technologies and applications at the local, state, regional, and national levels. The center's CCNet
(Champaign County Network) collaboration with its host community has received national recognition as an innovative
application of the National Information Infrastructure and as a model for other communities. Chicago Mosaic, a
collaborative project with the City of Chicago, is investigating the use of World Wide Web technologies lo communicate
among city government, the citizens of that community, and the rest of the world.
Dr. Lany L. Sman, NCSA
Congressional Testimony
116
Fortune 500 Industrial Partners
Current NCSA industrial partners include Allstate, AT&T, AMR, Caterpillar, Dow Chemical, Eastman Kodak Co., Eli
Lilly & Co., FMC, J. P. Morgan, Motorola Inc., Phillips Petroleum Co., Schlumberger, Sears, Roebuck and Co., Tribune
Co., and United Technologies. Since 1986 sixteen partners have added 540,672,913 to NCSA's programs in order to
learn about the evolving high performance computing technologies, to apply them to solving breakthrough industrial
applications, and to support technology research at NCSA. Their investment in this program has enabled each company
to enhance technical and management skills and to explore the leading edge technologies which are needed for them to
help stimulate this country's economic viability. As a result of their successes in this program, NCSA and the University
of Illinois at Urbana-Champaign have gained a reputation for excellence among our nation's industrial leadership.
NSF/NCSA WWW Federal Consortium
A group of 16 US Federal agencies, assembled through the efforts of the National Science Foundation, came to NCSA to
learn how to apply new technologies in their efforts to stfeamline internal, cross agency, and multi-level government
processes and communication. Member agencies include the following: Nuclear Regulatory Commission; National
Library of Medicine; Cenual Intelligence Agency; Intelligence Services (COSPO); National Science Foundation;
National Institute of Standards and Technology; Department of Education; Defense Technical Information Center;
National Oceanic and Atmospheric AdminisU'ation; National Security Agency; National Cancer Institute; National
Institutes of Health; National Aeronautics and Space Administration; US Geological Survey; National Biological Service
and Bureau of the Census.
Dr. Larry U Smarr. NCSA
Congressional Testimony
117
Section 2— Impact of the NSF Solicitation on NCSA
Q: "What impact is the new solicilation having on the current centers?"
A: NCSA has partnered with a number of groups during the past 10 years including US industry and other R&D
facilities. In response to the solicitation NCSA is expanding the scope of its partnerships and subsequent projects.
Partnering with other institutions is a key component of the NSF solicitation. Therefore, NCSA is aggressively pursuing
exciting new working relationships with computational scientists, computer scientists, academic institutions, R&D
laboratories and regional centers to prototype the US computational and information infrastructure of the 21st Century.
Also, NCSA is developing new and innovative ways to manage a larger number of partners and highly complex projects.
The end result will be a stronger, national-scale laboratory for the creation and management of "virtual teams," which are
becoming essential to the emerging knowledge worker communities in American industry, government, universities,
health care, and education.
The solicitation has not led to a negative impact on the current daily operations at NCSA. The PACI mission has
heightened vendor interest in increasing their commitment and partnerships with NCSA. In addition, the solicitation has
led to a closer working relationship with NCSA's end users.
In conclusion, the solicitation has challenged NCSA to raise its sights to see what the center can further accomplish to
accelerate national objectives.
Q: "How are the current centers working to participate in the new solicitation proposals?"
A: As requested by NSF, NCSA's management is writing a preproposal based on the PACI mission statement and NSF
guidelines. NCSA is competing to become one of NSF's leading edge sites in the PACI program.
NCSA has taken the following steps to seek out partnerships:
• Discussed NCSA's new vision with potential parmers (including DOE laboratories, universities, consortia, research
groups, other NSF research centers, and state and regional centers)
• Solicited innovative ideas from potential partners
• Matched NCSA's vision with potential partners and their ideas
• Aligned all goals and objectives with PACI mission statement
NCSA has found the challenge of the PACI mission exciting. The center has also witnessed many innovative ideas
emerge from the broader community stimulated by the solicitation. As long as this process remains under the firm
control of NSF's merit review system, NCSA believes the result will be a stronger, more broadly based national
program.
In conclusion, NCSA believes that the PACI program is critical to the future of America's research system. The PACI
program will allow for critical early experience with the advanced computational infrastrucnire that will support 21st
Century scientific and engineering research.
Dr. Lany L Smair, NCSA
Congressional Testimony
118
Attachment 1
Major Accomplishments of the NSF Supercomputer Centers Program
From the Report of the Task Force on the
Future of the NSF Supercomputer Centers Program
September 15, 1995
High Performance Computing Infrastructure and Accomplishments
Introduction
Important Technology Accomplishments
HIGH PERFORMANCE COMPUTING
Supercomputer Usage ai NSF Centers
Architectures and Vendors
Center Program Chronology
National Access to Vector Multiprocessors
Achieving Production Parallelism
Early Migration to the UNIX Operating System
Early Access to Massively Parallel Computers
Superlinear speedup on heterogeneous processors
Workstation Clusters
PORTABLE PARALLEL PROGRAMMING TOOLS
Prototype Parallel Programming Enviroiunents
Extensions of PVM
Scalable Libraries
STORAGE TECHNOLOGIES
AFS-Establishing a National File System
HDF-Creating a Standard File Format
Migrating to a Standard Archiving Software
Development of high-density magnetic media
NETWORKING
Evolution of NSFNET
High Performance LANs
Gigabit Testbeds
Secure Networks
New Science Enabled by Networks— Telemicroscopy
Nil Testbeds
VISUAU7ATION AND VIRTUAL REAUTY
Development of Scientific Visualization
Virtual Reahty Impacts Industrial Design
Development of Inunersive Science Projects
Virtual Reality over ATM networks
Alpha Shapes, Biomolecules, and Cosmology
DIGITAL LIBRARIES AND INFOSERVERS
Digital Libraries
Scalable Information Servers
The Rise of the MosaicAVWW Information Infrastructure
DESKTOP SOFTWARE
Connectivity Tools
Collaboration Tools
2
2
2
3
4
4
4
5
5
6
6
7
7
7
7
7
8
9
9
10
10
11
11
12
12
13
14
14
16
16
16
17
17
17
18
18
18
19
19
119
Graphics Tools 19
Scientist's Workbench 20
Accomplishments in Education and Outreach 2 1
EDUCATION 21
Researchers and S tudents 2 1
Supercomputer Centers Educational Activity Support Sununary 21
Outreach to Educators 23
OUTREACH 24
Apphcation of Scientific Computation and VisualiTation to Industrial Production 24
Impact on Vendors of High Performance Computing Equipment 25
Stimulation of New, Computationally Dependent Ventures 26
Development of Nationally Valuable Reservoirs of Skill 27
Community Service 27
Important Science and Engineering Accomplishments 2 8
Summaries of computationally interesting problems in the NSF Centers Program by the
national science and engineering communities: 2 8
QUANTUM PHYSICS AND MATERIALS 28
Phase Transition in QCD 28
Phase Transitions of Solid Hydrogen 28
Prediction of new Nanomalerials 29
Theory of High Temperature Superconductors 29
Magnetic Materials 30
Understanding Glass 30
BIOLOGY AND MEDICINE 30
Crystallography 31
Folding Proteins using Artificial Intelligence 31
Protein Kinase solution 31
Molecular Neuroscience-Serotonin 32
Molecular Neuroscience-Acetylchohnesterase 32
Kinking DNA 33
Antibody-Antigen Docking 33
Tuning Biomolecules to Fight Asthma 34
Virtual Spider and Artificial Silk 34
Heart Modeling 34
ENGINEERING J5
Ultra-high-strengih Steels 35
Continuous Casting of Steel 35
Beverage can design 35
Designing a Leakproof Diaper 36
Bone Transplant Bioengineering 36
Improving Performance with Ribleis 36
Designing Better Aircraft 37
Crash Testing Street Signs 37
EARTH SCIENCES AND THE ENVIRONMENT 37
Detoxification of Ground Water 38
Sage Grouse-Endangered Species and the US Army 38
Storm modeling/forecastmg 38
Los Angeles Smog 39
Upper Ocean Mixing 39
Simulating Climate using Distributed Supercomputers 40
120
PLANETARY SCIENCES. ASTRONOMY, AND COSMOLOGY 40
Comet Collision with Jupiter 40
Discovery of First Extrasolar System Planet 41
Building the BIM A Radio Telescope 4 1
Pulsar Searching and Discovery 42
Accretion Disks Around Black Holes 42
Black Hole Colhsion Dynamics 42
Largest cosmological simulation 43
EVOLUTION OF THE METACENTER CONCEPT 43
Recognition Accorded NSF supercomputer Users and Projects 44
Acronyms 46
121
High Performance Computing Infrastructure and Accomplishments
Introduction
The NSF Centers Program was established to provide access to high performance
computers (supercomputers) for the broad Science and Engineering Research Com-
munity. The program has evolved from one comprising independent, competitive,
and similar computer centers to one including more cooperative and diverse activi-
ties. Coordinating the mission of the individual centers has increased the diversity
of computer architectures available to the research community, and has accelerated
outreach to segments of the community which had not before been able to use the
power of high performance computers. At the same time, competition between cen-
ters has been managed by NSF and its advisory committees to the advantage of the
engineering and science communities which the program was established to serve.
Building on each center's tradition of providing a stable source of computer cycles
for a large community of scientists and engineers, the centers have evolved into a
unique resource to test which diverse computer architectures best match the most
demanding problems posed by the community of university researchers and to de-
velop the necessary supporting software and algorithms. For example, this approach
has enabled the centers to test which applications can be efficiently served on newly
developed systems using clusters of the new generation of workstations that are
now being introduced. Such experiments are enabled by the open environment
characteristic of the program.
During the first decade of the centers program, major improvements in the delivery
of high performance computing have been developed, mairUy by American com-
puter researchers and companies. But advances in computing technology have been
matched in equal measure by improvements in computer networking, and as a con-
sequence the NSF Centers have been a primary focus for accelerating the evolution
of the Internet via NSFnet, NREN, and the still evolving broad-band width tech-
nology.
In this appendix, we provide detailed examples of the centers activities.
-1
122
High Performance Computing Infrastructure and Accomplishments
Important Technology Accomplishments
High performance computing
Originally set up in 1985 to provide national access to traditional supercomputers,
the NSF Centers have evolved to a much larger mission. The Centers now offer a
wide variety of high performance architectures from a large array of American ven-
dors. No longer just adopting technology from the national labs, the NSF Centers
Program has become a pioneering vanguard of technology - a model for other agen-
cies with a vested interest in the high performance computing to emulate. This to-
day is dominated by research efforts in software, with vital collaborations with com-
puter scientists, focusing on operating systems, compilers, network control, mathe-
matical libraries, and programming languages and environments. The feedback to
the leading US vendors is increasing the usefulness of their product offerings to the
scientific and engineering communities, while making them more competitive.
Supercomputer Usage at NSF Centers
Fiscal Year
Active Users
Usage in CPU
Hours
1986
1336
29,485
1987
3,299
95,751
1988
5,042
121,615
1989
5,967
165,960
1990
7,357
250,628
1991
7,723
361,073
1992
8,252
398,931
1993
7,735
910,088
(Usage is in normalized CPU hours, based on comparative performcmce tests. The astounding leap in ca-
pacity in 1993 is mainly a result of the introduction of new computing architectures to solve the most
demanding of computational problems — the Grand Challenges. The slight decrease in the number of
users is the result of a concerted effort by the Centers to assist many of their users with small memory or
CPU-time requirements to meet their computational needs by the increasingly powerful workstations of
the mid-90's. The greatest benefactors of the increase in massively parallel cycles are the scientists and
engineers addressing the problems with the greatest computing demands)
Architectures and Vendors
The national community has been offered access to a wide and frequently updated set
of high performance architectures since the beginning of the NSF Supercomputer
Centers Program. The current rate of change of the types of architectures, and the
number of vendors offering them, is probably near an all time high. We are in a pe-
riod of ferment which the science and engineering communities sort out the choices
for finding an architecture that matches their various computational problems. A list
of architectures that the NSF Centers Program has offered would include: single and
clustered high performance workstations or workstation multiprocessors, minicom-
puters, graphics supercomputers, mainframes with or without attached processors or
vector units, vector supercomputers, and SEvID and MIMD massively parallel proces-
•2-
123
High Performance Computing Infrastructure and Accomplishments
sors. Similarly, the list of current vendors whose top machines have been made
available would include IBM, DEC, Hewlett-Packard, Silicon Graphics, Sun Microsys-
tems, Cray Research, Convex Computer, Intel Supercomputer, Kendall Square,
Thinking Machines, nCUBE, Alliant, Floating Point Systems, ETA, Stellar, Ardent,
and Stardent.
Center Program Chronology
FY
Milestone or Event
Description
1986
5 NSF Supercomputer Centers become
operational
Cornell 1 heory Center
National Center for Supercomputing Applications
Pittsburgh Supercomputing Center
San Diego Supercomputer Center
John von Neumann Center
1988-1989
Renewal Review
1989
Renew 4 NSF Supercomputer Centers
(5yrs.)
CTC, NCSA, l^C, SDSC
1990-1992
ASC Advisory Committee report
completed
strong recommendations for adding parallel systems to
accompany the stable, production vector systems
1991
8 Vector Supercomputers operating
3 Scalable parallel systems operating
IBM 3090 6 processors (2)
Cray YMP 8 processors (2)
Alliant FX80 8 processors
Cray 2S 4 processors
Cray YMP 4 processors
Convex C240 4 processors
Intel iPSC/860 32 processors^
TMC, CM2 32,000 processors (2)'''
1992
7 Vector Supercomputers operating
9 Scalable parallel systems operating
Joint Planning Initiated
Cray systems remain
IBM ES9000/900 (Upgrade 3090)
Alliant 2800 (upgracie FX80)
Convex C3880 8 processors (Upgrade C240)
Intel iPSC/860 upgrade 64 processors)t
nCUBE2 128 processors!
TMC, CMS 512 processorsT
KSRl 64 processors!
DEC Workstation Ouster (2)t
IBM Workstation Cluster
IBM PVS 32 processors
Initial meeting at SDSC Fall 92
1993
Joint Activities Began
6Vector Supercomputers operating
13 Scalable parallel systems operating
Meeting at PSC
First joint proposals to PPRP
First of Joint projects
Cray C90 16 processors
(other Crays, IBM, Convex stay)
Intel Paragon 400 processors upgrade t
KSRl upgrade 160 processors!
Hewlett Packard Cluster^
MasPar 2 16,000 processors!
IBM SPl 64 processors!
Cray T3D 512 processors!
1994
Joint Activities
4 Vector Supercomputers operating
14 Scalable parallel systems operating
Meeting at CTC
Metacenter Regional Alliances - Mar 94
Expansion of joint projects
One YMP changed to C-90, others the same
IBM SP2 upgrade 512 processors upgrade SPl!
Cray T3D 512 processors !
Convex Exemplar 8 Nodes!
SGI Challenge 32 Nodes
^ Majority of funding provided by other Federal agency (ARPA, NIH) or state,
t Donated in full or in part by the manufacturer for extended evaluation
124
High Performance Computing Infrastructure and Accomplishments
The Centers Program provides a stable supply of vector computing cycles needed by
the research community while investing in significant capacity of scalable parallel
systems, capable of ultimately growing to a size necessary for full scale grand chal-
lenge problems. While the numbers of Vector Supercomputers has decreased, the
computing power represented by that group in fact increased substantially. How-
ever, the increase in the scalable parallel systems was much larger, reflecting the
growth potential of this type of computing platform and a strong NSF/ARPA part-
nership.
National Access to Vector Multiprocessors
The NSF Supercomputer Centers established in the mid-1980s brought access to state-
of-the-art supercomputers for the first time in at least 15 years. Indeed, in the 1960s,
only a few universities had such access. This opening of universal access led to an
unprecedented increase in the number of researchers and universities involved in
advancing the frontiers of scientific and engineering research by using high perfor-
mance computing. By the early 1990s, some 15,000 researchers in over 200 universi-
ties had used one of the Cray Research vector multiprocessors or the IBM vector
mainframe in one of the NSF centers. This wide pool of computational researchers
made it possible for the center's program to begin to respond to the demand for paral-
lelism that had been developed in the Computer Science Community, and adopted
by the most adventurous user. The 90's saw the NSF center's program substantially
widen its range of architectural offerings.
Achieving Production Parallelism
The Cornell Theory Center (CTC) became the first member Center of the NSF Meta-
Center to achieve production parallelism on a vector supercomputer, with over 1/3
of its vector supercomputer cycles used for parallelism in 1989. CTC integrated its two
(ES/3090 600) vector supercomputers using a special 200 mbyte/sec hardware inter-
face, allowing parallel jobs the potential of executing across 12 vector processors.
Users used a shared-memory parallel FORTRAN developed by IBM in a joint project
with the Theory Center.
Early Migration to the UNIX Operating System
During the early and mid-1980's the UNIX operating system was widely viewed as
inappropriate for supercomputers for reasons of performance, system management
tools, application development and measurement mechanisms, and security. Cray
supercomputers were run mostly with operating systems that were designed at na-
tional laboratories (LLNL and LANL in particular) and this required extensive local
software support. In 1987, NCSA became the first major supercomputer center to mi-
grate its Cray supercomputer from CTSS (Cray's proprietary time-sharing system) to
UNICOS, a UNIX-based operating system developed at Cray Research for its super-
computers. This move to UNIX was the beginning of a merger between computa-
tional science and computer science, because most computer science research in-
volved the UNIX operating system at that time. Coincidentally, CTC was the first site
to run IBM's high performance UNIX system on its ES/3090 and ES/9000 main-
frames in production.
-4-
125
High Performance Computing Infrastructure and Accomplishments
Early Access to Massively Parallel Computers
Beginning in 1985, CTC provided experimental scalable parallel machines, first the
FPS T-series and an iPSC/1 parallel system, to its user community. In 1988, with the
installation of its iPSC/2 with 32 processors, CTC made early scalable computing
available for production use by the national community. Massively parallel comput-
ing was introduced to the research community beginning with NCSA's CM-2 in
1989. Each Center provided early access to new generation MPPs. The CTC was the
first site to install an IBM SPl and SP2, PSC installed the first Cray T3D, and early
CMS (NCSA), Paragon(SDSC), Ncube(SDSC), and Kendall Square (CTC) machines
were installed. Early access to these machines, enabled by support from ARPA and
NIH, allowed pioneering users to explore the benefits of fine-grained parallelism.
Each Center worked with the user community and the vendors to develop
application codes, which could then be ported to other platforms. In 1992, the CM-5
was added to the program at NCSA as the largest distributed memory parallel super-
computer available to the national academic and industrial communities. From
1992 to the present NCSA has worked closely with national users and the computer
science community to create a wide range of 512-way parallel application codes that
can in 1995 be moved to other large MPP architectures such as the T3D at PSC, the
Intel Paragon at SDSC, or the IBM SP-2 at CTC.
Superlinear speedup on heterogeneous processors
In 1991, PSC was the first site to distribute code between a massively parallel machine
(TMC-CM2) and a vector supercomputer (Cray YMP), linked by a high speed channel
(HiPPI). Experiments on applications as diverse as molecular dynamics, medical
imaging, chemical flowsheeting and gene sequence alignment showed superlinear
speedup (the applications on the linked system ran more than twice as fast on each
system separately). This formed part of the motivation for heterogeneous comput-
ing, as later embodied in the tightly coupled Cray T3D/C90 systems. PSC's T3D was
the first shipped anywhere. PSC developed a set of codes for transferring data be-
tween the Cray and CM2 which later enabled them to coitununicate between the T3D
and C90 at speeds superior to what was available from the vendor.
A similar superlinear speedup was obtained on the CASA gigabit testbed set-up in-
cluded parts of two supercomputers (64 nodes of the 528-node Intel Delta system at
Caltech and one processor of the CRAY C-90 at SDSC, 150 miles away); two HiPPI-
SONET gateways; and a SONET wide-area link between San Diego and Pasadena,
which is itself a prototype undergoing tests in a collaboration between MCI and Pa-
cific Bell. Using an environment for distributed parallel computing called Express (a
product of ParaSoft Corporation), Aron Kuppermarm and Mark Wu (Dept. of
Chemistry, CalTech) did a test calculation of the reaction of atomic hydrogen with
molecular heavy hydrogen (deuterium) at a total energy of 2.5 eV. This problem had
taken 100 hours to solve on the SDSC CRAY Y-MP a year earlier. On the new C-90, it
took 17 hours. On the Delta alone, it took 16. But when the problem was distributed
between the C-90 and the Delta, the whole problem was solved by the two machines
in just under five hours, a factor of 3.3 faster than it could be done on either ma-
chine alone.
126
High Performance Computing Infrastructure and Accomplishments
Workstation Clusters
Given historical trends showing much more rapid improvement in microprocessor
technology than in vector technology, many Centers began exploring workstation
clusters. In 1990, NCSA examined the usage of the Cray Y-MP and determined that a
significant amount of capacity could be gained by moving appropriate applications to
scalar RISC processors. Based on this study and on predictions that microprocessor
technology would surpass vector technology by the mid-1990's, NCSA set up an IBM
RS/6000 cluster as a farm of processors used as stand-alone compute servers. Begin-
ning in 1991, CTC did pioneering work with IBM on clustered RS/6000 workstations
with high-speed, proprietary communications links, including an experimental opti-
cal switch. The information gained during this joint project was used to guide the de-
sign of the SP systems. Also in 1991, PSC set up a cluster of DEC workstations, and
working with Florida State University, significantly enhanced queuing, accounting
and control software and also integrated the AFS file system into this environment.
These clusters all served as computing resources and also as platforms for the devel-
opment of distributed memory message passing codes. These projects generated high
interest in industry, and NCSA trained several dozen industrial sites on the integra-
tion, operation, and management of clusters of microprocessors. NCSA is now mov-
ing to establish dusters of shared memory workstation multiprocessors from Con-
vex/HP and SGI, while PSC will develop applications for Cray's new offering, the J90.
The CTC was given supplemental funding by NSF to build its cluster to 32 proces-
sors, providing a compatible path to its (later) IBM SPl and SP2 systems. The system,
including experimental high-speed switch, was used for production work by the
commuruty, which has now migrated to the SPl and SP2 environments.PORTABLE
Parallel programming Tools
Although the architectures of massively parallel systems differ greatly, the major
time and money investment of the research community (as contrasted to the cen-
ter's personnel) is in developing and converting codes (porting) to operate in differ-
ent environments. The dose cooperation between the NSF Centers via the Meta-
Center and informal contacts among its research users and cooperating agencies
such as ARPA and the various National Labs have resulted in substantial progress
ensuring that labor intensive programming operations need not be duplicated need-
lessly.
Prototype Parallel Programming Environments
Working with the Parascope Group at the Center for Research in Parallel Comput-
ing, an NSF Sdence and Technology Center, CTC developed extensions supporting
new parallel programming paradigms and extensions making ports from one type of
parallel programming platform to another easier. ParaScope is a prototype parallel
programming environment. Both the BIMA and Cosmology GCs at NCSA are work-
ing closely with Indiana University Computer Scientist Dennis Gannon to move ap-
plication codes previously written in FORTRAN to the portable pC++ which is the
model for HPC++, the equivalent of HPF in the C++ world.
127
High Performance Computing Infrastructure and Accomplishments
Extensions of PVM
In the Dome project, Adam Beguelin, one of the original developers of PVM now
working jointly at PSC and in Computer Science at Carnegie Mellon University, is
extending PVM to improve load balancing and fault tolerance. His work is guided by
the experience of PSC's cluster users.
Scalable Libraries
The goal of the ARPA funded Scalable Parallel Libraries project is to develop math-
ematical software libraries for massively parallel processors that are roughly compa-
rable in scope to the math libraries typically available on conventional supercomput-
ers. Michael Heath and his group at NCSA are one team in this multi-institutional
project and they have been developing parallel direct methods for solving sparse lin-
ear systems. For this purpose, they have developed a fully parallel sparse solver for
distributed memory parallel computers. Unlike most other efforts, which have fo-
cused only on factorization, this solver performs all phases of the computation in
parallel mode, including the symbolic preprocessing necessary to reorder the sparse
matrix and distribute it across processors to maintain data locality. With funding
from IBM, CTC staff developed scalable versions of key numerical library routines
for its IBM cluster system; these routines were included by IBM in its ESSL library
product.
Storage Technologies
With the vast increase in both simulation and observational data, the MetaCenter
has worked a great deal on problems of storage technologies. Here again, many of
the biggest areas of progress are in software. The creation of a universal file format
standard, a national file system with a single name space, and a multivendor archiv-
ing software are some of the results of MetaCenter innovation, collaboration with
computer scientists, and with other leading national laboratories. There are even
examples of the Program's computational facilities being used to improve the basic
storage capacity of the physical medium of storage itself.
AFS-Establishing a National File System
PSC recognized that the Andrew File System (AFS), developed at Carnegie Mellon
University with IBM support for a workstation environment, was particularly-well
suited for use in high performance computing, because of its superior security, scal-
ing properties, and manageability. PSC undertook a major program of adapting AFS
to the high performance computing environment which has led to a MetaCenter
wide effort to develop a shared national file system. PSC's AFS enhancements to
Cray's UNICOS are installed at a number of advanced computing centers (SDSC,
NERSC, rPP, LRZ (Germany), ETH (Switzerland) and the University of Stuttgart).
PSC has also extended AFS to multi-resident AFS which enables AFS to be a com-
ponent of a hierarchical storage system, permitting transparent and cost-effective
storage of large amounts of data. These enhancement are installed at other Meta-
Center sites, NERSC, Cray Research, Transarc Corporation, Max Planck Institute for
Plasma Physics, and the University of Cologne. With AFS, distributed applications
across the MetaCenter are now possible. For instance, Paul Dawson (Dept. of Mech.
■7-
128
High Performance Computing Infrastructure and Accomplishments
and Aerospace Engineering, Cornell Univ.), simulating deformations of aluminum,
uses AFS to build a distributed application with modeling performed at PSC and vi-
sualization at the CTC. The SDSC is running an AFS cell supporting the Computa-
tional Center for Macromolecular Structure (CCMS), an SDSC, UCSD, and Scripps
Research Institute collaboration. The charter of the CCMS is the development and
distribution of portable, innovative software for the study of macromolecular struc-
ture. AFS simplifies the cross-institution distribution and maintenance of software
and textual information.
CTC was the first Center to put AFS in production on its HPC systems; in fact, all
CTC production systems, except KSR, and servers are integrated through AFS, in-
cluding its new mass storage environment. CTC has lead in a number of areas of file
system integration, including a project with the University of Michigan integrating
its AFS mainframe port, IFS, into its HPC environment, and joint efforts with
TRANSARC in improving AFS performance over high speed networks, including
an FDDI testbed. New efforts involving TRANSARC include optimization over
ATM and within IBM's SP2 scalable switch.
HDF-Creating a Standard File Format
The NCSA Hierarchical Data Format (HDF), created by Michael Folk and his group at
NCSA has become one of the leading self-describing file formats in the world today.
Many scientific institutions, organizations and programs have adopted HDF as a
standard file format for data exchange and/or archiving. In 1992, NASA selected HDF
as the basis from which to develop an EOSDIS standard data format (SDF). The goal
of SDF is to provide a single, self-describing format for distributing data derived from
approximately 1.2 terabytes of data daily that EOSDIS will eventually produce. Other
examples of HDF adoption include the Institute of Applied GeoScience (seismic
data). Pacific Northwest Laboratory (cancer research). Children's Hospital in Boston
(x-ray images), the UCLA Scientific Visualization Lab. By working closely with many
different user communities to support the harmonization of data models and meta-
data conventions across as many disciplines as possible, NCSA is helping to create a
software foundation for the Nil enabling it to reach its potential to support the broad-
est constituency possible.
Migrating to a Standard Archiving Software
In 1985, the NSF Centers' goal was simply to establish national access for the aca-
demic community to the type of advanced supercomputing and archiving systems
found in the Dept. of Energy national laboratories. NCSA and SDSC duplicated the
Los Alamos computing environment in 1985, including the Common File System
(CFS) archive, while PSC adopted Westinghouse's PDM software that had additional
data migration facilities. As time went on, the Centers began to develop innovation
in storage software. It is the strategy of the MetaCenter to explore alternative techni-
cal approaches to storage at the bits level, while maintaining interoperability
through standard protocols. SDSC, through the DISCOS (Distributed Computer So-
lutions) division of General Atomics (which is now owned by OpenVision), pio-
neered productization of distributed, hierarchical file and storage management ap-
plications software for networked, multi-vendor and open systems environments,
129
High Performance Computing Infrastructure and Accomplishments
based on the IEEE storage model. Working with the UIUC Computer Science De-
partment, NCSA was able to encode data migration and caching strategies into CFS
to improve its ability to minimize disk cache misses. In 1992, NCSA developed CFS-
to-UniTree data formatting and migration tools as well as a suite of archive man-
agement tools. PSC integrated Cray's proven commercial archiving technology
(DMF) with more usable front-end software and with its multi-resident APS. SDSC
collaborates with OpenVision and the National Storage Laboratory in the further
development and stabilization of UniTree as a robust production archival storage
system. SDSC has developed transaction journaUng software that is critically impor-
tant for guaranteeing integrity of the file nameserver. CTC and SDSC are installing
the first generation of a high-performance variant of UNITREE which has been de-
veloped by the National Storage Laboratory, to provide enhanced I/O capability to
balance the increase in installed computing capacity. CTC will continue to work
closely with IBM in the testing and deployment of the next generation of mass stor-
age systems, the High Performance Storage System (HPSS). This will include the ca-
pability of utilizing parallel I/O to speed data transfer to the SP2.
Development of high-density magnetic media
The Center for Magnetic Recording Research (CMRR), located on the campus of
UCSD, has funding from NSF and 21 corporations having enterprises connected
with the storage and retrieval of magnetically written information. The chief tech-
nical problem CMRR has been addressing has been magnetic noise in the metallic
thin-film media used to coat high-density disks. Neal Bertram (Dept. of Electrical
and Computer Engineering, UCSD) is a researcher at CMRR who tackles the prob-
lem of magnetic noise computationally. Magnetic thin films are polycrystalline,
rather than continuous or amorphous, and the grainy, particulate nature of the
medium is a fundamental source of noise. Bertram's calculations have explored the
effects of two primary types of interaction between grains that cause noise: magneto-
static coupling and exchange coupling. The calculations resulted in recommenda-
tions for alloys and fabrication processes that would reduce noise from both sources.
The research enabled engineers at IBM's Almaden Research Center to design a disk
coating that packs a gigabit (one billion bits) of information onto a square inch,
which is 15-30 times current storage densities. Bertram has now turned his atten-
tion to calculations of other effects, including giant magnetoresistance, that can be
important in designing high-density disk media and disk recording and playback
heads. His codes model the process of recording bits in intricate detail; the process of
laying down a single bit of information takes several minutes to calculate on SDSC's
Cray Research C90.
Networking
One of the great successes of the NSF MetaCenter has been in providing the "high-
end pull" that has led to the creation and exponential evolution of the NSFnet. As a
result, the NSFnet backbone of 1995 has 3000 times the bandwidth of the backbone of
1986. The Centers have also prototyped the high performance local area networks
that are needed to feed into the national backbone as well as the next generation of
gigabit backbones. Security over networks is essential not only for industrial usage.
-9-
130
High Performance Computing Infrastructure and Accomplishments
but more and more for widespread citizen usage. Again, the MetaCenter has created
innovations such as dealing with industrial firewalls. The existence of the MetaCen-
ter network testbeds allows for new kinds of science to be attacked, perhaps best il-
lustrated by the rise of telemicroscopy, in which leading edge projects are being car-
ried out at each of the NSF Supercomputer Centers. With the national priority on
the Nn, the Centers are moving rapidly to expand their networking research to
community based Nil testbeds, including local healthcare, education, government,
and small business partners.
Evolution of NSFNET
The 56kbps connection between the NSF Centers, established in 1986, was the begin-
ning of the NSFnet. Based on the successes of ARPAnet and the TCP/IP protocol
within the computer science and Dept. of Defense communities, the NSFnet rapidly
grew to provide remote access to the NSF Supercomputer Centers by the creation of
regional and campus connections to the backbone. Although started by the pull from
the high end, the NSFnet soon began to provide ubiquitous connectivity to the aca-
demic research community for electronic mail, file transport, and remote login, as
well as supercomputer connectivity. These daily uses soon became indispensable to
the research community and the sustained exponential growth of the Internet took
off. The MetaCenter's industrial partner network was among the first in the corpo-
rate world to use NSFnet /Internet technology to connect corporations to the Internet
for the purposes of computational science. This was an important precursor for to-
day's rapid commercialization of the Internet.
By early 1995, the NSFnet will return to a high speed backbone connecting the Meta-
Center and some of the newly selected MetaCenter Regional Alliance members.
However, the bandwidth of the backbone will be 3000 times higher than that of the
original backbone (56 kbps). While some tend to think of the MetaCenter as focusing
on high performance computing only, it is useful to remember that computing
power of the fastest supercomputer processor in the program has grown by little
more than 100 times during the same period. Indeed, it is likely that the 155 mbps
vBNS will be upgraded to 622 mbps within two years. Even by the time the Centers
receive the first teraflop machines in 1997-98, realizing a factor of 1000 increase in
speed over 1985, the backbone will have grown by a factor of at least 25,000 fold in
bandwidth.
As part of an SDSC/UCSD collaboration, Kimberly Claffy recently completed a Ph.D.
dissertation that outlined a methodology for profiling Internet traffic flows at a vari-
ety of granularities. The methodologies and models developed as part of this traffic
characterization effort should prove very useful as the Internet evolves to an even
larger system in which the traffic composition needs to be understood, particularly
for planning future technology and capacity.
High Performance LANs
The center's program has also pioneered several transitions in local area and
metropolitan area networks both on site and on university campuses, acting as a pro-
totyping facility for other campuses who needed to know how to develop long range
10-
131
High Performance Computing Infrastructure and Accomplishments
networking plans for their campuses. In 1988, NCSA installed the first Ultranet Giga-
bit LAN networks with multiple supercomputers and demonstrated 480 Mbit/s be-
tween the CRAY-2 and Cray Y-MP supercomputers. In 1989, NCSA replaced tradi-
tional HYPERchannel backbone networks with the then-emerging 100 Mbit/s FDDI
standard. In 1991, PSC began its move to a HIPPI-based interconnect between its ma-
jor systems.
In 1993, NCSA, several industrial partners, and the UIUC Computer Science De-
partment established a local area ATM network testbed to help corporations gain
hands-on experience with ATM switches and interfaces. Insight from this ATM
testbed has already been used to develop long-range corporate network strategies for
J.P. Morgan, Phillips Petroleum, FMC Corporation, and United Technologies.
CTC was the first site to integrate ATM into a parallel supercomputer environment
on its IBM SPl in April 1994 and its IBM SP2 in July. ATM will be used for AFS-based
file service and other high speed transport needs, including distributed applications
and image transport.
Gigabit Testbeds
Since 1987, NCSA and the Uruversity of Illinois Computer Science Department have
worked with AT&T on the XUNET research network testbed with capacity that is one
step beyond what is available on the Internet. While the NSFnet has moved from 56
Kbs through 1.5 Mbps to 45 Mbs, XUNET has moved from 1.5 Mbs through 45 Mbs to
622 Mbs. In July 1993, XUNET was upgraded to 622 Mbs, the first network testbed to
interconnect ATM switches using 622 Mbs transmission technology over long (>50
miles) distance using pure optical fibers with in-line optical amplifiers.
PSC has worked with CMU's Computer Science Department in the Nectar Metropoli-
tan area gigabit testbed to develop new networking technology for very high-speed,
low-latency multi-machine interconnects and to develop the applications base which
can benefit from such technology. This work is fully collaborative with PSC's
ground-breaking systems and applications level work in heterogeneous systems. As a
result of testbed work, numerous applications can now run routinely between ad-
vanced machines at PSC's main hardware facility and those on the CMU campus, 15
miles distant, at speeds of up to 1 Gbit/s.
In partnership with NYNEX, Syracuse and Rome Laboratory, now extended to
Columbia, SUNY Stonybrook and Polytechnic Institute, CTC participated in building
a production-level ATM network focused on demonstrating research and commer-
cial applications. This network was demonstrated to the Governor of New York in
January 1994. NYNET is also a Nil testbed, involving outreach, medical applications,
video on demand, as described in a later section.
Various applications are being tested in the CASA testbed, in which SDSC is a major
partner. Besides the chemical reaction dynamics, led by Aron Kuppermarm of Cal-
tech and mentioned above in the section on superlinear speedup, there is a coupled
atmosphere /ocean model developed by the group led by Roberto Mechoso at UCLA.
Another is Calcrust, a project directed by JPL, which has used distributed heteroge-
neous computing over the CASA links to combine satellite imaging, seismic data.
11
132
High Performance Computing Infrastructure and Accomplishments
and surface topography in visualizing the foci of aftershocks of the 1992 Landers
earthquake in Southern California.
Secure Networks
In 1987 NCSA installed a 1.5 Mbit/s DS-1 connection to Eastman Kodak in Rochester,
New York, followed by another DS-1 connection to Amoco laboratories in Chicago
and Tulsa. By 1990 NCSA had connected over a dozen industrial laboratories to the
Internet using a combination of innovative security precautions. These included
various forms of "firewalls" which have now become commonplace on the Internet.
New Science Enabled by Networks— Telemicroscopy
The San Diego Microscopy and Imaging Resource (SDMIR), led by UCSD neurocy-
tologist Mark Ellisman, is an NIH-funded Research Resource centered on a new,
fully computerized Intermediate Voltage Electron Microscope (IVEM). The FVEM is
used to look at comparatively thick tissue sections (2-10 microns) and it has been
employed in studies of cortical neurons with and without symptoms of
Alzheimer's, in studies of another type of brain cell, called a Purkinje neuron, and
studies of cell membranes. A long-term collaboration between SDMIR and SDSC has
made the microscope usable interactively, over the Internet, coupled to the SDSC
Cray Research C-90.
Computational analysis and simulation is allowing biomedical researchers to study
and predict the activity of potential new drugs at the molecular level. CTC is working
jointly with Steven Ealick, et al., director of MacCHESS, a group using the Cornell
High Energy Synchrotron Source for Macromolecular Modeling. Using existing high-
speed connections between CTC and MacCHESS, the project is building the capability,
for the first time, for pharmaceutical companies and academic researchers to interact
dynamically with x-ray crystallographic analyses at the synchrotron, rather than dis-
covering long after the beam run that the sample was defective or the beam position-
ing non-optimal. While initially the researchers are using processors on the CTC SP
systems, ultimately a small IBM SP may be installed at the synchrotron site used for
dynamic analysis, with longer-scale simulation needs being met using the far larger
SP2 at tfie CTC.
PSC is working with the Center for Light Microscopy and Biotechnology, an NSF Sci-
ence and Technology Center at Carnegie Mellon University, to develop an Auto-
mated Interactive Microscope. This microscope will couple leading edge-microscopy
and high performance computing through high speed networks allowing the real-
time tagging of chemical reactants in the cell. It will open new research horizons in
biology by giving researchers the ability to control the release of chemically active
agents at critical moments in cell life, and to monitor the cell's subsequent develop-
ment.
The personal computer controlling a scanning tunneling microscope (STM) in the
Beckman Institute at UIUC used software integrated over a LAN with the NCSA
Convex C3880, TMC CM-5, and SGI graphics workstation to enable realtime imaging
and nanolithography of silicon surfaces in order to create novel quantum electronic
devices. Working with the laboratory's director, Joseph Lyding (Dept. of Electrical and
-12-
133
High Performance Computing Infrastructure and Accomplishments
Computing Engineering, UIUC and Beckman Institute) and inventor of a widely
used STM, NCSA staff members Rachael Brady and Clint Potter extended this
telemicroscope to the Internet and demonstrated the feasibility of using advanced
imaging instrumentation linked with advanced computing capabilities from any-
where in the world. The project was featured in the special issue of Research and
Development Magazine (Oct 25, 1993) on Winning in the 21st century.
Nil Testbeds
As a partner in Common Knowledge: Pittsburgh, an innovative project introducing
networking and computing into the entire Pittsburgh School District, PSC is working
with numerous partners, including Digital Equipment, Apple Computer and both
telephone (Bell of Pennsylvarua) and cable TV (TCI) companies, to create a prototypi-
cal, cost-effective approach to widespread use of advanced technology in public edu-
cation.
In collaboration with researchers and physicians at the Uruversity of Pittsburgh Med-
ical Center, the PSC is developing an Nil-based digital library of pathology images,
and the applications and software technology which wiU enhance the practice, teach-
ing and cost-effective delivery of pathology.
NCSA in collaboration with UIUC and the Champaign County Chamber of Com-
merce have been building CCnet, an Nil testbed during the last 18 months. Over 200
people from over 70 community organizations have been involved since April, 1993
in defining six major applications experiments in small business, health care, educa-
tion, government and community services, agribusiness, and geographic informa-
tion systems (GIS). The first of 20 multimegabit/s links into the community was es-
tablished with the Urbana Free Library in August 1994. All the high schools and a
number of small businesses are hooking on in September. NCSA is establishing a
large GIS server which will be available over CCnet to community projects. Partners
in CCnet include Time-Warner cable, Ameritech, DEC, Motorola, and potentially
MCI and AT&T.
NYNET, one of the gigabit testbeds, is also designated as an NE testbed providing
outreach, medical applications, video on demand to CTC and New York academic
and industrial partners.
InterNIC is the latest in an evolutionary line of support from the NSF for the use of
the Internet by the science, research, and education communities. The InterNIC pro-
vides three types of services: Information Services (provided through General Atom-
ics and the SDSC), Directory and Database Services, and Registration Services. Infor-
mation Services provides procedures for connecting to the Internet, pointers to re-
sources and tools available over the network, training seminars for new and experi-
enced users and up-to-date reports on new resources and activities on the Internet.
Several irmovative approaches to distributed services have been implemented, in-
cluding the InfoGuide, an on-line Internet information service. The Scout Report is
a weekly summary of Internet highlights which combines in one place the highlights
of new resource announcements and other news that occurred on the Internet dur-
ing the previous week. The InterNIC Reference Desk acts as the "NIC of first and last
-13-
134
High Performance Computing Infrastructure and Accomplishments
resort." The desk supports a variety of users answering "starter" questions from
novice users who are unfamiliar with the Internet as well as specialized questions
from intermediate and advanced users.
VISUALIZATION AND VIRTUAL REALITY
The NSF center's were instrumental in bringing the notion and tools of scientific vi-
sualization to the research commuruty in the 1980s. By combining advanced visual-
ization resources with simulation datasets created by remote users on the centers
program's high performance computers, new visualization paradigms for interpret-
ing numerical data were developed. This led scientists to consider visualization as an
intimate part of their computational toolkit. In addition, the centers worked closely
with the pre-existing computer graphics community to get them creating new tools
for scientists as well as for entertainment. Already by 1987, the staffs of the centers,
working with national users, were creating scientific visualizations so compelling
that they became regularly chosen to be part of the SIGGRAPH Film and Video
Show, the "academy awards" of the visualization industry. Today the centers visual-
ization staff and their allied visualization Centers are at the forefront of research into
how to turn virtual reality technologies into useful tools for scientific and engineer-
ing research.
Development of Scientific Visualization
From its inception, NCSA has worked with computer artists like Donna Cox (UIUC
Dept. of Art) and Dan Sandin (UIC School of Art and Design) and computer scientists
like Tom DeFanti (Dept. of Electrical Engineering and Computer Science, UIC) to cre-
ate cross disciplinary teams with end users in order to create new levels of scientific
visualizations. NCSA also hired a number of staff from leading companies in the
California entertainment industry to bring the software tools of special effects in
movies or TV commercials to the use of the scientific and engineering communities.
Initially, the NCSA visualization environment was built on Alliant shared-memory
multiprocessors using the Wavefront visualization software. As specialized graphics
hardware became available on Silicon Graphics systems, the NCSA visualization
environment migrated from the Alliant to Silicon Graphics systems. The scientific
visualizations created by NCSA s*^'ff have not only broken new ground for scientists
viewing their data, they have also won awards worldwide for aesthetic quality. NC-
SA's Renaissance Experimental Laboratory (REL), created by Donna Cox with a major
donation from Jim Clark (founder of Silicon Graphics) was the first advanced visual-
ization training facility in the centers and continues to support university courses in
geology, mathematics, graphics design, computer science, and other disciplines.
The SDSC has developed a variety of software tools that can be used to access and
connect existing visualization resources automatically. The goal has been to provide
researchers with training and access to tools that will support their research needs.
The tools include: 1) The SDSC Image Library, a collection of image manipulation
and conversion utility routines that can be embedded in existing software. An inter-
esting example of the use of this library are the image conversion modules devel-
oped by the International AVS Center, which quickly became the top 2 user modules
in their distribution; 2) the SDSC Image Tools, a collection of utilities based on the
-14
135
High Performance Computing Infrastructure and Accomplishments
Image Library, are software tools for reading, writing, and manipulating raster im-
ages. This toolset also allows researchers to convert file formats among over thirty
widely used graphics formats (e.g., from HDF to PICT). Now in its second release, it
runs on Cray Research, DEC, HP, IBM, SGI, and Sun Microsystems platforms. Over
4,000 sites worldwide have uploaded the Image Tools from SDSC's anonymous FTP
area; 3) vpr, a client-server visualization hardcopy utility, vpr takes advantage of the
Internet by allowing remote users to send images to local hardcopy devices. Popular
hardcopy devices that are connected to vpr include a variety of film recording de-
vices, a color paper plotter, and video recording; and 4) the SDSC Color Tutorial, a
SuperCard-based hypertext exploration in color theory for computer graphics. Exam-
ples show the different points that are being made, while hyperlinks allow the user
to jump to the most interesting references.
CTC Visualization staff developed Visual Programming Language for Animation
(VPLA), a program that easily integrates sound and image sequences into scientific
animations. VPLA can use rendered images from several standard visualization
packages including DataExplorer. As the national repository for Data Explorer soft-
ware and lead training site, the CTC works with faculty, industrial users and stu-
dents across the country in developing state-of-the art animations and images. In
addition, using DX, the CTC has spearheaded using visual programming languages
not only for visualizing, but for managing distributed applications running across
the centers program. For example, the CTC built a Data Explorer module allowing its
researchers to access the CMS at NCSA. The CTC was instrumental in IBM's agree-
ing to support DX across all major vendor platforms, including SGI.
PSC has concentrated its visualization efforts on the development of tools for re-
mote users. Its GPLOT software is in use at over 300 sites. Its automated animation
facility has enabled researchers to produce hundreds of videotapes without physi-
cally visiting the any specific center. It is now turning its efforts to develop such
tools embodying virtual reality.
Virtual Reality Impacts Industrial Design
In 1992 NCSA began a transition from the now-traditional workstation visualiza-
tion activities to virtual environments, with leadership provided by Caterpillar Inc.,
an NCSA Industrial Partner. Traditionally, translating electronic CAD blueprints
into full scale wooden models of new heavy earth moving equipment in order to
evaluate design changes required 6-9 months. Working with NCSA staff. Caterpillar
built up a VR laboratory in the UIUC Beckman Institute and networked the SGI
graphics workstations which create the VR images to their Peoria headquarters facil-
ities. Using a variety of VR viewing technologies, a number of design options al-
ready have been tested for new models of Caterpillar wheel loaders and backhoe
loaders that will be introduced by 1996. Design changes can now be made in less than
one month. Caterpillar design engineers Dave Stevenson and John Bettner received
the 1993 NCSA Industrial Grand Challenge Award for their innovative work. Media
coverage of this award reached over 200 media outlets.
-15-
136
High Performance Computing Infrastructure and Accomplishments
Development of Immersive Science Projects
The transition from stand-along workstation visualization to Nil distributed visual-
ization was emphasized at SIGGRAPH'92 in Chicago when NCSA collaborated with
the Ul-Chicago's Electronic Visualization Laboratory and dozens of science teams to
demonstrate wide area interactive visualization at Showcase. A new level of real-
ism in virtual reality was debuted there as well with the public showing of EVL's
Cave Automated Virtual Environment (CAVE), which provided complete immer-
sion in complex 3-D data sets at workstation levels of resolution. In 1993-1994, EVL,
NCSA, and Argonne organized a national call for proposals which resulted in over
60 EVL/NCSA computer science and visualization staff and graduate students help-
ing researchers from over 30 institutions in porting their applications into the
CAVE environment. For the first time, this included realtime coupling to parallel
supercomputers so that dynamic 3-D evolutions could be viewed immersively and
steered interactively. At SIGGRAPH 94, 8,000 attendees were able to directly experi-
ence these science projects. CTC developed a specific Cave Visualization on Macro-
molecular Modeling: the Structure of Acetylcholine Esterase; this visualization was
integral to the researcher's understanding of the molecule's activity. This applica-
tion runs not only on the CAVE's SGI workstations, but on CTC's IBM SP2 system as
well.
Virtual Reality over ATM networks
In a project IN 1994 with Rome Laboratory, demonstrated virtual reality techniques
over an ATM network between CTC SGI computers and Rome Laboratory. Research-
ing ATM technologies for real time applications and demonstrating software tools
for application steering important to molecular modeling, telemedicine and com-
mand and control.
The Sequoia 2000 Visualization Group at SDSC developed a prototype data visualiza-
tion system "Tecate" using virtual reality technology to address many of the issues
involved in exploring the informational content of networked data servers. Tecate
enables the browsing for data that resides in repositories managed by a database man-
agement system via user-interaction with graphical renditions of objects that repre-
sent data features.
Alpha Shapes, Biomolecules, and Cosmology
Alpha shapes, a form of geometric modeling developed by the 1993 Waterman
Award winner Herbert Edelsbruner (Dept. of Computer Science, UIUC) and NCSA
staff member Ping Fu, focuses on the formal definition, construction, and measure-
ment of shapes for any given point set in space. The discrete nature of the alpha
shape complex has computational advantages over any other known method which
can be exploited in computing surface area and volume of a space filling diagram and
in localizing and measuring voids. The latter is useful in studying water molecules
residing inside a protein. NCSA users have discovered other related applications of
alpha shapes by applying them to such diverse fields as adaptive grid generation,
medical image analysis, visualizing the structure of earthquake data, and the large-
scale structure of the universe.
-16-
137
High Performance Computing Infrastructure and Accomplishments
DIGITAL LIBRARIES AND INFOSERVERS
The National Information Infrastructure requires many software, computer, and
communications resources that were not traditionally thought to be part of high
performance computing. In particular, knowledge organization, location, and navi-
gating tools needed to be developed. The NSF Supercomputer Center staffs and their
associated universities have proven to be fertile ground for developing their new
tools. Perhaps the most spectacular success has been NCSA Mosaic, which in less
than 18 months has become the Internet knowledge browser of choice by over a mil-
lion users. The Mosaic/World Wide Web infrastructure has set off an exponential
growth in the number of decentralized authoring of information servers.
Digital Libraries
In 1989, NCSA as part of its XUNET application testbed proposed that multimedia
digital libraries would require gigabit networks in order to fully support high defini-
tion imagery and the coupling of large data sets with computing resources and geo-
graphically dispersed researchers. This resulted in research developments like DICE
(Distributed Collaboration Environment). Parallel efforts in providing researchers
with global information retrieval and display capabilities over existing environ-
ments combined collaboration with Bruce Schatz (then at U. Arizona) and his
Worm Community System, with internet-based designs of component client/server
architectures, like the World Wide Web. These approaches influenced the design of
the current grand challenge digital library prototype for accessing radio astronomy
images and data sets. NCSA built on the success of Internet access tools such as
NCSA Telnet, adapted this modified digital library paradigm to the Internet with
NCSA Mosaic. Today, the grand challenge image library uses NCSA Mosaic as it's
user interface, and Schatz has joined NCSA to head the recently awarded
NSF/ARPA Digital Libraries project, which combines a testbed based on the compo-
nent architecture with experiments in object-based designs.
Scalable Information Servers
The enormous success of NCSA Mosaic and CERN's WorldWideWeb has resulted
in explosive growth in the use of NCSA's WWW server. By the end of 1993, NC-
SA's server load had grown beyond the capabilities of any single server. This re-
sulted in the design of an innovative distributed scalable server architecture that in-
volved a modification of the Internet's Domain Name System software. By Sept.
1994, the NCSA WWW server was handling over 2 million connections per week.
NCSA's Hewlett-Packard workstation cluster based distributed information server
has now been duplicated at many WWW and FTP sites on the Internet and within
corporations. A number of corporations are presently working with NCSA on the
next generation of this distributed architecture.
The Rise of the MosaicAVWW Information Infrastructure
NCSA developed the Mosaic user interface software which provides point-and-click
access to the diverse information storage protocols of the Internet, such as World
Wide Web (WWW), Gopher, FTP, and WAIS. NCSA Mosaic establishes the neces-
sary connections, file transmissior^, decompression, launch of viewer programs, and
17-
138
High Performance Computing Infrastructure and Accomplishments
screen display of text, images, animations, or audio, in response to a single mouse
click from the user. NCSA Mosaic is available for Mac, Windows, and Unix comput-
ers for free to individual users, for government and educational use, and for internal
use within companies. Monthly download rates from the NCSA site alone are con-
sistently over 30,000. Although accurate estimates are difficult, it is widely felt that
over a million copies of NCSA Mosaic are in use. Further, commercial versions of
NCSA Mosaic are available. The principal licensee. Spyglass, Inc., has announced or-
ders for over five million copies of their enhanced version, with projections to
twenty million copies within a year. Use of NCSA Mosaic has increased WWW traf-
fic on the NSF backbone by over 10,000 fold since Jan. 1993. Overall WWW traffic in
August hit 1.3 Terabytes or 8 % of the total NSFnet backbone traffic, higher than
SMTP. Because of this, NCSA has become the second biggest Internet site in the
world in terms of traffic from its site. The NCSA Mosaic/WWW information infras-
tructure is allowing for an enormous growth in decentralized authoring of infos-
ervers throughout the world. In 1994, NCSA was given Infoworld's Publisher's In-
dustry Achievement Award.
DESKTOP Software
From the begirming, the NSF Supercomputer Centers provided focal points for
pulling together teams of computer scientists and software developers. Since the
history of the centers has greatly overlapped with the worldwide rise of the personal
computer and workstation, it is not surprising that the software developers focused
on creating easy-to-use software tools for the desktop machines themselves. These
tools have had a major influence on the usefulness of the supercomputer facilities
to the remote science and engineering community. The collaboration tools will
have a great impact on tying together the newly emerging electronic teams of scien-
tists made possible by the growth of the Internet.
Connectivity Tools
NCSA Telnet was a ground-breaking desktop application that provided access to the
emerging NSF Supercomputing Centers in the late '80s. Developed by the new
Workstation Tools Group (later the SDG) at NCSA, this brought full TCP connectiv-
ity to researchers using IBM and Macintosh systems, sigruficantly broadening the par-
ticipation base beyond Unix users, thereby introducing thousands to both the inter-
net and the NSF Centers Program. Continuously supported up to the present time,
these tools have also led to a spin-off company Intercon, headed by one of NCSA
Telnet developers.
Collaboration Tools
NCSA has supported a program of research and development on collaboration tech-
nology for science and engineering researchers for over 3 years. NCSA Collage, a tool
that runs across MSWindows, Mac, and XWindows systems, provides the capability
to carry on remote digital conferencing sessions between researchers. The first live
MetaCenter collaborative session using NCSA Collage was held in 1992 . Collage
combines many of the features of NCSA's communications and graphic data analysis
tools. NCSA also continues to innovate in asynchronous collaboration tools such as
-18
139
High Performance Computing Infrastructure and Accomplishments
asynchronous collaboration tools, hence the interest in annotation and workgroup
support capabilities in NCSA Mosaic. Current work focuses on choosing and combin-
ing the best of these synchronous and asynchronous capabilities in usable next-gen-
eration global collaboration tools for the scientific and educational communities.
Cornell University's CUSEEME video teleconferencing software, aimed at providing
video teleconferencing on low-end workstations, is in use at the NSF Centers and as
part of NYNET, NYSERNET and other organizations for routine use, including
medical projects between the CTC and the Cornell University Medical College in
NYC. This work, funded by NSF and Cornell itself, is freely distributed and runs on
Mac and PC platforms using inexpensive video equipment. Several national and in-
ternational collaborations have successfully utilized this software. The centers have
gained experience with traditional video teleconferencing systems, through its NSF-
funded system. It is now looking at packet video systems using the vBNS and other
network facilities. These systems will have the capability of moving the videocon-
ference from a set of specially equipped rooms to the desktop. An investigation is
now underway to develop the optimal system for CTC's requirements.
Graphics Tools
NCSA Image was the first scientific visualization tool developed for the desktop
viewing of supercomputing output in the program. It provided the research com-
munity Mac and Unix based visualization methods for analysis of huge data sets, as
well as creating some of the first client/ server tools which integrated remote desktop
workstations and personal computers with the center programs high performance
engines.
The SDSC Image Tools are software tools for reading, writing, and manipulating
raster images. This toolset also allows researchers to convert file formats among over
thirty widely used graphics formats (e.g., from HDF to PICT) and includes extensive C
library functionality for creating custom image-manipulation applications. Now in
its second release, it runs on Cray Research, DEC, HP, IBM, SGI, and Sun Microsys-
tems platforms. Binaries and sample source code are available in the public domain
by accessing SDSC's anonymous ftp area (ftp.sdsc.edu).
Scientist's Workbench
The Scientist's Workbench is an X and Motif-based software package developed at the
CTC. The main functions of the Scientist's Workbench are to bring together the tools
and software required by scientific researchers in a distributed computing environ-
ment, to provide a graphical interface to access those tools, and to provide the soft-
ware necessary to allow researchers to easily build their own graphical interfaces.
This tool has been used at most of the CTC's Smart Nodes (affiliates) by users and as
part of teaching environments for high performance computing, as well as at the
other centers and by companies and national labs developing "custom" program-
ming interfaces for their communities.
19
140
High Performance Computing Infrastructure and Accomplishments
Accon^plishments in Education and Outreach
Education
Familiarity with the tools of computation and visualization is quickly becoming a
sine qua non for both researchers and the public. The spread of access to these tools,
like access to the telephone and television before them, is a democratizing force in
itself: the world of the shut-in is opened up, the disadvantages of distance are mini-
mized, the exchange of techniques and knowledge is enhanced. Education, training,
and outreach are thus fundamental to the programs of the MetaCenter. Each mem-
ber of the centers program has developed educational programs targeted to a variety
of constituencies: university researchers, graduate students, undergraduates, educa-
tors at all levels, and K-12 students.
Researchers and Students
One- or two-day workshops are offered by centers program staff to researchers on site
and at associated institutions, covering introductions to the computational envi-
ronments, scientific visualization, and the optimization and parallelization of scien-
tific code. In addition, special workshops have been offered throughout the centers
program on the use and extension of computational and visualization techniques
specific to various disciplines (from biochemistry to lattice gauge theory). On the
campuses of centers program institutions and on other campuses, centers program
scientists and engineers are active teachers, either through regular academic ap-
pointments or as adjuncts, lecturers, seminar leaders, or teachers in extension divi-
sions.
Graduate students often receive fellowships or similar appointments at centers pro-
gram institutions, as their contributions may benefit a large academic research
community or the computational community generally. As an example, Ph.D. stu-
dent Kimberly Claffy (Computer Science, UCSD) recently completed her dissertation
on a flow-based measure of Internet traffic that she developed as a Junior Fellow at
SDSC. Dr. Claffy's technique is the first to permit traffic characterization on the basis
of a temporally and spatially flexible unit, and it is thus an enabling technology for
further advanced network research at SDSC and elsewhere.
The centers program has contributed to the research projects of hundreds of gradu-
ate students through stipends, access to resources, and relations with centers pro-
gram researchers. Each Center fosters collaborative research by muitidisciplinary,
multi-institutional teams of computer scientists, research scientists, and engineers;
postdoctoral research associates; and graduate students from the national and inter-
national community. These teams forge new approaches to previously insoluble re-
search problems, develop community codes, and host workshops and seminars to
transfer technology.
Supercomputer Centers Educational Activity Support Summary
Educational Activities
FY91
FY92
FY93
High School
20-
141
High Performance Computing Infrastructure and Accomplishments
Institutes
7
4
5
Attendees
128
131
121
Other K-12 Events
15
8
17
Attendees
715
1,370
1,985
Research Institutes
13
11
6
Attendees
262
377
390
Training Courses/ Workshops |
On Site Events
18
102
134
Attendees
1,414
1,773
1,929
Off Site Events
17
23
17
Attendees
295
622
104
Seminars/Colloquia
Events
138
114
132
Attendees
2,251
2,788
3,085
Academic Course Accounts
64
63
79
Monthly Newsletter Circu-
lation
234,986
247,692
165,176
Visitors
13,506
16,380
16392
For undergraduates, the Research Experiences for Undergraduates programs, funded
by NSF, bring in undergraduates to work for a summer or a school semester or quar-
ter on specific projects devised by centers program researchers and/or faculty advi-
sors. The projects are significant in their scope of computational science and in
many instances have resulted in presentations at meetings and publications. A spe-
cial project is CTC's Supercomputing Programs for Undergraduate Research (SPUR),
in which students apply to work on one of a selection of projects developed by Cor-
nell faculty in collaboration with CTC. One REU student at SDSC went on to win
the top prize in the Westinghouse Science Talent Search in 1991. Another devel-
oped a program to teach the use of the Braillewriter to blind students, which was
presented to the Commission on Equal Opportunity in Science and Engineering at
NSF (this student, herself blind, is now a successful computer scientist in Silicon
Valley). Undergraduate assistantships and internships are also available in the cen-
ters program. Undergraduate student programmers have worked on many research
problems including numerical weather prediction, the visualization of numerical
spacetimes, and social network analysis. They have developed numerous applica-
tions and utilities to improve the computational environment for MetaCenter re-
searchers. Students have also worked on library, visualization, and educational pro-
jects. The REU programs have been ongoing in various forms for more than five
years.
Outreach to Educators
One particularly effective approach to educating the next computational generation
is the training of teachers, and many centers program efforts have been devoted to
teacher training and curriculum development.
21
142
High Performance Computing Infrastructure and Accomplishments
Common Knowledge: Pittsburgh is a national pilot program developed by PSC, the
University of Pittsburgh, and the Pittsburgh Public Schools to institutionalize educa-
tional technologies within the Pittsburgh Public School District by having PSC im-
plement the network infrastructure and develop specific curriculum-based network
and computer applications. PSC's High School Initiative (1992-1994) involves stu-
dent/teacher teams using PSC facilities to develop computational tools for inclusion
in their schools' science or mathematics curriculum, with an emphasis on integrat-
ing high-performance computing into the curriculum and thus bridging the gap be-
tween textbook instruction and real world applications of scienre.
SuperQuest is a program involving centers program sites that brings teams of teach-
ers and students from selected high schools to summer institutes to develop compu-
tational and visualization projects that they work on throughout the following year.
In addition to educational workshop programs associated with SuperQuest, NCSA
has developed five interactive simulation programs now being tested in classrooms
across the country and around the world. These include GalaxSee, an N-body simu-
lator of galaxy formation and interaction; the Fractal Microscope, which enables the
exploration of self-similar patterns; SimSurface and SimElevator, simulated anneal-
ing programs; and LaplaceSeein', an electrostatic potential solver. Students can
change initial conditions and watch the simulation evolve as the parameter space is
explored.
SDSC's computational sciences curriculum coordinator, Kris Stewart (who is a pro-
fessor of mathematics at San Diego State University) has conducted summer work-
shops, funded by NSF and Cray Research, with faculty from primarily undergradu-
ate institutions to develop ways of incorporating high-performance computing into
the curriculum. Stewart uses the workshop materials in her own SDSU classes, Su-
percomputing for the Sciences and an Introduction to Computational Analysis.
SDSC is now halfway through a three-year, NSF-funded Supercomputer Teacher
Enhancement Program targeted to high-school teachers whose classes contain un-
derrepresented minorities.
Dr. Bruce Land, of the CTC, has developed an undergraduate course in scientific vi-
sualization and computer graphics, using the data flow block diagram capabilities of
IBM's Data Explorer software. This curriculum, lab exercises and the resulting stu-
dent projects have been shared with the larger educational community through the
CTC's Education and Training home page. Additionally, Prof. Steve Vavasis has de-
veloped an interdisciplinary course in scientific computing using high performance
computing for graduate and undergraduate students.
Over the past year, the CTC had established the Data Explorer repository, a full set of
tutorials for parallel computing on diverse platforms, a complete set of lecture notes
for use by educators as well as researchers, and a gateway to materials on the net-
work for secondary school science and mathematics education.
The educational outreach programs of the centers program enable students to expe-
rience the advantages of connectivity and training in all aspects of modern compu-
tational practice. The challenge to effectively deliver centers program resources to all
classrooms is being met mainly by the distance-defeating and multiplicative effects
-22-
143
High Performance Computing Infrastructure and Accomplishments
of high-performance computation itself. The dissemination of trairung and curricu-
lum materials over the National Information Infrastructure is a major way in
which the successful pilot programs can be turned into a new class of educational re-
sources.
Outreach
Because advances in high-performance computing and communications (HPCC) are
driven by the needs of the practitioners with the most advanced problems, the cen-
ters program's scientific mission includes the construction of an extensive web of re-
lationships with research and development efforts in American industry and com-
merce. Collectively, the centers program's outreach programs represent a long
record of sustained collaboration among scientists, HPCC developers, and industrial
researchers. Another aspect of outreach is the effort to find and serve local and re-
gional needs of government, schools, and communities. Some aspects of these activ-
ities are discussed below.
Application of Scientific Computation and Visualization to Industrial Produc-
tion
Half of the partnerships between the individual centers and industry are collabora-
tions with major industrial firms. These include American Cyanamid, Amoco, Al-
coa, AT&T, Caterpillar, Corning, Dow Chemical, Eastman Kodak, Eli Lilly, FMC,
Gencorp, General Dynamics, Hughes Aircraft, IBM, JP Morgan, Martin-Marietta,
McDormell-Douglas, Merck Research Labs, Motorola, Parke-Davis, Philips
Petroleum, Schlumberger, USX, and Xerox.
In their original form, the partnerships represented the first introduction of large-
scale computation and visualization into the store of resources possessed by even
the largest of these Fortune 500 comparues. While the companies are for the most
part fully computerized now, the majority of these partnerships continue today be-
cause centers program expertise has been essential to the introduction of new ways
of employing the resources of supercomputing: the algorithms, visualization rou-
tines, and engineering codes are being combined in ways that result in such ad-
vances as high-end rapid prototyping of new products. As a result, for example, Eli
Lilly maintains its partnership although the company has purchased its own super-
computer — the useful interactions with centers program scientists, consultants, and
visualizers continue. In many of these arrangements, the industrial partner's re-
searchers are frequent visitors to the NSF centers, and centers program researchers
also visit the partner's installations.
Thus, while it is extremely important to Alcoa that it was able to produce an opti-
mum aluminum can, to Gencorp that it was able to design a better injection mold-
ing process, to McDonnell-Douglas that it could perform rapid airfoil analyses, to
American Cyanamid that it could reformulate soil enhancers, the sum of these
long-term relationships is important in another dimension as well. TheCenter out-
reach efforts are helping to revitalize American industry, making it more competi-
tive in an increasingly competitive world market.
■23-
144
High Performance Computing Infrastructure and Accomplishments
Impact on Vendors of High Performance Computing Equipment
The centers program has had a major impact on the vendors of major high perfor-
mance computing equipment. All Centers have taken early prototypes of machines,
have provided national access to the largest scale version of such machines, and
provided critical feedback to the vendors. Several have entered into strategic devel-
opment efforts v^ith the vendors. For example, PSC is a formal partner with Cray
Research in the development of applications for its massively parallel T3D. Cray
has also internalized some of PSC's file system developments. CTC played an inte-
gral role in IBM's re-entry into the High Performance Computing arena, as the first
customer for its IBM ES/3090 vector supercomputers and as a partner in the design
and development of its parallel FORTRAN products. CTC, through its director
Malvin H. Kalos, was a key influence in IBM's decision in 1991 to build the IBM SP
systems and ensured that IBM adopted a strategy that was scalable ultimately up to
the teraflops and down to the desktop.
SDSC has established a close collaboration with the Supercomputer Systems Divi-
sion (SSD) of Intel Corporation to develop systems software to support multi-user
systems, to serve as a test site for new operating system releases, and to improve the
stability of the Paragon system. SDSC staff have developed MACS, the Multi-user
Accounting and Control System, which Intel offers as part of the Paragon's standard
operating system software. This system includes a dynamic job mix scheduling algo-
rithm, a port of the Network Queuing System batch job submission software, and
CPU quota and accounting systems to control resources used by separate projects.
SDSC has also collaborated with Cray Research to develop support for multiple-user
systems on Cray systems. SDSC staff have developed a resource management system
that controls access to various resources on the system (CPU, memory, and disk) and
a dynamic job mix scheduler (DJMS) to dynamically adjust the workload for optimal
performance. SDSC has recently run a T3D emulator on the Cray C90 and is provid-
ing feedback on its performance. SDSC now plans to install and evaluate Cray's new
FDDI card, a fiber-optic high-speed network interface.
Digital Equipment Corporation recently awarded UCLA and SDSC an external re-
search grant to acquire nine Alpha 3000 model 400 workstations. The duster, which
has a peak speed of 1.2 Gflops and is connected at 100 Mbps via a Gigaswitch, will be
used primarily for climate studies led by Dr. Roberto Mechoso of UCLA. It will also
be available for scientific use and performance testing by the SDSC user community.
SDSC staff are collaborating with DEC to port the global climate model to the Alpha
cluster using DEC's High Performance Fortran compiler.
DEC is a major supporter of Project Sequoia 2000, a collaboration of scientists, com-
puter and information experts, government agencies, and industrial sponsors to de-
velop an information-management system for studying global climate change. The
Sequoia visualization group, centered at SDSC, has been developing a system that
will build on the strengths of existing hardware and software to support next-gener-
ation visualizations. Recently, the group collaborated with Kubota-Padfic Computer
Corporation to combine Kubota's Denali system with DEC Alpha machines for ad-
24-
145
High Performance Computing Infrastructure and Accomplishments
vanced 3D color graphics capabilities. Such collaborations have benefited DEC, Kub-
ota-Pacific, and the Sequoia 2000 project.
The National Storage Laboratory (NSL), a consortium that is developing next-gener-
ation high-speed storage devices, has selected the UniTree system as the production
archival storage system for all centers program sites. UniTree, which was originally
developed at the Lawrence Livermore National Laboratory, was commercialized l^
DISCOS, a spin-off of SDSC and a former division of SDSC's parent company. Gen-
eral Atomics. DISCOS was, in turn, sold to Open Vision, which continues to market
the product.
SDSC staff are collaborating with NSL, IBM, and Open Vision to implement the
NSL's base version of the UniTree archival storage system on the center's IBM
RS/6000 model 980 workstation. They are adding a new, more robust name server
developed by Lawrence Livermore National Laboratory for Open Vision, transaction
journaling (which allows reconstructing the database in case of catastrophic failure),
and adding and enhancing system administrator tools.
The integration of PSC's Multi-resident AFS into the NSL UniTree environment is
planned to provide user-friendly access for the centers program members into SD-
SC'S NSL archival storage system. The future integration of HIPPI-attached periph-
erals, including high-speed, high-density tape and high-speed RAID disk arrays, us-
ing third party data transfer is planned as a way to substantially increase both
archival system storage capacity and data transfer speed.
A prototype HPSS parallel I/O archival storage system is also planned for evaluation
as the follow-on to the NSL UruTree system. This system will support striping
across multiple high-speed peripherals to even further increase the speed of file
transfers.
Stimulation of New, Computationally Dependent Ventures
About a fourth of the industrial partnerships are with smaller and newer firms,
many of them leaders in biotechnology. Some are firms designing new pharmaceu-
ticals (e.g., Agouron Pharmaceuticals, Genentech), others develop and market the
software packages required for these enterprises. Biosym Technologies, for example,
is working with both CTC and SDSC to develop parallel versions of its popular Dis-
cover and Insight packages.
Outreach efforts of the centers program have resulted in actual spinoff ventures as
well. The commercialization of the software developed at individual centers is be-
ing undertaken by a number of companies. For example, NCSA Telnet has been
commercialized by Intercon, and Spyglass will release a package containing up-
graded versions of image tools and Mosaic. Some 20 companies have now licensed
NCSA Mosaic. CERFnet, a California wide-area network for Internet access has
pioneered in supplying access to library and other large databases; and
DISCOS/UniTree, a mass storage system, is in use at more than twenty major
computer sites. A new molecular modeling system, called Sculpt, developed at
SDSC, is being commercialized by a new company. Interactive Simulations. Sculpt
enables drag-and-drop molecular modeling in real time while preserving
-25
146
High Performance Computing Infrastructure and Accomplishments
minimum-energy constraints; its output was featured on the cover of Science last
May.
Development of Nationally Valuable Reservoirs of Skill
About a fourth of the partnerships between the centers and industry are collabora-
tions with manufacturers of high-performance computational, networking,
telecommunications, and visualization equipment. Of particular interest here are
the several partnerships funded by the NSF and ARPA through CNRI to construct
and test the "gigabit testbeds," prototypes of the connectivity that will be required for
the future International Information Infrastructiire. Both academic and industrial
research groups are developing the codes to test the connections, even as manufac-
turers develop the connections themselves, and the experts assembled in the centers
program supply links in the form of specifications and software.
Community Service
Local and regional outreach efforts range from the tours given at all centers program
installations through the hosting of visits by national, regional, and local officials
and commissions, to the kinds of full-scale partnerships mentioned above. The
NCSA relationship with the Champaign County Chamber of Commerce has re-
sulted in the formation of a nonprofit public network, CCnet, which is already bene-
fiting the Chamber itself as well as local schools. Plans are in the works with Time-
Warner to start pilot tests of the use of public-access cable for a data highway.
SDSC is working with the City of San Diego on plans to connect all units of dty gov-
ernment, including a high-technology resource center to be developed with De-
partment of Commerce funding that will connect local industry (with a lot of de-
fense reconversion efforts) to business and computational resources, including
SDSC itself.
PSC is exploring extension of the technology it developed for its Common Knowl-
edge:Pittsburgh K-12 project to embrace major units of city government.
Outreach is also represented by the publications programs of the centers program,
the production of scientific videos and /or multimedia CD-ROMs, and a collabora-
tive program for maintaining a lively and informative presence on World-Wide
Web servers, which will make information on the programs easily accessible over
the Nn.
26-
147
High Performance Computing Infrastructure and Accomplishments
Important Science and Engineering Accomplishments
Summaries of computationally interesting problems in the NSF Centers Program by
the national science and engineering communities:
QUANTUM PHYSICS AND MATERIALS
The great disparity between nuclear, atomic, or molecular scales and macroscopic
material scales, implies that vast computing resources are needed to attempt to pre-
dict the characteristics of bulk matter from fundamental laws of physics. Therefore,
it is not surprising that since the begirming of the NSF Centers program this area of
science has brought us some of the largest users of supercomputers. Materials scien-
tists have often been among the first group of researchers to try out new architec-
tures that promise even higher computational speeds.
Below are outlined some outstanding examples of studying properties of bulk mat-
ter from extreme conditions, such as occur in nuclear collisions, the early universe,
or in the core of Jupiter; new materials such as nanotubes and high temperature su-
perconductors; and more practical materials used today such as magnetic material
and glass.
Phase Transition in QCD
The MIMD Lattice Calculations Collaboration (MILC) is attacking the Grand Chal-
lenge problem of "the origins of mass." Their objective is to use the theory of the
forces governing what are called the "strong interactions" of elementary particles
(quarks and gluons) to calculate the observed masses and interactions of the particles
that are made out of them: the hadrons, which include the familiar proton and neu-
tron. The theory is called quantum chromodynamics (QCD), and its numerical in-
carnation is called "lattice gauge theory," because the quarks and gluons are repre-
sented on a four-dimensional space-time lattice. They have published numerous
studies of the mass spectrum of the hadrons; the transition between ordinary matter
and the quark-gluon plasma, which is important in the study of the conditions of
the early uruverse; and the decays of hadrons via weak interactions. A number of
investigators, coordinated by Robert Sugar (Dept. of Physics, UCSB), are engaged in
this project including: Claude Bernard (Washington Uruv.), Thomas A. DeGrand
(Univ. of Colorado), Carleton DeTar (Univ. of Utah), Steven Gottlieb and Alexander
Krasnitz (Indiana Univ.), Douglas Toussaint (Univ. of Arizona), Julius Kuti (UCSD).
The consortium has used large allocations of time on a wide range of MetaCenter
computational facilities including: Intel Paragon (SDSC), TMC CM-5 (NCSA), clus-
tered IBM RS/6000S under PVM (CTC, NCSA), Cray Research C-90 (SDSC and PSC)
Phase Transitions of Solid Hydrogen
Calculations by Natalie, Martin and Ceperley (Dept. of Physics UIUC, NCSA), carried
out on the CRAY Y-MP at NCSA have established the series of crystalline phase
transitions of hydrogen as it is compressed to several million atmospheres of pres-
sure, such as found in the interior of the giant planets. Since Wigner and Hunting-
ton in 1935 pointed out that a transformation from a molecular to atomic state is in-
evitable at high pressure, there have been extensive speculations on when and how
27-
148
High Performance Computing Infrastructure and Accomplishments
this transformation would take place. The recent development of the diamond
anvil technique have allowed experiments to be performed at pressure slightly
lower than the atomic transition. Those experiments confirmed the existence of an
molecular orientation transition which had been earlier computationally predicted
by Ceperley at l.SMbar. Extensive and highly accurate quantum Monte Carlo calcu-
lations on a variety of crystal structures now predict that the metallic transition will
take place from a distorted molecular hexagonal structirre into an atomic diamond
lattice. For this and other pioneering science, David Ceperley was awarded the fifth
Eugene Feenberg Memorial Silver Medal in 1994. David's PhD advisor and the third
Feenberg awardee is Mai Kalos, Director of the CTC, himself a major user of several
MetaCenter supercomputers.
Prediction of new Nanomaterials
Marvin L. Cohen (NAS) and Steven G. Louie (Dept. of Physics, UC Berkeley) have
used MetaCenter computational resources (SDSC, PSC, NCSA Cray Research Y-MP
and C90) to make numerous advances in computational materials science. Most re-
cently, they have used both first-principles and tight-binding codes to examine the
properties of carbon nanotubes and nanotubes composed of boron, carbon, and ni-
trogen. Carbon nanotubes-essentially rolled microsheets of graphite-are well
known, thanks to the work of Sumio lijima and colleagues at NEC. They have di-
ameters on the order of 1-20 nm and are producible in the same carbon arc chambers
used to produce fullerenes (also called "buckyballs"- assemblages of 60 or more car-
bon atoms in cage-like structures). They have interesting capillarity and electronic
properties. Cohen, Louie and their colleagues have predicted the structure and
properties of nanotubes made of boron nitride, which appear to be more stable and
controllable in terms of their electronic properties. They have also predicted nan-
otubes of BC_2N, a boron-carbon-nitrogen compound, whose electroruc properties
are even more interesting: they should behave like nanoscale induction coils. Most
exciting, the structures that were predicted computationally are now being produced
experimentally in the lab of Alex Zettl at Berkeley, where their electronic properties
can be confirmed.
Theory of High Temperature Superconductors
The Nobel Prize in Physics in 1987 was for the discovery of a new class of high tem-
perature superconductors. Thousands of research papers have been written about
these unique materials, but the battle is still raging over the fundamental mecha-
nism that causes the superconducting transition at Tc ~ 90K for the cuprate oxides
such as YBA2CU3O7. David Pines (NAS and first Feenberg Medalist) and Philippe
Monthoux (Department of Physics, UIUC) used the NCSA Cray Research Y-MP to
carry out a strong coupling (Eliashberg) calculation of the normal state properties
and Tc for the model experiment-based magnetic interaction between quasiparticles.
They found that when the full structure of the quasiparticle interaction is taken into
account, a superconducting transition into a d-wave planar pairing state occurs at Tc
~ 90K for comparatively modest values of the coupling constant. Although still an
area of active research, this computation lends credibility to the model that it is the
coupling of planar quasiparticles to the experimentally measured planar electronic
-28-
149
High Performance Computing Infrastructure and Accomplishments
spin fluctuation excitations which determines the normal state properties (which
they show acts like a nearly antiferromagnetic Fermi liquid) and makes possible
high temperature superconductivity.
Magnetic Materials
James Sethna (Laboratory of Atomic and Solid State Physics, Cornell Univ.) uses
CTC parallel supercomputers to study the dynamics of disorder-driven first-order
phase transformations, including 3-D numerical simulations of hysteresis loops. He
is developing scalable parallel algorithms for systems of size N = 20003. Sethna's
work enables the prediction of phase transitions with critical fluctuations, the simu-
lation of orders-of-magnitude larger systems to explore critical phenomena, and de-
tailed computational studies in materials science as applied to magnetic storage me-
dia, metallurgical phase transformations, and gases adsorbed on surfaces.
Understanding Glass
For ab initio dynamical calculations to be useful for real materials in an industrial
setting, they must be able to deal with ensembles of thousands of atoms for dynami-
cal effects modeled over microseconds. Significant algorithmic developments made
jointly at Corning, Inc. and the CTC, coupled with the much increased capability of
the CTC's IBM SP2 system, allow this threshold to be crossed for the first time. Post-
doctoral Fellow Stefan Goedecker, hired jointly by Corning, Inc. and CTC, has devel-
oped new extremely fast ways of doing tight-binding which he can parameterize
with the ab initio codes. This is the only approach currently known that will handle
thousands of atoms for millions of time steps, bringing the researchers close to ob-
serving many of the mysteries involved in glass chemistry which have been not
well understood for over 2000 years.
Biology AND Medicine
Living creatures exhibit some of the greatest complexity found in nature. Therefore,
supercomputers have made possible unprecedented opportunities to explore these
complexities based on the fundamental advances made in biological research of the
last fifty years. These activities include: inverting the data from x-ray crystallography
experiments to obtain the molecular structure of macromolecules; learning how to
use artificial intelligence to fold polypeptide chains, determined from genetic se-
quencing, into the three-dimensional proteins; and determining the function of
proteins by studying their dynamic properties, as well as how they interact with each
other or with the DNA backbone from whence they were created.
These insights are beginning to make significant impacts on medicine and plant and
animal biology. New fields of computational science, such as molecular neuro-
sciences, are being enabled by academic access to MetaCenter computing and visual-
ization resources and staff. Corporations are using supercomputers and advanced
visualization techniques in collaboration with the NSF MetaCenter to create new
drugs to fight human diseases such as asthma. New ir\sights into economically
valuable bioproducts are being gained, for instance, by combining molecular and
medical imaging techniques to create "virtual spiders" which can be digitally dis-
sected to understand the production of silk. Finally, high performance computers
-29-
oc—m D r^
150
High Performance Computing Infrastructure and Accomplishments
are just becoming powerful enough that some dedicated researchers are able to pro-
gram mathematical models of realistic organ dynamics, such as the human heart.
Crystallography
Herbert Hauptman (Medical Foundation of Buffalo, Inc.) won a Nobel Prize in 1985
for development of the "direct method" of protein structure determination from X-
ray crystallographic data. In a collaboration with Russ Miller (State University of
New York at Buffalo), these researchers have developed a numerical approach that
extends the "direct method" of determining molecular structure from X-ray crystal-
lographic data to larger molecules, beyond its present limit of about 100 atoms. The
algorithm they have developed, called "Shake-and-Bake," runs on a number of
computing platforms (PSC CM-2 and Cray T3D, NCSA and PSC CM-5) and has
proven itself effective in more than 20 cases at accurately determining the structure
of proteins that have taken as long as 10 years by existing methods, reducing the
time to a matter of hours.
Folding Proteins using Artificial Intelligence
One of the most pressing problems in molecular biology is how to determine the
folding and 3-D structure of a protein, given its sequence. Peter Wolynes (NAS), Zan
Schulten, and coworkers (Dept. of Chemistry, UIUC) have developed a novel ap-
proach to this classic problem using elements from the theory of spin glasses, asso-
ciative memory models, and neural networks. Spin glass theory provides a frame-
work for understanding the cooperative nature of the folding transition and the
qualitative nature of the phase diagram describing the thermodynamics of proteins.
Wolynes et al. developed simulation codes, based on associative memory Hamilto-
nians, and characterized their phase diagrams semi-quantitatively. These Hamilto-
nians are based on an energy function which correlates the sequence of the protein
to be folded with those of proteins whose structure is known. They were introduced
several years ago by Wolynes and coworkers as polymer analogues of the Hopfield
neural nets. Their work, carried out on NCSA's Cray-2 and CRAY Y-MP, shows that
even primitive associative memory Hamiltonians can recognize protein structures
from sequences that are only moderately related to those already existing in the
database. These procedures are somewhat similar in effectiveness to the rule-based
homology modeling.
Protein Kinase solution
The Computational Center for Macromolecular Structure (CCMS), founded in 1990,
is an NSF-fundcd joint project of UCSD, SDSC, and The Scripps Research Institute,
with collaborators from all over the country. The center made headlines in 1991
when a group led by Susan Taylor (Dept. of Chemistry, UCSB), one of the principal
investigators of CCMS published the three-dimensional structure of the catalytic
unit of cyclic-AMP-dependent protein kinase, or cAPK. This was the first kinase
structure to be solved. The solution was achieved by a combination of computa-
tional methods, including refinement on the SDSC CRAY Y-MP using the program
XPLOR, developed by Axel Bruenger of Yale University. Most important to the solu-
tion was the ability of the scientists to study stereo visualizations of the structure on
■30
151
High Performance Computing Infrastructure and Accomplishments
a large screen at SDSC, so scientists from every discipline within the group could
contribute their insight to a collective determination of the structure. Kinases play
important messenger roles in cell metabolism, and hundreds of such compounds
have been identified and sequenced. Because sequences are homologous in long
stretches, the solution for cAPK is proving extremely valuable as a template for
modeling and deriving the structure of other kinases. Taylor and her group have
collaborated with several other groups since in modeling proposed solutions for
other kinases, including those known to have carcinogenic properties or to be in-
volved in other disease processes. In all of these studies, computation and visualiza-
tion have played an important role. Solutions for various kinases can lead to the de-
sign of inhibitors to prevent the enzymes from acting to produce diseases. The work
won the Forefronts of Large-Scale Computation Award presented at Supercomput-
ing '93.
Molecular Neuroscience-Serotonin
A number of cardiovascular and psychiatric diseases are currently treated with drugs
that act on the neurotransmitter serotonin and its receptors. The cellular receptor
for serotonin is a gatekeeper molecule that recognizes and binds the serotonin and
then transmits the signal to the cell by binding to a special class of transducers: the
G-proteins. Using the CTC's ES/9000, Dr. Harel Weinstein, chairman of the Depart-
ment of Physiology and Biophysics at Mount Sinai Medical Center, made a break-
through in modeling the serotonin receptor. His breakthrough came from modeling
the structural changes that occur in the serotonin receptor when it binds to a ligand
and to the G-protein, causing it to carry out its function. This research shows how
G-proteins can be switched on by structural changes in specific regions of the recep-
tor molecule and is expected to lead to the development of more effective drugs,
specific ligands aimed at the regions where the structural change takes place. Wein-
stein believes that his work may be applied more broadly to other receptor
molecules, including all neurotransmitters, that communicate with cells via G-pro-
teins. If Weinstein can demonstrate a common mechanism of response in these re-
ceptors, he will have a new typ)e of molecular approach to treating a vast range of
diseases. Weinstein has also used the Cray Research C-90 and Intel Paragon at SDSC.
Molecular Neuroscience-Acetylcholinesterase
A collaboration between Michael Gilson, T.P. Straatsma, and Andrew McCammon
(Dept. of Chemistry, University of Houston), Daniel RipoU (Research Associate,
CTC), Carlos Faerman (Dept. of Molecular and Cell Biology, Cornell Univ.), Paul
Axelsen (Dept. of Pharmacology, University of Pennsylvania School of Medicine),
and Israel Silman and Joel Sussman (Weizmann Institute of Science, Israel) has
used molecular dynamics algorithms to investigate the rapid activity of the enzyme
acetylcholinesterase (AChE). The enzyme breaks down the neurotransmitter acetyl-
choline diffused across nerve cell synaptic gaps. Its three-dimensional crystal struc-
ture was solved by Joel Sussman and colleagues at the Weizmann Institute in Re-
hovot, Israel, several years ago. That structure showed the active site to be a long,
narrow channel — too narrow to deal rapidly with the job of dissociating acetyl-
choline into choline and an acetate ion. Yet it is known that AChE acts very rapidly.
•31-
152
High Performance Computing Infrastructure and Accomplishments
no doubt because of aeons of evolutionary pressure to optimize the response of the
nervous system in all organisms. Molecular dynamics calculations performed on
the MetaCenter's Cray Research C-90 (SDSC and PSC), TMC CM-5 (NCSA), and
Kendall Square KSR-1 (CTC) showed that there was also a "back door" to the active
site, that might open to facilitate the exit of the acetate ion from the site. A study of
the electromagnetic fields of acetylcholinesterase with the back door closed and then
open supplied confirming evidence. Since inhibitors of AChE are important medi-
cations for myasthenia gravis, glaucoma, and Alzheimers's disease, this new insight
may lead to more effective pharmaceutical agents to fight these diseases. This work
was the cover story of the March 4, 1994 issue of Science magazine.
Kinking DNA
John M. Rosenberg (University of Pittsburgh) used the PSC Cray Research C-90 vec-
tor supercomputer to determine how a protein identifies and interacts with specific
sites of DNA — a fundamental biological process called "protein-DNA recognition,"
which is related to many disease processes and is also a vital tool in the biotechnol-
ogy industry. Rosenberg's molecular dynamics simulations have refined the struc-
ture of an important protein, Eco Rl endonuclease, used in DNA cloning, and they
have resulted in a clear understanding of a "kink" in the DNA backbone that results
when Eco Rl endonuclease binds with DNA. Rosenberg won the 1991 Forefronts of
Large-Scale Computation award for this research, and his work was cited in the 1993
Computerworld Smithsonian award for science given to the PSC.
Antibody-Antigen Docking
A collaboration among computer scientists Michael Hoist and Faisal Saied (Dept. of
Computer Science, UIUC) and two biologists Richard Kozack and Shankar Subra-
maniam (Dept. of Physiology and Biophysics, UIUC and Beckman Institute/NCSA)
has been able to solve for the first time the complete nonlinear Poisson-Boltzmann
equation, which is the fundamental equation of macromolecular electrostatics. A
method based on multigrid-inexact Newton algorithms has been developed and
large memory applications run in parallel on the NCSA Convex C3 show that this
has profound consequences for protein structure, enzyme mechanisms and protein
design. Coupling this new approach with a Brownian dynamics method, the largest
simulation ever of an encounter between two proteins, an antibody and an antigen,
has been carried out using the NCSA CM-5 and SGI Challenge. This simulation for
the first time is able to give rate constants for association of proteins that is compa-
rable to experimental measurements. The results of the electrostatics work was the
cover story of the March 1994 issue of Proteins: Structure, Function, and Genetics.
Tuning Biomolecules to Fight Asthma
Over the last 20 years, the number of asthma cases has almost tripled in the U.S.
David Herron, senior research scientist at Eli Lilly and Company, is searching for
new drugs that will inhibit the action of leukotrienes, inflammatory agents released
by several types of cells in the lungs, which cause the lungs to stiffen and become ir-
ritated. Several gigabytes of data from molecular dynamics of three key
leukotrienes, run on NCSA's and Lilly's Cray-2 supercomputer were analyzed in a
-32-
153
High Performance Computing Infrastructure and Accomplishments
lengthy scientific visualization created with NCSA staff. Using the animations as a
guide, Lilly asthma researchers synthesized highly active antagonists against the
leukotrienes. Some have been tested in asthma sufferers and found to be effective
medically. For this work, Herron was a co-recipient of the first NCSA Industrial
Grand Challenge Award.
Virtual Spider and Artificial Silk
Biophysicist Lynn Jelinkski, director of Cornell University's Biotechnology Center
for Advanced Technology (CAT) is combining medical imaging techniques with the
state-of-the-art computer visualization resources of the CTC to study the molecular
structiire of the strongest silk of the golden orb weaver spider and its transformation
from a viscous fluid into the extremely strong crystalline fiber which has the poten-
tial to replace manmade fibers, such as nylon, manufactured from petrochemicals.
Jelinski has devised a way to create a 3-D "computer spider" by compiling stacks of
the 2-D MRI images using the IBM POWER Visualization System, one of the high-
performance computing resources of the CTC. Each image contains over 100,000
pixels. Hundreds of images are combined to construct the 3-D simulated spider.
Once in hand, this virtual spider can be dissected by computer to describe the
anatomy of the glandular system and to provide the physical processing informa-
tion Jelinski seeks. This understanding, coupled with molecular-level studies of the
amino acids that make up the web silk polymer, may aid in geneticaUy engineering
plants to produce fibers as strong as those produced by the spider. Jelinski's work
blazes a path toward the development of a new class textiles with superior strength
at the same time that it promises fundamental insight into the mystery of the spi-
der's web.
Heart Modeling
Charles S. Peskin and David M. McQueen (Courant Institute, New York University)
have developed over the last decade a fully functioning three-dimensional model of
the heart, its valves and nearby major vessels. This computational model will make
it possible to study questions about normal and diseased heart function that are dif-
ficult or impossible to address through animal and clinical studies. The complexity
of the heart model is so great that a single heartbeat requires a 150 hour run on the
PSC Cray Research C90 and could not have been run without the very large memory
of the C90. This research won the 1994 Computerworld Smithsonian award for
Breakthrough Computational Science. Peskin was awarded a MacArthur Prize Fel-
lowship in 1983.
Engineering
Man-made devices have become so complex that researchers in both academia and
industry have tvirned to supercomputers in order to be able to analyze and modify
accurate models in ways which complement the traditional experimental methods.
Such easily accessible high performance computers enable academic engineers to
study the brittleness of new types of steel, to improve bone transplants, or to reduce
drag of flows over surfaces using riblets. Industrial partners of the individual super-
computer centers within the MetaCenter are using computational facilities more
•33-
154
High Performance Computing Infrastructure and Accomplishments
advanced than they have access to internally to improve industrial processes such as
in metal forming. Better consumer products such as leakproof diapers, or more effi-
cient airplanes are being designed. Even State agencies are able to use the MetaCen-
ter facilities to improve traffic safety or find better ways to use recycled materials.
Some 70 corporations have taken advantage of the MetaCenter industrial programs
to improve their competitiveness.
Ultra-high-strength Steels
Gregory B. Olson and Arthur J. Freeman (Northwestern University) use computer
modeling to design ultra-high strength steel for weight-critical applications such as
naval aircraft landing gear, high-performance race cars, and bearings in the main
engine turbo pumps of the space shuttle. In recent supercomputer modeling on the
PSC Cray C90, applying quantum mechanical calculations to the structure of steel,
they have explained the molecular mechanisms that give rise to impurity-induced
embrittlement in steel, work which is expected to lead to steel that will not shatter
in frigid conditions. This work was reported in the July 15, 1994 Science. Freeman
and his group have been users of NSF supercomputers since the founding of the
program on a wide range of problems in materials sciences. In recognition of his pi-
oneering work in computational materials research. Freeman received the first Ma-
terials Research Society Medal and the first Award in Magnetism from the lUPAP.
Continuous Casting of Steel
Achilles Vassilicos (U.S. Steel Technical Center) models the flow of molten steel in a
continuous-casting "tundish" on the PSC Cray Research C90, resulting in improved
process control over the quality of steel. By more acciirately predicting the precise
metallurgical composition of the continuous-casting output "slabs," U.S. Steel re-
duces waste steel and the amount of inventory it must keep on-hand, resulting in
substantial cost savings.
Beverage can design
Three-dimensional stress modeling of aluminum beverage cans on the PSC C90 by
Robert E. Dick and Andrew B. Trageser (ALCOA Laboratories) has greatly reduced
the expense of developing a new can design that will meet customer specifications
for strength and appearance. By relying less on costly, time-consuming prototype
testing, ALCOA engineers estimate a cost savings of $100,000 or more per can design.
This research has been described in articles in Discover (March 1991), Business Week
(Oct. 8, 1990) and in Science Qune 23, 1989).
Designing a Leakproof Diaper
Designing effective and comfortable disposable infant diapers requires greater un-
derstanding of the function of the diaper components, such as the cellulosic fluff
and the superabsorbent polymer particles-and the effect of variations of parameters
related to these components. Dow Chemical Company, has done extensive experi-
mental testing, evaluation, and computer modeling that has contributed to a faster
developmental process and shortening the time for new product introduction. The
innovative Dow design was evaluated using a computer model run on NCSA's
-34-
155
High Performance Computing Infrastructure and Accomplishments
CRAY-2 system. Three separate time-dependent processes are modeled. The first, a
fast spreading process, involves the insult on the pad by a quantity of liquid (urine)
which is transported through the pad by wicking. During this process the fluff pad
collapses as it becomes wet. The second, called the imbibition process, models the
swelling of the superabsorbent polymer particles and their uptake of liquid from the
cellulose fluff. During this process, the fluff expands again. The slowest-and final-
process tracks the redistribution of liquid in the fluff pad as the saturation of the
fluff adjacent to the superabsorbents changes. The overall model was compared to a
magnetic resonance imaging experiment, which provides a three-dimensional im-
age of the water distribution in a diaper, and was shown to give comparable results
to the final steady-state values. Optimization of these processes is leading to an im-
proved, quality diaper.
Bone Transplant Bioengineering
Dean Taylor and Donald Bartel (Dept. of Mechanical and Aerospace Engineering,
Cornell University) have been able to investigate bone-implant systems across a
wide range of design parameters by using high performance parallel computing
(including the IBM SPl) and visualization resources at CTC. Their long-term re-
search has produced models of the stresses placed on normal bones and the artificial
components of a hip joint — these models have led to custom-designed prostheses
and reduced prosthesis replacement surgery.
Improving Performance with Riblets
George Em Karniadakis^ and his group at Brown University are using the SDSC In-
tel Paragon to explain drag reduction in turbulent flow by means of "riblets," paral-
lel grooves on the surface of an object moving through the flow. Such grooves are
used on aircraft, in pipelines, and on radng cars and sleds, to improve performance.
The group also models flows in micro-electromechanical systems used in surgery
and other complex applications, where the molecules of the fluid are not much
smaller than the channels in which they flow. Both projects are resulting in new
ways to optimize the performance of broad classes of machinery.
Designing Better Aircraft
Dino Roman, John Vassberg, and Tom Gruschus of McDonnell Douglas are using
SDSC's Cray C-90 for a computational fluid dynamics simulation of an aircraft in
flight and to visualize the results using FAST (Flow Analysis Software Toolkit) on a
Silicon Graphics IRIS workstation. Tracer particles are released into the flow field in
front of the aircraft and allowed to follow the streamlines around the vehicle. A cut-
ting plane through the 3D volume of data is placed to intersect the aircraft fuselage
and wings. The aerospace industry relies on computational fluid dynamics-the
simulation of air or fluid flow-to design, develop, and test new aeronautical config-
urations. This process enables companies to test new models quickly to select candi-
' Incidentally, Prof. Kamiadakis is the current chair of the joint NCSA/PSC National Peer Review
Board, another example of the tight links between members of the MetaCenter and the scientific
community.
-35-
156
High Performance Computing Infrastructure and Accomplishments
dates for wind-tunnel testing. Such methods used in aircraft design and manufac-
ture can give American companies a technological edge in the global market.
Crash Testing Street Signs
California State Department of Transportation (Caltrans) engineer Payam Rowhani
and SDSC engineer Chuck Charman generated a computerized model of a crash-test
vehicle, called a "bogie," on the SDSC Cray Research Y-MP for simulations of test
crashes with sign and lighting supports. They fine-tuned the bogie front-end design
with the computer model to minimize the number of validation tests necessary at
the Federal Outdoor Impact Laboratory, operated by the Federal Highway Adminis-
tration. In a second application, Charman is working with Caltrans engineers
William Nokes and Dario Perdomo, who are designing pavement using structural
modeling techniques. They are researching the use of new and recycled materials
and these materials' response to different axle and tire configurations. They are us-
ing the supercomputer and the visualization facilities to explore new truck suspen-
sion systems, and new tires and heavier loadings, innovative pavement structures
with recycled materials and rubber and polymer-modified binders. The results of
this research are expected to lead to significant cost savings in the design, construc-
tion, maintenance, and rehabilitation of pavement structures.
Earth Sciences and the environment
From understanding the motions of the Earth's convective mantle to daily compu-
tation of air pollution levels in southern California, the resources of the NSF Meta-
Center are being used to compute and visualize the complexity of the natural world
around us. The US Army is working with academics to determine how they can
practice tank maneuvers without endangering the breeding habits of the sage
grouse. Pollution, whether underground or in the air, is a difficult coupling of
chemical reactions and flow dynamics which must be understood in detail if correc-
tive measures are to be efficacious. High performance computers also act as time
machines, allowing for faster-than-realtime computation of severe storms. Finally,
to improve global weather or climate forecasts, supercomputers allow researchers to
zero in on the critical coupling physics of such processes as mixing at the air/ocean
interface.
Detoxification of Ground Water
Christine Shoemaker (Dept. of Civil and Environmental Engineering, Cornell Uni-
versity) has been a pioneer user of the scalable IBM SP machines at the CTC for the
development of numerically efficient supercomputer algorithms for optimal con-
trol of dynamical systems and the application of these techniques to detoxification of
contaminated groundwater. Her efforts are leading to methods of determining the
most cost-effective way to clean up the groundwater by computing time-varying
rates of pumping. Shoemaker's group has also developed an animation, using the
visualization resources (both hardware and personnel) of the CTC, that represents
the effects of different policies and natural chemical and biological processes on
groundwater cleanup. Such animations are crucial for conveying the results of basic
research to the mixed audience involved in setting environmental policy.
-36
157
High Performance Computing Infrastructure and Accomplishments
Sage Grouse-Endangered Species and the US Army
Working with the U. S. Army, Bruce Hannon (Dept. of Geography, UIUC) and Jim
Westervelt, of the U. S. Army Construction Engineering Research Laboratory, have
developed an ecological model for the sage grouse, an endangered species, popula-
tion on an army training base in Washington State. Using the Macintosh software
STELLA, the CM-5 and the GRASS geographical information systems, these re-
searchers were able to optimize the scheduling of trairung exercises to maximize
grouse reproduction and longevity. For each geographic cell, a very large STELLA
model was constructed, representing the grouse at various life stages, different kinds
of plants and predators, soil type and moisture, weather-all the many physical vari-
ables-and also introduce the necessary human activities like tank and troop maneu-
vers on the army range. Each cell, of which there are over a hundred thousand i n
the GIS covering the army base, could contain 100 to 200 variables. This work
demonstrated the efficacy of coupling GIS datasets to ecological models and running
them in a client-server fashion between a Macintosh and the CM-5.
Storm modeling/forecasting
Robert Wilhelmson (Dept. of Atmospheric Sciences, UTUC and NCSA) and his col-
leagues have been able to simulate the development of tornadoes embedded within
larger storms called supercells (producing the largest tornadoes) and along low level
convergence boundaries (e.g. along a thunderstorm cold air boundary) using both
traditional vector supercomputers (NCSA and PSC Cray Research Y-MP and C90 and
NCSA TMC CM-2 and CM-5). Study of these results is leading to a better under-
standing of when tornadoes will develop and to more accurate tornado warnings.
The visualization of the internal dynamics of a severe thunderstorm, created by the
NCSA visualization team in 1989, is perhaps the most widely viewed visualization
of a supercomputer simulation ever made. It had a major worldwide impact on the
adoption of scientific visualization as a working tool of computational science.
Kelvin Droegemeier, a former student of Wilhelmson's, and his colleagues associ-
ated with the Center for Analysis and Prediction of Storms (CAPS), an NSF S & T
Center, have used the NCSA and PSC Cray Research supercomputers to develop the
Advanced Regional Prediction System, a computational model for forecasting se-
vere storms. As of Spring 1994, this model has been used, with data augmented by
the single-Doppler radar network now being deployed by NOAA, in daily weather
reporting on an experimental basis. Because of their use of parallel supercomputers
they have shown that regional storm forecasts based on very high resolution mod-
els are possible with the advent of teraflop computing capabilities in the next few
years. The long-term objective is to improve the prediction of hazardous weather on
scales ranging from a few kilometers (an individual storm) and tens of minutes to
hundreds of kilometers (a squall line or other mesoscale system) and several hours.
Los Angeles Smog
Gregory J. McRae (Massachusetts Institute of Technology) and Armistead Russell
(Carnegie Mellon University) have developed the most comprehensive model of
smog formation available. Their modeling of smog in Los Angeles on the PSC C90
-37-
158
High Performance Computing Infrastructure and Accomplishments
showed that, contrary to EPA policy at the time, it is necessary to control nitrogen
oxide emissions as well as hydrocarbons to control smog. This work formed the sci-
entific underpinning for the Air Quality Management Plan adopted in 1988-89 for
the Los Angeles air basin, the most stringent such plan in the United States. Their
modeling also showed that alternative vehicle fuels, methanol in particular, repre-
sent a worthwhile strategy for improving urban air quality, which influenced inclu-
sion of this policy in the 1990 revisions to the Federal Clean Air Act. This work is be-
ing extended using the combination of the PSC Cray Research C90 and T3D as an
NSF Grand Challenge. The first Forefronts of Large-Scale Computation award, given
in 1989, recognized McRae for this work.
These pioneering computations are leading to practical tools for states to predict air
pollution levels. The Modeling and Meteorology Branch of the California Air Re-
sources Board joined the SDSC Industrial Partners program in 1991. They are run-
ning, on the SDSC Cray Research C90, the Urban Airshed model which estimates
hourly pollutant concentrations. They results are used to estimate maximum pollu-
tant concentrations or population exposure statistics for different emissions con-
trols.
Upper Ocean Mixing
Sidney Leibovich (Dept. of Mechaiucal and Aerospace Engineering, Cornell Univer-
sity) has developed a mathematical model on the CTC IB