Linux - 32 bit p3 or p4; 64 bit opteron
Compilers:
Pathscale 2.0
Intel ifort 8.1.024 (use EM64 version for opteron)
Portland Group 5.2-4 (earlier versions have a bad bug)
MPI Interconnects:
MPICH 1.2.6 (backward compatibility to at least 1.2.5.2)
LAM MPI 7.1.1 (backward compatibility to at least 6.5.9)
MPICH2 1.0 (this stuff is faster than mpich!)
Myrinet MPICH-GM 1.2.6..13 (GM 2.0.17) - earlier versions probably work fine
but this is what I tested.
There is a new mechanism for both pathscale and ifort that uses LD_LIBRARY_PATH to set rpath, which then should make it reasonably likely that your processes will all find the compiler shared libraries. I was prompted to go this route by changes in recent releases of ifort that made things even worse than they were. Read the readme for details. Thanks to Ross Walker, Scott Brozell and Dave Case for providing a little impetus in this area, as well as for suggesting that ifort em64 will work on opterons (until they came out with em64, I didn't even worry about this).
This should end at lot of the headaches for configuring pmemd on Linux systems!
(Thanks for your patience; it is my intention to keep updating the web site tar ball as appropriate to support new stuff; it just happens that we released amber 8 at about the worse time imaginable in regard to stability of linux hardware and software).
Okay, I basically completed my testing and benchmarking on the opterons late last week, adding effort in the area of the pgf90 and ifort compilers. These compilers were tested to the extent possible with existing interconnects at U of Utah. All the pathscale stuff was done earlier in the week, and I was able to test virtually everything there (the pathscale compiler passes all (21) regression tests for uniprocessor and mpich-gm builds, and it also produces the fastest code of all the compilers, though the margin is not large - roughly 5%).
Testing:
For Portland Group PGI pgf90, v 5.2-4:
For Intel ifort em64, v 8.1.024
For bigcp (90,906 atoms, const pressure) on the 1.4 GHz opterons at utah I get:
1) Running on 1 processor:
Pathscale pathf90 v2.0 = 60.2 psec/day
PGI pgf90 v 5.2-4 = 57.1 psec/day
ifort em64 v 8.1.024 = 55.9 psec/day
2) Running on 8 processors, mpich 1.2.6:
Pathscale pathf90 v2.0 = 324 psec/day
PGI pgf90 v 5.2-4 = 312 psec/day
ifort em64 v 8.1.024 = not done (mpich libraries for ifort not
available)
I hope to have the more extensive pathscale benchmarks I sent to the reflector posted on the amber web site for reference; they show the excellent performance and scaling of pathscale-built pmemd on an opteron-mpich-gm cluster and also show tha mpich2 outperforms mpich if you have gigabit ethernet.
CAVEATS:
Okay, one thing to keep in mind with the faster interconnects like Myrinet is that they can be touchy, and there can be problems with all the asynchrony inherent in a fast interconnect. There have been problems with mpich-gm in the past that required a software patch. Clearly, there are current problems with a pgi-built mpich-gm build. SO, if you use these interconnects, someone who is competent in systems setup and administration needs to do the installation AND clearly test the basic functionality, and then you need to test pmemd. PMEMD basically works very well on a lot of high end hardware, but if there are any problems with threading or asynchrony in the interconnect itself, pmemd may well expose the problem. I am aware of problems with an Infiniband installation using pgi-built pmemd at the moment, and am hoping that we will be able to report that using pathscale instead fixes the problem (and / or get a fix for pgi).
Much thanks to Tom Cheatham and the folks at U of Utah for access to their delicatearches opteron cluster; this work would not have otherwise been possible.
Regards - Bob Duke