This is a slightly modified version of the standard OSCAR cluster installation toolkit to include the Scalable System Software (SSS) components. The SSS-OSCAR releases choose many defaults to simplify the installation via OSCAR. Please see the included documentation and 'RESOURCES' section for further details. Enjoy! INSTALLATION: - For further details please see the "Installation Guide" included in the doc/ directory. - Quick summary root# tar -zxf sss-oscar-VERSION.tar.gz -C /tmp # where VERSION is the software release version root# cd /tmp/sss-oscar-VERSION root# ./configure root# make install root# cd /opt/oscar root# ./install_cluster eth0 RELEASE NOTES: - The version string indicates both the SSS version as well as the OSCAR version used for the release. For example, "sss-oscar-1.2-v4.1" is sss v1.2 and oscar stable v4.1. - This release is limited to Fedora Core 2 on x86. However, the OSCAR version it is based upon supported several distributions that are not supported in this SSS specific release. (see also: "Installation Guide" and "SSS-OSCAR Overview") - Some tests have stalled/hung during 'Step3: Install OSCAR Server Packages' when trying to start NFS. Starting/restarting the 'portmap' service fixes the problem, e.g., [root@headnode]# service portmap restart - Due to some differences with standard PBS and the Bamboo & friends tools used with SSS-OSCAR, some of the test scripts are SKIPPED. More specifically, any OSCAR Package test that uses a 'test_user' script, which makes use of the 'pbs_test' helper script, will be flagged as SKIPPED. This will be fixed in a future release. (This issue is known to effect LAM/MPI, MPICH & PVM.) - The following packages were removed from the stock OSCAR package set: maui pbs lam apitest This was due to either an alternate version supplied with SSS-OSCAR or because of conflicts/errors. In the case of APItest, a newer version than was included in OSCAR-4.1 is included with this release. - During "Step 7: 'Complete Cluster Setup'" some services that are restarted print usage errors when stopping the service. This is generally not a problem and can be ignored. Example, Stopping Event Manager: cat: /var/run/sss_em.pid: No such file or directory kill: usage: kill [-s sigspec | -n signum | -sigspec] [pid | job]... or kill -l [sigspec] done - Warehouse: due to an ordering issue and limitations with this release of OSCAR the top-level 'post_install' script has been modified to run Warehouse's "post_install" API script again after all other scripts in this phase of the install. - Warehouse: To manually restarting Warehouse's client and server services by typing the following commands as shown (in this order): [root@headnode]# /etc/init.d/warehouse_SysMon stop \ && cexec -p /etc/init.d/warehouse_node stop [root@headnode]# cexec -p /etc/init.d/warehouse_node start \ && /etc/init.d/warehouse_SysMon start - If trying to work directly from CVS, the 'make_dist.pl' script should be helpful in pooling together the necessary files. It creates a tarball that can be used for testing. RESOURCES: - Scalable System Software (SSS) Project http://www.scidac.org/ScalableSystems - OSCAR Homepage http://www.OpenClusterGroup.org/OSCAR - SSS-OSCAR project page http://sss-oscar.sourceforge.net # $Id: README,v 1.14.2.1 2005/10/12 20:53:21 naughtont Exp $