tests package#
Submodules#
tests.basic_test module#
tests.cartpole_test module#
- tests.cartpole_test.test_cartpole_container()[source]#
Perform the following steps:
Allocate CPU nodes from SLURM.
Copy container to SLURM nodes.
Start Ray Cluster from containers.
Run the Cartpole experiment.
tests.cli_test module#
- tests.cli_test.test_run_bare_metal()[source]#
This test will use the CLI to create a Ray Cluster on SLURM bare-metal. Runs the following steps:
Cleanup any files from previous runs.
Use the CLI command to setup a head node. (bare-metal)
Use the CLI command to setup some CPU nodes. (bare-metal)
Use the CLI command to setup some GPU nodes. (bare-metal)
Use the CLI command to activate the Ray Cluster.
Verify from the logs that all Ray runtimes successfully deploy!
- tests.cli_test.test_run_container()[source]#
This test will use the CLI to create a Ray Cluster on SLURM with containers. Runs the following steps:
Cleanup any files from previous runs.
Use the CLI command to setup a head node. (containerized)
Use the CLI command to setup some CPU nodes. (containerized)
Use the CLI command to setup some GPU nodes. (containerized)
Use the CLI command to activate the Ray Cluster.
Verify from the logs that all Ray runtimes successfully deploy!
tests.ip_test module#
- tests.ip_test.test_ip_container()[source]#
Perform the following steps:
Allocate CPU nodes from SLURM.
Copy container to SLURM nodes.
Start Ray Cluster from containers.
Get IP from each node on Ray Cluster.
tests.multi_partition_test module#
- tests.multi_partition_test.test_multi_partition()[source]#
This test will attempt to create a multi-node, multi-partition Ray Cluster. Multi-node means more than one node. Multi-partition means hetergenous nodes (nodes have different hardware). This requires setting up the head node, CPU nodes, and GPU nodes on different computing devices and aggregating them into a single Ray Cluster.
After bringing up the multi-partition cluster, you will need to ssh into the head node and execute the multi_partition_script.sh to run the test.