Interactive and Batch Jobs

Populus trichocarpa

salloc --partition=ckpt --cpus-per-task=1 --mem=5G --time=20:00

salloc: Pending job allocation 18981043
salloc: job 18981043 queued and waiting for resources
salloc: job 18981043 has been allocated resources
salloc: Granted job allocation 18981043
salloc: Nodes n3424 are ready for job

[UWNetID@n3424 basics]$

mkdir out

apptainer shell --cleanenv --bind /gscratch/ locator.sif

Apptainer>

ls /
bin  boot  dev environment  etc  gscratch  home  lib  lib64  locator  media  mmfs1  mnt  opt  proc  root  run sbin  scr  singularity srv  sys  tmp  usr  var

ls /locator/

LICENSE.txt  README.md data  locator_py  out  req.txt scripts  setup.py

ls /locator/scripts/

install_R_packages.R  locator.py  locator_phased.py  plot_locator.R  vcf_to_zarr.py

python

Python 3.8.13 (default, Mar 29 2022, 14:56:46) 
[GCC 8.3.0] on linux
Type "help", "copyright", "credits" or "license" for more information.
>>> 

wc -l data/potr_genotypes1000.txt 
# The genotypes matrix has 425 lines
# one row per individual tree plus a header

425 data/potr_genotypes1000.txt

head -3 data/potr_genotypes1000.txt

### Truncated for website view
"ALAA.20.1" 1 0 0 0 0 0 0 0 0 2 0 0 2 0 1 0 0 0 0 NA 0 1 1 0 0 0
"BELA.18.2" 0 0 0 0 1 0 0 0 0 0 1 0 0 NA 2 0 1 0 1 0 0 1 0 0  1  NA
### Truncated for website view

wc -l data/potr_m_pred1.txt

425 data/potr_m_pred1.txt

head data/potr_m_pred1.txt

"sampleID" "x" "y"
"BELA.18.3" -126.166667 52.416667
"BLCG.28.1" -125.183333 49.833333
"BULG.11.4" -126.8 54.45
"CEDA.10.4" -128.916667 54.95
"CHKD.19.3" -127.2 51.766667
"CHWH.27.5" NA NA
"CNYH.28.5" -125.066667 49.666667
"DENA.17.4" -126.616667 52.766667
"DENC.17.4" NA NA

python /locator/scripts/locator.py --matrix data/potr_genotypes1000.txt --sample_data data/potr_m_pred1.txt --out out/potr_predictions1 
# you should see the Epochs begin to compute after 10-30 seconds

loaded (1000, 424, 2) genotypes

filtering SNPs
running on 989 genotypes after filtering

### Truncated for website view
To enable them in other operations, rebuild TensorFlow with the appropriate compiler flags.
Epoch 1/5000
11/11 [==============================] - ETA: 0s - loss: 1.3765
Epoch 1: val_loss improved from inf to 0.70498, saving model to out/potr_predictions1_weights.hdf5
11/11 [==============================] - 2s 85ms/step - loss: 1.3765 - val_loss: 0.7050 - lr: 0.0010
Epoch 2/5000
11/11 [==============================] - ETA: 0s - loss: 0.8562
Epoch 2: val_loss improved from 0.70498 to 0.66874, saving model to out/potr_predictions1_weights.hdf5
11/11 [==============================] - 1s 64ms/step - loss: 0.8562 - val_loss: 0.6687 - lr: 0.0010
Epoch 3/5000
11/11 [==============================] - ETA: 0s - loss: 0.6853
Epoch 3: val_loss improved from 0.66874 to 0.63751, saving model to out/potr_predictions1_weights.hdf5
### Truncated for website view
predicting locations...
R2(x)=0.9011147471513719
R2(y)=0.9758116274801546
mean validation error 0.6421174928291902
median validation error 0.48744820589723886

run time 0.5494037707646687 minutes

ls out/

potr_predictions1_fitplot.pdf  potr_predictions1_history.txt  potr_predictions1_params.json  potr_predictions1_predlocs.txt

head out/potr_predictions1_predlocs.txt

x,y,sampleID
-134.91964819211591,58.435803669001785,ALSC.1.4
-122.8459170387414,45.644072664997694,CARS.29.3
-128.12848901483463,54.95290102466758,CDRE.10.3
-127.26174811400172,51.72250932734256,CHKD.19.1
-121.6691954064578,49.14120451469648,CHWH.27.5
-125.04810237947252,49.641106599108824,CNYH.28.4
-126.72995371793931,52.81267201275188,DENB.17.1
-126.78770090586458,52.82414669580753,DENC.17.4
-126.79467698648334,52.83947566564575,DEND.17.4

Apptainer> exit
exit
[UWNetID@n3162 basics]$

nano locator_NN_job.slurm
# exit nano by holding Ctrl and pressing X; then save it by pushing Y

#!/bin/bash

#SBATCH --job-name=locator_job
#SBATCH --partition=ckpt
#SBATCH --nodes=1
#SBATCH --ntasks-per-node=1
#SBATCH --mem=5G
#SBATCH --time=10:00
#SBATCH -o log/%x_%j.out

# command:
apptainer exec --cleanenv --bind /gscratch locator.sif python /locator/scripts/locator.py --matrix data/potr_genotypes1000.txt --sample_data data/potr_m_pred2.txt --out out/potr_predictions2

#### Truncated for website view

sbatch locator_NN_job.slurm
# the following is an example result
sbatch: No account specified, defaulting to: account
Submitted batch job 12345678
# Slurm will assign a JobID when the job was submitted
# it will likely be an 8-digit number, but not 12345678

# Below replace the word "UWNetID" with your UW NetID.
watch squeue -u UWNetID 

more locator_job_12345678.out

tail --follow locator_job_12345678.out
# Use Ctrl + C to exit the tail command

ls out/
potr_predictions1_fitplot.pdf  potr_predictions1_params.json   potr_predictions2_fitplot.pdf  potr_predictions2_params.json
potr_predictions1_history.txt  potr_predictions1_predlocs.txt  potr_predictions2_history.txt  potr_predictions2_predlocs.txt

head out/potr_predictions2_predlocs.txt
x,y,sampleID
-126.6824556618361,52.28918750153857,BELC.18.1
-126.832089065367,52.31092787450795,BELC.18.5
-123.01705302953444,46.47589331561,CARS.29.2
-127.29940751539401,51.750880602889794,CHKD.19.5
-121.72059406365925,49.29831855856583,CHWH.27.5
-121.80840511339338,49.19749305063466,CHWK.27.2
-125.10795643505986,49.613891258205996,CNYH.28.5
-126.81512181108964,52.73697246518585,DENA.17.2
-123.04024738823856,44.4420668151814,HALS.30.4

#SBATCH --mail-type=ALL
#SBATCH --mail-user=UWNetID@uw.edu

Interactive and Batch Jobs

Interactive Jobs

Using Locator in interactive mode

Batch Jobs

Monitoring the Slurm Job Queue

Interactive Jobs​

Using Locator in interactive mode​

Batch Jobs​

Monitoring the Slurm Job Queue​

Interactive Jobs

Using Locator in interactive mode

Batch Jobs

Monitoring the Slurm Job Queue