Data Transfer
Storage on Hyak is physically separate from servers used for computation. It is best practice on every supercomputer that storage live on its own and is high-performance to handle the bandwidth I/O and read/write operations required by so many compute nodes attached to it. These are typically parallel file systems (e.g., GPFS, Lustre, BeeGFS).
On klone
the storage system (e.g., /gscratch/
) is then mounted (i.e., accessible) from every compute node of the cluster.
scp
and rsync
to transfer data#
Using klone
to your local computer with "server copy" or scp
#
Transfer data from klone
with "server copy" or scp
#
Transfer data from your local computer to rysnc
command#
Similarly, data can be transferred using the rsync
and scp
can be paired with other options such as -v
(verbose) which provides a detailed output of the transfer process or -z
( -c
for scp
) to compress data during transfers.
important concept: scp
vs rsync
rsync
is generally used for larger file transfers and file synchronization. Unlike scp
which always transfers the entire file, rsync
will only transfer the parts of the file that changed. rsync
can also resume aborted transfers from lost internet connections. scp
works well for quick file transfers that do not require the additional features rsync provides.
#
For Large Files: Compression prior to transfer#
For larger data files, you can compress them into tar files to facilitate transfer.First, identify which files or directories you would like to transfer on your local device and compress them into tar archives:
The -c
flag creates the tar archive, -v
prints out all the files being processed, and -f
allows you to name the archive. Once the files are compressed, transfer the tar file to klone
from your local device with scp
or rsync
:
data.tar
will save to the specified /path/to/directory
. To check if it transferred, log into Hyak and navigate to the target directory:
On klone
under default user profile settings, data.tar
should be color coded red, indiciating that is it an archive file. From here, you can unzip the tar file with the command:
Tar Options
tar -c: Creates a new tar archive.
tar -x: Extract files from a tar archive.
tar -v: Stands for "verbose". Displays the list of files being archived or extracted.
tar -f: Allows you to specify the name of your archive file.
Use tar --help for a detailed list of tar options
Alternatively, you can create a tar archive and copy it to klone
from your local device in one step:
This will create an archive of data_directory
and send it directly to klone
. This directory will be saved under the name data.tar
and will be located at the /path/to/directory
that you specifiy where it can then be extracted and used.
Check if it transferred to klone
by navigating to the directory it saved to:
If it sucessfully transferred, unzip it using tar
and the -xvf
flag:
#
A Graphical User Interface for File Transfers โ Cyberduck- To use Cyberduck, you will need to download it on your local computer. Download Cyberduck Here. Once it has downloaded, open Cyberduck in a file explorer and open a connection.

- A window with a drop down menu should appear. Select the SFTP (SSH File Transfer Protocol) option.


- Type in
klone.hyak.uw.edu
as the server name and your UW NetID and password. You can now click the "Connect" button on the bottom right side of the window.

After pressing "Connect", you should get a DUO two-factor authorization prompt.
After completing the two-factor authorization, you should be able to see all of your files located in your home directory on Hyak. You can navigate between directories using this visual interface.

- The next time you wish to access Cyberduck, a bookmark will have automatically been created, allowing for quick access by clicking on the clock icon on the upper left side of the screen.
