Basic Linux Commands II

In this section, we will review more commands to get you comfortable doing basic things on Hyak. This tutorial has sampled some of the data and exercises from The Unix Shell by Software Carpentry, but have been tailored to fit most Hyak users. Sampled materials are under the Copyright of Software Carpentry and are made available under the Creative Commons Attribution license (CC BY 4.0).

Tutorial:#

`head`, `tail`, `more`, `less`#

These four commands are other ways to view file contents, great for viewing smaller chunks of longer files.

head basilisk.dat # displays the first part of a file

tail basilisk.dat # displays the last part of a file

more basilisk.dat # displays file contents one page at a time
# exit the more command with Q

less basilisk.dat # displays file contents one page at a time
# exit the less command with Q

By default, the head and tail commands display the first and last 10 lines of a file. To view a specific number of lines use -n followed by the desired number of lines.

head -n 20 basilisk.dat

`cp`#

"copy" files with `cp`#

Move to top of working directory.

cd /gscratch/scrubbed/UWNetID
pwd

Copy animals.csv to your current directory using the shorthand . to mean "here"

cp /mmfs1/sw/hyak101/basics/data/animals.csv .

Copy a directory with all its contents using recursive copy

cp -r /mmfs1/sw/hyak101/basics/data/ .

`mv`#

"move" file to a new name (rename) with `mv`#

mv animals.csv dataset.csv

`rm`#

"remove" a file with `rm`#

warning

rm permanently deletes a file. This action is irreversible.

rm dataset.csv

Remove a directory with recursive rm.

cd shell-lesson-data/exercise-data/writing/
ls
rm -r thesis

`touch`#

Create an empty file with `touch`#

cd ../../../ 
pwd 
# prints gscratch/scrubbed/UWNETID
touch file1.txt file2.txt file3.txt
ls # make sure the .txt files were created

`touch` is also useful for updating the timestamp of a file#

ls -l file1.txt # -l will list the current timestamp of a file
touch file1.txt # touch will update the timestamp

`?` and `[]`#

Wildcards are special characters used as a shorthand#

# ? represents any singular character
ls file?.txt  
# lists all file.txt files with a singular character where ? is
ls file[2-3].txt
# lists all file2.txt and file3.txt
rm file[1-3].txt

`*`

cd shell-lesson-data/exercise-data/alkanes
ls
ls *.pdb # lists all files ending with .pdb
ls /sw/hyak101/basics/*.slurm
ls /sw/hyak101/basics/locator*
ls /sw/hyak101/basics/locator_NN*

`wc`#

apply "word count" with `wc`#

cd alkanes
ls 
wc cubane.pdb
wc --help
wc *.pdb

`wc -l` counts lines#

wc -l *.pdb

`>`#

"redirect" output to a file with `>`#

wc -l *.pdb > lengths.txt
ls lengths.txt
ls
cat lengths.txt

caution

If the file already exists, it will be overwritten.

`>>`#

"append" output to a file with `>>`#

First, change to the animal-counts/ directory and print the contents with the cat command

cd shell-lesson-data/exercise-data/animal-counts/
cat animals.csv

The expected output is as follows:

2012-11-05,deer,5
2012-11-05,rabbit,22
2012-11-05,racoon,7
2012-11-06,rabbit,19
2012-11-06,deer,2
2012-11-06,fox,4
2012-11-07,rabbit,16
2012-11-07,bear,1

Using wc -l to count the lines of the above output

wc -l animals.csv

8 animals.csv

To create a subset of animals.csv, we can combine head, tail, >, and >> commands.

head -n 3 animals.csv > animals-subset.csv
cat animals-subset.csv

animals-subset.csv should now contain the first 3 lines of animals.csv:

2012-11-05,deer,5
2012-11-05,rabbit,22
2012-11-05,racoon,7

Because we do not want to override animals-subset.csv file contents, use >> to append rather than >:

tail -n 2 animals.csv >> animals-subset.csv
cat animals-subset.csv

Now, the subset reads as follows:

2012-11-05,deer,5
2012-11-05,rabbit,22
2012-11-05,racoon,7
2012-11-07,rabbit,16
2012-11-07,bear,1

Counting the lines once more:

wc -l animals-subset.csv

Expected output:

5 animals-subset.csv

`sort`#

"sort" numbers with `sort`#

cd ../
# You should now be in the exercise-data directory
pwd
# /gscratch/scrubbed/NETID/shell-lesson-data/exercise-data
cat numbers.txt

The expected output:

The sort command by itself will sort a text file's contents in ascending order:

sort numbers.txt

Notice how the output is sorted in lexicographical order:

To sort the numbers based on their numerical value, use the -n option

sort -n numbers.txt

Now, the file contents will be interpreted as numbers rather than strings and numbers.txt will be sorted in ascending numerical order:

tip

The sort command will sort lines alphabetically or numerically. Lines with numbers and letters are sorted as follows:

Numbers
Capital letters
Lowercase letters

Other sorting options include -r which sorts lines in reverse order and
-u which removes duplicate lines.

Going back to the alkanes directory

cd alkanes
cat lengths.txt
sort -n lengths.txt

Expected output:

methane.pdb
ethane.pdb
propane.pdb
cubane.pdb
pentane.pdb
octane.pdb
total

Create a sorted list of the alkane lengths and check the first output line:

sort -n lengths.txt > sorted-lengths.txt
head -n 1 sorted-lengths.txt

9 methane.pdb

caution

Empty file resulting from >. The following code will cause the contents of lengths.txt to be deleted:

sort -n lengths.txt > lengths.txt
cat lengths.txt

Notice how lengths.txt is now an empty file. To get lengths.txt back, backtrack to the original command:

 wc -l *.pdb > lengths.txt

`|`#

Use a "pipe" to string commands together#

Alternatively, the alkane lengths can sorted by using one line of code:

wc -l *.pdb | sort -n

wc -l *.pdb | sort -n > sorted_lengths_v2.txt

cat sorted_lengths_v2.txt

The output should be the same at sorted-lengths.txt:

methane.pdb
ethane.pdb
propane.pdb
cubane.pdb
pentane.pdb
octane.pdb
total

`grep`#

Search for a PATTERN in a FILE with `grep`#

grep finds and prints lines in files that match a pattern

cd shell-lesson-data/exercise-data/writing
cat haiku.txt
grep not haiku.txt 

Only the lines containing "not" will print out. Now try the follwing grep command:

grep The haiku.txt

The Tao that is seen
"My Thesis" not found.

Notice how longer words containing "The" are also printed out. Use the -w option to ensure that grep is only matching whole words:

grep -w The haiku.txt

The Tao that is seen

grep also works with multiple words at a time:

grep -w "is not" haiku.txt

Today it is not working

Other options include -n which displays line numbers and -r which recursivly searches for patterns:

grep -n "it" haiku.txt

With searching comes loss
Yesterday it worked
Today it is not working

Combining options -n and -w:

grep -n -w "the" haiku.txt

2: Is not the true Tao, until
6: and the presence of absence:

The -r (recursive) option:

grep -r Yesterday .

This searches for the word Yesterday in your current directory (writing).

`history`#

View your history of commands with `history`#

history

Use history, |, and grep together to find all times the cat command was used.

history |grep cat

`find`#

"find" files or files matching a pattern with `file`#

cd shell-lesson-data/exercise-data/
find . -name numbers.txt

This will print out the location of numbers.txt :

./numbers.txt

Use * to find all .txt files starting in your current ditectory :

find . -name "*.txt"

Use grep and find togther to search for files with specific words or phrases:

grep "searching" $(find . -name "*.txt")

The output shows the file location and the lines with the word "searching" :

./writing/haiku.txt: With searching comes loss
./writing/LittleWomen.txt: sitting on the top step, affected to be searching...

`scp`#

Transfer your data to `klone` with "server copy" or `scp`#

From your local computer#

Find a file you would like to transfer. Let's say it is called text.txt. Use the following to transfer text.txt from your local computer to klone. We will transfer it to our working directory in gscratch/scrubbed

scp text.txt UWNetID@klone.hyak.uw.edu:/gscratch/scrubbed/UWNetID

Transfer data from `klone` to your local computer with "server copy" or `scp`#

To your local computer#

scp UWNetID@klone.hyak.uw.edu:/gscratch/scrubbed/UWNetID/text.txt .

`rsync`#

Similarly, data can be transferred using the rysnc command

# From Klone to your local computer
rysnc UWNetID@klone.hyak.uw.edu:/gscratch/scrubbed/UWNetID/text.txt .
# use -a to preserve original file permissions, timestamp, etc
rysnc -a UWNetID@klone.hyak.uw.edu:/gscratch/scrubbed/UWNetID/text.txt .

rsync and scp can be paired with other options such as -v (verbose) which provides a detailed output of the transfer process or -z ( -c for scp ) to compress data during transfers.

important concept: `scp` vs `rsync`

rsync is generally used for larger file transfers and file synchronization. Unlike scp which always transfers the entire file, rsync will only transfer the parts of the file that changed. rsync can also resume aborted transfers from lost internet connections. scp works well for quick file transfers that do not require the additional features rsync provides.

Well done!#

You made it through the tutorial! If you have any questions about the content covered here or feedback to pass along about this tutorial, please open a ticket by emailing help@uw.edu with "Hyak" in the subject line.

Tutorial:#

head, tail, more, less#

cp#

"copy" files with cp#

mv#

"move" file to a new name (rename) with mv#

rm#

"remove" a file with rm#

warning

touch#

Create an empty file with touch#

touch is also useful for updating the timestamp of a file#

? and []#

Wildcards are special characters used as a shorthand#

*

wc#

apply "word count" with wc#

wc -l counts lines#

>#

"redirect" output to a file with >#

caution

>>#

"append" output to a file with >>#

sort#

"sort" numbers with sort#

tip

caution

|#

Use a "pipe" to string commands together#

grep#

Search for a PATTERN in a FILE with grep#

history#

View your history of commands with history#

find#

"find" files or files matching a pattern with file#

scp#

Transfer your data to klone with "server copy" or scp#

From your local computer#

Transfer data from klone to your local computer with "server copy" or scp#

To your local computer#

rsync#

important concept: scp vs rsync

Well done!#

`head`, `tail`, `more`, `less`#

`cp`#

"copy" files with `cp`#

`mv`#

"move" file to a new name (rename) with `mv`#

`rm`#

"remove" a file with `rm`#

`touch`#

Create an empty file with `touch`#

`touch` is also useful for updating the timestamp of a file#

`?` and `[]`#

`*`

`wc`#

apply "word count" with `wc`#

`wc -l` counts lines#

`>`#

"redirect" output to a file with `>`#

`>>`#

"append" output to a file with `>>`#

`sort`#

"sort" numbers with `sort`#

`|`#

`grep`#

Search for a PATTERN in a FILE with `grep`#

`history`#

View your history of commands with `history`#

`find`#

"find" files or files matching a pattern with `file`#

`scp`#

Transfer your data to `klone` with "server copy" or `scp`#

Transfer data from `klone` to your local computer with "server copy" or `scp`#

`rsync`#

important concept: `scp` vs `rsync`