Skip to main content

2 posts tagged with "Slurm"

View All Tags

January 2026 Maintenance Update

· 5 min read
Kristen Finch
Director of Research Computing Solutions

During this month’s scheduled maintenance window, we completed several system upgrades and routine updates across Klone and Tillicum to improve stability, performance, and security. The next maintenance is scheduled for Tuesday, February 10, 2026 (the second Tuesday of the month).

Notable Updates

In addition to routine image updates and security patches, we upgraded:

  • Klone login node process enforcement to Arbiter 3 and paused user notifications and Klone and Tillicum (read more below).
  • Klone has been upgraded from cgroups v1 to cgroups v2. This upgrade shouldn't be a noticeable for most users, but in some cases memory accounting under cgroups v2 is unified and includes all file-backed page cache. This difference may make memory usage appear higher or "inflated" compared to v1 in niche cases and some specific Java-based applications.
  • Slurm version 25.11.1 on both clusters.
  • Duo 2FA on Klone (Tillicum already up to date). This change aligns with ongoing UW security upgrades.
Action Required

All University of Washington technology users must update Duo Mobile to version 4.85.0 or later on all registered devices by February 2, 2026. Users who cannot update their devices must register a platform authenticator, a new phone/tablet, or request a hardware token from UWIT. The Duo phone call authentication method is being phased out by the University and no longer available for logging into Klone or Tillicum. 

Check your device’s Duo Mobile application version.

Login Node Usage & Arbiter Enforcement (Important Reminder)

Login nodes are shared community resources intended for:

  • Transferring data
  • Navigating the filesystem
  • Editing and developing code
  • Submitting jobs to the scheduler

As part of this maintenance, Klone was upgraded to Arbiter 3, a tool which automates login node monitoring and enforces usage limits to ensure stability and ensure fair access.

Arbiter monitors resource usage on login nodes and will:

  • Slow or halt processes that exceed permitted thresholds.
  • Terminate processes outright if necessary.

Arbitor email notifications halted

Previously, users received email notifications for each offending process when Arbiter thresholds were exceeded. We have found these notifications are not an effective way to communicate our policies. To avoid notification fatigue, we have stopped sending these emails.

This does not mean enforcement has stopped. Arbiter continues to actively limit, halt, or kill processes on login nodes as needed.

Be a good HPC-citizen - do not connect to a login node

Connecting directly to the login node using tools like VS Code Remote-SSH frequently leads to Arbiter intervention and could cause login node instability since background server processes persist beyond an active session.

Instead, follow our recommended best practices and set up your ProxyJump to connect your local VS Code to Klone or use the streamlined option offered by our Open OnDemand interactive application for VS Code.

Winter 2026 Computing Workshops

Stay informed by subscribing to our mailing list and the UWIT Research Computing Events Calendar.

Office Hours

Additional Training Opportunities

Having trouble? Get Research Computing support.

Happy Computing,

Hyak Team

January 2025 Maintenance Details

· 5 min read
Kristen Finch
HPC Staff Scientist

Our January maintenance is complete, and Klone is back in operation. The next maintenance is scheduled for Tuesday February 11, 2025 (AKA the 2nd Tuesday of the month).

Notable Updates

  • Slurm update: We updated to version 24.11.0. If any user software was built against the older Slurm libraries (e.g., openmpi), then users may experience errors, and it may be necessary to rebuild their software against the newer Slurm libraries.
  • Compute node updates: The compute node OS images were updated to address any security patches in underlying core packages.
  • Mathematica module installed using the license maintained by the UW Physics Department. Learn how to launch Mathematica on Hyak.
  • MATLAB application has been added to Hyak Open OnDemand Beta. Learn how to launch MATLAB with Open OnDemand.

Upcoming Training

Hyak: Open OnDemandOpen OnDemand (OOD) is an open-source web portal for HPC centers to provide users with an easy-to-use web interface to HPC clusters. For the last year, the Hyak team has been adding features to OOD. This workshop will demonstrate OOD's main features such as exploring the filesystem, composing jobs, and launching interactive applications like Jupyter. This workshop will be held in person 10am - 11:30am on Friday January 31, 2025 in CSE2 (Gates Center) Room 371 (3800 E Stevens Way NE, Seattle, WA 98195). Click here to learn more and register for this event.

Winter 2025 Office Hours

If you would like to request 1 on 1 help, please send an email to help@uw.edu with "Hyak Office Hour" in the subject line to coordinate a meeting.

Opportunities

The eScience Institute offers the annual Winter School to students and lecturers interested in developing basic skills and knowledge of the tools used in data science. Gaining literacy in topics such as Python, R, Jupyter, and reproducible environments can be beneficial beyond STEM, including areas like global or public health, public policy, social sciences, social work, international relations, and business management. Apply by January 24, 2025. Learn more!

Summer Internship Opportunity at Purdue - The Rosen Center for Advanced Computing (RCAC) is seeking students for Research Experience for Undergraduates (REU) paid internships for an 11-week onsite summer REU program. This program aims at developing the next generation workforce in advanced computing and cyberinfrastructure technologies. It offers students from diverse backgrounds the opportunity to gain the knowledge and skills necessary to build and support advanced research computing systems and scientific applications. As part of RCAC's decade long successful student apprentice program, the REU students will learn by doing, working on the National Science Foundation funded Anvil system in a team environment and mentored by cyberinfrastructure professionals. Open to undergraduate students from all backgrounds and undergraduate programs within and beyond Purdue. Each student will present their work to Purdue staff, faculty, students, collaborators, and researchers at the end of the program. Students may present at a national conference as part of the program. This onsite program at Purdue University in West Lafayette, Indiana runs from May through August. Learn more and apply NOW! Interviews start in January!

April 2-3, 2025, for AI Unlocked: Empowering Higher Education through Research and Discovery, a workshop designed for individuals across all disciplines and career stages in higher education. This workshop provides access to cutting-edge computational resources and expertise to researchers, students, faculty, and practitioners, especially those from Minority-Serving Institutions (MSIs). Hosted by the National Artificial Intelligence Research Resource (NAIRR) Pilot User Experience Working Group, this event introduces AI fundamentals, hands-on experience with pre-configured AI tasks, and guidance for tailoring AI models to your projects. Travel support is available for selected participants to the Westin Denver Downtown, and virtual attendance options are also offered. Apply now to advance your AI skills and collaboration in higher education. Apply by January 31, 2025. Acceptances will be notified by the end of February.

The University of Alaska Fairbanks (UAF) Alaska Center for Energy and Power (ACEP) summer internship is a 10-week program for students to gain hands-on research experience and skill development in the energy industry. Our program offers two internship strands: AUSI and REU. Regardless of strand, all interns will receive:

  • A specific research project with 1:1 mentorship from an ACEP researcher
  • Collaborative workspace at ACEP
  • Travel to and from Alaska
  • Field trips related to energy in Alaska

Applications due January 24, 2025.

If you have any questions about using Hyak, please start a help request by emailing help@uw.edu with "Hyak" in the subject line.

Happy Computing,

Hyak Team