Senior HPC System Architect in Phoenix, AZ at nTech Workforce

Date Posted: 9/15/2022

Job Snapshot

Job Description

Terms of Employment




  • W2 Contract-to-Hire, 3 Months

  • Public Trust/Other Required: NACI (T1)

  • Location: Onsite at Phoenix, AZ

  • Travel: Less than 10% travel required



Overview of Program



WCOSS provides NOAA the operational High Performance Computing (HPC) resources essential to process sophisticated numerical models used to predict and understand atmospheric and oceanic phenomena for weather and climate operational use. Operating 24/7, the next 10-year WCOSS program will deliver significant computational capability that will evolve over time to keep pace with NOAA’s growing environmental modeling needs.



Overview of Position



For our client, people are the differentiator and they believe every challenge is an opportunity. Their work depends on a Senior HPC Systems Engineer joining their team to support the National Oceanic and Atmospheric Administration (NOAA), Weather and Climate Operational Supercomputer System (WCOSS). They are looking for individuals to join their team to deploy, operate and support leading-edge technology for WCOSS (specific technology training will be provided).



Responsibilities




  • Applying current HPC systems administrative skills, desire to learn and deploy new technologies.

  • Developing and deploying monitoring capabilities.

  • Developing and implementing tools for cluster administration.

  • Providing technical support with team of HPC System & Storage Administrators to resolve operational issues.

  • Providing off-hour on-call support on a rotating basis.


Required Skills & Experience





  • Bachelor’s degree or equivalent

  • 10+ years of experience with HPC systems operations.

  • Experience working in a 24X7 operational environment.

  • Ability to work both independently and as part of a team.


Preferred Skills & Experience


  • Demonstrated experience to deploy and manage large-scale HPC systems using OS provisioning tools (e.g., xCat, HPCM).

  • Demonstrated experience using configuration management tools (e.g., Ansible, Puppet).

  • Linux system administration experience (e.g., SLES, RedHat or CentOS).

  • Batch management/scheduling experience (PBSpro preferred).

  • Parallel file system configuration and monitoring experience (e.g., Lustre, NFS).

  • Network interconnect configuration and monitoring experience (e.g., Infiniband, Ethernet).

  • Programming or scripting in at least two languages (e.g., Bash, Perl, Python, C).

  • Strong writing skills for technical documents, system procedures, user wiki’s and FAQs.

  • Experience developing regression tests (e.g. pavilion, ReFrame)



nTech is an equal opportunity employer. All offers of employment are contingent upon pre-employment drug and background screenings. Only candidates who meet all of the above client requirements will be contacted by a recruiter

CHECK OUT OUR SIMILAR JOBS

  1. Architect Jobs
  2. Systems Engineer Jobs