Job Details

Storage System Administrator

  2025-04-11     Collins Consulting     Berkeley,CA  
Description:

Must be a US Citizen or Green Card holder. This is a hybrid position.

Statement of Work:
GBS requires the services of a Storage System Administrator III to provide labor services to support the DOE National Energy Research Scientific Computing Center (NERSC) Storage Systems Group (SSG) Team's hardware and software environment at the Lawrence Berkeley National Laboratory's NERSC facilities in Berkeley, CA. The hardware and software is part of a High Performance Computing (HPC) system environment and includes storage systems, servers in support of storage systems, storage services, software, and network components. The work will require active interaction/participation with clients and the Team to troubleshoot and resolve technical issues with production storage system.

Baseline Hardware and Software Environment Support:

  • 14 racks of Elastic Storage System computer storage - Community File System - manufactured by IBM
  • 43 disk arrays - NetApp
  • 80 storage servers - Supermicro
  • 12 elastic storage system enclosures - IBM
  • 44 storage servers - test development environment - Supermicro
  • 48 mid-range servers - HPE
  • 164 enterprise tape drives - installed in IBM tape libraries
  • 3 tape libraries - manufactured by IBM
  • 3 director level fiber channel switches - Brocade
Baseline Software:
  • IBM Spectrum Scale
  • IBM Red Hat Linux, Centos
  • High Performance Storage System
Required skills/Level of Experience:
  • Bachelor's degree or equivalent experience and a minimum of three years of computing or storage experience; or equivalent experience
  • Strong understanding of Linux fundamentals including file systems, networking, and automation tools like Ansible or Puppet
  • Experience using one or more interpreted programming or scripting languages such as Python and Bash to automate system management tasks.
  • Ability to work effectively and collaboratively on a team and on technical projects, as well as give and receive constructive feedback to foster communication and trust.
  • Experience with hardware installation and replacement, running cables, cable management, racking systems, and labeling
  • Strong organizational skills and ability to effectively manage priorities across many projects ranging from immediate problem resolution to long-term strategic planning.
  • Strong written and verbal communication skills and the ability to document and describe complex tasks to audiences of varying familiarity with storage technologies.
Task Description:
Team Interaction/Participation:
  • Participate in weekly team meetings to maintain awareness of open projects and goals.
  • Monitor Slack for direct messages and other channels for issues related to storage systems.
  • Respond to email in a timely manner as determined by the University Technical Representative.
  • Participate as a proactive team member.
  • Potential participation in on-call 24/7 responsibilities.
  • Potential participation in production storage system problem determination and resolution.
Hardware activities:
  • Communicate discovered and suspected hardware issues to the storage team.
  • Monitor for and respond to hardware issues on all systems from multiple vendors as needed.
  • Amber light walk at least weekly.
  • Work with on-site technicians as needed from the University and vendors.
  • Install/de-install hardware as needed.
Software activities:
At the Client's discretion -
  • Determine for all storage system components when updates are needed.
  • Identify areas for routine process optimization and implement solutions.

Nice to have skills:
  • Has demonstrated contributions to the high-performance storage community.
  • Understanding of file system internals, prior work developing storage systems, or experience troubleshooting and optimizing parallel I/O.
#J-18808-Ljbffr


Apply for this Job

Please use the APPLY HERE link below to view additional details and application instructions.

Apply Here

Back to Search