Hardware Break / Fix Engineer needed for an opportunity with SOC’s client to work in Los Alamos, New Mexico.
DOE Q Clearance is required or must have held one in the past 3 years (DoD Top Secret will be considered)
Responsibilities :
- Maintain the HPC systems availability to the customer
- Create and document site procedures, system diagrams, and other configuration or support documents
- Monitoring and maintaining system health on the HPC system(s) compute, network and storage
- Reviewing, resolving and responding to client tickets
- Creating, monitoring and closing all support cases
- Maintaining availability reports for tracking SLA’s
- Maintaining the system security posture required by the client
- Troubleshooting and repairing hardware issues
- Tracking / documenting the hardware repairs as well as opening, tracking, closing part cases and returning replaced parts.
- Maintaining the on-call schedule to support our 24x7x2 / 4 contracts
- Assisting with hardware and system installation activities in new systems
- Maintain system software and firmware revisions, including patches, updates, and OS upgrades
- Solve system hardware, software, and third-party software issues, and provide detailed and thoughtful analysis of problem and solution
- Gather data, perform analysis, and escalate problems to higher-level product support groups and appropriate management when necessary to ensure timely resolution of system or customer issues
- Provide solutions and implement repair or workarounds, when possible, fully documenting steps taken when required
- Answer customer inquiries concerning system software versions, product lifecycles, new releases, and third-party applications
- Works with minimal direction from the technical lead and with customer nominated representatives to accomplish assigned tasks.
- Participates as part of a team and maintains good relationships with team members and customers
Qualifications
3 - 5 years of technical experience and a Bachelor of Arts / Science or equivalent degree in computer science or related area of study;
without a degree, two additional years of relevant professional experience (5-7 years in total).
- Previous experience working on Servers, Storage, and Networking in a Datacenter environment.
- Understanding of a Data Center IT Operations environment
- Ability to lead and work effectively in a team environment
- Extensive knowledge and experience with Linux operating systems (RHEL or SLES), workload management systems, parallel file systems, networking and security
- Ability to maintain system software, utilizing debugging tools for problem isolation; will perform software builds, software upgrades, and patch installation as needed
- Excellent interpersonal, customer relations and problem management skills, with the ability to stay calm and professional under pressure while working to strict deadlines
- Experience with project planning and management, process management, and team or project leadership preferred
- Able to clearly document processes and procedures with a focus toward mentoring and knowledge sharing
- Professional written and verbal communication skills.
- Attention to detail with a focus on customer satisfaction
- Occasional travel for training is required
Desired Qualifications
- CompTIA A+ or Server+ Certification
- Security+ Certification
- Vendor Certifications
- HPC - High Performance Computing.
- Scripting languages (e.g. Bash, Perl, Python, etc.)
- Familiarity with ticket-tracking software (any ticket tracking is good)
In compliance with the Equal Pay for Equal Work Act, the annual salary range for this role is $80, to $,. This is not a guarantee of compensation or salary, as final offer amount may vary based on factors including but not limited to experience and geographic location.
Employment Pre-requisites
The following requirements must be met to be eligible for this position : Successful completion of a background investigation, and drug urinalysis.
SOC, a Day & Zimmermann company, is an Equal Opportunity Employer,