Search jobs > Columbus, OH > Site reliability engineer

Site Reliability Engineer

Huntington National Bank
Columbus, OH
Full-time

Description

Summary :

The Site Reliability Engineer provides technical and consultative support on the most complex technical matters.

Responsibilities :

  • Extensive expertise within production environments (AWS / On Premise), covering security, deployment, automation, and serverless technologies.
  • Apply deep knowledge of SRE principles to ensure the scalability and reliability of systems.
  • Utilize extensive experience in Configuration Management and Infrastructure as Code.
  • Drive process enhancements, streamline communication, and optimize delivery through innovative practices, from discovery to deployment.
  • Participate with other developers and / or contractors in troubleshooting and identifying problems.
  • Provide technical support on technical matters
  • Show drive and interest in learning new systems, applications, technologies, and most importantly business domains
  • Work in a fast-paced environment and will be good at multitasking
  • Provide Incident and Problem Management support for all Digital applications
  • Research and fulfillment of On Demand / Ad hoc requests by multiple business areas for all Digital applications
  • Good to have Digital Application Monitoring utilizing the following tools : DynaTraceSPLUNKDynatrace SyntheticsZenoss
  • Production Assurance for Infrastructure Releases
  • Disaster Recovery support for annual Testing
  • Maintenance window monitoring
  • Work with Digital Development teams to document production issues that require code fixes and assist with validating and closing out any defects
  • Status update in Daily Service Review Meetings
  • Update and Manage Runbooks and Maintenance reference Manuals on Support wiki site
  • Provide Level 1 & Level 2 support for application and platform issues
  • Ensuring monitoring alerts and systems events are assessed, prioritized, and assigned
  • Manage customer impacting incidents including business impact assessment, technical resolution, engagement, and communications
  • Own incident resolution and keep user informed of status
  • Provide expected time of availability for delayed streams and processes
  • Update ticket resolution status and details in the ticket management system(s)
  • Respond to user requests and queries
  • Escalate incidents to Level 2 / 3 development team as needed with summary analysis
  • Escalate incidents to appropriate interfacing support team or external teams such as product vendors
  • Update knowledgebase with support information (ex. Known errors and solutions for these errors)
  • Provide clarifications on data issues identified in the application

Basic Qualification :

  • Bachelor's Degree or 10 years equivalent experience
  • 5 years of experience in Software Engineering
  • 3 years of experience in Site Reliability

Preferred Qualifications :

  • Working knowledge of SQL Server or equivalent
  • Self-directed / Independent problem solving
  • Excellent oral and written communication skills
  • Experience operating in a large-scale environment
  • Exhibit best practices, follow standards and present suggestions while remaining flexible and open
  • In-depth knowledge of different SDLC methodologies including Waterfall, Agile, etc.
  • HTML 5, CSS 3, JQuery UI, ASP.net MVC, website accessibility standards
  • Professional experience with the .NET framework (C#, ASP.NET, XML) or Java
  • Performance analysis and tune JVM based services
  • Professional experience with scripting languages (e.g., PowerShell, Python, etc.)
  • Knowledge of TFS to manage an Agile development lifecycle
  • Experience in financial industry and in a Regulatory and Compliance environment preferred
  • Familiarity with large scale system monitoring and alerting frameworks
  • Expertise utilizing Cloud Infrastructure such as Azure, AWS
  • Experience creating effective resource plans that ensure a high level of performance
  • Experience developing repeatable processes and metrics that maximum uptime, reliability, and predictability
  • Experience with Agile and DevOps methodologies

Exempt Status : (Yes not eligible for overtime pay) ( No eligible for overtime pay)

Workplace Type :

Huntington is an equal opportunity and affirmative action employer and is committed to providing equal employment opportunities for all regardless of race, color, religion, sex, national origin, age, disability, sexual orientation, veteran status, gender identity and expression, genetic information, or any other basis protected by local, state, or federal law.

Tobacco-Free Hiring Practice : Visit Huntington's Career Web Site for more details.

Agency Statement : Huntington does not accept solicitation from Third Party Recruiters for any position

30+ days ago
Related jobs
Promoted
Huntington National Bank
Columbus, Ohio

The Google Cloud Platform (GCP) Site Reliability Engineer (SRE) Manager is responsible for supporting the GCP framework and consumers of the platform. The Google Cloud Platform (GCP) Site Reliability Engineer (SRE) Manager is responsible for supporting the GCP framework and consumers of the platform...

Veeva Systems
Columbus, Ohio

Veeva is seeking a talented and motivated Senior Systems Engineer - Site Reliability to join our dynamic team. As an SRE, you are innately curious, have a penchant for problem-solving, and will play a crucial role in ensuring the reliability, scalability, and performance of our systems. Our mission ...

WELLS FARGO BANK
Columbus, Ohio

Site Reliability Engineers leverage their experience as software and systems engineers to ensure applications onboarded to SRE are available, have full stack observability, introduce continuous improvement through code and automation, provide operational insight through analytics, continuously test,...

JPMorgan Chase & Co.
Columbus, Ohio

As a Lead Site Reliability Engineer at JPMorgan Chase within the Enterprise technology, network operations team, you hold a leadership role in your team, demonstrate strong knowledge across multiple technical domains, and advise others on the technical and business issues facing them. Formal trainin...

JP Morgan Chase & Co.
Columbus, Ohio

As a Lead Site Reliability Engineer at JPMorgan Chase within the Enterprise technology, network operations team, you will solve complex and broad business problems with simple and straightforward solutions. Formal training or certification on Site Reliability Engineering concepts and 5+ years of app...

JPMorgan Chase Bank, N.A.
Columbus, Ohio

Job responsibilities * Guides and assists others in the areas of building appropriate level designs and gaining consensus from peers where appropriate * Collaborates with other software engineers and teams to design and implement deployment approaches using automated continuous i...

JP Morgan Chase & Co.
Columbus, Ohio

Proficient in site reliability culture and principles and familiarity with how to implement site reliability within an application or platform. Site Reliability Engineer III. Supports the adoption of site reliability engineering best practices within your team. Formal training or certification on si...

JPMorgan Chase Bank, N.A.
Columbus, Ohio

Job responsibilities * Demonstrates and champions site reliability culture and practices and exerts technical influence throughout your team * Leads initiatives to improve the reliability and stability of your team's applications and platforms using data-driven analytics to impro...

Huntington National Bank
Ohio

The Google Cloud Platform (GCP) Site Reliability Engineer (SRE) Manager is responsible for supporting the GCP framework and consumers of the platform. The qualified candidate will collaborate with the CDO, Application, Incident, Security, and Change Management teams to manage the ITIL process, reduc...

JP Morgan Chase & Co.
Columbus, Ohio

Familiar with site reliability concepts, principles, and practices. Play a key role in ensuring system reliability at one of the world's most iconic and largest financial institutions. Leverages technology to solve business problems by writing high quality, maintainable, and robust code following be...