Job Summary
Apply fast, check the full description by scrolling below to find out the full requirements for this role.
Plans, installs, configures and tests monitoring, observability and automation solutions for the Enterprise. Ensures tools, processes and technical solutions are tailored to maximize functionality.
Prepares and maintains operational documentation and standards. Leads development of automation and monitoring solutions for all technology components (APM, Infrastructure, RUM, Synthetics, custom integrations etc.
Ensures that services and components meet and continue to meet all of their agreed performance targets and service levels.
Progress and maintain automation in business uses cases and technology incident management. Manages and monitors cloud-based infrastructure, ensures security and compliance, collaborating with other teams.
Maintains support processes, identifies issues and resolves problems relating to applications and infrastructure.
Primary Activities and Responsibilities
Plans and implements service request work and has a significant role in the delegation of responsibilities. Works under broad direction.
Work is often self-initiated. Is fully accountable for meeting allocated technical objectives.
Establishes milestones for project related work and makes decisions which impact the success of projects and team objectives.
Analyzes, designs, plans, executes and evaluates work to time, cost, and quality targets.
Investigates and resolves complex problems requiring an understanding of the relationship between own specialism and wider customer / organizational requirements.
Influences organization, customers, suppliers, partners and peers on the contribution of his own specialism.
- Performs an extensive range of complex technical or professional work activities. Undertakes work which requires the application of fundamental IT principles in a wide and often unpredictable range of contexts.
- Maintains an awareness of developments in the industry. Takes initiative to keep skills up to date. Mentors colleagues.
- Follows best practices around developing and maintaining secure systems.
- Miscellaneous activities and responsibilities as assigned by manager.
Minimum Qualifications
- Bachelor's degree from an accredited institution required in Computer Science, Computer Engineering, Software Engineering, Information Systems / Technology or related major field of study.
- 5 or more years of experience required in working in the observability, operations, or DevOps domains.
Equivalent Minimum Qualifications
- High School diploma / GED.
- 7 or more years of experience required in working in the observability, operations, or DevOps domains.
Preferred Qualifications
- Master's degree from an accredited institution required in Computer Science, Computer Engineering, Software Engineering, Information Systems / Technology or related major field of study.
- 3+ or more years of experience in working in the observability, operations, or DevOps domains.
Knowledge and Skills
- Proficient in observability, monitoring, and logging tools like Datadog, SolarWinds, Azure Monitor.
- Experience with API and test automation (i.e., Insomnia, Postman).
- Knowledge of system and software quality assurance best practices and methodologies and experience in performance testing including requirements, test plan, scripting, execution and analysis.
- Experience with automation tools such as Ansible, Puppet or Terraform.
- Deep understanding of IT infrastructure monitoring and observability best practices.
- Programming skills in languages such as Perl, Shell, or JavaScript.
- Solid understanding of agile methodologies such as CI / CD, application resiliency, and security.
- Knowledge of Microsoft Azure.
- Executes software solutions, design, development, and technical troubleshooting with ability to think beyond routine or conventional approaches to build solutions or break down technical problems.
- Experience with installation, setup and configuration of monitoring tools like Datadog, SolarWinds, Azure Monitor.
- Knowledgeable about traditional ITSM principles and modern incident management processes and platforms (Opsgenie, PagerDuty, or MIR3).
- Works with product owners to define functional and non-functional requirements.
J-18808-Ljbffr