Lead Site Reliability Engineer

AppOmni
Little Ferry, New Jersey, US
Full-time

As the Lead Site Reliability Engineer (SRE) Engineer, you will ensure the reliability, scalability, and performance of our systems and infrastructure.

Key duties include monitoring system availability, implementing automation for deployment and maintenance tasks, and proactively identifying areas for optimization.

You will also collaborate with the development team to establish and refine service level objectives, as well as drive incident response and postmortem analysis to minimize service disruptions.

Your work will have a direct and meaningful impact on the integrity and security of our customers and their customers' data - including your own! Our core values are customer experience, quality, and trust.

We succeed when our customers can confidently understand and manage the security and configuration critical to their business.

Scroll down to find an indepth overview of this job, and what is expected of candidates Make an application by clicking on the Apply button.

You have

  • 7+ years of relevant experience
  • Excellent technical and non-technical communication skills
  • Prior Experience as an SRE or related disciple responsible for maintaining the high availability of a cloud-based application, troubleshooting performance bottlenecks, configuring monitoring and alerting, and conducting incident response in a blameless environment
  • A knack for reducing manual toil tasks with automation and systematic thinking
  • Prior experience working with CI / CD tools and processes, pipelines-as-code (GitHub Actions, CircleCI)
  • At least 5+ years of hands-on experience with Python or Golang
  • A solid background in configuration management and infrastructure-as-code(Terraform)
  • Solid experience in monitoring / observability systems, including Synthetic and APM monitoring (Grafana, Prometheus, Scout, etc.)
  • Solid experience in troubleshooting and handling outages & incidents.
  • Demonstrated knowledge of Container orchestration ( Kubernetes / GKE)
  • Experience managing Kubernetes platforms and resources, and using Kubernetes deployment tools and patterns ( Helm, GitOps, Knative)

You might also have

  • Experience in FedRAMP or similar secure environments
  • Expertise working within highly controlled environments containing sensitive information.
  • Experience designing and maintaining CI / CD pipelines using commercial solutions
  • Experience working on and within GCP and / or AWS

We have

A flexible, remote-first company with a team of talented individuals who love answering questions, guided by a high bar for quality and commitment to self-improvement and personal growth.

An open mind for new ideas and methodologies, offering competitive salary and benefit options and opportunities and support for massive career growth.

Culture

We believe in cultivating excellence - within ourselves and in the work that we do. Our team of customer-centric, data-driven experts is brought together by the shared passion to create tools for the greater good.

Our tribe is determined to make a difference to positively impact our way of life by securing the technology that is changing the world.

We believe in being a trusted and transparent partner to our customers. We are fervent about providing them with high-quality, usable, and dependable software focused on the human experience, built out of a culture of competition and a deep understanding of their needs and goals.

We value our people and know that wellness and a healthy work / life balance enable you to thrive and bring us your best.

An autonomous schedule, flexible commute, and freedom from punching a clock mean you are empowered to enjoy life, work when inspired, and be available when needed.

About AppOmni

AppOmni is a leading provider of SaaS Security Management software. The company was founded by a team of security veterans from top SaaS providers and cybersecurity vendors, and its customer base includes global leaders across technology, healthcare, banking, and finance, as well as many of the most well-known cybersecurity providers.

AppOmni's patented technology scans APIs, security controls, and configuration settings to compare the current state of enterprise SaaS deployments against best practices and business intent.

The solution offers fast deployment and instant visibility and makes it easy for security and IT teams to secure their entire SaaS environment from each vendor to every end user.

As SaaS applications evolve, AppOmni stays current with all updates and releases to keep customer environments secure.

https : / / appomni.com / about /

AppOmni is an equal-opportunity employer. Applicants will not be discriminated against because of race, color, creed, sex, sexual orientation, gender identity or expression, age, religion, national origin, citizenship status, disability, ancestry, marital status, veteran status, medical condition, or any protected category prohibited by local, state or federal laws.

J-18808-Ljbffr

12 days ago
Related jobs
Promoted
Capital One
Newark, New Jersey

As a Capital One Lead Software Engineer, Site Reliability Engineer you’ll have the opportunity to be on the forefront of driving a major transformation within Capital One. Lead Software Engineer, Site Reliability (Bank Tech). New York City (Hybrid On-Site): $201,400 - $229,900 for Lead Software Engi...

Promoted
RBC Capital Markets, LLC
Jersey City, New Jersey

The Lead Support SRE will be responsible for supporting and spearheading the development and implementation of Site Reliability Engineering solutions for all applications within City National Bank (CNB), an RBC company. Spearhead the development of SRE solutions (monitoring and alerting, machine lea...

Promoted
JPMorgan Chase & Co.
Jersey City, New Jersey

Lead Site Reliability Engineer. Take lead and conduct resiliency design reviews, break up complex problems into digestible work for other engineers, act as a technical lead for medium to large-sized products, and provide advice and mentoring to other engineers. Advanced knowledge in site reliability...

Promoted
AppOmni
Little Ferry, New Jersey

As the Lead Site Reliability Engineer (SRE) Engineer, you will ensure the reliability, scalability, and performance of our systems and infrastructure. AppOmni is a leading provider of SaaS Security Management software. The company was founded by a team of security veterans from top SaaS providers an...

Promoted
Automatic Data Processing, Inc.
Roseland, New Jersey

ADP is hiring a Senior Software Engineer. In this role, you will collaborate with a team of software engineers to create microservice features for Lifion by ADP's Human Capital Management software platform. You will work alongside a team of intelligent and creative engineers to complete sprints iden...

Promoted
ClickJobs.io
Newark, New Jersey
Remote

Site Reliability Engineer - Backend, Shopping (Remote-Eligible). As a Capital One Senior Lead Software Engineer, you’ll have the opportunity to be on the forefront of driving a major transformation within Capital One. We are seeking Full Stack Software Engineers who are passionate about marrying dat...

Promoted
Devexperts LLC
Jersey City, New Jersey

We are looking for a Senior Site Reliability Engineer (SRE) to fill the open position in a team that develops and supports proprietary trading platforms for large scale clients. Make key decisions for scalability, reliability, and accessibility. ...

Promoted
Cloudtel Technologies
Paterson, New Jersey

A little about ADP: We are a global leader in HR technology, offering the latest AI and machine learning-enhanced payroll, tax, HR, benefits, and much more. Systems Reliability: Working in two-week sprints, you’ll ensure that our software services and applications are monitored appropriately to dete...

IDEXX
US, NJ, Virtual

Are you interested in working on a fast-paced Agile team, building modern & global LIMS platform? Do you want to work on a product that makes a difference in the day-to-day life of lab operations, veterinarians, and pet owners? Are you a self-starter individual? We are looking for a motivated engine...

Royal Bank of Canada>
Jersey City, New Jersey

The Application Support SRE will be responsible for the support, development, and implementation of Site Reliability Engineering solutions for all applications within City National Bank (CNB), an RBC company. Development of SRE solutions (monitoring and alerting, machine learning anomaly detection, ...