Site Reliability Engineering Manager

iSeatz
Boston, MA, US
$9 an hour
Full-time

Job Description

Job Description

Our Mission

iSeatz provides digital commerce and loyalty tech solutions that enable travel and lifestyle bookings to global customers including American Express, Expedia, and IHG Hotels.

Our proprietary platform processes more than $9B a year in transactions.

We have a history of long-term trusted relationships and innovation that drives tangible value to our customers through a customizable, scalable, and secure platform, a global third-party marketplace, and loyalty integration.

We aspire to put our customers at the heart of every decision and exceed their expectations with best-in-class solutions and business-value innovations.

What you’ll do

The Site Reliability Engineering (SRE) Manager reports to the Manager of Platform Services and leads full-time and contractor team members.

In this role, you will ensure the reliability of our sites and infrastructure to meet our SLAs around latency, incidents, and uptime.

Your impact

Lead, manage and mentor a team of Site Reliability Engineers to ensure the reliability, scalability, and performance of our services.

Develop and implement SRE best practices, processes, and tools to improve system reliability and efficiency.

Utilize SumoLogic for monitoring, logging, and troubleshooting to ensure the health and performance of our systems.

Collaborate with cross-functional teams, including development, operations, and security, to design and implement reliable and scalable infrastructure solutions.

Drive incident response, root cause analysis, and post-mortem processes to identify and address system issues.

Continuously improve system observability, automation, and incident management processes.

Stay up-to-date with industry trends and emerging technologies to drive continuous improvement.

What you bring to the table

Bachelor's degree in Computer Science, Engineering, or a related field or equivalent experience.

Proven experience as an SRE Manager or in a similar leadership role.

Extensive experience with SumoLogic for monitoring, logging, and analytics.

Strong understanding of SRE principles, practices, and tools.

Experience with cloud platforms (AWS, GCP, Azure), container orchestration (Kubernetes and ECS) and serverless technologies (lambda, SNS, SQS, etc.).

Proficiency in programming and scripting languages (TypeScript, Python, Rust, Bash, TypeScript, etc.).

Excellent problem-solving skills and a proactive approach to identifying and addressing issues.

Strong leadership and communication skills, with the ability to collaborate effectively across teams.

Experience with incident management and on-call rotations.

Bonus points

Certification in cloud platforms or related technologies.

Experience with CI / CD pipelines and infrastructure as code (CloudFormation, CDK, Terraform, etc.).

Familiarity with security best practices and compliance requirements.

Location

This role is remote-first and can be located anywhere inside the continental United States. iSeatz is a New Orleans-based company with Central Time Zone business hours, but feel free to work from your home office, from the beach, or from the cottage you rented for the summer!

What we bring to the table

iSeatz is among the most prominent tech employers in New Orleans. With employee engagement and community impact at the forefront of our culture, we have been named a 2020 Top Workplace by nola.

com and honored as a CityBusiness Best Places to Work since 2008, including a 1st place award in 2020, at the height of a global pandemic.

iSeatz is committed to ensuring all employees are given every opportunity to succeed and grow within and beyond their current roles and responsibilities.

We work diligently to build and maintain trust among our workforce in everything we do, beginning with fostering an autonomous and thought-provoking work environment.

Micromanagement does not have a place at iSeatz. You will be trusted to use the knowledge and experience that brought you to iSeatz in tandem with the support of your manager and those around you, as needed, to deliver a high-quality end product.

We value a diverse workplace

We are committed to building and maintaining a culture of support, awareness, and sensitivity about the importance and impact of our differences and leverage these differences to build a stronger iSeatz.

If reasonable accommodation is needed to participate in the job application or interview process, to perform essential job functions, and / or to receive other benefits and privileges of employment, please contact the People Operations Team at [email protected].

A note about joining our workforce

At iSeatz, we’re looking for candidates who are genuinely excited about joining our fast-paced and motivated team. If you’re not enthusiastic about the opportunity to be a significant contributor;

to lead with confidence, discipline, impact, thoughtfulness, innovation, and accountability; and to bring your passion and drive for this specific role to the table, we ask that you kindly refrain from applying.

On the other hand, if this all sounds like you, we can’t wait to hear from you! Come help us shape the future of the travel and loyalty tech industry.

14 days ago
Related jobs
Promoted
Klaviyo
Boston, Massachusetts

As a Senior Site Reliability Engineer you will own multiple foundational Klaviyo services and make a big impact on the productivity of our product engineering teams. Ship foundational services to enable Klaviyo engineering to move faster with confidence. Prototype and advocate for architectural impr...

Klaviyo
Boston, Massachusetts

Site Reliability Engineering (SRE) is what you get when you treat system operations as a software engineering problem. The mission of the Site Reliability Engineering group is to provide services, tooling, and guidance to Klaviyo's product engineers to make them more productive and ensure their serv...

Promoted
Klaviyo
Boston, Massachusetts

As a Senior Site Reliability Engineer you will own multiple foundational Klaviyo services and make a big impact on the productivity of our product engineering teams. Internally, we call this role Senior Site Reliability Engineer on the Security SRE team. Check out this quick video with Sean Lutner, ...

iSeatz
Boston, Massachusetts

The Site Reliability Engineering (SRE) Manager reports to the Manager of Platform Services and leads full-time and contractor team members. Lead, manage and mentor a team of Site Reliability Engineers to ensure the reliability, scalability, and performance of our services. In this role, you will ens...

Promoted
Klaviyo
Boston, Massachusetts

As a Senior Site Reliability Engineer you will own multiple foundational Klaviyo services and make a big impact on the productivity of our product engineering teams. Ship foundational services to enable Klaviyo engineering to move faster with confidence. Prototype and advocate for architectural impr...

Klaviyo
Boston, Massachusetts

Site Reliability Engineering (SRE) is what you get when you treat system operations as a software engineering problem. The mission of the Site Reliability Engineering group is to provide services, tooling, and guidance to Klaviyo's product engineers to make them more productive and ensure their serv...

Bright Horizons
Newton, Massachusetts

The Senior Manager, Site Performance Engineering will lead our efforts to enhance the performance and overall reliability of our software applications. Lead the Site Performance and Testing Engineering teams, fostering a culture of accountability, innovation, and team spirit. Develop and implement s...

NetApp
Waltham, Massachusetts

The Site Reliability Engineering (SRE) Manager will lead a dynamic team responsible for ensuring our critical systems' reliability, performance, and efficiency. Title: Mgr, Site Reliability Engineering. This role involves a strategic blend of engineering and operations and requires a strong backgrou...

CIRCLE
Boston, Massachusetts

As a Senior Site Reliability Engineer at Circle, you will design, build, and maintain Circle’s infrastructure estate to meet the growing worldwide customer base on public cloud providers across multiple regions. Staff Site Reliability Engineer (IV). Senior Site Reliability Engineer (III). Senior Sit...

RISE Robotics
Somerville, Massachusetts

At this pivotal point in company growth, RISE is looking for a passionate and team-oriented Senior Manager, Test Engineering & Reliability to join us in shaping the future of heavy-duty linear actuation and heavy machinery. Lead a multidisciplinary test engineering team to validate the performan...