Search jobs > Chicago, IL > Senior site reliability

Senior DevOps / Site Reliability Engineer

Teragonia
Chicago, IL, US
$160K-$180K a year
Full-time
Quick Apply

Department : Engineering

Location : Chicago / New York / Dallas (we will cover eligible relocation expenses)

Base Salary : $160,000 - $180,000

Other compensation : Performance bonus, LTIP and various other benefits

Company Overview

Teragonia develops Analytics Engineering and AI solutions for private equity portfolio companies and founder-owned businesses, supported by a team of technologists, data scientists, and business experts with first-hand private equity experience as investors, operators, and M&A advisors.

We create bespoke data-to-dollars’ value creation playbooks using Decision Intelligence Systems (DIS) that deliver real-time, system-level insights to enhance EBITDA.

We supplement the DIS with advisory services to help business leaders operationalize analytics-driven strategies and work closely with deal teams, operating partners, and management to embed analytics in strategic planning and tactical monitoring for timely course corrections.

Our Data-to-Dollars playbooks and solutions are designed to augment returns during private equity investment lifecycles and span i) Infrastructure Enhancement, ii) Business Growth and Profitability Diagnostics, iii) Value Creation, and iv) Exit Prep.

In each of the core solutions we leverage cutting edge tools and technology such as SQL, Python, Tableau, Power BI, Dataiku, DBT, and more to translate technical challenges into business value for our clients.

We deliver value to clients across three teams : i) Value Creation & Analytics, ii) Data Science & AI, and iii) Software Engineering.

Teragonia offers a comprehensive career development platform through cross-learning within our multidisciplinary structure.

We nurture a diverse, inclusive, collaborative, respectful, and collegial environment. Our competitive compensation package aligns with the technology industry leaders, encompassing 401k contribution matching, medical insurance, and additional benefits.

Job Summary

We are seeking a DevOps / Site Reliability Engineer to join our engineering team at Teragonia. This role is crucial for building, maintaining, and securing highly scalable operational systems across our cloud-based infrastructure.

As part of a cross-functional team, you will engage in the development, deployment, and continuous improvement of server-based solutions, data handling, and cybersecurity measures.

You will also participate in working with Teragonia’s customers to provide engineering support in ingesting data from client’s systems into Teragonia’s managed analytics platform.

This position is ideal for individuals passionate about leveraging DevOps principles such as collaboration, automation, and customer-centric action, and who thrive in innovative and fast-paced environments.

As an operations-oriented member of our team, you will be participating in on-call rotations, which involve being available during off-hours to handle highly exceptional cases where there is a loss of availability, security incident, or other critical incident.

We are committed to the growth and development of our team through focused education, cross-functional experiences, and mentorship.

We are particularly interested in candidates who are committed to a long-term career with us and are excited to be part of our thriving, collaborative, and forward-thinking team.

Responsibilities

  • Architect & deploy new services on Linux servers and Kubernetes hosted on Google Cloud.
  • Use Terraform to deploy cloud-based infrastructure in a modular manner.
  • Monitor system performance, conduct regular security assessments, and execute necessary updates or upgrades to maintain system integrity and reliability.
  • Manage firewall policies & networking for cloud-based infrastructure.
  • Oversee and execute regular maintenance tasks on all server-based solutions, including timely patching and updates to ensure security and efficiency.
  • Implement and manage comprehensive monitoring and alerting systems to track platform performance, proactively identify issues, and enable swift resolution to maintain system reliability and minimize downtime.
  • Design and implement CI / CD pipelines, including testing & security scanning, to facilitate seamless code transitions from development through QA to production, ensuring high-quality deployments..
  • Integrate best security practices, such as diligent secrets management, into all aspects of system architecture and data handling, for both off-the-shelf and custom-developed applications.
  • Be available for scheduled on-call rotations, which involve being available during off-hours to handle highly exceptional cases where there is a loss of availability, security incident, or other critical incident.
  • Create and maintain comprehensive documentation for deployment and operational procedures, fostering a culture of knowledge sharing.
  • Assist with product development as needs arise.
  • Mentor junior team members.
  • Perform other responsibilities as needed from time to time due to the nature of Teragonia being a startup

Requirements

Necessary Qualifications

  • Bachelor's degree in Management of Information Systems, Computer Science, Computer Engineering, or other related degree.
  • 6+ years of experience as a Site Reliability Engineer, DevOps Engineer, Cloud Engineer, or other relevant position in technology operations.
  • Knowledge of Linux systems administration and troubleshooting.
  • Knowledge of Kubernetes.
  • Proficiency in cloud infrastructure & services, especially in GCP.
  • Experience with Terraform.
  • Experience with version control best practices using Git.
  • Experience with Network Administration.
  • Expertise in containerization technologies such as Docker and Kubernetes.
  • Programming skills in Python and bash script development for automation and data handling.
  • Ability to participate in scheduled on-call rotations.
  • Availability for occasional late evening or early morning meetings to collaborate with international teams.

Preferred Qualifications

  • Certifications in Google Cloud, cybersecurity, or network administration.
  • Experience in Azure or AWS.
  • Experience with SOC II or ISO 27001 Compliance.

Benefits

  • Competitive Total Compensation - We constantly benchmark our pay scales against the market to maintain competitiveness.
  • Restricted Stock Awards - We offer restricted stock awards to select positions to ensure our teams’ long-term commitment and hard work are rewarded by sharing our collective success.
  • Visa Sponsorship - We offer visa sponsorship for eligible employees.
  • 401k Plan with Employer Contributions - We help secure your future with matching of 401k contributions up to 3.5% of your annual base salary, rising to 6% on the calendar year after your first work anniversary.
  • Professional Development Opportunities - We will sponsor company-selected training sessions, conferences, and certifications to foster your growth.
  • Meal Reimbursement - Reimbursement for lunch at the office, and dinner too if you are working past 7 pm, up to a limit.
  • Gym Membership - Reimbursement for fitness & wellness programs, such as gym memberships, up to a limit.
  • Commuter Benefits - We offer the opportunity to leverage the available tax concessions for commuting expenses.
  • Relocation Assistance - Available to those who move to the Chicago, Dallas, New York City or Houston metropolitan area as required by their job role.
  • Paid Time Off (PTO) - You will be eligible for 15 to 20 days PTO, depending on level, accrued according to company policies.
  • Annual Quiet Week - From Dec 24 to Jan 2, we anticipate a quiet week. During this period, all employees may work remotely, and we put in our best effort to keep workload as light as possible in light of balancing business needs.
  • Firm Holidays - 12 firm-designated holidays annually, in addition to the eligible PTO.
  • Comprehensive Healthcare, Vision, and Dental Insurance - We prioritize your health and wellness with insurance coverage through leading national providers.
  • Gynecology, fertility, and family-building benefits through leading providers.
  • Paid Parental Leave of 12 weeks after 6 months of service with the company.
  • Flexible Spending Account (FSA) / Health Savings Account (HSA) Plan - We provide the opportunity to manage your healthcare expenses efficiently through FSA / HSA plans.
  • Mental Health Support - Enroll in our healthcare plan and gain access to mental health support through Talkspace.
  • HealthAdvocate - Expert assistance to help you navigate the complexities of the healthcare system.
  • Disability Insurance - We sponsor short term disability insurance through a leading national provider.
  • Sick Days - We adhere to applicable state regulations, ensuring you have time to recover when you are not feeling well.

Flexible Work Policy

We empower you to choose the work environment that best balances your personal needs with the business needs. We recognize that a remote location may allow you to better focus and balance your personal needs.

We also believe brainstorming, planning, and team-building exercises are most effective in person and would expect you to be in the office, except when specific remote working arrangements are agreed due to personal situations.

For candidates living in the Chicago, Dallas, or New York City metropolitan areas or within commutable distance, we expect your presence in the office based on business or team needs or team-building activities that are most productive in person.

We do not impose specific days for in-office attendance.

We will consider specific remote working arrangements for candidates living outside the Chicago, Dallas, or New York City metropolitan areas.

We ask that you travel to our headquarters for important events, such as critical brainstorming and planning sessions, training, or team-building events.

If we open an office near you in the future, the same flexible work policy would apply as for our employees living in cities with current Teragonia offices.

Diversity, Equity, and Inclusion Statement

Teragonia strives for diversity and inclusion in the workforce and does not tolerate harassment or discrimination of any kind.

We make employment decisions based on the needs of our business and the qualifications of the individual and do not discriminate based upon race, religion, color, national origin, gender (including pregnancy or other medical conditions / needs), family or parental status, marital, civil union or domestic partnership status, sexual orientation, gender identity, gender expression, personal appearance, age, veteran status, disability, genetic information, or any legally protected characteristic not otherwise covered here.

We encourage all to apply.

2 days ago
Related jobs
Promoted
VirtualVocations
Chicago, Illinois

A company is looking for a Senior Cloud DevOps Engineer to lead the implementation and management of DevOps practices in Azure and Google Cloud environments. Key Responsibilities:Lead deployment and management of DevOps practices utilizing Kubernetes and TerraformOversee technology projects ensuring...

Promoted
Capital One
Chicago, Illinois

Senior Platform Engineer (DevOps, Mobile). New York City (Hybrid On-Site): $134,100 - $153,000 for Senior Platform Engineer. We are seeking Platform Engineers who are passionate about creating and supporting DevOps tools with emerging technologies to join our team. If you have visited our website in...

Promoted
VirtualVocations
Chicago, Illinois

...

Promoted
Cardinal Intellectual Property
Chicago, Illinois

In your role as the Senior DevOps Engineer, you will lead the implementation of tools and processes that accelerate the development and deployment of software. Create, configure, and manage CI/CD pipelines in Azure DevOps to enable efficient and automated software delivery. Configure and maintain Az...

Promoted
Capital One
Summit, Illinois
Remote

What You''ll Do:Lead a portfolio of diverse technology projects and a team of developers with deep experience in machine learning, distributed microservices, and full stack systems to create solutions that help meet regulatory needs for the companyShare your passion for staying on top of tech trends...

CIRCLE
Chicago, Illinois

As a Senior Site Reliability Engineer at Circle, you will design, build, and maintain Circle’s infrastructure estate to meet the growing worldwide customer base on public cloud providers across multiple regions. Senior Site Reliability Engineer (III). Senior Site Reliability Engineer (III). All the ...

Oak Street Health
Chicago, Illinois

As a Site Reliability Engineer, you will be instrumental to the stability and performance of a new kind of platform for healthcare, one built specifically for the clinical team. From design to implementation, you will partner with our stellar software engineering teams in a fast-paced, agile environ...

Enova Financial
Chicago, Illinois

The Site Reliability engineer will join a team of fellow SRE and Observability engineers, working together to make Enova’s reliability best of breed. As a Site Reliability engineer you will help maintain the reliability of our consumer business from a technology and operational standpoint, and will ...

Gusto
Chicago, Illinois

Staff Site Reliability Engineer. Gusto’s Infrastructure Engineering team enables our product teams to build impactful products by building secure, resilient, and accessible systems, using tools like AWS, terraform, and Kubernetes. Establish standards and build deterministic automation while optimizi...

The Hartford
Chicago, Illinois

Hands-on DevOps and CI/CD tool chains ( GitHub, Jenkins, Docker, K8s etc. Executes on Production Engineering process and practices such as incident management, root cause analysis and problem solving. Bachelors degree in Computer Science, Math, or any Engineering. Experience with continuous integrat...