Search jobs > Bloomington, MN > Site reliability engineer

Sr Site Reliability Engineer

SAS
Bloomington, Minnesota, United States
Full-time

Sr Site Reliability Engineer

Job Locations US-MN-Bloomington Requisition ID 20060806 Category IDeaS (a SAS company) Position Type Contractor

Passionate people. Loyal clients. Leading solutions.

With a rich culture of creative collaboration and professional growth, IDeaS’ team members build successful careers with us.

IDeaS is proud to be a global powerhouse of innovation and excellence; challenge and reward. No matter where we’re working, our teams come together to create leading revenue management solutions that accelerate our clients’ growth through revenue optimization.

Now we just need you!

We are seeking a Senior Site Reliability Engineer at IDeaS, a SAS Company. You will play a pivotal role in ensuring the reliability, scalability, and performance of our revenue science software solutions.

With a minimum of eight years of experience, you bring a wealth of knowledge and expertise in software development and infrastructure operations.

You will serve as a go-to expert in ensuring the stability and efficiency of our systems, collaborating closely with cross-functional teams to address complex challenges.

Your strong communication skills will be instrumental as you proactively build relationships and streamline processes to enhance system reliability and performance.

You are persistent in the face of roadblocks, dispatch them efficiently, and pull in others when necessary, taking the initiative to ensure stability of the production environments while creating a backlog to reduce re-occurrences of issues and ensure long-term scalability.

Our systems are data-intensive and require a strong focus on data and machine-learning pipelines.

What you’ll be doing...

  • Collaborate closely with our development and operations teams to design, implement, and maintain highly available, scalable, and resilient software solutions, with a particular focus on data and ML pipelines.
  • Utilize your expertise in cloud computing and microservices architecture to enhance the reliability and performance of our data-intensive systems.
  • Engage with stakeholders to understand system requirements and ensure that our solutions meet rigorous reliability and performance standards, especially in the context of data processing and machine learning.
  • Actively participate in project scoping, scheduling, and task tracking, identifying potential reliability issues and implementing solutions to address them within our data-centric environment.
  • Implement Service Level Objectives (SLOs) and Service Level Indicators (SLIs) to measure and monitor the reliability and performance of our data and ML pipelines, ensuring that they meet agreed-upon targets.
  • Collaborate with the performance engineering team to design and implement performance regression test suites tailored to data and ML workloads, ensuring that system performance is continuously monitored and optimized in these critical areas.
  • Take ownership of the reliability and performance of our codebase, providing support to internal and external users as needed, particularly in the context of data processing and ML applications.
  • Collaborate closely with subject matter experts to gain domain-specific insights into data and ML pipelines and document system designs and configurations accordingly.
  • Utilize tools like Jira, Datadog, and GitHub to manage projects, track issues, and collaborate effectively with team members, with a focus on supporting data-intensive workflows.
  • Define success metrics and monitor system performance to ensure that our solutions meet or exceed reliability and performance targets, especially in the context of data processing and ML applications.
  • Proactively identify and address potential reliability issues before they impact system performance, with a particular emphasis on maintaining the integrity and efficiency of our data and ML pipelines.
  • Perform other duties, as assigned

What you’ll bring to us

  • Bachelor's degree in Computer Science, Engineering, or a related.
  • Minimum of eight years of experience in software development and / or infrastructure operations.
  • Strong interpersonal skills and excellent communication abilities, with a focus on proactive relationship-building.
  • Proficiency with cloud services and architectures, particularly AWS.
  • Hands-on experience with relational databases such as SQL Server, PostgreSQL, and MySQL.
  • Understanding of web technologies and frameworks, with experience in Angular being a plus.
  • Experience with performance monitoring and optimization tools like Datadog.
  • Proficiency in version control systems like Git / GitHub.
  • Experience with infrastructure as code tools like Terraform.
  • Knowledge of agile methodologies and best practices in software development and operations.

We Support Who You Are .

As a global company, we strive to create an inclusive environment where diverse perspectives spark innovation and meet the challenges of an evolving world.

Whether you’re launching a new career or expanding your current one, IDeaS is a company where you can balance great work with all other aspects of your life.

At IDeaS, we also aspire to live our values each day by being Accountable, Curious, Passionate and Authentic. And we continue our quest to build a more inclusive environment that attracts, represents and provides a place for diverse ideas, unique perspectives, and authentic voices.

30+ days ago
Related jobs
Promoted
Granicus
Saint Paul, Minnesota

With comprehensive cloud-based solutions for communications, government website design, meeting and agenda management software, records management, and digital services, Granicus empowers stronger relationships between government and residents across the U. Experience with software engineering best ...

Patterson Companies, Inc.
Saint Paul, Minnesota
Remote

Site Reliability Engineer (SRE) is responsible for the availability, latency, performance, efficiency, change management, monitoring, emergency response, and capacity planning. Plan, design, deploy, and operate Site Reliability Engineering capabilities for cloud products & services. DevOps and Site ...

Promoted
Granicus
Saint Paul, Minnesota

With comprehensive cloud-based solutions for communications, government website design, meeting and agenda management software, records management, and digital services, Granicus empowers stronger relationships between government and residents across the U. ...

SAS
Bloomington, Minnesota

We are seeking a Senior Site Reliability Engineer at IDeaS, a SAS Company. You will play a pivotal role in ensuring the reliability, scalability, and performance of our revenue science software solutions. Your strong communication skills will be instrumental as you proactively build relationships an...

WELLS FARGO BANK
Minneapolis, Minnesota

Site Reliability Engineers leverage their experience as software and systems engineers to ensure applications onboarded to SRE are available, have full stack observability, introduce continuous improvement through code and automation, provide operational insight through analytics, continuously test,...

Tata Consultancy Services
Minneapolis, Minnesota

Bridge between Platform and app engineering/ partners with application SRE. SRE and knowledge of Platform ( AWS/Kubernetes ) . Deep knowledge of platform (AWS/ Kubernetes etc) as platform engineer. ...

Federal Reserve System
Minneapolis, Minnesota

As a Senior Cloud Reliability Engineer in the SRE chapter, you will be accountable for implementing reliability practices using software as means for the cloud foundational product line in the Federal Reserve. The SRE Chapter is part of the Cloud Solutions & Services department and has the overall r...

Thomson Reuters
Eagan, Minnesota

Thomson Reuters is seeking a Senior Site Reliability Engineer to join our Service Management, Technology team. In this opportunity as Senior Site Reliability Engineer, you will:. You're a fit for the role of Senior Site Reliability Engineer if your background includes:. DevOps Engineer, Cloud Engine...

Wipro
Minneapolis, Minnesota

BN USD WE’RE PRESENT IN 66 COUNTRIES OVER 1,400 ACTIVE GLOBAL CLIENTS Role: Site Reliability Engineer (DevOps) Location - USA JOB/ROLE DESCRIPTION Site Reliability Engineer responsibilities include monitoring computer systems and building alerts for various operational issues that computer systems c...

Federal Reserve System
Minneapolis, Minnesota
Remote

As a Senior Engineer of the SRE / Production Operations team for FedNow, you will operate the production environment for the program. The team uses open source and proprietary software to support Engineering, DevOps, and DevSecOps tools, services, and solutions. The SRE / Production Operations team ...