Search jobs > Bloomington, MN > Site reliability engineer

Sr Site Reliability Engineer

SAS
Bloomington, Minnesota, United States
Full-time

Sr Site Reliability Engineer

Job Locations US-MN-Bloomington Requisition ID 20060806 Category IDeaS (a SAS company) Position Type Contractor

Passionate people. Loyal clients. Leading solutions.

With a rich culture of creative collaboration and professional growth, IDeaS’ team members build successful careers with us.

IDeaS is proud to be a global powerhouse of innovation and excellence; challenge and reward. No matter where we’re working, our teams come together to create leading revenue management solutions that accelerate our clients’ growth through revenue optimization.

Now we just need you!

We are seeking a Senior Site Reliability Engineer at IDeaS, a SAS Company. You will play a pivotal role in ensuring the reliability, scalability, and performance of our revenue science software solutions.

With a minimum of eight years of experience, you bring a wealth of knowledge and expertise in software development and infrastructure operations.

You will serve as a go-to expert in ensuring the stability and efficiency of our systems, collaborating closely with cross-functional teams to address complex challenges.

Your strong communication skills will be instrumental as you proactively build relationships and streamline processes to enhance system reliability and performance.

You are persistent in the face of roadblocks, dispatch them efficiently, and pull in others when necessary, taking the initiative to ensure stability of the production environments while creating a backlog to reduce re-occurrences of issues and ensure long-term scalability.

Our systems are data-intensive and require a strong focus on data and machine-learning pipelines.

What you’ll be doing...

  • Collaborate closely with our development and operations teams to design, implement, and maintain highly available, scalable, and resilient software solutions, with a particular focus on data and ML pipelines.
  • Utilize your expertise in cloud computing and microservices architecture to enhance the reliability and performance of our data-intensive systems.
  • Engage with stakeholders to understand system requirements and ensure that our solutions meet rigorous reliability and performance standards, especially in the context of data processing and machine learning.
  • Actively participate in project scoping, scheduling, and task tracking, identifying potential reliability issues and implementing solutions to address them within our data-centric environment.
  • Implement Service Level Objectives (SLOs) and Service Level Indicators (SLIs) to measure and monitor the reliability and performance of our data and ML pipelines, ensuring that they meet agreed-upon targets.
  • Collaborate with the performance engineering team to design and implement performance regression test suites tailored to data and ML workloads, ensuring that system performance is continuously monitored and optimized in these critical areas.
  • Take ownership of the reliability and performance of our codebase, providing support to internal and external users as needed, particularly in the context of data processing and ML applications.
  • Collaborate closely with subject matter experts to gain domain-specific insights into data and ML pipelines and document system designs and configurations accordingly.
  • Utilize tools like Jira, Datadog, and GitHub to manage projects, track issues, and collaborate effectively with team members, with a focus on supporting data-intensive workflows.
  • Define success metrics and monitor system performance to ensure that our solutions meet or exceed reliability and performance targets, especially in the context of data processing and ML applications.
  • Proactively identify and address potential reliability issues before they impact system performance, with a particular emphasis on maintaining the integrity and efficiency of our data and ML pipelines.
  • Perform other duties, as assigned

What you’ll bring to us

  • Bachelor's degree in Computer Science, Engineering, or a related.
  • Minimum of eight years of experience in software development and / or infrastructure operations.
  • Strong interpersonal skills and excellent communication abilities, with a focus on proactive relationship-building.
  • Proficiency with cloud services and architectures, particularly AWS.
  • Hands-on experience with relational databases such as SQL Server, PostgreSQL, and MySQL.
  • Understanding of web technologies and frameworks, with experience in Angular being a plus.
  • Experience with performance monitoring and optimization tools like Datadog.
  • Proficiency in version control systems like Git / GitHub.
  • Experience with infrastructure as code tools like Terraform.
  • Knowledge of agile methodologies and best practices in software development and operations.

We Support Who You Are .

As a global company, we strive to create an inclusive environment where diverse perspectives spark innovation and meet the challenges of an evolving world.

Whether you’re launching a new career or expanding your current one, IDeaS is a company where you can balance great work with all other aspects of your life.

At IDeaS, we also aspire to live our values each day by being Accountable, Curious, Passionate and Authentic. And we continue our quest to build a more inclusive environment that attracts, represents and provides a place for diverse ideas, unique perspectives, and authentic voices.

30+ days ago
Related jobs
WELLS FARGO BANK
Minneapolis, Minnesota

Site Reliability Engineers leverage their experience as software and systems engineers to ensure applications onboarded to SRE are available, have full stack observability, introduce continuous improvement through code and automation, provide operational insight through analytics, continuously test,...

SAS
Bloomington, Minnesota

We are seeking a Senior Site Reliability Engineer at IDeaS, a SAS Company. You will play a pivotal role in ensuring the reliability, scalability, and performance of our revenue science software solutions. Your strong communication skills will be instrumental as you proactively build relationships an...

Thomson Reuters
Eagan, Minnesota

Thomson Reuters is seeking a Senior Site Reliability Engineer to join our Service Management, Technology team. In this opportunity as Senior Site Reliability Engineer, you will:. You're a fit for the role of Senior Site Reliability Engineer if your background includes:. DevOps Engineer, Cloud Engine...

Wipro
Minneapolis, Minnesota

BN USD WE’RE PRESENT IN 66 COUNTRIES OVER 1,400 ACTIVE GLOBAL CLIENTS Role: Site Reliability Engineer (DevOps) Location - USA JOB/ROLE DESCRIPTION Site Reliability Engineer responsibilities include monitoring computer systems and building alerts for various operational issues that computer systems c...

Inspire Medical Systems
Golden Valley, Minnesota

Senior Software Engineer, Site Reliability – Minneapolis, MN. Senior Software Engineer, Site Reliability. As an integral part of our DevOps team, you will work closely with our engineers and scientists to debug applications and develop solutions for our next generation Inspire products. Bachelor’s D...

Dayforce Corporation
Lakeville, Minnesota

DRE will be required to work with senior leadership within Dayforce and a combination of machine learning engineers, data engineers, and site reliability engineers. As part of the Data Reliability Engineering (DRE) team, you will be responsible for ensuring that all Dayforce data pipelines, storage,...

Novon Consulting
Minneapolis, Minnesota

We are seeking a Senior Site Reliability Engineer that will be at the forefront of establishing and driving best practices in system reliability, performance optimization, and observability. Five years of experience in a site-reliability-focused role responsible for establishing reliability standard...

Smartthings
Minneapolis, Minnesota

Facilitate a community of practice for operations and site reliability concepts to extend the capabilities of service teams through a culture of trust and team empowerment Mentor engineers on Site Reliability Engineering principles, practices, and toolsDevelop Platform Reliability Operational Health...

Federal Reserve System
Minneapolis, Minnesota
Remote

As a Senior Engineer of the SRE / Production Operations team for FedNow, you will operate the production environment for the program. The team uses open source and proprietary software to support Engineering, DevOps, and DevSecOps tools, services, and solutions. The SRE / Production Operations team ...

Novon Consulting
Minneapolis, Minnesota

We are seeking a Senior Site Reliability Engineer that will be at the forefront of establishing and driving best practices in system reliability, performance optimization, and observability. Five years of experience in a site-reliability-focused role responsible for establishing reliability standard...