Search jobs > Detroit, MI > Temporary > Site reliability engineer

Cloud Site Reliability Engineer

Strategic Staffing Solutions
Detroit, MI, US
Full-time

STRATEGIC STAFFING SOLUTIONS (S3) HAS AN OPENING!

Hit Apply below to send your application for consideration Ensure that your CV is up to date, and that you have read the job specs first.

Job Title : Cloud Site Reliability Engineer

Location : Detroit, MI Hybrid-3 days / week on site in Detroit, 2 days remote

Duration : 2 year+ contract

Role Type : W2 only, no corp to corp

Highly competitive rate, with benefits available

Job Summary

The Cloud Site Reliability Engineer (SRE) works closely with cloud development team, IT operations team and business partners to streamline and implement enhanced monitoring and alerting capability across infrastructure, application layers.

By leveraging automation tools, SREs address and resolve issues, minimizing manual workload and enhancing system scalability and reliability.

Their core focus lies in standardization and automation to build and run fault-tolerant systems. Typically, SREs possess a background in software engineering, system engineering, or system administration, coupled with substantial IT operations experience.

SREs oversee availability, latency, performance, efficiency, change management, monitoring, emergency response, and capacity planning.

Key Accountabilities

  • Writing and developing code to automate processes, such as analyzing logs, testing production environments and responding to any issues
  • Collaborates with agile teams and business partners to develop specifications that resolve problems and enhancement needs, including focusing on monitoring, and metrics for operational readiness
  • Identify bottlenecks in development and deployment processes and designs automation solutions to mitigate
  • Develop new capabilities in displaying / monitoring / alerting on key performance indicators by tracking business transactions in real-time
  • Maintain and grow knowledge of platform configuration management, monitoring of established metrics, and troubleshooting
  • Provides continuous feedback to development teams on system stability, defect analysis, and system enhancements
  • Design and develop alert escalation and incident response automation
  • Provide production support for cloud service outages and incidents and work on both tactical and strategic plans for outage prevention
  • Provide feedback on resiliency and maintainability of solutions to Cloud and App architects
  • Conduct disaster recovery scenario generation and testing
  • Implement sustainable, audit-ready processes that support information technology controls, including deployment execution, access management, audits, incident management and related requirements.

Must-have technical skills :

  • Should have at least 3 years’ experience as a site reliability engineer on a cross functional agile team working in Azure.
  • Have working knowledge of agile development methodologies (scrum, sprints, KanBan etc.) and tools (Azure DevOps etc.)
  • Have at least 3 years hands-on experience using IaC tools Terraform, Github, Ansible and Packer
  • Proven experience across testing, integration, source code management, deployment and containerization
  • Sound problem-solving skills with the ability to quickly process complex information and present it clearly and simply
  • Experience with cloud technologies and services including those for Compute, Storage, Databases and API Management
  • On-premise to cloud migration experience

The S3 Difference

The global mission of S3 is to build trusting relationships and deliver solutions that positively impact our customers, our consultants, and our communities.

The four pillars of our company are to :

  • Set the bar high for what a company should do
  • Create jobs
  • Offer people an opportunity to succeed and change their station in life
  • Improve the communities where we live and work through volunteering and charitable giving
  • 1 day ago
Related jobs
Promoted
Strategic Staffing Solutions
Detroit, Michigan

The Cloud Site Reliability Engineer (SRE) works closely with cloud development team, IT operations team and business partners to streamline and implement enhanced monitoring and alerting capability across infrastructure, application layers. Job Title: Cloud Site Reliability Engineer. Should have at ...

Promoted
Dynatrace
Detroit, Michigan

Site Reliability Engineering at Dynatrace focuses on the enablement of our engineering teams to autonomously operate their services. If you have a passion for large-scale deployments in a highly dynamic environment and are interested in growing your skills around Site Reliability Engineering, then j...

Promoted
Canonical - Jobs
Detroit, Michigan

As an Senior SRE & Gitops engineer you'll be in a unique position to drive operations automation to the next level, both in our own private clouds as well as in the public clouds. As a Senior Site Reliability / Gitops Engineer you will. Automate software operations for re-usability and consi...

Promoted
https:/wayup.com/sitemap.xml
Detroit, Michigan

Collaborates with motivated engineers from diverse backgrounds in software engineering, system engineering, and product management. Automate monitoring and alerting to improve efficiency, security, and reliability of cloud infrastructure. Works in a dynamic, secure cloud environment with expertise i...

Promoted
Amtex Enterprises Inc
Detroit, Michigan

The Cloud Site Reliability Engineer (SRE) works closely with the cloud development team, IT operations team, and business partners to streamline and implement enhanced monitoring and alerting capability across infrastructure and application layers. Job Title: Site Reliability Engineer. At least 3 ye...

Tekvivid Inc
Dearborn, Michigan

W2 Requirement!!</p> <p>Looking for Full Stack/Site Reliability Engineer</p> <p>Location: Dearborn, MI - Hybrid</p> <p> </p> <p>Skills Required:</p> <p>· Understanding of gRPC & RESTful APIs, and microservices platform</p...

Dynatrace
Detroit, Michigan

Collaborates with motivated engineers from diverse backgrounds in software engineering, system engineering, and product management. Automate monitoring and alerting to improve efficiency, security, and reliability of cloud infrastructure. Works in a dynamic, secure cloud environment with expertise i...

Stefanini North America and APAC
Southfield, Michigan

Full Stack / Site Reliability Engineer. The primary focus of this role will be on ensuring the stability and scalability of the Internal Developer Platform that hosts the cloud applications that power our customer's connected vehicle experiences. The secondary focus of this role will be to facilitat...

Ford Motor Company
Dearborn, Michigan

We are seeking a talented Full Stack Software Engineer / Site Reliability Engineer to play a key role in developing Bedrock, a comprehensive Internal Developer Platform (IDP) that includes CI/CD pipelines, managed infrastructure, observability, and a developer portal. The primary focus of this role ...

Stefanini
Dearborn, Michigan

Improve reliability, quality, and time-to-market of our suite of software solutions. Provide primary operational and engineering Support for multiple large, distributed software applications. Identify and reduce or eliminate toil via automation to maximize the time spent on engineering and innovatio...