Search jobs > Detroit, MI > Site reliability engineer

Site Reliability Engineer – Hybrid

Amtex Enterprises Inc
Detroit, Michigan, US
$85 an hour
Full-time

Job Title : Site Reliability Engineer

Ready to make your application Please do read through the description at least once before clicking on Apply.

Location : Detroit, MI

Hybrid-3 days / week on site

Rate : $85 / hr

Visa : USC, GC

Job Summary :

The Cloud Site Reliability Engineer (SRE) works closely with the cloud development team, IT operations team, and business partners to streamline and implement enhanced monitoring and alerting capability across infrastructure and application layers.

By leveraging automation tools, SREs address and resolve issues, minimizing manual workload and enhancing system scalability and reliability.

Their core focus lies in standardization and automation to build and run fault-tolerant systems. Typically, SREs possess a background in software engineering, system engineering, or system administration, coupled with substantial IT operations experience.

SREs oversee availability, latency, performance, efficiency, change management, monitoring, emergency response, and capacity planning.

Key Accountabilities :

  • Writing and developing code to automate processes, such as analyzing logs, testing production environments, and responding to any issues.
  • Collaborating with agile teams and business partners to develop specifications that resolve problems and enhancement needs, including focusing on monitoring and metrics for operational readiness.
  • Identifying bottlenecks in development and deployment processes and designing automation solutions to mitigate.
  • Developing new capabilities in displaying, monitoring, and alerting on key performance indicators by tracking business transactions in real-time.
  • Maintaining and growing knowledge of platform configuration management, monitoring of established metrics, and troubleshooting.
  • Providing continuous feedback to development teams on system stability, defect analysis, and system enhancements.
  • Designing and developing alert escalation and incident response automation.
  • Providing production support for cloud service outages and incidents and working on both tactical and strategic plans for outage prevention.
  • Providing feedback on resiliency and maintainability of solutions to Cloud and App architects.
  • Conducting disaster recovery scenario generation and testing.
  • Implementing sustainable, audit-ready processes that support information technology controls, including deployment execution, access management, audits, incident management, and related requirements.

Must-have technical skills :

  • At least 3 years’ experience as a site reliability engineer on a cross-functional agile team working in Azure.
  • Working knowledge of agile development methodologies (scrum, sprints, KanBan, etc.) and tools (Azure DevOps, etc.).
  • At least 3 years hands-on experience using IaC tools Terraform, Github, Ansible, and Packer.
  • Proven experience across testing, integration, source code management, deployment, and containerization.
  • Sound problem-solving skills with the ability to quickly process complex information and present it clearly and simply.
  • Experience with cloud technologies and services including those for Compute, Storage, Databases, and API Management.
  • On-premises to cloud migration experience.

Interested candidates email your resume to [email protected] & [email protected] .

J-18808-Ljbffr

Remote working / work at home options are available for this role.

21 hours ago
Related jobs
Promoted
Amtex Enterprises Inc
Detroit, Michigan

Job Title: Site Reliability Engineer. The Cloud Site Reliability Engineer (SRE) works closely with the cloud development team, IT operations team, and business partners to streamline and implement enhanced monitoring and alerting capability across infrastructure and application layers. At least 3 ye...

Promoted
Dynatrace
Detroit, Michigan

Site Reliability Engineering at Dynatrace focuses on the enablement of our engineering teams to autonomously operate their services. If you have a passion for large-scale deployments in a highly dynamic environment and are interested in growing your skills around Site Reliability Engineering, then j...

Promoted
Canonical - Jobs
Detroit, Michigan

As a Site Reliability / Gitops Engineer engineer you will. As an SRE & Gitops engineer you'll be in a unique position to drive operations automation to the next level, both in our own private clouds as well as in the public clouds. Provide assistance and work with globally distributed engine...

Promoted
Electric Reliability Council of Texas
Taylor, Michigan

JOB SUMMARYJob Summary GMS Application Engineer-Markets: Provides support for Market Management Systems (MMS) applications portfolio such as Security Constrained Economic Dispatch (SCED), Day-Ahead Market (DAM), Reliability Unit Commitment (RUC), Congestion Revenue Rights (CRR), QSE Training Simulat...

Stefanini
Dearborn, Michigan

Improve reliability, quality, and time-to-market of our suite of software solutions. Provide primary operational and engineering Support for multiple large, distributed software applications. Identify and reduce or eliminate toil via automation to maximize the time spent on engineering and innovatio...

Dynatrace
Detroit, Michigan

Collaborates with motivated engineers from diverse backgrounds in software engineering, system engineering, and product management. Automate monitoring and alerting to improve efficiency, security, and reliability of cloud infrastructure. We are a one-product software company with a flat hierarchy, ...

Ford Motor Company
Dearborn, Michigan

We are seeking a talented Full Stack Software Engineer / Site Reliability Engineer to play a key role in developing Bedrock, a comprehensive Internal Developer Platform (IDP) that includes CI/CD pipelines, managed infrastructure, observability, and a developer portal. Years experience in software en...

Strategic Staffing Solutions
Detroit, Michigan

STRATEGIC STAFFING SOLUTIONS (S3) HAS AN OPENING!Job Title: Cloud Site Reliability EngineerLocation: Detroit, MI Hybrid-3 days/week on site in Detroit, 2 days remoteDuration: 2 year+ contractRole Type: W2 only, no corp to corpHighly competitive rate, with benefits availableJob SummaryThe Cloud Site ...

Federal Reserve System
Detroit, Michigan
Remote

As a Senior Engineer of the SRE / Production Operations team for FedNow, you will operate the production environment for the program. The team uses open source and proprietary software to support Engineering, DevOps, and DevSecOps tools, services, and solutions. You will work closely with Engineers ...

Dynatrace
Detroit, Michigan

Collaborates with motivated engineers from diverse backgrounds in software engineering, system engineering, and product management. Automate monitoring and alerting to improve efficiency, security, and reliability of cloud infrastructure. If your disability makes it difficult for you to use this sit...