Search jobs > Sandy Springs, GA > Site reliability engineer

Azure Site Reliability Engineer II / Sandy Springs, GA / Hybrid

Motion Recruitment
Sandy Springs, Georgia, United States
Full-time

Exciting opportunity in Sandy Springs, GA! This company sells software for an e-commerce website focused in the retail industry.

They are seeking an experienced Azure Site Reliability Engineer to join their team. This is an On-Site position and is a full-time role.

In this role, you'll work with cutting-edge technologies such as Azure Services and Datadog!

Our client is looking for hard-working individuals who work well on a team. Here, you will have the chance to grow your skills, work on meaningful projects, and enjoy a supportive work-life balance.

If you are ready to grow your skills, then this is the place for you!

Required Skills & Experience

  • Proficiency with Azure services
  • Strong experience with Datadog
  • 5+ YOE with Site Reliability
  • Proficient in scripting languages like Python, PowerShell, or Bash
  • Strong skills in diagnosing, troubleshooting, and optimizing system performance issues across large-scale environments.

Desired Skills & Experience

  • Knowledge of Datadog integrations for Azure services, Kubernetes, and CI / CD pipeline monitoring.
  • Familiarity with managing and optimizing databases such as Azure SQL, Cosmos DB, or MySQL.
  • Knowledge of SRE principles such as error budgets, automation, and incident postmortems.
  • Familiarity with IaC (Terraform and Ansible)
  • Understanding of compliance standards (ISO, SOC 2, GDPR) and security practices specific to cloud environments.

What You Will Be Doing

Tech Breakdown :

  • Core services : Azure Kubernetes Service (AKS), Azure Functions, Azure App Services.
  • Set up Datadog to monitor Azure resources, including Virtual Machines, AKS clusters, and storage accounts.
  • Use Datadog’s dashboards and anomaly detection features to proactively detect and resolve system issues before they impact users.
  • Monitor deployments through Datadog to detect any application errors or performance issues introduced during updates.
  • Develop and optimize CI / CD pipelines for efficient, reliable application deployment.

Daily Responsibilities

  • Automate resource provisioning and deployment with IaC tools like Terraform or ARM templates.
  • Continuously monitor Azure infrastructure and applications using Datadog for performance, uptime, and resource utilization.
  • Use Infrastructure as Code (IaC) tools like Terraform or Ansible to provision, update, and manage cloud infrastructure.
  • Develop, maintain, and improve CI / CD pipelines to automate Docker image builds and Kubernetes deployments.
  • Respond to system alerts, production issues, and incidents. Work to resolve outages quickly and perform root cause analysis to prevent future incidents.

The Offer

Bonus OR Commission eligible

You will receive the following benefits :

  • Medical, Dental, and Vision Insurance
  • Vacation Time
  • 401(k) with a company match, commuter benefits, paid holidays, PTO, quarterly bonuses, and more
  • Health Insurance

Applicants must be currently authorized to work in the US on a full-time basis now and in the future.

15 hours ago
Related jobs
Promoted
VirtualVocations
Alpharetta, Georgia

A company is looking for a Site Reliability Engineering (SRE) Solution Architect. ...

Promoted
Your Part-Time Controller, LLC
Atlanta, Georgia

Controller for Nonprofit Organizations. Your Part-Time Controller, LLC, (YPTC) is a national leader in providing outsourced accounting services to nonprofit organizations. Magazine, as well as Accounting Today's #2 Best Accounting Firm to Work for in 2021! Most recently we were named to Accounting T...

Promoted
VirtualVocations
Decatur, Georgia

A company is looking for a Site Reliability Engineer II to join their Platform and Site Reliability engineering team. ...

Promoted
Gusto
Atlanta, Georgia

Staff Site Reliability Engineer. Gusto's Infrastructure Engineering team enables our product teams to build impactful products by building secure, resilient, and accessible systems, using tools like AWS, terraform, and Kubernetes. Establish standards and build deterministic automation while optimizi...

Motion Recruitment
Atlanta, Georgia

Have you ever wanted to work in the heart of Atlanta… now is your chance! A growing company that specializes in creating a customer facing investment platform is looking to find their next Senior DevOps engineer. The office is in Buckhead, GA and provides a spectacular view of the surrounding area f...

https:/www.energyjobline.com/sitemap.xml
Atlanta, Georgia

Join us as a Senior Site Reliability Engineer. As a Senior Site Reliability Engineer, you will be responsible for developing sophisticated systems and software based on the customer’s business goals, needs, and general business environment. Engage with other Engineering organizations to implement pr...

10000 solutions llc
Sandy Springs, Georgia

Strong organizational abilities. ...

Macy’s
Johns Creek, Georgia

The Lead, Software Engineer at Macy’s Technology reports to the Tech Manager, Engineering and plays a pivotal role in leading the technical direction and development of enterprise solutions. As a lead engineer, they serve as the technical anchor for the engineering team supporting a product. Possess...

Motion Recruitment
Atlanta, Georgia

A growing start-up company in the consulting space is looking for their next big star in the IT department! This company provides a software that helps businesses implement AI into their environments so they can scale, and they are searching for a Senior DevOps Engineer to help with that. BS in Comp...

Boston Scientific
Atlanta, Georgia

Demonstrates effective change leadership and builds strategic partnerships to better the area/organization by leveraging relationships with their peers, management and across AF Solutions organization (Marketing, Training, and Strategic Planning). Educates clinical investigators on clinical trial pr...