Search jobs > San Jose, CA > Site reliability engineer

Site Reliability Engineer - Kubernetes

Ajmera Infotech Inc.
San Jose, CA, us
Permanent
Full-time
Quick Apply

Job Description

Job Title : Site Reliability Engineer - Kubernetes

Location : San Jose, California

Experience : 5-10 Years

Job Type : Full-time, Permanent Role

Job Overview : We are seeking a seasoned Senior Azure DevOps Engineer with extensive experience in Kubernetes to lead our cloud infrastructure initiatives.

As a senior member of our DevOps team, you will be instrumental in designing, implementing, and optimizing our Azure-based solutions while leveraging Kubernetes for container orchestration.

The ideal candidate will possess a strong background in Azure cloud services, CI / CD pipelines, infrastructure as code, and Kubernetes administration, with a proven track record of delivering scalable and reliable cloud solutions.

Responsibilities :

  • Lead the design and implementation of CI / CD pipelines on Azure DevOps, ensuring automated build, test, and deployment processes.
  • Architect, deploy, and manage highly available Kubernetes clusters on Azure Kubernetes Service (AKS) or other Kubernetes platforms.
  • Develop and maintain infrastructure as code (IaC) using tools like Terraform or ARM templates for Azure resource provisioning.
  • Collaborate closely with development teams to containerize applications and optimize their performance for Kubernetes orchestration.
  • Implement comprehensive monitoring, logging, and alerting solutions for Kubernetes clusters and Azure infrastructure using tools such as Prometheus, Grafana, Azure Monitor, and Log Analytics.
  • Ensure the security and compliance of Kubernetes clusters and Azure services through proper configuration and adherence to best practices and policies.
  • Mentor junior team members, provide technical guidance, and promote knowledge sharing within the team.
  • Stay abreast of emerging technologies and industry trends in cloud computing, DevOps practices, containers, and Kubernetes.

Requirements

  • Bachelor’s degree in Computer Science, Engineering, or a related field.
  • Minimum of 5 years of experience in a DevOps role with a strong focus on Azure cloud services and Kubernetes.
  • Deep expertise in Azure services such as Azure App Service, Azure Functions, Azure Container Registry, Azure Blob Storage, Azure SQL Database, etc.
  • Proficient in containerization technologies including Docker and Kubernetes, with hands-on experience in cluster deployment and management.
  • Strong scripting skills in PowerShell, Bash, or Python for automation and infrastructure management tasks.
  • Demonstrated experience with infrastructure as code tools such as Terraform, ARM templates, or Ansible.
  • Thorough understanding of CI / CD concepts and experience with CI / CD tools, preferably Azure DevOps.
  • Familiarity with monitoring, logging, and alerting solutions like Prometheus, Grafana, Azure Monitor, and Log Analytics.
  • Excellent problem-solving abilities and a proactive attitude towards troubleshooting and issue resolution.
  • Effective communication skills and the ability to collaborate with cross-functional teams.
  • Azure certifications (e.g., AZ-104, AZ-400) and Kubernetes certifications (e.g., CKAD, CKA) are highly desirable.

Benefits

Benefits :

  • Competitive compensation package including salary and performance-based bonuses.
  • Comprehensive benefits package including health, dental, and vision insurance.
  • Retirement savings plan with employer match.
  • Opportunities for professional development and training.
  • Flexible work hours and remote work options.
  • Collaborative and inclusive work culture.
  • Company-sponsored social events and team-building activities.
  • Cutting-edge technology environment with access to the latest tools and resources.

You should know :

  • Potential new employees must successfully complete a drug screen and background check which includes criminal search, education certification, and employment verification before hire.
  • Applicants must be authorized to work for any employer in the U.S. We are unable to sponsor or take over sponsorship of an employment Visa currently.
  • The position will be posted until a final candidate is selected for the requisition or the requisition has enough applications.
  • Legal Compliance : Ensure compliance with all federal, state, and local regulations, including but not limited to E-Verify requirements for employment eligibility verification.

Requirements

Minimum of 5 years of experience in a DevOps role with a strong focus on Azure cloud services and Kubernetes. Deep expertise in Azure services such as Azure App Service, Azure Functions, Azure Container Registry, Azure Blob Storage, Azure SQL Database, etc.

Proficient in containerization technologies including Docker and Kubernetes, with hands-on experience in cluster deployment and management.

Strong scripting skills in PowerShell, Bash, or Python for automation and infrastructure management tasks. Demonstrated experience with infrastructure as code tools such as Terraform, ARM templates, or Ansible.

Thorough understanding of CI / CD concepts and experience with CI / CD tools, preferably Azure DevOps. Familiarity with monitoring, logging, and alerting solutions like Prometheus, Grafana, Azure Monitor, and Log Analytics.

Excellent problem-solving abilities and a proactive attitude towards troubleshooting and issue resolution. Effective communication skills and the ability to collaborate with cross-functional teams.

Azure certifications (e.g., AZ-104, AZ-400) and Kubernetes certifications (e.g., CKAD, CKA) are highly desirable.

7 days ago
Related jobs
Promoted
VirtualVocations
Santa Clara, California

...

Ajmera Infotech Inc.
San Jose, California

Site Reliability Engineer - Kubernetes. Architect, deploy, and manage highly available Kubernetes clusters on Azure Kubernetes Service (AKS) or other Kubernetes platforms. We are seeking a seasoned Senior Azure DevOps Engineer with extensive experience in Kubernetes to lead our cloud infrastructure ...

Apple
Cupertino, California

We are looking for seasoned software and systems engineers to join the Block Storage SRE team at Apple. This engineer’s work will affect hundreds of millions of users and be essential to the success of some of the most visible current and future Apple features. We think critically and strive to bala...

Atlassian
Mountain View, California

As a Site Reliability Engineer (SRE) you will actively work to improve the performance and reliability of services as well as address root causes of incidents and reduce incident rates. Love staying ahead of the growth curve and experimenting with new software and environments? Get on board as an At...

Akraya
Sunnyvale, California

We are seeking a motivated and skilled Site Reliability Engineer (SRE) for a 5-month hybrid contract with a possibility for conversion. This role requires a proactive approach to problem-solving, with a focus on balancing speed of development and reliability. Ensure service reliability while meeting...

ByteDance
San Jose, California

Our data infrastructure Site Reliability Engineering (SRE) team is a pioneer in innovation. Develop and manage components of cloud-managed data infrastructure, encompassing technologies such as Kubernetes, Redis, MySQL, Flink, and more. Establish sustainable mechanisms for scaling systems, such as a...

TikTok
Mountain View, California

Site Reliability Engineering (SRE) combines software and systems engineering to build and run large-scale, massively distributed services and infrastructures. As a site reliability engineer in the Ads data platform area, you will have the opportunity to manage the services and infrastructures in one...

NVIDIA
Santa Clara, California

Join our team at NVIDIA as a Senior Site reliability engineer focused on HPC storage and play a crucial role in designing, implementing, and optimizing on-prem High-Performance Computing (HPC) storage solutions while harnessing the power of cloud computing. You will collaborate closely with engineer...

SIS-Systems Integration Solutions, Inc.
Sunnyvale, California

Role:Site Reliability EngineerTerms:12mos+Loc:Sunnyvale,CASkill Sets Kafka - At least 1 year Is RequiredAWS -At least 1 year Is Required MongoDB - At least 1 year Is Required Core Java - 5-10 years Is Required Elastic Search - At least 1 year Is Required Skills:-Skilled at writing clean, high-perfor...

ByteDance
San Jose, California

TEAM INTRODUCTION Our data infrastructure Site Reliability Engineering (SRE) team is a pioneer in innovation. Develop and manage components of cloud-managed data infrastructure, encompassing technologies such as Kubernetes, Redis, MySQL, Flink, and more. Establish sustainable mechanisms for scaling ...