Talent.com
Site Reliability Engineer (Ex - Fidelity Exp)

Site Reliability Engineer (Ex - Fidelity Exp)

Axiom Software Solutions LimitedUS
job_description.job_card.30_days_ago
serp_jobs.job_preview.job_type
  • serp_jobs.job_card.full_time
  • serp_jobs.filters.remote
  • serp_jobs.filters_job_card.quick_apply
job_description.job_card.job_description

Role : Site Reliability Engineer (Ex - Fidelity Exp)

Location : Remote

Position Type : Contract

Key Responsibilities

  • Design, implement, and manage Kubernetes environments from deployment to configuration, monitoring, and troubleshooting
  • Build and maintain scalable and reliable infrastructure using infrastructure as code principles
  • Develop comprehensive monitoring solutions and implement alerting strategies
  • Analyze system performance bottlenecks and implement improvements
  • Implement and maintain CI / CD pipelines for seamless deployments
  • Conduct incident response, root cause analysis, and implement preventative measures
  • Create and enhance automation tools leveraging AI / ML where applicable
  • Collaborate with development teams to improve application reliability and performance

Required Qualifications

  • 5-7 years of experience in SRE or DevOps roles
  • Strong expertise with Kubernetes ecosystem and container orchestration
  • Deep understanding of Linux / Unix operating systems and performance analysis tools (NMON, etc.)
  • Experience with log analysis, monitoring systems, and observability tools
  • Proficiency in database administration and performance tuning (Oracle, SQL Server)
  • Strong programming skills in at least one of : Python, Go, Java, or Node.js
  • Experience developing automation tools and frameworks
  • Proven track record of proactive problem identification and resolution
  • Preferred Qualifications

  • Experience with AI / ML integration into operational workflows
  • Cloud platform experience (AWS, GCP, Azure)
  • Knowledge of service mesh technologies
  • Experience with distributed systems architecture
  • Familiarity with security best practices and compliance requirements
  • Personal Qualities

  • Proactive mindset with strong analytical and problem-solving abilities
  • Collaborative approach to working across development and operations teams
  • Excellent communication skills and ability to explain complex technical concepts
  • Self-motivated with the ability to work independently and as part of a team
  • Passion for continuous improvement and learning
  • serp_jobs.job_alerts.create_a_job

    Site Reliability Engineer • US