Search jobs > Seattle, WA > Site reliability engineer

Site Reliability Engineer

Prodigy Resources
Seattle, WA, United States
Full-time

About Us :

Prodigy is seeking an SRE to join our client's organization which is leading the charge in fintech innovation, providing state-of-the-art solutions that drive financial success and empower our clients.

As they embark on an exciting Greenfield project, they're seeking an experienced Site Reliability Engineer to join their team.

This role is critical to ensuring the reliability, scalability, and performance of their systems as we build and deploy new technologies.

Role Overview :

We are looking for a talented Site Reliability Engineer with a strong background in Python, Django, Flask, and AWS. In this role, you will be pivotal in maintaining the uptime, performance, and reliability of our systems.

You'll work closely with development teams to integrate reliability engineering practices into the software development lifecycle, ensuring our services meet high standards for reliability and performance.

Key Responsibilities :

  • System Reliability : Ensure the availability, performance, and scalability of backend services and APIs. Develop and implement reliability engineering practices and tools.
  • Incident Management : Respond to and resolve incidents, perform root cause analysis, and implement preventive measures to avoid future issues.
  • Monitoring & Metrics : Set up and manage monitoring, logging, and alerting systems using tools such as AWS CloudWatch, and ensure comprehensive visibility into system performance.
  • Automation : Automate operational tasks and processes to improve efficiency and reduce manual intervention. Develop and maintain CI / CD pipelines.
  • Capacity Planning : Work on capacity planning and performance tuning to handle increasing loads and ensure system resilience.
  • Collaboration : Collaborate with development teams to design, deploy, and manage infrastructure and applications. Provide guidance on reliability best practices and performance optimizations.
  • Documentation : Create and maintain documentation for systems, processes, and incident response procedures.
  • Continuous Improvement : Stay updated with industry trends and emerging technologies to continuously improve our reliability and performance practices.

Qualifications :

  • Experience : Minimum of 5 years of experience in Site Reliability Engineering or a related field, with a solid background in Python, Django, Flask, and AWS.
  • Technical Skills : Proficiency in Python and experience with Django and Flask frameworks. Hands-on experience with AWS services (EC2, S3, RDS, Lambda, etc.).
  • Reliability Practices : Strong understanding of SRE principles, including SLAs, SLOs, and error budgets. Experience with incident management and disaster recovery.
  • Monitoring Tools : Experience with monitoring and observability tools (e.g., AWS CloudWatch, Prometheus, Grafana).
  • Automation : Proven experience in automating tasks and managing CI / CD pipelines.
  • Problem-Solving : Excellent analytical and troubleshooting skills, with the ability to resolve complex technical issues.
  • Communication : Strong verbal and written communication skills, with the ability to convey technical concepts to both technical and non-technical audiences.
  • Fintech Experience : Experience in the fintech industry or similar regulated environments is highly desirable.

Why Join Us?

  • Innovative Projects : Contribute to a transformative Greenfield project that will shape the future of fintech.
  • Dynamic Environment : Engage in a fast-paced, collaborative environment focused on continuous improvement and innovation.
  • Growth Opportunities : Access to ongoing learning and career development opportunities.
  • Competitive Compensation : Enjoy a competitive salary and comprehensive benefits package.
  • 21 days ago
Related jobs
Promoted
VirtualVocations
Seattle, Washington

A company is looking for an Associate Site Reliability Engineer to support identity risk operations and enhance operational efficiency. ...

Promoted
Microsoft
Redmond, Washington

Senior Site Reliability Engineer. Site Reliability Engineering IC4 - The typical base pay range for this role across the U. Can identify and recommend configurations optimal of cloud technology solutions and modify the code base that defines systems or cloud technologies to improve the reliability a...

Promoted
VirtualVocations
Seattle, Washington

A company is looking for a Site Reliability Engineer in Remote Kentucky. ...

Promoted
The Dignify Solutions, LLC
Bellevue, Washington

Take the next step in your career now, scroll down to read the full role description and make your application.Windows Servers, Digital: Microsoft Azure.Windows Powershell, Digital: DevOps.Windows Server 2012 - 2019 Administration Microsoft Azure.Maintain and update documentation of projects and sta...

Promoted
VirtualVocations
Seattle, Washington

...

Redfin
Seattle, Washington

Ability to engage in complex technical discussions with a variety of audiences, including Software and Systems Engineers, and Senior Management. Bachelor's degree in Computer Science, Computer or Electrical Engineering, or equivalent work experience. ...

Tata Consultancy Services
Bellevue, Washington

Digital : BigData and Hadoop Ecosystems; Digital : Kafka; Digital : HBase.Supporting HDInsight product team in SRE support.Helping customer in providing resolutions to end customer by resolving the ICM.Responsibilities would include –.Run prototype to migrate Big-data workloads .Performance tuning f...

Circle
Seattle, Washington

As a Senior Site Reliability Engineer at Circle, you will design, build, and maintain Circle's infrastructure estate to meet the growing worldwide customer base on public cloud providers across multiple regions. Staff Site Reliability Engineer (IV). Senior Site Reliability Engineer (III). Senior Sit...

Carman Solutions Group
WA, United States

SRE ( Site Reliability Engineer) </b></p> <p><b>Location - Seattle WA- - needs to come to office 3 days a week. Requires 10-12 years experience in the IT industry </li> <li>Requires 9+ years of software and DevOps development engineering</li> <li>Expe...

Microsoft
Redmond, Washington

We're seeking an Site Reliability Engineer II to join us in this mission to power the biggest AI training workloads imaginable. As a Site Reliability Engineer II in our team, you will get exposed to some of the biggest AI infrastructure in the world and you will help us build the most reliable AI tr...