Talent.com
Site Reliability Engineer (SRE)
Site Reliability Engineer (SRE)ShiftPixy Resources Inc • District of Columbia, WA, United States
Site Reliability Engineer (SRE)

Site Reliability Engineer (SRE)

ShiftPixy Resources Inc • District of Columbia, WA, United States
job_description.job_card.variable_hours_ago
serp_jobs.job_preview.job_type
  • serp_jobs.job_card.full_time
  • serp_jobs.filters_job_card.quick_apply
job_description.job_card.job_description

Responsibilities Deployment & Automation

  • Implement and maintain CI / CD pipelines using tools such as GitHub Actions, AWS CodePipeline, and Jenkins.
  • Automate infrastructure provisioning and management using Infrastructure-as-Code (IaC) with Terraform, CloudFormation, or AWS CDK.
  • Develop robust automation scripts and self-service tooling to minimize toil and enhance operational efficiency.

Capacity, Performance & Cost Optimization

  • Lead and implement operational cost optimization initiatives across cloud infrastructure and data platforms.
  • Configure, maintain, and tune auto-scaling policies and performance thresholds.
  • Develop and execute Resiliency Test plans and provide critical support for Performance testing efforts.
  • Incident Management & SRE Principles

  • Serve as a production on-call responder, employing strong troubleshooting skills to quickly resolve complex incidents.
  • Proficiently utilize ITIL framework concepts and ITSM tools (e.g., ServiceNow) for incident and change management.
  • Develop high-quality Root Cause Analysis (RCA) documentation and Knowledge articles to prevent future recurrence.
  • Implement and enforce SRE principles, including the definition and tracking of Service Level Indicators (SLIs), Service Level Objectives (SLOs), and Error Budgets.
  • Observability & Monitoring

  • Manage and leverage advanced observability platforms (Dynatrace preferred, AppDynamics, ELK, etc.).
  • Implement distributed tracing with accurate context propagation across data services and applications.
  • Optimize monitoring queries, and configure actionable dashboards, alerts, and anomaly detectors using tools like Dynatrace and Kibana.
  • Data Analytics Platform Reliability

  • Ensure the reliability, performance tuning, and access control for Databricks cluster management and data pipelines.
  • Maintain Informatica workflow orchestration, connector reliability, and error handling for critical data flows.
  • Manage Power BI gateway health, access control, and ensure reliable data refresh processes.
  • Security & Compliance

  • Manage service accounts, access permissions, and roles following the principle of least privilege.
  • Create, deploy, and manage digital certificates and TLS / SSL configurations.
  • Execute effective remediation tasks and respond to security incidents as part of the operational team.
  • Qualifications Education & Experience

  • Bachelor's degree in Computer Science, Engineering, or a related technical field.
  • 2 to 4 years of hands-on experience in a DevOps, Site Reliability Engineering (SRE), or Cloud Infrastructure role.
  • Practical, working experience with major cloud platforms, specifically AWS and Azure.
  • Technical Skills

  • Mid-level proficiency in Python or other scripting languages (e.g., Bash, Go) for automation tasks.
  • Mid-level proficiency with Configuration Management tools, including Ansible.
  • Strong knowledge of containerization technologies (Docker, Kubernetes / ECS).
  • Solid understanding of Linux systems and networking fundamentals (TCP / IP, DNS, Load Balancing).
  • Working knowledge of relational, cloud-native (e.g., AWS RDS), and NoSQL database technologies.
  • Direct hands-on experience supporting and maintaining data platforms like Databricks, Informatica, or Power BI is highly desirable.
  • Professional Attributes

  • Excellent written and verbal communication skills, with a proven ability to document complex systems.
  • Demonstrated ability to work independently, manage shifting priorities, and drive initiatives to completion.
  • Availability for on-call duties and to work outside of standard business hours as required to support a 24 / 7 production environment.
  • serp_jobs.job_alerts.create_a_job

    Site Reliability Engineer • District of Columbia, WA, United States

    Job_description.internal_linking.related_jobs
    Staff Site Reliability Engineer

    Staff Site Reliability Engineer

    Visa • Ashburn, VA, United States
    serp_jobs.job_card.full_time
    Visa is a world leader in payments and technology, with over 259 billion payments transactions flowing safely between consumers, merchants, financial institutions, and government entities in more t...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Cloud Site Reliability Engineer

    Cloud Site Reliability Engineer

    VirtualVocations • Alexandria, Virginia, United States
    serp_jobs.job_card.full_time
    A company is looking for a Cloud Site Reliability Engineer (AWS).Key Responsibilities Design, deploy, and maintain AWS cloud infrastructure for high availability and fault tolerance Administer M...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Site Reliability Engineer - Redmond WA

    Site Reliability Engineer - Redmond WA

    Redis Enterprise • Washington, DC, United States
    serp_jobs.job_card.full_time
    We built the product that runs the fast apps our world runs on.If you checked the weather, used your credit card, or looked at your flight status online today, you’re welcome.At Redis, you’ll work ...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    SE L2 - Requirements Engineer

    SE L2 - Requirements Engineer

    Technology Resource Experts LLC • Fort Meade, MD, US
    serp_jobs.job_card.full_time
    Technology Resource Experts, LLC is looking for an experienced Systems Engineer to join their rapidly growing team.Analyzes user requirements, concept of operations documents, and high level system...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Principal Site Reliability Engineer

    Principal Site Reliability Engineer

    VirtualVocations • Rockville, Maryland, United States
    serp_jobs.job_card.full_time
    A company is looking for a Principal Site Reliability Engineer.Key Responsibilities Lead the technical direction of the team while contributing to the design and implementation of self-service to...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Site Reliability Engineer- Talent Day

    Site Reliability Engineer- Talent Day

    Visa • Ashburn, VA, United States
    serp_jobs.job_card.full_time
    Visa is a world leader in payments and technology, with over 259 billion payments transactions flowing safely between consumers, merchants, financial institutions, and government entities in more t...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Site Facilities Engineer

    Site Facilities Engineer

    KBR, Inc. • Springfield, VA, US
    serp_jobs.job_card.full_time
    KBR is in search of a skilled Site Facilities Engineer to lead overall operations, maintenance, and performance of our government customer's sites. You will manage a team of around 60 specialists, f...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Sr. Manager - Site Reliability Engineer

    Sr. Manager - Site Reliability Engineer

    Visa • Ashburn, VA, United States
    serp_jobs.job_card.full_time
    Visa is a world leader in payments and technology, with over 259 billion payments transactions flowing safely between consumers, merchants, financial institutions, and government entities in more t...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    VirtualVocations • Alexandria, Virginia, United States
    serp_jobs.job_card.full_time
    A company is looking for a Senior Site Reliability Engineer to help scale its platform and ensure system reliability.Key Responsibilities Act as a first responder for system incidents and outages...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    DevOps Site Reliability Engineer

    DevOps Site Reliability Engineer

    VirtualVocations • Baltimore, Maryland, United States
    serp_jobs.job_card.full_time
    A company is looking for a DevOps / Site Reliability Engineer (Remote).Key Responsibilities Configure, manage, and improve CI / CD pipelines for application deployments Monitor application perform...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_1_day • serp_jobs.job_card.promoted
    Site Implementation Engineer

    Site Implementation Engineer

    Leidos Inc • Reston, VA, United States
    serp_jobs.job_card.full_time
    Leidos Digital Modernization Sector is looking for a Site Implementation Engineer to work on the Army Global Unified Network (AGUN) - Increment 1 (INC1) program. The Global Enterprise Network Modern...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    costar • Arlington, VA, US
    serp_jobs.job_card.full_time
    Senior Site Reliability Engineer.CoStar Group (NASDAQ : CSGP) is a leading global provider of commercial and residential real estate information, analytics, and online marketplaces.Included in the S...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Site Reliability Engineer

    Site Reliability Engineer

    CSCI Consulting • Quantico, VA, United States
    serp_jobs.job_card.full_time
    CSCI Consulting is looking for a.Site Reliability Engineer (SRE).This role combines deep systems engineering knowledge with DevOps automation, proactive monitoring, and incident response practices....serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Site Implementation Engineer, Senior

    Site Implementation Engineer, Senior

    Leidos Inc • Reston, VA, United States
    serp_jobs.job_card.full_time
    Leidos Digital Modernization Sector is looking for a Site Implementation Engineer to work on the Army Global Unified Network (AGUN) - Increment 1 (INC1) program. The Global Enterprise Network Modern...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Site Reliability Engineer

    Site Reliability Engineer

    VirtualVocations • Washington, District of Columbia, United States
    serp_jobs.job_card.full_time
    A company is looking for a Site Reliability Engineer (SRE).Key Responsibilities Design, build, and maintain scalable and reliable infrastructure using cloud platforms and automation tools Implem...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    DevSecOps Site Reliability Engineer - Clearance Preferred

    DevSecOps Site Reliability Engineer - Clearance Preferred

    LMI Consulting, LLC • Tysons, VA, United States
    serp_jobs.job_card.full_time
    DevSecOps Site Reliability Engineer - Clearance Preferred.Salaried High Fringe / Full-Time.DevSecOps Site Reliability Engineer. United States Army delivers software.The DevSecOps Site Reliability Engi...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Site Reliability Engineer III

    Site Reliability Engineer III

    Verisign • Reston, VA, United States
    serp_jobs.job_card.full_time
    Verisign helps enable the security, stability, and resiliency of the internet.We are a trusted provider of internet infrastructure services for the networked world and deliver unmatched performance...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Senior Site Reliability Engineer - Remote

    Senior Site Reliability Engineer - Remote

    Donnelley Financial, LLC • Rockville, MD, United States
    serp_jobs.filters.remote
    serp_jobs.job_card.full_time
    Join a dynamic team at the pulse of global markets, where we deliver innovative software and service solutions for essential financial reporting and capital markets transactions.At DFIN, we are a v...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted