Search jobs > San Jose, CA > Site reliability engineer

Site Reliability Engineer (SRE) - Intermediate

Equifax, Inc.
San Jose, California, US
Full-time

Site Reliability Engineering (SRE) at Equifax is a discipline that combines software and systems engineering for building and running large-scale, distributed, fault-tolerant systems.

SRE ensures that internal and external services meet or exceed reliability and performance expectations while adhering to Equifax engineering principles.

Do not pass up this chance, apply quickly if your experience and skills match what is in the following description.

SREs in our team take an engineering approach to building and running our Equifax Security production systems we engineer solutions to operational problems.

Our SREs are responsible for overall system operation and we use a breadth of tools and approaches to solve a broad set of problems.

What you’ll do

  • Engage in and improve the software development lifecycle from inception and design, through development, deployment, operation and refinement.
  • Influence and design infrastructure, architecture, standards and methods for large-scale systems.
  • Support services prior to production via infrastructure design, software platform development, load testing, capacity planning and launch reviews.
  • Maintain services during deployment and in production by measuring and monitoring key performance and service level indicators including availability, latency, and overall system health.
  • Automate system scalability and continually work to improve system resiliency, performance and efficiency.
  • Remediate tasks within the corrective action plan via sustainable, preventative, and automated measures whenever possible.
  • Practice sustainable incident response as part of an on-call rotation and through blameless postmortems
  • Responsible for vulnerability and penetration testing remediation.
  • On call rotational support (1 week a month)

What experience you need

  • BS degree in Computer Science or related technical field involving coding (e.g., physics or mathematics), or equivalent job experience required.
  • 1+ years of experience developing and / or administering software in public cloud
  • 1+ years experience in monitoring infrastructure and application uptime and availability to ensure functional and performance objectives.
  • 1+ years experience in languages such as Python, Bash, Java, Go JavaScript and / or node.js
  • 1+ years experience with cross-functional knowledge with systems, storage, networking, security and databases
  • 1+ years experience with system administration, including automation and orchestration of Linux / Windows using Terraform, Chef, Ansible and / or containers (Docker, Kubernetes, etc.)
  • 1+ years experience with CI / CD tooling and practices

What could set you apart

  • Experience implementing CI / CD Pipelines with automation and orchestration of builds / deployments
  • Experience in Jenkins Pipelines & Kubernetes Deployments
  • Experience with Cloud Security Tools such as Twistlock, Qualys, Fortify, SentinelOne
  • Experience with system administration, including automation and orchestration of Linux / Windows using Chef, Puppet, Ansible, Salt Stack and / or containers (Docker, Kubernetes, etc.)

J-18808-Ljbffr

4 days ago
Related jobs
Promoted
Equifax, Inc.
San Jose, California

Site Reliability Engineering (SRE). SRE ensures that internal and external services meet or exceed reliability and performance expectations while adhering to Equifax engineering principles. SREs in our team take an engineering approach to building and running our Equifax Security production systems ...

Promoted
DICE
San Jose, California

We seek a highly skilled and dynamic Site Reliability Engineer. Onsite - 2 days a week / 3 days Remote. Maintain and improve the reliability, performance, and availability of software systems. Act as a bridge between traditional IT operations and software development, bringing a software engineering...

Promoted
Zscaler
San Jose, California

Position: Staff Site Reliability Engineer. Resolve escalations and help prevent reiteration of incidents with process, monitoring and reliability improvements. Relevant experience preferably in an Operations or Engineering environment. ...

E-Solutions
California, United States

Site Reliability Engineer (SRE). We are seeking a skilled Site Reliability Engineer (SRE) to join our dynamic team. You will be responsible for ensuring the availability and reliability of our SaaS products, which host customer data and require 24x7 uptime. Ensure the reliability, availability, and ...

CENTRL
Mountain View, California

Balance feature development speed and reliability with well-defined service level objectives. Previous success in technical engineering. ...

TikTok
Mountain View, California

Team Insight:CDN Site Reliability Engineering combines software and network engineering with system operations to build and run large-scale, massively distributed infrastructure. CDN performance and traffic engineering, network solution architecting or network-focused site reliability engineering ro...

Ajmera Infotech Inc.
San Jose, California

Site Reliability Engineer - Kubernetes. We are seeking a seasoned Senior Azure DevOps Engineer with extensive experience in Kubernetes to lead our cloud infrastructure initiatives. Bachelor’s degree in Computer Science, Engineering, or a related field. ...

TikTok
Mountain View, California

The USDS Video Platform team is seeking an experienced Site Reliability Engineer to help us continue improving TikTok's video system. The teams within USDS that deliver on this commitment daily span across Trust & Safety, Security & Privacy, Engineering, User & Product Ops, Corporate Functions and m...

Splunk Inc
California, United States

Learn more aboutSplunkcareers and how you can become a part of our journey!Role:Splunk is looking for a TechOps Engineer with the ability to provide day-to-day technical expertise for our Splunk Cloud Azure TechOps team and the Splunk organization. As a TechOps Engineer, you will be interfacing with...

TikTok
Mountain View, California

Site Reliability Engineering (SRE) combines software and systems engineering to build and run large-scale, massively distributed services and infrastructures. As a site reliability engineer in the Ads data platform area, you will have the opportunity to manage the services and infrastructures in one...