Senior Site Reliability Engineer

Tek Ninjas
Sunnyvale, CA, US
Temporary

Job Description

Job Description

Job Title : Senior Site Reliability Engineer

Location : Onsite 2x / week in Sunnyvale, CA

Duration : 12 month contract

International Tech

Top Skills : Java

Java

Python

NodeJS

  • Need to be a strong coder
  • DevOps Engineer should work here too

Main Responsibilities :

  • Create automation suite for In-Market BCDR systems
  • Enable SRE Best practices and standardization for Brick & Mortar Systems, In-Market systems
  • Incorporate changes in GCP Projects for auto-scaling and optimal Storage Utilization

Job Description :

This is a coder position and we are looking for development engineers with experience in at least either Java or Python

Site Reliability Engineering (SRE) combines software and systems engineering to build and run large-scale, massively distributed, fault-tolerant systems.

SRE ensures that Walmart applications have reliability, uptime appropriate to customer's needs and a fast rate of improvement.

Additionally SRE's will keep an ever-watchful eye on our systems capacity and performance.

Much of our software development focuses on optimizing existing systems, building infrastructure and eliminating work through automation.

On the SRE team, you'll have the opportunity to manage the complex challenges of scale which are unique to Walmart while using your expertise in coding, algorithms, complexity analysis and large-scale system design.

SRE's culture of diversity, intellectual curiosity, problem solving and openness is key to its success. Our organization brings together people with a wide variety of backgrounds, experiences and perspectives.

We encourage them to collaborate, think big and take risks in a blame-free environment. We promote self-direction to work on meaningful projects, while we also strive to create an environment that provides the support and mentorship needed to learn and grow.

Job Duties :

With your technical expertise you will manage project priorities, deadlines, and deliverables

You will design, develop, test, deploy, maintain, and enhance software solutions

Write product or system development code

Review code developed by other engineers and provide feedback to ensure best practices (e.g., style guidelines, checking code in, accuracy, testability, and efficiency)

Contribute to existing documentation or educational content and adapt content based on product / program updates and user feedback

Triage product or system issues and debug / track / resolve by analyzing the sources of issues and the impact on hardware, network, or service operations and quality

Participate in, or lead design reviews with peers and stakeholders to decide amongst available technologies

1 day ago
Related jobs
Promoted
eTek IT Services, Inc.
Mountain View, California

Role: Site Reliability Engineer. Participate in technical operations and rotations in response to performance and reliability issues. ...

Promoted
Palo Alto Networks
Santa Clara, California

As a Senior Staff DevOps Engineer for the CDL/SLS team, you will be part of a team supporting the services running on this infrastructure. This includes automation, architecture, performance, observability, troubleshooting, security, and reliability. Infrastructure, Operations, DevOps, or System Eng...

Apple
Cupertino, California

The Apple Services Engineering (ASE) team is one of the most exciting examples of Apple’s long-held passion for combining art and technology. These engineers build secure, end-to-end solutions. Thanks to Apple’s unique integration of hardware, software, and services, engineers here partner to get be...

Hireio, Inc.
San Jose, California

Site Reliability Engineering(SRE) team. Scale systems sustainably through mechanisms such as automation; evolve systems reliability, efficiency, and velocity by pushing for changes. ...

Qcells
Santa Clara, California

Master’s or PhD degree in Reliability Engineering, or ASQ Certified Reliability Engineer is a plus. Apply solid knowledge of reliability methods and power electronics systems to design accelerated test plans for design validation, burn-in testing, environmental stress screening (ESS), ongoing reliab...

Palo Alto Networks
Santa Clara, California

As a Senior Staff DevOps Engineer for the CDL/SLS team, you will be part of a team supporting the services running on this infrastructure. This includes automation, architecture, performance, observability, troubleshooting, security, and reliability. Infrastructure, Operations, DevOps, or System Eng...

Zoom
San Jose, California

You will also design and implement reliability best practices to accomplish a highly available service ( Additionally, you will identify and fix problems in Kubernetes operators, submitting code fixes to OSS if needed. ...

TikTok
Mountain View, California

Team Insight:CDN Site Reliability Engineering combines software and network engineering with system operations to build and run large-scale, massively distributed infrastructure. CDN performance and traffic engineering, network solution architecting or network-focused site reliability engineering ro...

ByteDance
San Jose, California

Therefore, we set up an engineer team with high talent density, mainly focusing on AI technology and Privacy&Security in CapCut. ...

Apple
Cupertino, California

The Apple Services Engineering (ASE) team is one of the most exciting examples of Apple’s long-held passion for combining art and technology. These engineers build secure, end-to-end solutions. Thanks to Apple’s unique integration of hardware, software, and services, engineers here partner to get be...