Site Reliability Engineer - Remote

https:/www.energyjobline.com/sitemap.xml

Burbank, California, US

Remote

Full-time

At EFG (ESL FACEIT Group), we create worlds beyond gameplay where players and fans become community. We pride ourselves in having a corporate social responsibility, which is that IT’S NOT GG (Good Game), UNTIL IT’S GG FOR ALL .

We are passionate about the culture we foster that ultimately helps to create and shape the world of esports, gaming tournaments, leagues, events, and holistic ecosystems staged for our millions of players, fans, and heroes.

Apply fast, check the full description by scrolling below to find out the full requirements for this role.

The Team :

As a Site Reliability Engineer at EFG, you will be designing, analyzing, and troubleshooting large-scale distributed systems.

You will demonstrate a systematic problem-solving approach, and the ability to debug and optimize code and to automate routine tasks.

You will ensure that EFG’s services and systems are reliable, that they have uptime appropriate to users' needs, and they have a fast rate of improvement.

Apart from monitoring our systems' capacity and performance, you will also focus on optimizing existing systems, on building infrastructure, and on eliminating work through automation.

You will work collaboratively with the software engineering teams to deploy and operate our systems, and you will help to automate and streamline our operations and processes.

Within this role, you will be given real responsibilities, and you have the opportunity to drive change and have a big impact on our products and platform.

What you will do :

Maintaining and improving the monitoring and observability tools (Grafana / Prometheus / Thanos / Jaeger);
Working closely with your team and with other cross-functional teams to help design, maintain, and operate systems at scale;
Developing and driving adoption of SRE best practices across the company;
Leading on incident management process and adoption;
Using your troubleshooting skills to help identify and fix operational issues;
Working with Cloud Native technologies such as Kubernetes, Envoy, Istio, Prometheus, and Helm;
Working with the Hashi Stack (terraform, packer, vault);
Experimenting with and introducing cutting edge technologies.

Requirements :

Proven experience as a Site Reliability Engineer, DevXP Engineer, or Software Engineer, focusing on building and maintaining scalable infrastructures;
Excellent working knowledge on at least one of the major cloud providers (GCP / AWS / Azure);
You have experience with cluster management systems (Kubernetes);
Knowledge of incident management : ability to investigate, troubleshoot, recover and prevent the recurrence of incidents that interfere with the normal delivery of IT services;
Proficient in Go language and some level of proficiency in at least another language : Java, Python, Rust ;
You have knowledge of GitOps practices;
You have production scale experience with one of the following : MongoDB, Redis, MySQL;
Experience contributing to open source technologies would be an added bonus.

J-18808-Ljbffr

Remote working / work at home options are available for this role.

2 days ago

Related jobs

Promoted

Senior Associate Site Reliability Engineer

VirtualVocations

Burbank, California

A company is looking for a Senior Associate Site Reliability Engineer responsible for designing, building, and maintaining infrastructure for highly available solutions. ...

Promoted

Site Reliability Engineer 2 (mid-level)

https:/www.energyjobline.com/sitemap.xml

Burbank, California

As a Site Reliability Engineer you will be responsible for the availability, latency, performance, efficiency, change management, monitoring, emergency response, and capacity planning of all production systems and services. You will work together with other DevOps, Product and Engineering teams to d...

Promoted

Sr. Site Reliability Engineer. Cloud

Abbott Laboratories

Los Angeles, California

Site Reliability Engineer (Sr Engineer, DevOps Engineering). You will also contribute to our existing operations hosted on-prem which includes managing several java applications, support middleware technologies, continuous automation, adapt Site reliability engineering (SRE) principles to operations...

Promoted

Site Reliability Engineer (SRE for Datacenter)

Glow Networks, Inc.

Culver City, California

Follow test plans, follow test site scheduling, understand equipment calibration and setup, RF signal measurement and basic data analysis. ...

Promoted

Staff Site Reliability Engineer - Network Management Platform

Fastly

Los Angeles, California

The Network Management Platform Team is looking for a talented Site Reliability and/or System Development Engineer with experience in designing, building and operating distributed systems that are scalable, fault tolerant and easy to manage. You'll be joining a dynamic and highly collaborative team ...

Promoted

Site Reliability Engineer

Circle

Los Angeles, California

As a Site Reliability Engineer at Circle, you’ll build out and maintain Circle’s infrastructure estate to meet the growing worldwide customer base across multiple regions on public cloud providers. Circlers are consistently evolving in a remote world where strength in numbers fuels team success. You...

Senior Site Reliability Engineer

Disney Entertainment & ESPN Technology

Burbank, California

The Senior Site Reliability Engineer is a key member of our Performance and Reliability embedded teams. Our Performance and Reliability teams are leading the improvements, optimization, and availability of applications across the Disney organization and business units, taking a consultative approach...

Site Reliability Principal Engineer

City National Bank

Los Angeles, California

SITE RELIABILITY PRINCIPAL ENGINEER. WHAT IS THE OPPORTUNITY? As an SRE, you will utilize your software, systems engineering, and operations background to build and run large-scale, fault-tolerant systems. Your role is to ensure the reliability, scalability and maximum uptime of CNB systems in the D...

Senior Staff Engineer- Observability and Reliability Platform Engineering (REMOTE)

GEICO

Los Angeles, California

Remote

Our Staff Engineer works with our Sr Staff Engineer and Sr. GEICO is seeking an experienced Staff Engineer with a passion for building high-performance, low maintenance, zero-downtime platforms, and applications. You will help drive our insurance business transformation as we transition from a tradi...

Site Reliability Engineer

City National Bank

Los Angeles, California

SITE RELIABILITY ENGINEER WHAT IS THE OPPORTUNITY? As an SRE, you will utilize your software, systems engineering, and operations background to build and run large-scale, fault-tolerant systems. Your role is to ensure the reliability, scalability and maximum uptime of CNB systems in the Data Center ...