Search jobs > Newport Beach, CA > Site reliability engineer

Site Reliability Engineer - SRE Lille

Scaleway
Newport Beach, California, US
Full-time

Fondée en 1999, Scaleway est la filiale cloud du groupe Iliad, l’un des leaders des télécommunications en Europe. Notre mission est de favoriser une industrie numérique plus responsable en aidant les développeurs et les entreprises à créer, déployer et adapter des applications à n'importe quelle infrastructure.

Depuis nos bureaux situés à Paris et à Lille, nous perfectionnons quotidiennement l'écosystème cloud de Scaleway, dont nous sommes les premiers utilisateurs.

Nos quelques 25 000 clients nous choisissent pour notre redondance multi-AZ, notre expérience-utilisateur fluide, nos datacenters neutres en carbone ainsi que nos outils natifs de gestion d'architectures multi-cloud.

Nos produits incluent des solutions entièrement gérées pour le bare metal, la conteneurisation et les architectures serverless, offrant ainsi un choix responsable dans le domaine du cloud computing.

Rejoignez notre équipe dynamique de près de 600 collaborateurs venant de divers horizons, dans un environnement stimulant et international alliant excellence technique, créativité et partage.

About the job Scaleway is looking for a Site Reliability Engineer to join our teams.Reporting to a Lead SRE, you will be responsible to ensure we can reliably serve our products for users around the world.

We expect you to have a strong background in development and system administration. Our systems evolve constantly and the tools needed to observe and act to ensure their resilience need to evolve accordingly.Minimum qualifications

  • Previous experience as a developer in Go, Python or Rust
  • Experience in system programming with usual scripting languages (bash, Python)
  • Demonstrated ability to troubleshoot production systems failures
  • A great attitude and desire to work with a team
  • Passion for incremental improvements on tooling, love all things of automation
  • Experience with Linux systems (Ubuntu / Debian)
  • Experience with cloud environments architecture (baremetal, virtual machines, containers, orchestrators)
  • Good understanding of computer networks : TCP / IP, DNS, load-balancing, IPv6, BGP and network virtualisation
  • Understanding of written and spoken english, capable of writing technical documentation in English, ability to speak english if needed

Preferred qualifications

  • Experience with infrastructure as code and continuous deployment
  • Experience dealing with physical hardware automation
  • Experience with monitoring & logging systems
  • Experience administering relational databases
  • Knowledge of one cloud platform and related use-cases
  • Take initiatives to propose new solutions and defend them
  • Team player, willing to share knowledge, opinions, and participate in regular team rituals
  • Good communication skills and coaching skills

Responsibilities

  • Create or optimize existing tools & documentation that will help identify, diagnose and remediate production incidents, automating as much as possible
  • Troubleshoot high-impact issues working with multiple engineering teams
  • Take on-call responsibilities, mitigate issues encountered in production and secure the best real-time answer to our customers
  • Ensure a high quality of service for our customers by leveraging observability and monitoring technologies
  • Manage lifecycle of products in production
  • Help implementing best practices in stability, resiliency, scalability, security and performance across our systems

Technical Stack

  • Python, Go, Rust
  • RabbitMQ
  • PostgreSQL
  • HA Proxy, Nginx, REST APIs / Flask
  • S3 API
  • Sentry, Prometheus, Grafana, ElasticSearch, Fluentd, Kibana
  • Ansible, AWX, Foreman, Salt
  • GitLab, Nexus
  • Ubuntu, Debian, CentOS
  • Jira, Confluence, Slack, GSuite

Location This position is based in our offices in Paris or Lille (France)

J-18808-Ljbffr

11 days ago
Related jobs
Promoted
JobLookup
Irvine, California

Job Title: Sr Site Reliability Engineer (SRE). They're looking for a passionate and experienced Sr Site Reliability Engineer to join our team and play a crucial role in ensuring our cloud platform's security, reliability, scalability, and operational excellence. Location: Fully onsite in Irvine, CA....

Promoted
Scaleway
Newport Beach, California

Scaleway is looking for a Site Reliability Engineer to join our teams. Reporting to a Lead SRE, you will be responsible to ensure we can reliably serve our products for users around the world. Troubleshoot high-impact issues working with multiple engineering teams. This position is based in our offi...

Promoted
VirtualVocations
Huntington Beach, California

A company is looking for a Senior Site Reliability Engineer - Splunk. ...

Promoted
AXON-Networks
Irvine, California
Remote

We are looking for a Site Reliability Engineer (SRE) to join our support team to respond to and resolve incidents reported by our customers in a timely manner. Site Reliability Engineers (SREs) are responsible for keeping all user-facing services and other AXON-Networks production systems running sm...

Promoted
VirtualVocations
Huntington Beach, California

Key Responsibilities:Preventing scaling or stability bottlenecks of platform servicesDelivering reliability to the stack and enabling software engineering teamsInstrumenting, automating, and load testing distributed products and servicesRequired Qualifications:7+ years of SRE, Production, or Systems...

CoStar Group
CA, Orange County

On-site fitness center and/or reimbursed fitness center membership costs (location dependent), with yoga studio, Pelotons, personal training, group exercise classes, as well as Segways and bikes available for use during the day. ...

Promoted
VirtualVocations
Huntington Beach, California

...

PEAK Technical Staffing
Local Remote, CA
Remote

This SRE role will focus on providing direct, level one and two support to internal engineering teams. Engage directly with engineering customers on troubleshooting requests and guiding them on solutions. Hands on experience in working with distributed systems and availability, reliability, scalabil...

eTeam
Remote, CA
Remote

Minimum years exp in Terraform, Ansible, Networking, Jenkins, Python, GCP in Technology companies.Security (vulnerability management)....

Microsoft
Aliso Viejo, California

Independently uses existing tools and/or models to troubleshoot problems or flaws affecting the availability, reliability, performance, and/or efficiency of components and features; proposes solutions that will resolve and prevent recurring issues and brings them to the attention of their Site Relia...