Search jobs > New York, NY > Site reliability engineer

Software Engineer - Site Reliability

Datadog
New York, NY
Full-time

Software Engineer - Site Reliability

Paris, France; Madrid, Spain; Nantes, France; Bordeaux, France; Sophia Antipolis, France; Lyon, France; Grenoble, France; Montpellier, France;

The Site Reliability teams at Datadog are responsible for ensuring that our high-volume, low-latency environments continue to perform around the clock.

These teams collaborate closely with our product engineers to ensure that Datadog can monitor millions of servers and containers, ensuring our customers always have dependable and actionable data at their fingertips.

You’ll be responsible for shaping the infrastructure of our data-intensive, real-time services as we continue to grow at petabyte scale.

At Datadog, we place value in our office culture - the relationships that it builds, the creativity it brings to the table, and the collaboration of being together.

We operate as a hybrid workplace to ensure our employees can create a work-life harmony that best fits them.

What You’ll Do :

  • Keep our service reliable, available and fast as a member of the operations team.
  • Respond to, investigate and fix service issues, whether they be deep in the OS kernel or in the application code.
  • Design, build and maintain the infrastructure we need to support orders of magnitude more customers.

Who You Are :

  • You have a track record as an engineer in the operations of a large site
  • You value correctness and efficiency; you leave no stone unturned when diagnosing production issues
  • You handle infrastructure with code because automation lets you focus on the more difficult and rewarding problems
  • You have production experience with distributed compute / storage tools, e.g. zookeeper, cassandra, postgres, kafka, elasticsearch, redis

Datadog values people from all walks of life. We understand not everyone will meet all the above qualifications on day one.

That's okay. If you’re passionate about technology and want to grow your skills, we encourage you to apply.

Benefits and Growth :

  • New hire stock equity (RSUs) and employee stock purchase plan (ESPP)
  • Continuous professional development, product training, and career pathing
  • Intradepartmental mentor and buddy program for in-house networking
  • An inclusive company culture, ability to join our Community Guilds (Datadog employee resource groups)
  • Access to Inclusion Talks, our Internal panel discussions
  • Free, global mental health benefits for employees and dependents age 6+
  • Competitive global benefits

Benefits and Growth listed above may vary based on the country of your employment and the nature of your employment with Datadog.

LI-MF2

30+ days ago
Related jobs
Promoted
Hispanic Technology Executive Council
New York, New York

In your role as a Site Reliability Engineer, youll use your skills to help instrument our systems so they can be easily built, observed, monitored, tested, and deployed at scale, and ensure Skytaps services perform well for enterprise customers. In order to be effective in this role as a Site Reliab...

Promoted
Mondrian Alpha
New York, New York

We are looking for someone with 5+ years of engineering experience in Site Reliability / Trading Systems Engineering, preferably who has worked in the buy-side, and a deep and comprehensive understanding of Linux / Unix and Python / C++ / Java. This individual will be joining a rapidly growing, dyna...

Promoted
Alpha Search Advisors
New York, New York

Our Storage Engineering team architects, builds, and operates high throughput storage platforms, using a combination of vendor storage appliances and internally developed automation. Provide escalation support to operations teams and software developers for storage related performance and availabili...

Promoted
Motion Recruitment Partners LLC
Queens, New York

One of the largest brewing companies in the world is hiring for a Site Reliability Engineer to join their team. ...

Datadog
New York, New York

Senior Software Engineer - Site Reliability (Lisbon). The Site Reliability teams at Datadog are responsible for ensuring that our high-volume, low-latency environments continue to perform around the clock. These teams collaborate closely with our product engineers to ensure that Datadog can monitor ...

S&P Global
New York, New York

We are seeking a highly motivated and experienced Site Reliability Engineer (SRE) to join the Enterprise Solutions SRE team. Site Reliability Engineer or equivalent in a similar role. We develop large scale technology platforms and enterprise software to produce global financial data with focus on a...

MongoDB
New York, New York

The Cloud Site Reliability Engineering Team designs and builds the global infrastructure on which we deploy our services. MongoDB’s mission is to empower innovators to create, transform, and disrupt industries by unleashing the power of software and data. ...

CIRCLE
New York, New York

As a Senior Site Reliability Engineer at Circle, you will design, build, and maintain Circle’s infrastructure estate to meet the growing worldwide customer base on public cloud providers across multiple regions. Staff Site Reliability Engineer (IV). Senior Site Reliability Engineer (III). Senior Sit...

Gemini
New York, New York
Remote

The Role: Senior Site Reliability Engineer. Given the need to build and integrate more of our software in the cloud, the ideal engineer will have extensive experience in automating and building out cloud-based software (e. The infrastructure team at Gemini creates and manages software tools and plat...

WarnerMedia Services, LLC
New York, New York

Deep knowledge of databases is a core competency for our engineers, but we take inspiration from the Reliability Engineering discipline and we invest significantly in cloud automation. You solve business problems with simple and straightforward solutions, applying appropriate technologies and softwa...