Search jobs > Mountain View, CA > Staff software engineer

Staff Software Engineer - Distributed Data Systems

Databricks
Mountain View, California
$192K-$260K a year
Full-time

P-186

At Databricks, we are obsessed with enabling data teams to solve the world's toughest problems, from security threat detection to cancer drug development.

We do this by building and running the world's best data and AI infrastructure platform, so our customers can focus on the high value challenges that are central to their own missions.

Founded in 2013 by the original creators of Apache Spark™, Databricks has grown from a tiny corner office in Berkeley, California to a global organization with over 1000 employees.

Thousands of organizations, from small to Fortune 100, trust Databricks with their mission-critical workloads, making us one of the fastest growing SaaS companies in the world.

Our engineering teams build highly technical products that fulfill real, important needs in the world. We constantly push the boundaries of data and AI technology, while simultaneously operating with the resilience, security and scale that is critical to making customers successful on our platform.

We develop and operate one of the largest scale software platforms. The fleet consists of millions of virtual machines, generating terabytes of logs and processing exabytes of data per day.

At our scale, we regularly observe cloud hardware, network, and operating system faults, and our software must gracefully shield our customers from any of the above.

Modern data analysis employs sophisticated methods such as machine learning that go well beyond the roll-up and drill-down capabilities of traditional SQL query engines.

As a software engineer on the Runtime team at Databricks, you will be building the next generation distributed data storage and processing systems that can outperform specialized SQL query engines in relational query performance, yet provide the expressiveness and programming abstractions to support diverse workloads ranging from ETL to data science.

Below are some example projects :

Apache Spark™ : Develop the de facto open source standard framework for big data.

Data Plane Storage : Deliver reliable and high performance services and client libraries for storing and accessing humongous amount of data on cloud storage backends, e.

g., AWS S3, Azure Blob Store.

Delta Lake : A storage management system that combines the scale and cost-efficiency of data lakes, the performance and reliability of a data warehouse, and the low latency of streaming.

Its higher level abstractions and guarantees, including ACID transactions and time travel, drastically simplify the complexity of real-world data engineering architecture.

Delta Pipelines : It's difficult to manage even a single data engineering pipeline. The goal of the Delta Pipelines project is to make it simple and possible to orchestrate and operate tens of thousands of data pipelines.

It provides a higher level abstraction for expressing data pipelines and enables customers to deploy, test & upgrade pipelines and eliminate operational burdens for managing and building high quality data pipelines.

Performance Engineering : Build the next generation query optimizer and execution engine that's fast, tuning free, scalable, and robust.

What we look for :

  • BS in Computer Science, related technical field or equivalent practical experience.
  • Optional : MS or PhD in databases, distributed systems.
  • Comfortable working towards a multi-year vision with incremental deliverables.
  • Driven by delivering customer value and impact.
  • 8+ years of production level experience in either Java, Scala or C++.
  • Strong foundation in algorithms and data structures and their real-world use cases.
  • Experience with distributed systems, databases, and big data systems (Apache Spark™, Hadoop).

Pay Range Transparency

Databricks is committed to fair and equitable compensation practices. The pay range(s) for this role is listed below and represents base salary range for non-commissionable roles or on-target earnings for commissionable roles.

Actual compensation packages are based on several factors that are unique to each candidate, including but not limited to job-related skills, depth of experience, relevant certifications and training, and specific work location.

Based on the factors above, Databricks utilizes the full width of the range. The total compensation package for this position may also include eligibility for annual performance bonus, equity, and the benefits listed above.

For more information regarding which range your location is in visit our page .

Local Pay Range$192,000 $260,000 USD

30+ days ago
Related jobs
Promoted
NetApp
San Jose, California

Globally recognized as domain expert in software and system design for highly scalable distributed storage systems, control and data plane architectures, core operating systems fundamentals required to fuel large scale AI infrastructure for GenAI and AI as a Service. The ONTAP team drives the produc...

Promoted
LinkedIn
Mountain View, California

Experience with industry, open-source projects and/or academic research in large-data, parallel and distributed systems-Experience leading high-impact, cross-organization initiativeSuggested Skills:-Distributed Systems-Technical Leadership-Databases -Storage-Open Source DevelopmentLinkedIn is commit...

Promoted
Palo Alto Networks
Santa Clara, California

We're seeking innovators - engineers who seek to design new products, designing state-of-the-art products that do not exist today. These engineers love to code with a drive to build global products and bring new ideas to develop security disciplines to solve real-world problems. Collaboration is at ...

Promoted
Intuit Inc.
Mountain View, California

Come join Intuit’s Identity platform team as a Staff Software Engineer. Experience developing systems that process data at large scale. Ensure the highest standards for engineering design, implementation, and testing. Mentor engineers on technology, process, people, and product skills. ...

Pioneer Data Systems
Redwood City, California

Position Details:Job Title: Software Engineer (C++ / Python, Embedded Software) / Medical DeviceDuration: 6+ months contract, extendable up to 24 monthsLocation: Redwood City, CAFully Onsite (Redwood City In 2024, Santa Clara In 2025)Note:The client has the right-to-hire you as a permanent employee ...

NVIDIA
Santa Clara, California
Remote

NVIDIA Cloud Functions team is looking for a motivated, product-minded Senior Distributed Systems Software Engineer with an observability focus. Our product enables and scales AI inferencing workloads using globally distributed orchestration of workloads on GPU-backed cloud-agnostic Kubernetes clust...

BILL
San Jose, California

Experience crafting and architecting distributed systems, concurrent programming, and coding data structures. Experience in complex problem-solving in large-scale distributed systems, performance optimization, and high-availability systems. Passion for software architecture, APIs and high performanc...

Hireio, Inc.
San Jose, California

Working industry experience with Big Data systems and projects • Experience in building large scale distributed systems in a product environment. As a software engineer in experimentation and evaluation team, you will have the opportunity to build, optimize and grow one of the largest data platforms...

Oracle
Santa Clara, California

As a member of the software engineering division, you will apply intermediate to advanced knowledge of software architecture to perform software development tasks associated with developing, debugging or designing software applications or operating systems according to provided design specifications...

Aurora
Mountain View, California

We’re searching for a Staff Systems Engineer - Hardware Systems to be an integral part of the Hardware Engineering team that is working to deliver the next generation Aurora Driver Hardware Kit from concept to scale production by:. Write design documents to evaluate requirements compatibility with a...