Oracle Senior Principal Software Engineer - Austin, Texas
Sr. Principal Member of Technical Staff, OCI Storage
Want to apply Read all the information about this position below, then hit the apply button.
Are you interested in delivering large-scale, high performance, fault tolerant solutions? Oracle’s Cloud Infrastructure team is building a next generation Infrastructure-as-a-Service that supports the most demanding mission-critical customer requirements, and operates at cloud scale to provide a secure, distributed multi-tenant cloud environment.
We're looking for hands-on engineers with a passion for solving difficult problems in distributed systems, virtualized infrastructure, and highly available services.
Joining Oracle will give you the opportunity to design and build innovative new systems from the ground up and operate services at scale.
Our engineers have significant technical and business impact while delivering critical enterprise level features.
Responsibilities
As a Sr. Principal Member of Technical Staff, you will work with other senior architects and product management to define requirements for OCI’s upcoming AI / ML storage infrastructure services.
You have deep experience with Lustre parallel filesystems operating in large scale Linux environments. You ideally possess a working understanding of the Lustre architecture and codebase and have used your knowledge to troubleshoot issues, modify code or contribute improvements back to the Lustre git tree.
Expertise in one or more Public Cloud offerings is a plus. You will be expected to make substantial contributions towards our design and architecture and will implement proof of concepts.
You have excellent communication skills and can clearly explain complex technical concepts. As a technical leader on your team, you will mentor and demonstrate core values for other more junior engineers.
You will write code, review code written by your peers, and write test automations. You should value simplicity and scale, work comfortably in a collaborative, agile environment, and be excited to learn.
Qualifications
- 10+ years experience delivering and operating large scale, highly available distributed systems.
- Deep code-level or system administration experience with Lustre filesystems operating in large scale Linux environments.
- Strong proficiency with C and C++. Python and / or Java is a plus.
- Expertise in one or more Public Cloud offerings (OCI, AWS, GCP, Azure) is a plus.
- Experience with other high-throughput I / O architectures like DAOS / SPDK is a strong plus.
- Background in RMDA and high-performance networking (SmartNICs, NVMe / TCP, RoCEv2) is a plus.
- Familiarity with AI / ML frameworks (Tensorflow / Keras, PyTorch, Scikit-Learn, XGBoost, Caffe) as well as MLOps and Kubernetes is a plus.
- Strong knowledge of data structures, algorithms, operating systems, and distributed systems fundamentals.
- Strong troubleshooting and performance tuning skills.
- Self-motivation to thrive in a fast-paced environment.
- Bachelors or Masters in Computer Science, Computer Engineering, or related field.
Disclaimer : Certain US customer or client-facing roles may be required to comply with applicable requirements, such as immunization and occupational health mandates.
About Us
As a world leader in cloud solutions, Oracle uses tomorrow’s technology to tackle today’s problems. True innovation starts with diverse perspectives and various abilities and backgrounds.
Oracle is an Equal Employment Opportunity Employer*. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, national origin, sexual orientation, gender identity, disability and protected veterans’ status, or any other characteristic protected by law.
J-18808-Ljbffr