Talent.com
Senior Site Reliability Engineer Cloud Platform

Senior Site Reliability Engineer Cloud Platform

ZillizRedwood City, California, United States, 94061
job_description.job_card.variable_hours_ago
serp_jobs.job_preview.job_type
  • serp_jobs.job_card.full_time
job_description.job_card.job_description

Senior Site Reliability Engineer Cloud Platform

Zilliz is a fast-growing startup developing the industrys leading vector database company for enterprise-grade AI. Founded by the engineers behind Milvus, the worlds most popular open-source vector database , the company builds next-generation database technologies to help organizations quickly create AI applications. On a mission to democratize AI, Zilliz is committed to simplifying data management for AI applications and making vector databases accessible to every organization.

What you will do :

  • Work at the intersection of development and site reliability. Creating SRE tools and systems, as well as supporting existing infrastructure and platforms.
  • Ensure the reliability, availability, and performance of Zillizs distributed database systems.
  • Develop and implement strategies for monitoring, incident management, and disaster recovery.
  • Automate system operations and maintenance tasks to improve efficiency and reduce manual intervention.
  • Design and build tools to manage and monitor infrastructure, ensuring scalability and robustness.
  • Collaborate with software engineers to enhance system reliability, scalability, and performance.
  • Maintain and improve the CI / CD pipeline to ensure smooth and rapid deployment of changes.
  • Actively contribute to the Milvus Vector Database open-source community, focusing on improving reliability and operational efficiency.

What we are looking for :

  • 4+ years of experience in site reliability engineering or similar roles with a focus on cloud-native systems.
  • Proficiency in scripting languages such as Python, Go, or Java.
  • Strong knowledge of container orchestration technologies like Kubernetes and Docker.
  • Expertise with cloud platforms such as AWS, GCP, or Azure, and their respective monitoring and management tools.
  • Experience with infrastructure as code tools such as Terraform or Ansible.
  • Familiarity with CI / CD tools such as Jenkins, GitLab CI, or Argo.
  • Proven ability to troubleshoot complex distributed systems and resolve issues promptly.
  • Bachelors degree or above in computer science, software engineering, or other relevant disciplines.
  • Ability to thrive in a fast-paced, startup environment and handle multiple projects simultaneously.
  • Experience with Open Source Milvus Vector Database is nice to have
  • Zilliz is an Equal Opportunity Employer and welcome people from all backgrounds, experiences, abilities, and perspectives. All qualified applicants will receive consideration for employment regardless of race, color, national origin, religion, sexual orientation, gender, gender identity, age, physical disability, or length of time spent unemployed.

    $175,000 - $225,000 a year

    Compensation details : 175000-225000 Yearly Salary

    PI72c72197086a-30511-37867191

    serp_jobs.job_alerts.create_a_job

    Senior Site Reliability Engineer • Redwood City, California, United States, 94061

    Job_description.internal_linking.related_jobs
    • serp_jobs.job_card.promoted
    Site Reliability Engineer - SRE at Descope Los Altos, CA

    Site Reliability Engineer - SRE at Descope Los Altos, CA

    Itlearn360Los Altos, CA, United States
    serp_jobs.job_card.full_time
    Site Reliability Engineer - SRE job at Descope.Descope R&D group is a skilled team of developers with a unique DNA of creativity,flexibility,anopen mindset. We are looking for a passionate SRE to jo...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    Tarana WirelessMilpitas, CA, US
    serp_jobs.job_card.full_time
    serp_jobs.filters_job_card.quick_apply
    Join the Team That's Redefining Wireless Technology At Tarana , we're more than just a fast-growing tech company—we’re a team of bold innovators on a mission to revolutionize broa...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
    • serp_jobs.job_card.promoted
    DevOps Engineer / Site Reliability Engineer

    DevOps Engineer / Site Reliability Engineer

    HyperFiSan Francisco, CA, United States
    serp_jobs.job_card.full_time
    We're building the kind of platform we always wanted to use : fast, flexible, and built for making sense of real-world complexity. Behind the scenes is a robust, event-driven architecture that connec...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_days
    • serp_jobs.job_card.promoted
    • serp_jobs.job_card.new
    Site Reliability Engineer I

    Site Reliability Engineer I

    prosper.comSan Francisco, CA, United States
    serp_jobs.job_card.full_time
    As a Site Reliability Engineer I at Prosper, you will play a crucial role in enhancing the reliability, scalability, and maintainability of our technology platform. This entry-level position is desi...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_hours
    • serp_jobs.job_card.promoted
    Senior / Staff Site Reliability Engineer, Storage

    Senior / Staff Site Reliability Engineer, Storage

    FluidstackSan Francisco, CA, United States
    serp_jobs.job_card.full_time
    Fluidstack is building GPU supercomputers for top AI labs, governments, and enterprises.Our customers include Mistral, Poolside, Black Forest Labs, Meta, and more. Our team is small, highly motivate...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
    • serp_jobs.job_card.promoted
    Senior Site Reliability Engineer, Storage

    Senior Site Reliability Engineer, Storage

    Epoch BiodesignSan Francisco, CA, United States
    serp_jobs.job_card.full_time
    Crusoe Energy is on a mission to unlock value in stranded energy resources through the power of computation.Take a look at what we do! - https : / / www. We aim to align the long term interests of the c...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
    • serp_jobs.job_card.promoted
    Senior / Staff Site Reliability Engineer, Compute

    Senior / Staff Site Reliability Engineer, Compute

    FluidstackSan Francisco, CA, United States
    serp_jobs.job_card.full_time
    Fluidstack is building GPU supercomputers for top AI labs, governments, and enterprises.Our customers include Mistral, Poolside, Black Forest Labs, Meta, and more. Our team is small, highly motivate...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
    Infrastructure, DevOps & Reliability Engineer (Multiple Roles, Remote & On-Site)

    Infrastructure, DevOps & Reliability Engineer (Multiple Roles, Remote & On-Site)

    MLabsSan Francisco, CA, US
    serp_jobs.job_card.full_time
    serp_jobs.filters_job_card.quick_apply
    We’re recruiting Infrastructure, DevOps, and Reliability Engineers for high-growth startups including .AirGarage, Dyno Therapeutics, Codex Health, and Banquet Health.These roles focus on scali...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_days
    • serp_jobs.job_card.promoted
    Senior Software Engineer - Site Reliability

    Senior Software Engineer - Site Reliability

    Ironclad Inc.San Francisco, CA, United States
    serp_jobs.job_card.full_time
    Every dollar earned, relationship formed, and advantage gained comes down to the contract that makes it real.But getting a contract done is more complicated than it should be.And when contract data...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_days
    • serp_jobs.job_card.promoted
    Software Engineer (Site Reliability Engineer)

    Software Engineer (Site Reliability Engineer)

    CerebrasSan Francisco, CA, United States
    serp_jobs.job_card.full_time
    San Francisco or Palo Alto, CA.At Anyscale, we take a market-based approach to compensation.We are data-driven, transparent, and consistent. As the market data changes over time, the target salary f...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
    • serp_jobs.job_card.promoted
    Site Reliability Engineer (SRE) - grok.com & API

    Site Reliability Engineer (SRE) - grok.com & API

    Pantera CapitalPalo Alto, CA, United States
    serp_jobs.job_card.full_time
    AI’s mission is to create AI systems that can accurately understand the universe and aid humanity in its pursuit of knowledge. Our team is small, highly motivated, and focused on engineering excelle...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_days
    • serp_jobs.job_card.promoted
    Senior Software Engineer, Site Reliability Engineer (SRE)

    Senior Software Engineer, Site Reliability Engineer (SRE)

    harvey.aiSan Francisco, CA, United States
    serp_jobs.job_card.full_time
    At Harvey, we’re transforming how legal and professional services operate — not incrementally, but end-to-end.By combining frontier agentic AI, an enterprise-grade platform, and deep domain experti...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_days
    • serp_jobs.job_card.promoted
    Site Reliability Engineer

    Site Reliability Engineer

    WritemedSan Francisco, CA, United States
    serp_jobs.job_card.full_time
    Would you like to join one of the fastest-growing organizations with a goal of using the latest AI, GenAI, LLM, Cloud, and Digital Technologies to advance drug development and improve patient care ...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
    • serp_jobs.job_card.promoted
    Site Reliability Engineer (SRE)

    Site Reliability Engineer (SRE)

    Air AppsSan Francisco, CA, United States
    serp_jobs.job_card.full_time
    At Air Apps, we believe in thinking bigger—and moving faster.We’re a family-founded company on a mission to create the world’s first AI-powered Personal & Entrepreneurial Resource Planner (PRP), an...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
    • serp_jobs.job_card.promoted
    Principal Site Reliability Engineer - Enterprise AI Platform

    Principal Site Reliability Engineer - Enterprise AI Platform

    NVIDIA CorporationSanta Clara, CA, United States
    serp_jobs.job_card.full_time
    Principal Site Reliability Engineer - Enterprise AI Platform page is loaded.Principal Site Reliability Engineer - Enterprise AI Platform. Apply locations US, CA, Santa Clara time type Full time post...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
    • serp_jobs.job_card.promoted
    • serp_jobs.job_card.new
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    Gridware Technologies Inc.San Francisco, CA, United States
    serp_jobs.job_card.full_time
    Gridware is a San Francisco-based technology company dedicated to protecting and enhancing the electrical grid.We pioneered a groundbreaking new class of grid management called active grid response...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_hours
    • serp_jobs.job_card.promoted
    • serp_jobs.job_card.new
    Site Reliability Engineer - Cybersecurity

    Site Reliability Engineer - Cybersecurity

    Pantera CapitalPalo Alto, CA, United States
    serp_jobs.job_card.full_time
    AI’s mission is to create AI systems that can accurately understand the universe and aid humanity in its pursuit of knowledge. Our team is small, highly motivated, and focused on engineering excelle...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_hours
    • serp_jobs.job_card.promoted
    Founding Site Reliability Engineer

    Founding Site Reliability Engineer

    Relevance AISan Francisco, CA, United States
    serp_jobs.job_card.full_time
    San Francisco, USA (Hybrid 3 days / week).At Relevance AI, our mission is to empower anyone to delegate work to the AI workforce. We’re building a new category of AI automation, enabling teams to crea...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_1_day