Overview
The Mission
We\'re building an AI-powered insurance brokerage that is transforming the $900 billion commercial insurance market by automating processes that currently run on pre-internet systems. Fresh off our $8M seed round, we\'re looking for an exceptional AI Platform Engineer who can architect and develop the core infrastructure that powers our entire AI ecosystem.
You\'ll build the foundational platform that enables our AI agents to operate across growth, sales, operations, and customer service. This includes extending our proprietary AI Grid context engineering system, developing evaluation infrastructure, building ML models for market-making and reasoning, and creating the systems that enable massive operational leverage (enabling one person to do the work of thousands). You\'ll be responsible for both ambient agents (background processes with context / memory) and the core systems that enable frontier agents (human-AI interfaces) to deliver exceptional experiences.
We\'re committed to "Staying REAL" with our AI systems - building agents that are Reliable, Experience-focused, Accurate, and have Low latency. You will work directly with the CEO and CTO to execute on our AI vision with a bias toward action. We live by core principles : "There is no try, there is just do," "Actions lead to information, always default to action," and "Strong opinions lead to information." We need engineers who build and ship, not just plan and strategize.
Responsibilities / What You\'ll Do
- Extend and enhance our proprietary AI Grid context engineering system that combines ETL with LLM pipelines and graphs
- Build robust evaluation systems with datasets and human-annotated data for supervised fine-tuning (SFT) and reinforcement learning (RL)
- Develop ML models for critical systems including underwriter load balancing (our market-making engine) and reasoning systems
- Architect and maintain data pipelines that pull from multiple diverse sources and push to various destination systems (ClickHouse, vector databases, ML platforms, etc.)
- Build MCP (Model Context Protocol) servers that expose memory and tools to AI agents across the platform
- Create integrations with payment providers and financial systems for seamless transaction processing
- Develop growth engineering infrastructure including agents that determine campaign strategies and optimize outreach
- Build voice AI systems for follow-ups, information collection, and cold outbound campaigns
- Create customer service AI infrastructure enabling one person to manage thousands of leads and customers
- Design ETL / ELT pipelines that handle both batch and real-time processing at scale
- Implement Lambda architecture patterns combining event streaming with batch processing
- Partner with forward deployed engineers to ensure platform capabilities meet business needs
- Build comprehensive observability systems to monitor agent reliability and performance
You're Our Person If
You're exceptional at one or more of : distributed systems, AI agents, context engineering, or data engineeringYou have experience building evaluation systems and working with human-annotated datasets for ML trainingYou understand how to build ML models for complex systems like market-making and reasoning enginesYou have deep expertise with modern data warehouse and ML platforms (Databricks, Astronomer, or similar)You can architect MCP servers and understand how to expose memory and tools to AI systemsYou have experience with payment provider integrations and financial systemsYou understand how to build voice AI infrastructure for outbound campaigns and information collectionYou can architect systems that enable massive operational leverage (1 : 1000s ratios)You have experience with data enrichment and building DAG-based orchestration systemsYou understand CAP theorem tradeoffs and can make appropriate architectural decisionsYou're equally comfortable with TypeScript / Node.js and Python / ML frameworksYou ship features daily and take immediate action instead of overthinkingYou embrace "there is no try, there is just do" as your engineering mantraHard Requirements
Strong experience with both TypeScript / Node.js and PythonDeep understanding of distributed systems principles, CAP theorem tradeoffs, and event sourcing architectureExperience with modern data warehouse solutions (Databricks, Astronomer, Snowflake, or similar)Proven track record building ML modeling infrastructure and ETL / ELT pipelinesExperience with evaluation systems and human-annotated datasets for ML trainingExperience building ML models for complex systems (recommendation engines, market-making, reasoning)Experience with MCP server architecture and protocol implementationExperience with voice AI systems and conversational interfacesExperience with payment provider integrations and financial systemsExperience designing data pipelines that serve multiple destination systemsExperience with temporal.io workflows or similar durable execution frameworksExperience with data enrichment and DAG-based orchestrationProven track record building production AI / ML systems at scaleExperience with vector databases (Qdrant, Pinecone, Weaviate) and RAG systemsStrong understanding of context engineering for AI systemsAdvanced usage of Cursor or WindSurf coding IDEMust be based in San Francisco and work in-office 5.5 days per week (relocation assistance provided)Our Tech Stack
AI Agent Infrastructure
AI Grid - our proprietary context engineering system combining ETL with LLM pipelines and graphsTemporal.io for durable workflow orchestration across agent systemsPydantic-AI for type-safe agent development with structured validationMCP servers for exposing memory and tools to AI agentsEvaluation systems with human-annotated datasets for SFT and RLEvent sourcing architecture with Redis streams and PostgreSQLLambda architecture combining real-time event streams and batch processingRAG systems with rigorous evaluation frameworksClaude (Anthropic), GPT-4 (OpenAI), and select open source modelsVoice AI infrastructure for campaigns and customer interactionsLogfire for comprehensive agent observabilityData & ML Infrastructure
Modern data warehouse solutions (Databricks / Astronomer) for ML modeling and ETLML models for underwriter load balancing (market-making) and reasoning systemsEvaluation infrastructure with human-annotated datasets for SFT / RLMultiple destination systems including ClickHouse for analyticsApache Airflow, Temporal, Airbyte, and N8N for pipeline orchestrationVector databases for AI context storage and retrievalCustom data enrichment pipelines for growth engineeringPayment provider integrations for transaction processingPostHog for product analytics and event trackingRedis streams and PostgreSQL for operational dataCore Engineering
TypeScript / Node.js for robust application developmentPython for AI systems and ML workflowsNext.js / React for frontend experiencesEvent-driven architecture with distributed systems designWhat You\'ll Build in Your First 90 Days
First Month
Extend and enhance our AI Grid context engineering system with new capabilitiesBuild initial evaluation datasets and implement human annotation workflowsSet up MCP servers for memory and tool exposure across AI agentsCreate voice AI infrastructure for outbound campaigns and information collectionDesign ML models for underwriter load balancing and reasoning systemsEstablish comprehensive observability and monitoring systemsSecond Month
Develop sophisticated data enrichment pipelines for growth engineeringBuild agents that determine and optimize campaign strategiesImplement payment provider integrations for seamless transactionsExpand evaluation systems with automated dataset generationCreate customer service AI infrastructure for 1 : 1000s operational leverageImplement advanced orchestration patterns for complex workflowsThird Month
Scale AI Grid to handle increasing complexity and volumeOptimize ML models for market-making and reasoning performanceBuild comprehensive growth engineering platform with campaign automationImplement advanced voice AI features for cold outbound and follow-upsFine-tune models using human-annotated datasets (SFT / RL)Integrate all systems into a unified, observable platform architectureOur AI Philosophy
Context is King : The quality of AI decisions directly correlates with the richness of available context through AI Grid10x Platform Impact : Build infrastructure that enables forward deployed engineers to create 10x business leverageEvaluation-Driven Development : Use human-annotated data and rigorous evaluation to continuously improveMulti-System Integration : Design for multiple sources and destinations from day oneEvent-Driven Architecture : React to events and state changes for maximum responsivenessDistributed & Durable : Create fault-tolerant systems that maintain state and recover from failuresBusiness-Enabling Infrastructure : Every platform capability should unlock new business opportunitiesAction Orientation : Always default to action - ship code, gather data, and iterateExecution Focus : There is no try, there is just do - we value engineers who build and shipJoin Us To Transform the $900B Insurance Market
This is an early-stage role at a fast-moving startup, and you\'ll often experience the crawl-walk-run approach to building. You\'ll quickly prototype systems and then push them into productionized platforms that can scale. We\'re looking for people who can be creative in providing impact first, then take learnings from that impact and push them back into the system.
You should ideally have worked in an early-stage startup environment and understand the pacing. This is a fast-paced environment where we value ownership and quick, rapid feedback loops within the team. You\'ll work directly with the CEO and CTO to execute on our AI vision with a bias toward action.
We require you to be in San Francisco and work from our office 5.5 days per week. We\'ll cover relocation costs and believe the best teams collaborate intensively in person.
Skills
TypeScript, Node.js, Python, Distributed Systems, Context Engineering, AI Grid, Data Engineering, Databricks, Astronomer, ETL / ELT, ML Infrastructure, Data Enrichment, DAG Orchestration, Temporal.io, Pydantic-AI, MCP Servers, Evaluation Systems, SFT / RL, Voice AI, Payment Integration, Event Sourcing, CAP Theorem, Lambda Architecture, Apache Airflow, Vector Databases, RAG Systems, Redis Streams, PostgreSQL, AI Agent Development, System Architecture, Market-Making Systems, Reasoning Engines
#J-18808-Ljbffr