A company is looking for a MCP & Tools Python Developer - Agent Evaluation Infrastructure.
Key Responsibilities
Developing and maintaining MCP-compatible evaluation servers
Implementing logic to check agent actions against scenario definitions
Creating or extending tools that writers and QAs use to test agents
Required Qualifications
4+ years of Python development experience, ideally in backend or tools
Solid experience building APIs, testing frameworks, or protocol-based interfaces
Understanding of Docker, Linux CLI, and HTTP-based communication
Ability to integrate new tools into existing infrastructures
Familiarity with how LLM agents are prompted, executed, and evaluated
Python Developer • Visalia, California, United States