Job Title : Site Reliability Engineer (SRE) eBPF Expert
Location : Remote
Type : Full-Time
Urgency : Immediate Hire
About the Role :
We are urgently seeking a highly skilled Site Reliability Engineer (SRE) with deep expertise in eBPF (Extended Berkeley Packet Filter) to join our remote team. This is a critical role focused on enhancing observability, performance, and security across our infrastructure using cutting-edge eBPF technologies.
Key Responsibilities :
- Design, implement, and maintain eBPF-based observability and security tools.
- Improve system reliability, scalability, and performance across distributed systems.
- Collaborate with DevOps, Security, and Engineering teams to integrate eBPF into CI / CD pipelines and monitoring stacks.
- Troubleshoot complex infrastructure issues using eBPF tracing and profiling.
- Automate operational tasks and build resilient systems using SRE best practices.
Required Skills :
Proven experience with eBPF development and tooling (BCC, libbpf, Cilium, etc.).Strong background in Linux systems, networking, and kernel internals.Proficiency in programming languages such as C, Go, or Rust.Experience with cloud-native environments (Kubernetes, Docker, etc.).Familiarity with observability tools (Prometheus, Grafana, Jaeger).Excellent problem-solving and communication skills.Nice to Have :
Contributions to open-source eBPF projects.Experience with security monitoring and threat detection using eBPF.Knowledge of performance profiling and tracing tools (perf, ftrace, etc.).