Job Title : Senior Performance Engineer (Consultant)
Location : Pittsburgh, Pennsylvania, United States
Term : 3-Month Contract
The Opportunity
We are seeking an elite Senior Performance Engineer for a short-term consulting engagement to drive critical optimizations in our autonomous vehicle’s onboard software stack.
This role requires a deep-dive analysis of our near real-time Linux-based system, identifying and eliminating bottlenecks to ensure maximum efficiency, reliability, and deterministic performance. This is a unique opportunity to apply your expertise to one of the most challenging problems in technology today, with a direct and measurable impact on the performance of our autonomous fleet.
What You'll Do
- Conduct comprehensive performance analysis across the entire software stack, from the Linux kernel and drivers to the application layer.
- Utilize advanced profiling and tracing tools such as perf, eBPF, and ftrace to diagnose CPU, GPU, memory, I / O, and network bottlenecks.
- Optimize system and thread scheduling to meet near real-time deadlines and minimize latency in critical processing pipelines.
- Analyze Inter-Process Communication (IPC) mechanisms to increase throughput and reduce overhead.
- Identify and remediate architectural pitfalls that can cause performance degradation, memory fragmentation, or unpredictable timing.
- Collaborate closely with core engineering teams to implement architectural improvements.
- Develop and document best practices and performance-aware design patterns for engineers.
- Establish KPIs and monitoring tools to prevent performance regressions.
Who You Are
Linux expert with deep knowledge of kernel internals, including scheduling, memory management, and networking.Proficient in performance profiling and tracing tools : perf, eBPF, and ftrace.Systems programmer with expertise in C++ and other high-performance, multithreaded application languages.Strong understanding of modern CPU and GPU architectures and experience leveraging hardware-specific acceleration such as CUDA and TensorRT .Experienced in optimizing low-level software stacks, from hardware interrupts to application logic.Familiarity with real-time Linux patches (e.g., PREEMPT_RT) and deterministic scheduling.Able to communicate complex technical issues clearly to both engineers and leadership.Preferred Qualifications
Prior experience in autonomous vehicles, robotics, aerospace, or high-frequency trading (HFT).Experience with robotics middleware (e.g., ROS, DDS) and performance tuning.Knowledge of GPU optimization pipelines and hardware-specific acceleration.Demonstrated ability to design custom performance analysis tooling when needed.Skills
Linux systems and performance analysis tools ( perf, eBPF, ftrace )Systems programming and multithreaded application optimization in C++Hardware-specific acceleration and GPU programming ( CUDA, TensorRT )Performance tuning across CPU, GPU, memory, I / O, and network subsystemsReal-time Linux patches ( PREEMPT_RT ) and deterministic schedulingIPC optimization, low-latency pipelines, and performance KPIsStrong analytical, statistical, and benchmarking skillsExcellent communication and documentation skills