Are you passionate about driving innovation in quality engineering and eager to work with cutting-edge technologies? At Cigniti, a leading force in the quality engineering space and a proud member of the Coforge family, we’re not just shaping the future we’re defining it.
If you're ready to elevate your career and make a significant impact on global enterprises, this is your opportunity.
About the Role
We are seeking a highly skilled and experienced Senior Chaos Engineering / Performance testing Lead with expertise in Chaos engineering / Resilience and chaos engineering tools- Gremlin / Chaos Native / Litmus etc.
to join our dynamic team. The successful candidate will be responsible for Design, develop automated / continuous Chaos Engineering experiments and implement and lead execution of the chaos engineering Lifecycle.
Responsibilities
- Overall IT experience of 12+ years
- Having experience in Performance testing with knowledge on java development.
- Relevant experience on Chaos engineering / Resilience / High availability testing of 4 years is must
- Implement and lead execution of the chaos engineering Lifecycle - Chaos Test Planning, Chaos Test Designing, and Reporting
- Ensure recovery and resilience testing is scheduled, staffed, executed, and documented, including remediation and closure of issues
- Ability to analyze the architecture & recommend weak areas that are likely to failure / outages
- Ability to work with Business & technology teams to identify and report on resilience / High availability requirements
- Ability to work with enterprise architecture and development teams to architect applications for high availability and resiliency
- Design, develop and execute automated / continuous Chaos Engineering experiments
- Ability to troubleshoot the failures in CI / CD pipeline
- Automate Chaos experiments through chaos engineering tools (Gremlin / Chaos Native / Litmus etc) to run continuously
- Hands on experience in Unix / Linux OS environments and operating system internals, file systems, disk / storage and networking protocols
- Strong knowledge on public cloud platforms AWS, GCP, Azure
- Java Development
- Knowledge on Monitoring, Alerting, Logging
- Knowledge on VPC’s, proxy’s, load balancers, availability zones
- Ability in diagnosing and debugging complex distributed systems
- Tools (any of these) - Gremlin, Chaos Native, Litmus
- Strong leadership skills and ability to work in a cross-functional environment
- Strong interpersonal, oral, and written communication skills
- Strong analytical, organizational, and decision-making skills