The Opportunity :
We, at Flywire, are looking for an experienced engineer to join as the Site Reliability Engineering team in North America to help drive reliability, automation and performance in our cloud-based infrastructure.
At Flywire the SRE team is responsible for the lifecycle of production systems and working across multiple levels. SRE’s are embedded in the development teams enabling and empowering them to achieve full speed on shipping reliable and operable systems.
They also work at a global scale driving initiatives to achieve production excellence.
Design, build and maintain core infrastructure pieces with a focus on availability, latency, performance, and capacity.
Reduce toil and improve deployment timelines (40%)
- Be embedded on a development team, support them with daily tasks, help drive towards production excellence and advocate for best practices. (40%)
- Be part of an on-call rotation. Debug production issues across services and levels of the stack and practice incident response and blameless postmortems. (15%)
- Engage and collaborate with other disciplines in the design, deployment, operation, and optimization of services. (5%)
What’s next in our team? :
- Pushing SRE practice to the next level
- Keep fast release pace whilst keeping the environment secure
- Migrate legacy custom deployments to managed cloud provider offerings (eg : RDS)
- Keep pursuing full DevSecOps
Here’s What We’re Look For :
- 5+ years of experience as an SRE or similar role. Experience as a Software Engineer or Systems Engineer is also valuable.
- We are aiming to build a multidisciplinary and balanced team based on "t-shaped" individuals. As such we are looking for people comfortable with the idea of being or becoming a generalizing specialist.
- Software engineering is an important part of our work, we actively use and support many different platforms and languages.
Experience with at least one programming language is needed, also experience with testing techniques such TDD or BDD will be highly valued.
- Being familiar with the container ecosystem, cloud infrastructure, build systems and CI / CD tools is key for being successful at this role.
- You will need to be comfortable taking ownership of complex systems challenges and help uncover opportunities for improvement.
- At SRE we are enablers, we empower and encourage our fellow colleagues so you will need to have strong communication and collaboration skills, and most importantly, empathy.
- Strong preference for candidates located near our geo-clusters and hubs in the following locations : Boston, New York City, Portland, Charlotte, Chicago, Austin, Dallas, Minneapolis, Kansas City, FL, PA, RI, & TN
Bonus :
- Experience with PCI compliant infrastructure (particularly in a modern CICD environment)
- Expertise with Infrastructure as Code tooling.
Technologies We Use :
- These are some of the technologies that we use, but we we are always learning, experimenting and open to change :
- Ruby, Bash / Shell, Java,, Kotlin, Go, Node, Python, PHP
- AWS : EC2, ECS, Lambda, Cloudwatch, SQS, RDS, Kinesis, S3, ElasticSearch, DocumentDB
- Linux, Docker, Terraform, Make, Chef, Ansible
- Gitlab, Jenkins (CI / CD)
- Sentry, Sumologic, New Relic, Grafana. OTEL (OpenTelemetry)
What We Offer :
- Competitive compensation, including Restricted Stock Units
- Employee Stock Purchase Plan (ESPP)
- Flying Start - Our immersive Global Induction Program (Meet our Execs & Global Teams)
- Work with brilliant people that will keep you on your toes, learn more about their journeys by checking out #InsideFlywire on social media
- Dynamic & Global Team (we have been collaborating virtually for years!)
- Wellbeing Programs (Mental Health, Wellness, Yoga / Pilates / HIIT Classes) with Global FlyMates
- Be a meaningful part in our success - every FlyMate makes an impact
- Competitive time off including FlyBetter Days to volunteer in a cause you believe in and Digital Disconnect Days!
- Great Talent & Development Programs (Managers Taking Flight for new or aspiring managers!)