Company Description
Devexperts has been working for nearly two decades consulting and developing for the financial industry. We solve complex technological challenges facing the most well-respected financial institutions worldwide.
By becoming a part of Devexperts, you’ll become a part of a company that fosters self-improvement and actively seeks out-of-the-box ideas.
Our teams work together to create the next generation of financial software solutions. We welcome all candidates who believe, as we do, that innovation is grounded in education.
Job Description
We are looking for a Senior Site Reliability Engineer (SRE) to fill the open position in a team that develops and supports proprietary trading platforms for large scale clients.
You will help the existing team to ensure access to various markets to end users from a lot of countries. You will be responsible for maintaining availability, automating release / deploy process, seamless monitoring, and alerting of all the solutions.
- work closely with developers for prototyping, and designing new features as part of the infrastructure
- deploy, install, configure and maintain sophisticated Trading / Finance and related software
- configure bare metal & сloud instances by using Infrastructure as Code
- make key decisions for scalability, reliability and accessibility
- install and manage in-house developed and external well-known monitoring systems
- design, deploy and configure cloud-based servers and networks provision servers and storage, configure firewalls, VPN, monitoring, etc.
- administrate UNIX / Cloud infrastructure installation, configuration and maintenance
- work with the Nexus and GIT repositories
Qualifications
- 5+ years of experience in UNIX / Linux administration
- 5+ years of experience in Networking
- experience as an SRE or DevOps
- strong experience with OS-level administration on Linux and / or UNIX
- hands-on scripting experience with Bash, Python, and / or Groovy
- experience with configuring TeamCity CI / CD pipelines
- IAAS solutions using Ansible (AWX), Terraform
- experience with Docker containers orchestrating (K8S / OpenShift / Hashicorp)
- know how to read and analyze errors
- in-depth knowledge of TCP / IP and ISO / OSI stack
- experience with monitoring and logging tools (Zabbix, Elasticsearch, or OpenSearch, Grafana, Kibana, Dynatrace, Prometheus, etc.)
- experience in working with Apache, Nginx, HAproxy, Envoy, etc
- strong ability to solve problems using code and scripting
- understanding of ITIL processes and routines
- Excellent English (written and verbal)