Search jobs > Jersey City, NJ > Site reliability engineer

Site Reliability Engineer (SRE)

Devexperts
Jersey City, US
Full-time

Company Description

Devexperts has been working for nearly two decades consulting and developing for the financial industry. We solve complex technological challenges facing the most well-respected financial institutions worldwide.

By becoming a part of Devexperts, you’ll become a part of a company that fosters self-improvement and actively seeks out-of-the-box ideas.

Our teams work together to create the next generation of financial software solutions. We welcome all candidates who believe, as we do, that innovation is grounded in education.

Job Description

We are looking for a Senior Site Reliability Engineer (SRE) to fill the open position in a team that develops and supports proprietary trading platforms for large scale clients.

You will help the existing team to ensure access to various markets to end users from a lot of countries. You will be responsible for maintaining availability, automating release / deploy process, seamless monitoring, and alerting of all the solutions.

  • work closely with developers for prototyping, and designing new features as part of the infrastructure
  • deploy, install, configure and maintain sophisticated Trading / Finance and related software
  • configure bare metal & сloud instances by using Infrastructure as Code
  • make key decisions for scalability, reliability and accessibility
  • install and manage in-house developed and external well-known monitoring systems
  • design, deploy and configure cloud-based servers and networks provision servers and storage, configure firewalls, VPN, monitoring, etc.
  • administrate UNIX / Cloud infrastructure installation, configuration and maintenance
  • work with the Nexus and GIT repositories

Qualifications

  • 5+ years of experience in UNIX / Linux administration
  • 5+ years of experience in Networking
  • experience as an SRE or DevOps
  • strong experience with OS-level administration on Linux and / or UNIX
  • hands-on scripting experience with Bash, Python, and / or Groovy
  • experience with configuring TeamCity CI / CD pipelines
  • IAAS solutions using Ansible (AWX), Terraform
  • experience with Docker containers orchestrating (K8S / OpenShift / Hashicorp)
  • know how to read and analyze errors
  • in-depth knowledge of TCP / IP and ISO / OSI stack
  • experience with monitoring and logging tools (Zabbix, Elasticsearch, or OpenSearch, Grafana, Kibana, Dynatrace, Prometheus, etc.)
  • experience in working with Apache, Nginx, HAproxy, Envoy, etc
  • strong ability to solve problems using code and scripting
  • understanding of ITIL processes and routines
  • Excellent English (written and verbal)
  • 30+ days ago
Related jobs
Promoted
The Dignify Solutions, LLC
Jersey City, New Jersey
Remote

SRE Architect in running large Reliability & Observability Programs for large, complex infrastructure deployments / distributed systems for major Banking customers. Good understanding of Observability (monitoring, logging, tracing, metrics), Chaos engineering concepts. ...

Trigyn Technologies
Jersey City, New Jersey

Site Reliability Engineers (SRE) to help their internal team provide production support in a public cloud environment. Trigyn’s financial services client has an immediate need for a Site Reliability Engineer in Jersey City. Demonstrated experience as a Site Reliability Engineer. Location: Must be ab...

JPMorgan Chase & Co.
Jersey City, New Jersey

Lead Site Reliability Engineer . Exhibits deep proficiency in reliability, scalability, performance, security, enterprise system architecture, toil reduction, and other site reliability best practices with the ability to implement these practices within an application or platform. Assume a critical ...

Federal Reserve System
Newark, New Jersey

As a Senior Cloud Reliability Engineer in the SRE chapter, you will be accountable for implementing reliability practices using software as means for the cloud foundational product line in the Federal Reserve. The SRE Chapter is part of the Cloud Solutions & Services department and has the overall r...

Bank of America
Jersey City, New Jersey

Designs solutions to visualize key production support metrics enabling Operational Readiness and Site Reliability Engineer teams to identify scenarios requiring intervention. This job is responsible for partnering with leaders across engineering and technology to define objective reliability goals f...

JPMorgan Chase & Co.
Jersey City, New Jersey

As a Site Reliability Engineer III at JPMorgan Chase within the Digital Private Markets /Aumni (A JP Morgan Chase Company), you will solve complex and broad business problems with simple and straightforward solutions. Formal training or certification on site reliability engineering concepts and 3+ y...

SMBC Group
Jersey City, New Jersey

The Nikko SRE team is dedicated to building cloud infrastructure for in-house financial business applications. The Nikko SRE team also drives infrastructure automation and efficiency, working closely with the application development team to ensure rapid deployment and operations. ...

Emonics LLC
New Jersey, United States

Role- Site Reliability Engineer. ...

SMBC Group
Jersey City, New Jersey

The Nikko SRE team is dedicated to building cloud infrastructure for in-house financial business applications. The Nikko SRE team also drives infrastructure automation and efficiency, working closely with the application development team to ensure rapid deployment and operations. ...

S&P Global
Englewood, New Jersey

This role of a SRE will build and run reliable production systems in cloud environments. SRE is responsible for availability, latency, performance, efficiency, change management, monitoring, emergency response and capacity planning. ...