Search jobs > Fort Lauderdale, FL > Site reliability engineer

Staff Site Reliability Engineer (US)

TD Bank, N.A.
5900 North Andrews Avenue
Full-time

Description

Site reliability engineering, or SRE, is the intersection of software engineering and systems operations. It's about designing, building, and maintaining the systems, services, and application.

And it's more than just keeping the lights on, it's proactively identifying potential issues and implementing solutions to prevent them from happening in the first place.

At TD, SRE ensures that our services both our internally critical and our externally-visible systems have reliability and uptime to deliver our legendary customer experience.

In this role, you'll be challenged to think ahead and find ways to prevent the unexpected. This is a thrilling and challenging field that requires a combination of technical skills, creativity and problem-solving.

Be on the front line, ensuring that millions of customers can access the information and services they need, when they need them.

This is an opportunity to make a real-world impact. Join us for this truly exciting journey.

Responsibilities

Identify optimal ways to improve the design and operation of systems to make them more scalable, more reliable, and more efficient.

Candidate should be able to implement the required changes.

  • Work within product teams to support TD's business objectives and operational support goals providing domain expertise on strategic Infrastructure as well as Business project related activities.
  • Review technical deliverables throughout the design and development phase to ensure systems adhere to SRE best practices.
  • Lead the definition and implementation of service-level objectives (SLO) for key technical and business driven measures.
  • Influence and partner with key technology and product team members in the design and development of solutions that promote automation, innovation, and the reduction of toil.
  • Guide and educate the technology organization around SRE practices and the need for scalable and resilient production systems.
  • Write / contribute to existing documentation or educational content and adapt content based on product / program updates and user feedback.
  • Actively participate in, or lead design and code reviews with peers and stakeholders to examine system components for resiliency issues.

Depth & Scope :

  • Expert Site Reliability Engineering role with comprehensive expertise in leading-edge theories, engineering practices, extensive coding and scripting
  • Advanced and highly specialized knowledge of applications, systems, networks, innovation models, design activities, best practices, business / organization, Bank standards, and may fulfill a governance role
  • Engineering specialist assigned to work autonomously on high profile, complex and / or high-risk technology initiatives with significant impact to the organization
  • Provides technical leadership / consulting / direction to multiple businesses and product teams, growing capability across the organization
  • Resolves unique and complex problems that have a broad impact on the business
  • Authoritative expert on site reliability issues within area of specialization
  • Understands the journey of an enterprise transformation where there is a hybrid cloud / non-cloud operating model.
  • Drives end / end accountability of products and services across the enterprise through collaboration and transparency
  • Primarily works at the product umbrella, segment, LOB or Product Family level

Education & Experience :

  • University degree in Computer Science or related technical field involving systems engineering or equivalent practical experience.
  • 10+ years of engineering experience (e.g. Software or platform)

Preferred Qualifications :

  • Several years of experience leading projects and designing, analyzing, and troubleshooting distributed systems.
  • Ability to program in Java, Java Script or .Net, other modern programming languages like Python.
  • SRE Mindset Blameless culture, data driven decisioning, endless curiosity, continuous improvement, bias for action, collaboration, and partnership.
  • Strong Leadership and communication skills, ability to influence without authority.
  • Innovative and comfortable working with unknowns, change advocate.
  • Performance engineering knowledge and Chaos Engineering principles.
  • System Administration experience an asset.
  • Tools : experience with Git, Splunk, Datadog, Dynatrace

Who We Are :

TD is one of the world's leading global financial institutions and is the fifth largest bank in North America by branches / stores.

Every day, we deliver legendary customer experiences to over 27 million households and businesses in Canada, the United States and around the world.

More than 95,000 TD colleagues bring their skills, talent, and creativity to the Bank, those we serve, and the economies we support.

We are guided by our vision to Be the Better Bank and our purpose to enrich the lives of our customers, communities and colleagues.

TD is deeply committed to being a leader in customer experience, that is why we believe that all colleagues, no matter where they work, are customer facing.

As we build our business and deliver on our strategy, we are innovating to enhance the customer experience and build capabilities to shape the future of banking.

Whether you’ve got years of banking experience or are just starting your career in financial services, we can help you realize your potential.

Through regular leadership and development conversations to mentorship and training programs, we’re here to support you towards your goals.

As an organization, we keep growing and so will you.

Our Total Rewards Package

Our Total Rewards package reflects the investments we make in our colleagues to help them and their families achieve their financial, physical and mental well-being goals.

Total Rewards at TD includes base salary and variable compensation / incentive awards (e.g., eligibility for cash and / or equity incentive awards, generally through participation in an incentive plan) and several other key plans such as health and well-being benefits, savings and retirement programs, paid time off (including Vacation PTO, Flex PTO, and Holiday PTO), banking benefits and discounts, career development, and reward and recognition.

Additional Information :

We’re delighted that you’re considering building a career with TD. Through regular development conversations, training programs, and a competitive benefits plan, we’re committed to providing the support our colleagues need to thrive both at work and at home.

Colleague Development

If you’re interested in a specific career path or are looking to build certain skills, we want to help you succeed. You’ll have regular career, development, and performance conversations with your manager, as well as access to an online learning platform and a variety of mentoring programs to help you unlock future opportunities.

Whether you have a passion for helping customers and want to expand your experience, or you want to coach and inspire your colleagues, there are many different career paths within our organization at TD and we’re committed to helping you identify opportunities that support your goals.

Training & Onboarding

We will provide training and onboarding sessions to ensure that you’ve got everything you need to succeed in your new role.

Interview Process

We’ll reach out to candidates of interest to schedule an interview. We do our best to communicate outcomes to all applicants by email or phone call.

Accommodation

If you are an applicant with a disability and need accommodations to complete the application process, email the TD Bank US Workplace Accommodations Program at .

Include your full name, best way to reach you, and the accommodation needed to assist you with the application process.

All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, or status as a protected veteran.

12 hours ago
Related jobs
Promoted
VirtualVocations
Fort Lauderdale, Florida

A company is looking for a Staff Site Reliability Engineer to play a key role in site reliability engineering and cloud operations of global cloud infrastructure. ...

TD Bank, N.A.
Fort Lauderdale, Florida

Site reliability engineering, or SRE, is the intersection of software engineering and systems operations. Expert Site Reliability Engineering role with comprehensive expertise in leading-edge theories, engineering practices, extensive coding and scripting. Every day, we deliver legendary customer ex...

Promoted
VirtualVocations
Fort Lauderdale, Florida

Key Responsibilities:Develop and implement automation solutions to streamline operationsDesign and implement effective monitoring and alerting systemsOwn the incident lifecycle, leading root cause analysis and resolutionRequired Qualifications:Bachelor's degree in Computer Science, Engineering, or a...

Splunk Inc
Florida, United States
Remote

Site Reliability Engineers in this role will be engaging with multiple service owners across the platform to teach and implement modern interpretations ofSRE,observability, Chaos Engineering andDevOps. Splunk's Cloud Services group is looking for a Site ReliabilityEngineer to help lead, design and b...

Promoted
VirtualVocations
Fort Lauderdale, Florida

A company is looking for an Associate Site Reliability Engineer responsible for maintaining infrastructure and ensuring system reliability. ...

UKG
Weston, Florida

...

FIS
Virtual from Any State, FL , United States of America

Site Reliability Engineer (SRE) will focus on Scalability, High Availability, Performance, Stability and Reliability of Software Applications. SRE will build automations to simplify operations and processes, collaborate with cross-functional teams to create proactive engineering mechanisms and ensur...

UKG
Weston, Florida

Principal Site Reliability Engineer. ...

FIS
Virtual from Any State, FL , United States of America

Site Reliability Engineer (SRE) will focus on Scalability, High Availability, Performance, Stability and Reliability of Software Applications. SRE will build automations to simplify operations and processes, collaborate with cross-functional teams to create proactive engineering mechanisms and ensur...

Splunk Inc
Florida, United States

Join us as we pursue our disruptive vision to make machine data accessible, usable and valuable to everyone. Skilled in identifying performance bottlenecks, spotting anomalous system behavior, and determining the root cause of incidents. We are a company filled with people who are passionate about o...