TD Bank Group is looking for a Site Reliability Engineer in None – Apply Here!

Deal Score0
Deal Score0

TD Bank, America’s Most Convenient Bank, is one of the 10 largest banks in the U.S., providing more than 8 million customers with a full range of retail, small business and commercial banking products and services at approximately 1,300 convenient locations throughout the Northeast, Mid-Atlantic, Metro D.C., the Carolinas and Florida. In addition, TD Bank and its subsidiaries offer customized private banking and wealth management services through TD Wealth®, and vehicle financing and dealer commercial services through TD Auto Finance. TD Bank is headquartered in Cherry Hill, N.J.

TD Bank, America’s Most Convenient Bank, is a member of TD Bank Group and a subsidiary of The Toronto-Dominion Bank of Toronto, Canada, a top 10 financial services company in North America. The Toronto-Dominion Bank trades on the New York and Toronto stock exchanges under the ticker symbol TD .

Department Overview

Job Profile Summary

The Site Reliability Engineer provides technical leadership and integrated guidance across business, product and technology teams/partners to improve the design and operation of systems, making them secure, stable, scalable, fault tolerant, resilient, observable and efficient while ensuring performance and high availability.

The role sets the direction and influences the development and implementation of production systems and services to address emerging business needs and resiliency strategies while advancing the overall design architecture and technology capabilities in accordance with technology standards, and industry developments. SREs considers the performance, resiliency, fault tolerance and stability of production systems their primary focus, yet at the same time is committed to designing scalable and operational improvement through the application of software engineering practices.

Job Description

Depth & Scope:
• Expert Site Reliability Engineering role with comprehensive expertise in leading-edge theories, engineering practices, extensive coding and scripting
• Advanced and highly specialized knowledge of TD applications, systems, networks, innovation models, design activities, best practices, business/organization, Bank standards, and may fulfill a governance role
• Engineering specialist assigned to work autonomously on high profile, complex and/or high-risk technology initiatives with significant impact to the organization
• Provides technical leadership/consulting/direction to multiple businesses and product teams, growing capability across the organization
• Resolves unique and complex problems that have a broad impact on the business
• Authoritative expert on site reliability issues within area of specialization
• Understands the journey of an enterprise transformation where there is a hybrid cloud/non-cloud operating model.
• Drives end/end accountability of products and services across the enterprise through collaboration and transparency
• Primarily works at the product umbrella, segment, LOB level
• Typically reports to the Site Reliability practice executive

• Must be eligible for employment under regulatory standards applicable to the position.

Customer Accountabilities:
• Provides technical leadership to improve the design and operation of systems in alignment reliability engineering best practices and overall Technology and Bank strategies, applying the practices of computer science and software engineering to the design and development of large, complex systems
• Drives and influences integrated DevOps solutions across business, product, platform, infrastructure, development, support/DevOps teams that improve the design and operation of systems, making them scalable, reliable, and efficient while ensuring performance and high availability of products/services
• Ensures availability, latency, performance, efficiency, change management, monitoring, emergency response, and capacity planning of products/service(s) including enterprise systems that may serve multiple services and applications/segments
• Influences and partners with key technology and product team members in the design and development of solutions that promote automation, innovation and the elimination of toil; identify optimal ways to improve the design and operation of systems to make them more scalable, more reliable, and more efficient and have the ability to implement the required changes
• Defines and prioritizes problems to solve with applications/products/services and respective systems and drives the resolution/remediation across technology areas
• Balances engineering and development priorities, providing expertise on automating the systems of their respective services and/or applications, and coding complex fixes and solutions in response to a major issue, toil or a new product/service feature
• Has ownership of the strategic planning for capacity and its provisioning activities
• Develops deep relationships with Product Owners, Ops, Tech Leads and Executives to build transparency and help foster end/end accountability of products and services
• Works in close partnership with technology teams to support TD’s business objectives and operational support goals providing domain expertise on strategic Infrastructure as well as Business project related activities (including both Change the Bank and Run the Bank programs)
• Engages executive stakeholders appropriately to review progress and obtain input, validation and approval of key decisions
• Anticipates client needs to identify appropriate solutions and to influence the development of innovative solutions
• Shareholder Accountabilities:
• Ensures adherence of Operational (Production) Readiness practices of respective products and services
• Sets service-level objectives (SLO) that defines availability of a particular product or service and exercise key decision rights of the SRE role (eg supporting release to production, rejecting software that is operationally substandard and directing developers to improve the code etc.)
• Implements the observability requirements to monitor and assure that our systems measure to the expected service levels and perform with the appropriate operational characteristics
• Focuses on reliability, scalability, and the development of the production computing infrastructure; including highly complex and scalable systems
• Develops observability standards to ensure that production systems operate under known conditions and transparently provides these measurements to anticipate when errors or failures can arise.
• Engineers solutions through problem post-mortem reviews to ensure that problem solutions are complete and that errors will not manifest again.
• Anticipates internal and external business challenges, helping teams find solutions through continuously improving on process and technologies
• Leads interaction with governance and control groups, (eg regulatory/operational risk, compliance and audit) to provide subject matter expertise and consult on risk issues related to Engineering technology and tools


Education & Experience:
• University degree in Computer Science or related technical field involving systems engineering or equivalent practical experience.
• 10+ years of engineering experience (eg Software or platform)




At TD, we are committed to fostering an inclusive, accessible environment, where all employees and customers feel valued, respected and supported. We are dedicated to building a workforce that reflects the diversity of our customers and communities in which we live in and serve, and creating an environment where every employee has the opportunity to reach their potential.

Apply Here

We will be happy to hear your thoughts

      Leave a reply

      Tech Jobs Here

      Get Alerts on the Latest Job Posts in your Inbox- Daily!




      We will not spam you. Don't forget to add us to your contacts!