Job Summary :
The individual in this position will join the Data Stores team in an exciting and fast-paced environment where they will help build and maintain highly available relational database systems operating across multiple cloud and on-premise environments.
These databases support many of the systems powering DE&ET’s digital media supply chain and consumer facing products.
We’re looking for an individual who can apply software engineering principles to database engineering and administration to build processes and automation to help deliver infrastructure faster, safer, more consistently, and with the right level of observability required to operate at scale.
Responsibilities and Duties of the Role :
- Responsible for building, deploying, and ensuring all DE&ET database infrastructure is available 24 / 7 / 365.
- Leverage software development and automation to design, modernize, and deliver database infrastructure.
- Participates in setting the architectural direction for database platforms and projects.
- Manage multiple competing priorities in a fast-paced, deadline-oriented environment.
- Analyze, design, and deploy fault-tolerant, distributed, and highly available database infrastructure.
- Proactively plan and implement infrastructure changes through capacity forecasting, software release cycles, and right sizing.
- Provide database expertise through performance tuning, troubleshooting and administration.
- Develop, enhance, and adhere to engineering and administration standards.
- Develop automation and tooling to increase operational efficiency while ensuring system reliability and security.
- Build infrastructure and systems for scalability, resiliency, availability, and recovery though infrastructure as code and configuration management.
- Provide relevant insights of data store infrastructure through metrics, monitoring, and alerting.
- Maintain thorough and well-written documentation.
- Participate in live event support and on-call rotation.
- May provide oversight and direction to junior team members.
- Builds relationships with engineering teams and leads.
Required Education, Experience / Skills / Training :
Basic Qualifications
- 5+ years of related work experience with Microsoft SQL Server, Amazon RDS for SQL Server, Azure SQL, and Azure SQL MI.
- Fundamental understanding of Microsoft SQL Server database internals.
- Experience working in Agile software development.
- Experience with source control management tools (Git, GitLab, GitHub).
- Intermediate to advanced level of expertise in one or more programming languages such as Python, Java, or Go.
- General understanding and experience with Windows operating system, network, and containers.
- Excellent verbal and written communication skills.
- Experience designing and deploying fault-tolerant, distributed, and highly available database infrastructure.
- Experience in database availability monitoring and status reporting using native monitoring tools.
- Well-versed in SQL Server backup, restore, and recovery strategies.
- Experience keeping a large environment compliant by deploying SQL Server patches and upgrades.
- Experience with disaster recovery planning and implementation.
- Carries out assignments with little coaching or guidance from others.
Preferred Qualifications
- Experience operating within a database reliability engineering (DRE) and / or systems reliability engineering (SRE) role.
- Experience running, deploying, and maintaining production systems in Azure and Amazon Web Services.
- Experience with infrastructure as code (Terraform, CloudFormation).
- Experience building a proper path to production leveraging multiple lifecycles, testing, integration, and CI / CD pipelines.
- Experience with the configuration and implementation of SQL Server Always On availability groups.
- Experience with configuration management (Ansible, Chef).
- Comfortable collaborating with cross-functional teams providing guidance in SQL Server best practices.
- Experience with proactively identifying problems / areas of improvement and designing creative solutions to successfully remediate issues.
Required Education
- Bachelor’s in computer science or related field and 5 + years relevant experience