Search jobs > San Antonio, TX > Remote > Site reliability engineer
The Site Reliability Engineer will be joining a team responsible for developing and maintaining tools, alerts, and dashboards to support the Technical Operations team in monitoring application health and performance.
The engineer should be familiar with should be familiar with monitoring tools such as Splunk, AppDynamics, Dynatrace, Cloudwatch or other similar tools.
The engineer will be responsible for implementing improvements to processes to improve site reliability and incident response.
- Provide analysis of application performance and user behavior to support design, architecture and operations decisions.
- Create and maintain alerts, dashboards and reports using Dynatrace, Splunk, AWS Cloudwatch and other monitoring tools.
- Collaborate with Technical Operations, Technical Architecture and Development teams to develop improved logging and monitoring practices.
- Build, improve and maintain tools to support the Technical Operations Teams.
Minimum Qualifications
- Bachelor’s Degree in Information Technology, Computer Science or a related field or equivalent relevant experience.
- 4-6 years of experience in information technology, systems administration or other IT related field.
Other Job Specific Skills
- Bash, python or other scripting languages.
- Familiarity with Operations Monitoring tools such as Dynatrace, Splunk, AppDynamics or AWS Cloudwatch.
- Understanding of web site architecture and performance.
- Knowledge of Agile Framework.
- AWS Certification is a plus.
- Exceptional customer service skills.
Site Reliability Engineer - Remote
The Site Reliability Engineer will be joining a team responsible for developing and maintaining tools, alerts, and dashboards to support the Technical Operations team in monitoring application health and performance. The engineer will be responsible for implementing improvements to processes to impr...
Site Reliability Engineer
The Site Reliability Engineer is responsible for supporting the core infrastructure including compute, cloud, identity, and application support. ...
Site Reliability Engineer
Collaborate with Service Management team to ensure production systems are engineered to cost-effectively meet established SLAs/SLOs. Experience applying modern software engineering best practices to the management of IT infrastructure and platforms. ...
Senior Software Engineer, Site Reliability Engineer (Remote)
Software Engineer, you will be part of a dynamic team with engineers of all experience levels who help each other build and grow technical and leadership skills while creating, deploying, and supporting production applications. Software Engineers may be involved in product and tool selection, config...
Licensed Civil Engineer - Site Design (Remote)
As a Licensed Civil Engineer - Site Design with our Dallas or Fort Worth office, you will perform project management duties on small to medium sized projects, prepare planning and design documents, and process design calculations. Olsson provides multidisciplinary design services for mixed...
Tech Ops-Site Reliability Engineer - 30264
Learn more aboutSplunkcareers and how you can become a part of our journey!Role:Splunk is looking for a TechOps Engineer with the ability to provide day-to-day technical expertise for our Splunk Cloud Azure TechOps team and the Splunk organization. As a TechOps Engineer, you will be interfacing with...
Project Civil Engineer - Site Design (Remote)
As a Civil Project Engineer in either our Dallas or Fort Worth office, you will apply diversified knowledge of engineering principles and practices to a broad variety of assignments and related fields. The project engineer is a registered professional engineer, whose supervision and guidance relate ...
Senior Staff Engineer- Observability and Reliability Platform Engineering (REMOTE)
Our Staff Engineer works with our Sr Staff Engineer and Sr. GEICO is seeking an experienced Staff Engineer with a passion for building high-performance, low maintenance, zero-downtime platforms, and applications. You will help drive our insurance business transformation as we transition from a tradi...
Tech Ops-Site Reliability Engineer - 29523
Learn more about Splunk careers and how you can become a part of our journey!Role:Splunk is looking for a TechOps Engineer with the ability to provide day-to-day technical expertise for our Splunk Cloud Azure TechOps team and the Splunk organization. As a TechOps Engineer, you will be interfacing wi...
Kafka Site Reliability Engineer
Job Title: Kafka Site Reliability Engineer<br /> Location: Hybrid - Austin, TX (LOCALS ONLY)<br /> </div> <div> </div> <div>Experience: 14+ years</div> <div> </div> <div> </div> <div>What are the top 3 skills required for thi...