Site Reliability Engineer

Authentic8
Portland, Oregon, US
Full-time
We are sorry. The job offer you are looking for is no longer available.

We are a leading cybersecurity company with multiple offices (San Francisco and Redwood City, CA; Herndon, VA; and Washington, D.

C.) working to fundamentally disrupt the way organizations deliver and access the web, as well as conduct digital investigations.

The world’s most at-risk organizations rely on Authentic8 to completely eliminate the risk of using the web. More than 700 government agencies and commercial enterprises trust Authentic8’s 100% cloud-native, Silo Web Isolation Platform to protect their most at-risk data and missions.

We have an immediate need for multiple Site Reliability Engineers. The successful candidates will have a passion for automating day-to-day operations on the infrastructure.

We are embarking upon a significant scale expansion that requires a new low-touch platform to enable us to scale to our current and future growth rate.

Key Areas of Responsibility :

  • Manage all aspects of our production service : Google Compute Engine, Chef, Kubernetes, Docker, HAProxy, RDP, AWS, Gitlab, Active Directory.
  • Participate in the design and implementation of a low-touch infrastructure platform expansion to support current and future scale.
  • Participate in a tier one / two on-call rotation.
  • Automate common operator actions to minimize the need for human interaction to resolve common incidents.
  • Develop and implement new monitoring strategies to ensure awareness of any customer impacting incidents.
  • Collaborate with Eng / QA / Operations to support the effort for zero-downtime deployments.
  • Work closely with our Software Developer and QA teams as part of the SDLC to provide bug feedback into future releases.
  • Work closely with our Software Developer teams to generate self-service tools to assist in debugging / fixing broken builds and provide proactive visibility into the build / CI service.
  • Work closely with the QA team to integrate / leverage, as needed, existing QA test suites as part of the build pipeline.
  • Integrate our deployment tools (and collaborate to enhance them) into the pipeline to auto build sandbox environments for Engineering, Operations and QA.
  • Review, analyze, and recommend solutions and tools to improve the overall software development process.

Qualifications :

  • BS or MS in Computer Science or equivalent degree.
  • 1 year of industry experience as a Site Reliability Engineer or similar hands-on experience automating and managing all aspects of a large-scale production web service.
  • Excellent communication and interpersonal skills.
  • Excellent problem-solving and debugging skills.
  • UNIX or Linux system administration background.

Experience With :

  • Experience with configuration management systems (Chef, Ansible, etc.).
  • Competent scripting experience : Ruby, Python, and Shell.
  • Experience with containerization technology : Docker, Kubernetes, Amazon ECS, AKS.
  • Experience with on-prem and cloud-based monitoring and visualization : Icinga, Splunk, Grafana, ThousandEyes, Pingdom, New Relic, Datadog.
  • Experience with tools / frameworks : git, Gitlab, Django, Nginx, and Postgresql.
  • Experience with cloud services platforms : GCP, Azure, and AWS.
  • Experience building Windows, Mac OSX, Linux, and iOS packages.
  • Some background with virtualization technologies : VirtualBox, VMWare, or Citrix.

Authentic8 offers competitive benefits, including medical, dental and vision, flexible PTO, a 401k program and stock options.

It is the policy of Authentic8 to provide equal employment opportunity (EEO) to all persons regardless of age, color, national origin, citizenship status, physical or mental disability, race, religion, creed, gender, sex, sexual orientation, gender identity and / or expression, genetic information, marital status, status with regard to public assistance, veteran status, or any other characteristic protected by federal, state or local law.

Scroll down the page to see all associated job requirements, and any responsibilities successful candidates can expect.

J-18808-Ljbffr

Remote working / work at home options are available for this role.

3 days ago
Related jobs
Promoted
VirtualVocations
Portland, Oregon

A company is looking for a Senior Site Reliability Engineer to contribute to the operational success and growth of their cloud infrastructure. ...

Promoted
Open Systems Technologies
Portland, Oregon

Develop new tools and libraries for broader use by SaaS Operations and Engineering teams. Enable engineering teams and understand problems quicker. Assist engineering teams in deep troubleshooting and application code review to find opportunities to improve performance and scalability. Work with Eng...

Promoted
VirtualVocations
Portland, Oregon

A company is looking for a Lead Site Reliability Engineer (SRE) to ensure system reliability, scalability, and performance. ...

Promoted
CDK Global
Portland, Oregon

Software Engineer - (SRE - Site Reliability Engineer). Work with internal groups such as Product Engineering, Tools and QA to adopt SRE best practices. ...

Promoted
VirtualVocations
Portland, Oregon

A company is looking for a Staff Site Reliability Engineer - Incident Response. ...

Promoted
Cerbo
Portland, Oregon

As the Site Reliability Engineer (SRE), you will play a pivotal role managing the future of our technology. Site Reliability Engineering or similar role. You will work with our current SRE and engineering team to tune, optimize and enhance our Amazon Web Services Infrastructure. Collaborate with dev...

Promoted
Block
Portland, Oregon

As a Senior Staff Site Reliability Engineer at Block, you will be a key player in maintaining and improving the reliability of our systems. The blocks that form our foundational teams — People, Finance, Counsel, Hardware, Information Security, Platform Infrastructure Engineering, and more — provide ...

Token Metrics
Beaverton, Oregon
Remote

Candidate should possess extensive experience in administration including system administration for cloud infrastructure (AWS primarily and knowledge of multi-cloud infrastructure), process automation, site reliability and the ability to optimize the performance of our IT infrastructure. ...

Square
Portland, Oregon

As a Senior Staff Site Reliability Engineer at Block, you will be a key player in maintaining and improving the reliability of our systems. The blocks that form our foundational teams — People, Finance, Counsel, Hardware, Information Security, Platform Infrastructure Engineering, and more — provide ...

Token Metrics
Beaverton, Oregon
Remote

Candidate should possess extensive experience in administration including system administration for cloud infrastructure (AWS primarily and knowledge of multi-cloud infrastructure), process automation, site reliability and the ability to optimize the performance of our IT infrastructure. ...