Join our Talent Network
Skip to main content

Site Reliability Engineer, HAP Tech

Location: , United States
Date Posted:

Share:

Description

HAP Tech, a subgroup of BRG’s Healthcare Analytics practice (HAP), is one of the firm’s largest and fastest growing teams. This innovative group is currently looking for talented and dynamic professionals to join us as we continue to grow! HAP Tech supports and advises pharmaceutical manufacturers on how to navigate the challenges and complexities of the 340B program as well as other areas of the healthcare ecosystem. Our team is the established market leader in data and technology solutions for 340B-related issues and we support an impressive client base which includes the largest pharmaceutical manufacturers in the US as well as early-stage biotech companies. Beyond our syndicated solutions, we also integrate and synthesize data to deliver unparalleled analytics and insights into various aspects of the 340B program and the pharmaceutical supply chain.
 
The Site Reliability Engineer will provide skilled problem-solving measures to ensure the scalability, performance, and reliability of large-scale, cloud-based applications and infrastructure.   
 
Responsibilities
  • Provide operational support for full-stack software applications.
  • Collaborate with product, development, QA, and Operation teams to create, monitor, and troubleshoot the system infrastructure.
  • Increase system resilience with expert-level coding, bulletproof release, and change management skills.
  • Develop service-level indicators and objectives to automate release validation.
  • Improve automation and increase the system’s self-healing capability.
  • Collect operating system data and report performance metrics to stakeholders.
  • Improve reliability, quality, and time-to-market of our suite of software solutions.
  • Manage cloud and database system maintenance, debugging production issues as they arise.
  • Partner with security and product teams to define and publish policies, processes, and playbooks to facilitate rapid and effective handling of alerts and incidents.
Qualifications:
  • Bachelor’s degree in computer science or similar field.
  • Five years’ experience as a site reliability engineer or similar role.
  • Relevant industry certifications, such as through the Site Reliability Engineering (SRE) Foundation.
  • Experience working with cloud-native infrastructure (Azure Cloud Services, AWS, or GCP).
  • Experience working with observability and incident management tools (Datadog, OpsGenie, Pagerduty).
  • Experience scripting operating system tasks with Infrastructure as Code.
  • Impeccable creative and communication skills.
  • Ability to problem solve in a fast-paced, high-stakes environment.
Candidate must be able to submit verification of his/her legal right to work in the United States, without company sponsorship. 
 
Salary Range: $140,000-$200,000 per year. 
 
#LI-JQ1
#LI-Remote
Share:

We look for highly motivated problem solvers who have strong analytical abilities and a desire to advance within their careers. Stay up to date on our career opportunities by joining our talent network.

Join our Talent Network