Site Reliability Engineer

Job Title:Site Reliability Engineer

Department: Engineering

Location: Remote

Company: RhythmScience Inc.

Salary Range: 80k-150k

About Us

RhythmScience is revolutionizing virtual cardiac care by developing advanced remote monitoring solutions for hypertension, heart failure, and rhythm management. Our mission is to enhance patient outcomes by enabling seamless collaboration between clinicians and patients through cutting-edge healthcare technology.

Our Rhythm360® platform serves as a comprehensive gateway for cardiology care, integrating real-time remote monitoring, standardized reporting, and streamlined workflows. By leveraging implanted and external cardiac devices, we provide healthcare teams with actionable data to improve patient management. We are committed to ensuring that clinicians have the tools they need to make informed, timely decisions that can prevent hospitalizations and improve lives.

Why We Do It

We believe that healthcare should be proactive, not reactive. Research shows that continuous remote monitoring can reduce hospital visits and improve patient survival rates by over 50%. Our solutions ensure that cardiac patients receive timely care, minimizing risks and enhancing quality of life.

Learn more about us here: http://rhythm360.io

About the Role

We are looking for a Site Reliability Engineer (SRE) to build, maintain, and scale our infrastructure while ensuring high availability, security, and performance. In this role, you will be responsible for deploying, monitoring, and optimizing our cloud environments while working closely with our engineering teams to support mission-critical healthcare applications.

As an SRE at RhythmScience, you’ll play a pivotal role in ensuring our infrastructure remains secure, resilient, and scalable, while helping shape best practices in DevOps and automation.

Key Responsibilities

Infrastructure & Cloud Management

  • Configure and maintain cloud resources using AWS, Docker Swarm, and Terraform
  • Manage security controls, access permissions, and data protection policies
  • Monitor and optimize system performance, ensuring infrastructure scalability and efficiency

Reliability & Performance

  • Support production and development environments to ensure high availability
  • Implement fault-tolerant architectures and troubleshoot networking issues
  • Develop monitoring solutions to identify and resolve performance bottlenecks

Automation & Engineering

  • Develop and maintain automation scripts for deployment, scaling, and monitoring
  • Optimize ETLs and database performance (PostgreSQL)
  • Support test automation and continuous integration processes

Incident Response & Support

  • Participate in weekend rotational support for site uptime and issue resolution
  • Investigate system failures, security incidents, and service degradations
  • Work closely with engineering teams to implement proactive solutions

Required Qualifications

  • 4+ years of experience in SRE, DevOps, or IT infrastructure roles
  • Hands-on experience with AWS, Docker Swarm, and Terraform
  • Expertise in ETLs and database management (PostgreSQL)
  • Experience handling Protected Health Information (PHI) and securing healthcare data
  • Strong Linux & Windows Server administration skills
  • Ability to work asynchronously and communicate effectively

Preferred Qualifications

  • Experience with healthcare data feeds (HL7 Engines, Mirth, Rhapsody)
  • Proficiency in object-oriented programming (Python preferred)
  • Strong SQL and business intelligence tooling experience
  • Knowledge of IPSec and network security best practices

Compensation & Benefits

  • Paid Time Off & Holidays
  • Comprehensive Healthcare (medical, vision, dental)

As part of our commitment to a safe and secure workplace, all successful candidates will undergo a background and reference check as part of the hiring process.

Apply Now