Site Reliability Engineer

Job Title:Site Reliability Engineer

‍Department: Engineering

‍Location: Remote

‍Company: RhythmScience Inc.

‍Salary Range: 80k-150k‍

About Us

RhythmScience is revolutionizing virtual cardiac care by developing advanced remote monitoring solutions for hypertension, heart failure, and rhythm management. Our mission is to enhance patient outcomes by enabling seamless collaboration between clinicians and patients through cutting-edge healthcare technology.

Our Rhythm360® platform serves as a comprehensive gateway for cardiology care, integrating real-time remote monitoring, standardized reporting, and streamlined workflows. By leveraging implanted and external cardiac devices, we provide healthcare teams with actionable data to improve patient management. We are committed to ensuring that clinicians have the tools they need to make informed, timely decisions that can prevent hospitalizations and improve lives.

Why We Do It

We believe that healthcare should be proactive, not reactive. Research shows that continuous remote monitoring can reduce hospital visits and improve patient survival rates by over 50%. Our solutions ensure that cardiac patients receive timely care, minimizing risks and enhancing quality of life.

Learn more about us here: http://rhythm360.io

About the Role

We are looking for a Site Reliability Engineer (SRE) to build, maintain, and scale our infrastructure while ensuring high availability, security, and performance. In this role, you will be responsible for deploying, monitoring, and optimizing our cloud environments while working closely with our engineering teams to support mission-critical healthcare applications.

As an SRE at RhythmScience, you’ll play a pivotal role in ensuring our infrastructure remains secure, resilient, and scalable, while helping shape best practices in DevOps and automation.

Key Responsibilities

Infrastructure & Cloud Management

Configure and maintain cloud resources using AWS, Docker Swarm, and Terraform
Manage security controls, access permissions, and data protection policies
Monitor and optimize system performance, ensuring infrastructure scalability and efficiency

Reliability & Performance

Support production and development environments to ensure high availability
Implement fault-tolerant architectures and troubleshoot networking issues
Develop monitoring solutions to identify and resolve performance bottlenecks

Automation & Engineering

Develop and maintain automation scripts for deployment, scaling, and monitoring
Optimize ETLs and database performance (PostgreSQL)
Support test automation and continuous integration processes

Incident Response & Support

Participate in weekend rotational support for site uptime and issue resolution
Investigate system failures, security incidents, and service degradations
Work closely with engineering teams to implement proactive solutions

Required Qualifications

4+ years of experience in SRE, DevOps, or IT infrastructure roles
Hands-on experience with AWS, Docker Swarm, and Terraform
Expertise in ETLs and database management (PostgreSQL)
Experience handling Protected Health Information (PHI) and securing healthcare data
Strong Linux & Windows Server administration skills
Ability to work asynchronously and communicate effectively

Preferred Qualifications

Experience with healthcare data feeds (HL7 Engines, Mirth, Rhapsody)
Proficiency in object-oriented programming (Python preferred)
Strong SQL and business intelligence tooling experience
Knowledge of IPSec and network security best practices‍

Compensation & Benefits

Paid Time Off & Holidays
Comprehensive Healthcare (medical, vision, dental)

As part of our commitment to a safe and secure workplace, all successful candidates will undergo a background and reference check as part of the hiring process.

Apply Now