Staff Data Engineer – RCM

October 30

Apply Now
Logo of Rula

Rula

Marketplace • B2C • Real Estate

Rula is a platform that connects individuals seeking home improvement services with skilled professionals in their area. By using Rula, customers can easily find, compare, and hire trusted professionals for various projects such as renovation, repairs, and maintenance. The platform aims to simplify the process of hiring contractors and ensures transparency and quality in the services provided.

201 - 500 employees

🏪 Marketplace

👥 B2C

🏠 Real Estate

📋 Description

• Oversee the design and implementation of a greenfield near real-time data platform, starting with micro-batching pipelines using Kafka to deliver critical operational reports and evolving into a scalable Apache Flink architecture for sub-second analytics. • Power real-time dashboards and insights that enable our providers, leadership, and operational teams to make data-driven decisions, ultimately improving patient outcomes. • Join a collaborative data team nested within the broader engineering organization, working closely with business analysts, product managers, and data experts to transform raw event streams into reliable, actionable reporting data. • Build fault-tolerant pipelines, ensure data accuracy, and optimize for low-latency delivery, laying the foundation for Rula’s near real-time data capabilities. • Own a strategic transition from micro-batching to a Flink-based streaming architecture, driving innovation in how we harness data to support our mission.

🎯 Requirements

• Data Pipeline Development (8+ yrs). Experience designing and maintaining scalable ETL/ELT pipelines for operational reporting using Kafka, Glue, dbt, Dagster, and Airflow. Leveraging Python and SQL for data transformation and quality checks, and working with Flink and Spark Streaming to build low-latency, near real-time pipelines. • Cloud Infrastructure & Data Warehousing (8+ yrs overall, 4+ yrs in AWS). Proficiency building and optimizing data pipelines using AWS services such as S3, Redshift, Glue, IAM, Kinesis, and EMR. Experience across GCP (BigQuery, Dataflow) and Azure (Synapse, Data Factory). Optimizing data warehouses (Redshift, Snowflake, BigQuery) and managing Data Lakes (S3, Delta Lake) for scalable, low-latency analytics. Ensuring cost efficiency, scalability, and compliance (CPRA, HIPAA) while supporting a migration toward Flink-based near real-time architecture. • Data Quality & Governance (8+ Years). Experience implementing scalable data validation, quality checks (e.g., deduplication, consistency), and error-handling mechanisms tailored for operational reporting pipelines, ensuring high-fidelity data for real-time dashboards and analytics. Proficiency in designing and enforcing data governance practices, including metadata management, lineage tracking for auditable reporting, and compliance with regulations like CPRA or HIPAA in Data Lake environments (e.g., AWS S3, Delta Lake). • Performance Optimization (3+ Years). Experience optimizing data pipelines, queries, and large-scale datasets for efficiency and scalability in operational reporting systems, with a focus on achieving low-latency delivery. Proficiency in tuning high-throughput streaming systems, including optimizing resource usage and implementing best practices for partitioning, caching, and indexing. • Security & Compliance (3+ Years). Experience implementing data security measures, including encryption, role-based access control (RBAC), and data masking, to protect sensitive data in operational reporting pipelines and Data Lakes (e.g., AWS S3, Delta Lake). Strong understanding of compliance standards such as HIPAA and CPRA, with hands-on expertise in applying these standards to streaming systems like Apache Kafka and Apache Flink. Demonstrated ability to ensure auditability and security in data workflows, supporting reliable and compliant near real-time analytics during the transition from micro-batching to a Flink-based architecture. • Collaboration & Communication (5+ Years). Strong ability to work cross-functionally with business analysts, product managers, leadership, and other stakeholders to define and deliver operational reporting requirements. Exceptional communication skills to translate complex technical concepts into clear, actionable insights for non-technical audiences. Proven adaptability to thrive in a fast-paced startup environment, collaborating effectively to support the rapid development and evolution of a near real-time data platform while aligning with Rula’s mission to improve mental health care outcomes.

🏖️ Benefits

• 100% remote work environment (US-based only): Working hours to support a healthy work-life balance, ensuring you can meet both professional and personal commitments • Attractive pay and benefits: Full transparency of pay ranges regardless of where you live in the United States • Comprehensive health benefits: Medical, dental, vision, life, disability, and FSA/HSA • 401(k) plan access: Start saving for your future • Generous time-off policies: Including 2 company-wide shutdown weeks each year for self-care (for most employees) • Paid parental leave: Available for all parents, including birthing, non-birthing, adopting, and fostering • Employee Assistance Program (EAP): Support for your mental and physical health • New hire home office stipend: Set up your workspace for success • Quarterly department stipend: Fund team-building activities or in-person gatherings • Wellness events and lunch & learns: Explore a variety of engaging topics • Community and employee resource groups: Participate in groups that celebrate employee identity and lived experiences, fostering a sense of community and belonging for all

Apply Now

Similar Jobs

October 28

ClimateWorks Foundation

51 - 200

🤲 Charity

🤝 Non-profit

Data Engineer responsible for building and managing data architecture and pipelines at ClimateWorks. Collaborating across teams to enhance data integration and governance strategies.

🇺🇸 United States – Remote

💵 $120k - $140k / year

⏰ Full Time

🟠 Senior

🔴 Lead

🚰 Data Engineer

🦅 H1B Visa Sponsor

October 28

Bestow

51 - 200

💳 Fintech

☁️ SaaS

Technical leader and strategic advisor for Bestow's data infrastructure, mentoring engineers and influencing decisions. Drive the vision for a scalable data platform in a leading technology company.

🇺🇸 United States – Remote

💵 $190k - $210k / year

⏰ Full Time

🔴 Lead

🚰 Data Engineer

October 28

McKesson

10,000+ employees

⚕️ Healthcare Insurance

💊 Pharmaceuticals

🧬 Biotechnology

Clinical Data Architect managing data architecture and clinical data integration for healthcare solutions. Collaborating with teams to ensure integrity and quality of ophthalmology data.

🇺🇸 United States – Remote

💵 $141k - $235k / year

⏰ Full Time

🟠 Senior

🔴 Lead

🚰 Data Engineer

🦅 H1B Visa Sponsor

October 26

Workiva

1001 - 5000

☁️ SaaS

💸 Finance

📋 Compliance

Director of Data Engineering managing Workiva’s multi-tenant data platform in the US and Europe. Driving strategy and execution of internal and external data products with measurable customer value.

🇺🇸 United States – Remote

💵 $177k - $284k / year

⏰ Full Time

🔴 Lead

🚰 Data Engineer

October 24

insightsoftware

1001 - 5000

💸 Finance

☁️ SaaS

🏢 Enterprise

Principal Data Engineer specializing in database architecture and optimization for insightsoftware. Leading modernization efforts for database systems to enhance performance and integration.

🇺🇸 United States – Remote

💰 Private Equity Round on 2021-07

⏰ Full Time

🔴 Lead

🚰 Data Engineer

🦅 H1B Visa Sponsor

Developed by Lior Neu-ner. I'd love to hear your feedback — Get in touch via DM or support@remoterocketship.com