Data Engineer

Job not on LinkedIn

🕒 February 17

Apply Now
Find Similar Remote Jobs

📊 Check your resume score for this job

Improve your chances of getting an interview by checking your resume score before you apply.

Logo of Ripjar

Ripjar

51 - 200 employees

💸 Finance

📋 Compliance

🤖 Artificial Intelligence

Finance • Compliance • Artificial Intelligence

Ripjar is a company focused on countering financial crime through advanced data analytics and artificial intelligence solutions. Their Labyrinth platform offers services in name screening, adverse media screening, threat investigations, and data fusion, targeting industries such as banking, financial services, and gaming. The platform allows institutions to enhance their KYC and AML operations by automatically identifying risks from various data sources. Ripjar supports global companies and governments in detecting and responding to criminal behaviors such as money laundering, bribery, and fraud. Their solutions enable users to integrate and fuse structured and unstructured data, providing a comprehensive view of threats. By leveraging advanced automation and machine learning, Ripjar delivers real-time alerts and insights for threat intelligence and compliance.

📋 Description

• Engineer distributed ingestion services that reliably pull data from diverse sources, handle messy real-world edge cases, and deliver clean, well-structured outputs to multiple downstream products. • Build high-throughput processing components (batch and/or near-real-time) with a focus on performance, scalability, and predictable cost, using strong profiling and measurement practices. • Design and evolve data contracts (schemas, validation rules, versioning, backward compatibility) so downstream teams can build with confidence. • Own production quality: write maintainable code, strong unit/integration tests, and add the observability you need (metrics/logs/tracing) to diagnose issues quickly. • Improve platform reliability by hardening pipelines against partial failures, retries, rate limits, data drift, and infrastructure issues—then codify those learnings into better tooling and guardrails. • Contribute to CI/CD and developer experience: faster builds, better test signal, safer releases, and automated operational checks. • Participate in design reviews, code reviews, incident retrospectives, and iterative delivery—making pragmatic trade-offs and documenting them clearly.

🎯 Requirements

• 2+ years building and operating production software systems • Fluency in at least one programming language (Python/Node.js a plus) • Experience debugging moderately complex systems and improving reliability/performance • Strong fundamentals: data structures, testing, version control, Linux basics • Spark/PySpark experience • Hadoop ecosystem exposure (HDFS/HBase) • Workflow orchestration (Airflow/Dagster/NiFi) • Search/indexing (OpenSearch, MongoDB) • Kubernetes and infrastructure-as-code • Degree in Computer Science or numerical degree

🏖️ Benefits

• Competitive salary DOE • 25 days annual leave + your birthday off, in addition to bank holidays, rising to 30 days after 5 years of service. • Remote working • Private Family Healthcare. • 35 hour working week. • Employee Assistance Programme. • Company contributions to your pension. • Pension salary sacrifice. • Enhanced maternity/paternity pay. • The latest tech including a top of the range MacBook Pro.

Apply Now

Similar Jobs

🕒 October 21, 2025

Olive Jar Digital

11 - 50

☁️ SaaS

🤝 B2B

Data Engineer supporting MDM initiative and associated data governance activities. Implementing data pipelines, metadata management, and data quality solutions within Azure environments.

Azure

Cloud

PySpark

Python

SQL

🕒 October 3, 2025

Veeva Systems

1001 - 5000

☁️ SaaS

⚕️ Healthcare Insurance

💊 Pharmaceuticals

Data Engineer focusing on data pipelines for life sciences at Veeva, improving data processing and collaboration with data science teams.

Apache

AWS

Cloud

Google Cloud Platform

Java

PySpark

Python

Spark

🕒 August 19, 2025

Burq

11 - 50

🛍️ eCommerce

☁️ SaaS

Data Engineer at Burq designs scalable data pipelines powering analytics and AI capabilities; collaborates with product, ops, and engineering teams.

Airflow

Amazon Redshift

AWS

Azure

BigQuery

Cloud

ETL

Google Cloud Platform

IoT

Java

Kafka

Python

Scala

SQL

🕒 July 30, 2025

Kainos

1001 - 5000

Join Kainos as a Data Architect, providing guidance on data architecture to drive impactful solutions.