Mid-Level Data Engineer

51 - 200 employees

💼 Consulting

🏥 Healthcare

📦 Logistics

Consulting • Healthcare • Logistics

Simple Technology Solutions is a HUBZone small business that specializes in IT modernization and digital experience for government operations. They focus on digitalizing government processes using cloud-native technologies and Agile practices to deliver full-stack digital solutions. The company emphasizes security, scalability, and interoperability in its enterprise approach. They work on enhancing cloud environments, migrating legacy IT systems, and promoting DevSecOps practices. Additionally, Simple Technology Solutions develops enterprise data management strategies using machine learning and AI, modernizes applications, and provides cloud contact center services. They primarily serve federal government agencies, particularly in law enforcement and public safety missions.

Mid-Level Data Engineer

Job not on LinkedIn

🕒 June 10

🇺🇸 United States – Remote

⏰ Full Time

🟡 Mid-level

🟠 Senior

🚰 Data Engineer

Airflow

Amazon Redshift

Apache

AWS

ETL

Oracle

Postgres

PySpark

Python

Spark

SQL

Apply Now

Find Similar Remote Jobs

📊 Check your resume score for this job

Improve your chances of getting an interview by checking your resume score before you apply.

Simple Technology Solutions

51 - 200 employees

💼 Consulting

🏥 Healthcare

📦 Logistics

Consulting • Healthcare • Logistics

📋 Description

• Develop new ETL pipelines and data ingestion processes alongside senior engineers using AWS Glue (Spark-based, PySpark), MWAA (Airflow), Lambda, and SNS • Integrate the agency's ETL Common Library into Glue jobs for standardized orchestration, error handling, metadata recording, and SNS notifications for all success and error job events • Ingest structured and semi-structured datasets (CSV, XML, JSON, Avro, pipe-delimited) into S3 landing, raw, and curated zones using Apache Iceberg tables • Configure static ETL metadata in the centralized PostgreSQL metadata store; ensure dynamic metadata records job status and timestamps for all key execution steps • Monitor assigned production jobs and participate in operations support rotations • Ensure ETL Load Reports are populated in real-time and ETL Gap Reports are updated on a weekly basis • Build and maintain materialized views and semantic layer objects in Trino and Athena to ensure optimized query performance and consistent business logic • Produce and maintain required documentation for each assigned dataset: Business Requirements, ETL Design Documents, Data Models, Data Dictionaries, Mapping Documents, Deployment Documents, O&M Guides, and ETL Test Plans • Write unit and integration tests achieving the 90% minimum code coverage threshold; complete security scans at least once per sprint • Deploy ETL resources using CloudFormation templates through the agency CICD pipeline • Support transition of ETL jobs from other agency teams and disaster recovery exercises

🎯 Requirements

• US Citizenship is required • Bachelor's Degree is required • minimum of 3-5 years' position related experience is required • Hands-on experience with Python (PEP 8), PySpark, and SQL for ETL pipeline development • Experience with AWS services including Glue, S3, MWAA (Airflow), Lambda, SNS, and SQS • Familiarity with Apache Iceberg, Parquet, and ORC file formats and S3 data lake zone concepts • Experience with PostgreSQL and basic familiarity with Redshift or Oracle • Familiarity with Trino or Athena for query and semantic layer development • Experience with CloudFormation, GitHub branching workflows, and CI/CD-integrated deployments • Ability to produce clear ETL documentation including data models (Mermaid format) and data dictionaries • Understanding of ETL metadata concepts including static and dynamic metadata, load reports, and gap reports • Experience in agile development environments with sprint-based delivery • Experience supporting IV&V and/or User Acceptance Testing (UAT) processes in a federal or technical program environment • Experience with automated testing frameworks; ability to write unit and integration tests achieving defined code coverage thresholds • Familiarity with FISMA, NIST 800-53, and OWASP ASVS Level 2 is a plus • Must be able to work 8am-5pm Eastern Time regardless of home location • Active federal public trust suitability determination or ability to obtain one required

🏖️ Benefits

• Flexible work arrangements • Continuous learning • Professional development • Special incentives for team members living in qualified HUBZones

Apply Now

Similar Jobs

Data Warehouse Developer

🕒 June 10

Texas Windstorm Insurance Association

201 - 500

💼 Consulting

📦 Logistics

🛡️ Insurance

Data Warehouse Developer leveraging expertise in Data Warehousing and Business Intelligence to design resilient data solutions for TWIA/TFPA. Collaborating across teams to transform data into meaningful insights for decision-making.

🇺🇸 United States – Remote

⏰ Full Time

🟠 Senior

🔴 Lead

🚰 Data Engineer

ETL

Guidewire

Informatica

SDLC

SSIS

Tableau

Senior Data Engineer

🕒 June 10

Samsara

1001 - 5000

📦 Logistics

🏗️ Construction

🏥 Healthcare

Senior Data Engineer developing scalable data pipelines for IoT systems at Samsara. Designing data models and collaborating with cross-functional teams to enhance data analysis efficiency.

🇺🇸 United States – Remote

💵 $119.6k - $201k / year

💰 Seed Round on 2014-08

⏰ Full Time

🟠 Senior

🚰 Data Engineer

🦅 H1B Visa Sponsor

Airflow

ETL

Python

Spark

SQL

Senior Manager, Data Engineering

🕒 June 10

AssistRx

501 - 1000

🏥 Healthcare

💼 Consulting

📦 Logistics

Senior Manager leading teams in data engineering for scalable data solutions at AssistRx. Engaging with stakeholders to ensure successful project delivery and team development.

🇺🇸 United States – Remote

⏰ Full Time

🟠 Senior

🚰 Data Engineer

🦅 H1B Visa Sponsor

AWS

Azure

Cloud

ETL

Hadoop

Informatica

Spark

SQL

Vault

Data Migration Analyst

🕒 June 10

NikoHealth

51 - 200

🏥 Healthcare

💼 Consulting

⚕️ Healthcare Insurance

Data Migration Analyst at NikoHealth overseeing transitions from legacy platforms into SaaS solutions. Ensure data accuracy and provide customer support during the migration process.

🇺🇸 United States – Remote

⏰ Full Time

🟢 Junior

🟡 Mid-level

🚰 Data Engineer

ETL

SQL

Lead Data Engineer

🕒 June 10

Egen

501 - 1000

💼 Consulting

📦 Logistics

📣 Marketing

Lead Data Engineer at Egen focusing on building and optimizing cloud-native data platforms on Google Cloud. Mentoring engineers and designing sustainable data architectures with a data-first mindset.

🇺🇸 United States – Remote

💵 $143.4k - $168.7k / year

⏰ Full Time

🟠 Senior

🚰 Data Engineer

🦅 H1B Visa Sponsor

Apache

BigQuery

Cloud

Spark

Terraform