Senior Data Engineer

Job not on LinkedIn

🔥 13 minutes ago

Apply Now
Find Similar Remote Jobs

📊 Check your resume score for this job

Improve your chances of getting an interview by checking your resume score before you apply.

Logo of Simple Technology Solutions

Simple Technology Solutions

51 - 200 employees

🏛️ Government

🤖 Artificial Intelligence

Government • Cloud • Artificial Intelligence

Simple Technology Solutions is a HUBZone small business that specializes in IT modernization and digital experience for government operations. They focus on digitalizing government processes using cloud-native technologies and Agile practices to deliver full-stack digital solutions. The company emphasizes security, scalability, and interoperability in its enterprise approach. They work on enhancing cloud environments, migrating legacy IT systems, and promoting DevSecOps practices. Additionally, Simple Technology Solutions develops enterprise data management strategies using machine learning and AI, modernizes applications, and provides cloud contact center services. They primarily serve federal government agencies, particularly in law enforcement and public safety missions.

📋 Description

• design, build, and maintain well-architected ETL pipelines • work at enterprise scale and process terabytes of financial data • ensure data is timely, accurate, and stored within contractually required timelines • build and maintain ingestion pipelines using AWS Glue • process and ingest large-volume XML filings • store curated zone data in Apache Iceberg tables • prevent duplicate data loads across target destinations • integrate the agency's ETL Common Library into Glue jobs • ensure all ETL jobs populate ETL Load Reports in real-time • implement technical data quality standards • develop and maintain the semantic layer • deploy ETL resources using CloudFormation templates • create and maintain documentation suite for each dataset • support operationalization of statistical outputs and derived data products

🎯 Requirements

• US Citizenship is required • Bachelor's Degree is required • minimum of 6 years' position related experience is required • strong expertise in AWS Glue (Spark-based), PySpark, and Python (PEP 8) • direct experience building large-scale ETL pipelines on AWS • experience with Apache Iceberg, Parquet, ORC, and Avro file formats • experience with PostgreSQL, Redshift, and Oracle; familiarity with NoSQL, knowledge bases, and vector stores • experience with Trino, Athena, and Hive • experience parsing and ingesting large-volume XML datasets with schema evolution handling using PySpark • proficiency with CloudFormation, GitHub branching workflows, CI/CD pipelines • demonstrated ability to produce complete ETL documentation

🏖️ Benefits

• flexibility to help them thrive personally and professionally • collaboration, continuous learning, and excellence • special incentives for team members living in qualified HUBZones

Apply Now

Similar Jobs

🔥 57 minutes ago

Samsara

1001 - 5000

🏢 Enterprise

🚗 Transport

🔐 Security

Senior Data Engineer developing scalable data pipelines for IoT systems at Samsara. Designing data models and collaborating with cross-functional teams to enhance data analysis efficiency.

🔥 4 hours ago

AssistRx

501 - 1000

⚕️ Healthcare Insurance

💊 Pharmaceuticals

☁️ SaaS

Senior Manager leading teams in data engineering for scalable data solutions at AssistRx. Engaging with stakeholders to ensure successful project delivery and team development.

🔥 4 hours ago

Sardine

51 - 200

🔒 Cybersecurity

📋 Compliance

💳 Fintech

Data Engineer building and owning internal data infrastructure for analytics at Sardine. Integrating systems into a scalable data warehouse to drive decision-making and insights.

🔥 13 hours ago

Ochsner Health

10,000+ employees

⚕️ Healthcare Insurance

🤝 Non-profit

📚 Education

Storage and Data Engineer leading enterprise storage migrations to AWS, Azure, and Rackspace. Ensuring smooth transitions into steady state run operations with strong resilience and compliance.

🔥 13 hours ago

Ad Hoc LLC

501 - 1000

🏛️ Government

🤖 Artificial Intelligence

🔌 API

Senior Data Architect at Ad Hoc collaborating on federal digital services. Leading data architecture strategy and guiding teams in complex data migrations and cloud solutions.