Fullstack Data Engineer

🕒 April 23

Apply Now
Find Similar Remote Jobs

📊 Check your resume score for this job

Improve your chances of getting an interview by checking your resume score before you apply.

Logo of Codvo.ai

Codvo.ai

51 - 200 employees

Founded 2019

🔒 Cybersecurity

☁️ SaaS

AI • Cybersecurity • SaaS

Codvo. ai is a technology company that specializes in delivering strategic enterprise solutions through advanced AI-driven innovation. They focus on transforming enterprise data into measurable value by helping businesses accelerate growth with custom AI implementations tailored to meet the specific challenges of various industries. Their extensive service offerings include AI/ML automation, application development, data analytics, cybersecurity, and digital transformation, ensuring that organizations can thrive in a rapidly evolving digital landscape.

📋 Description

• Design, build, and maintain Databricks data pipelines (ETL/ELT) for ingestion, transformation, and orchestration using Spark/Delta Lake/Databricks Workflows. • Operationalize machine learning models by building inference pipelines that invoke models authored by data scientists (batch or real-time), ensuring consistency between training and inference environments. • Ensure data reliability, quality, and observability through robust validation, monitoring, alerting, and automated recovery mechanisms. • Collaborate closely with data scientists to productionize models, manage model deployment lifecycles, and optimize inference performance and cost. • Implement best-practice DevOps/MLOps processes such as CI/CD for pipelines, model versioning, environment promotion, and infrastructure-as-code. • Optimize performance and cost across compute clusters, jobs, and storage layers. • Implement and manage the enterprise data catalog, including schema design, table ownership, lineage, governance, and documentation using Unity Catalog. • Experience with some Databricks infrastructure. • Experience with building BI dashboards and visualization. • Experience with coding agents and best practices (spec-driven development, etc.)

🎯 Requirements

• 8+ yrs experience • Databricks platform experience • Python development for data processing and ETL pipelines • Unity Catalog knowledge • AWS data services (S3, IAM, VPC, potentially Glue/Lambda) • Data lake/lakehouse architecture patterns • Dashboard building experience • RESTful API design and development (Flask, FastAPI, or similar) • Authentication/authorization patterns (OAuth, API keys, IAM roles) • Query optimization and performance tuning • PySpark optimization experience • ML/AI pipeline experience • Databricks AI/BI

Apply Now

Similar Jobs

🕒 April 23

Shyft6

201 - 500

👥 HR Tech

🎯 Recruiter

🤝 B2B

Data Engineer supporting large-scale Facets migration project in the healthcare sector. Responsible for data management, entry, and migration to ensure data accuracy and integrity across systems.

🕒 April 23

Quantiphi

1001 - 5000

🤖 Artificial Intelligence

🏢 Enterprise

📚 Education

As Senior Data Engineer at Quantiphi, design and build scalable data platforms and pipelines. Collaborate with teams to build modern ecosystems with a focus on healthcare data.

🕒 April 23

Guidehouse

10,000+ employees

Senior technical leader for enterprise data pipelines and metadata foundations at Palantir. Leading design, development, and governance within Palantir Foundry and integrated data platforms.

🕒 April 22

Imagine Pediatrics

51 - 200

🧘 Wellness

👥 B2C

Staff Data Engineer on a hybrid team at Imagine Pediatrics, defining data processes for clinical and operational analytics. Collaborating across engineering functions to enhance healthcare data management.

🕒 April 22

Arch Capital Group Ltd.

5001 - 10000

💸 Finance

🏢 Enterprise

Director, Data Architect designing and building enterprise data platforms for Arch Insurance Group to enhance analytics-driven decision-making. Collaborating with teams to deliver scalable data solutions.