Senior Autonomy Data Engineer

Job not on LinkedIn

🔥 0 minutes ago

⚔️ Virginia – Remote

info

💵 $160.8k - $193k / year

⏰ Full Time

🟠 Senior

🚰 Data Engineer

🦅 H1B Visa Sponsor

info
Apply Now
Find Similar Remote Jobs

📊 Check your resume score for this job

Improve your chances of getting an interview by checking your resume score before you apply.

Logo of Torc Robotics

Torc Robotics

501 - 1000 employees

Founded 2007

🚗 Transport

🔧 Hardware

🤖 Artificial Intelligence

Transport • Hardware • Artificial Intelligence

Torc Robotics is an innovative company focused on commercializing self-driving trucks for long-haul freight transportation. As an independent subsidiary of Daimler Truck, the company is developing autonomous technology, primarily focusing on the Freightliner Cascadia. Torc is committed to safe transportation, continuously improving its solutions through rigorous testing and integration of industry-leading sensors. It collaborates with fleet management companies to deploy real-world autonomous solutions, aiming to lead the industry in autonomous trucking.

📋 Description

• Own the design and organization of the program’s data lake, including schema definitions, partitioning strategy and metadata indexing. • Design and maintain end-to-end pipelines that ingest high-bandwidth sensor logs from vehicles into cloud storage with high reliability and tolerant of ad-hoc and intermittent connectivity mechanisms. • Develop data validation and integrity checks that can detect corrupted information, missing sensors, and inconsistent calibration prior to the data being processed by downstream systems. • Implement retention, tiering and lifecycle policies for data to balance storage costs with development value. • Build tooling to query raw logs to produce curated training and evaluation datasets. • Build automation to run cost-effective pseudo-labeling workflows at the scale of data ingest. • Implement data quality and model performance metrics that are used to direct labeling effort toward the highest-value examples. • Deploy and maintain data visualization tooling to support log review, annotation QA, and autonomy debugging workflows. • Build integrations between the visualization tooling and the data lake so engineers can navigate from a dataset entry or model failure directly to the origin log data • Work with autonomy engineers to define and surface custom visualization panels and implement metrics for analyzing unstructured operating environments. • Build dashboards that provide the autonomy engineers visibility into data coverage by terrain type, operating environment and geographic region. • Establish and document data contracts between the data services and model training consumers. • Partner with perception, planning and embedded engineers across the data lifecycle: from shaping the logging schemas and collection triggers to defining the dataset interfaces that supply model training and evaluation. • Define data engineering standards, best practices, and tooling choices for an innovative and fast-paced team. • Contribute to the data roadmap and provide input to technical leadership on investment priorities. • Mentor junior engineers and raise the team’s capabilities in data infrastructure scalability and operational hygiene.

🎯 Requirements

• Bachelor’s degree in Computer Science, Computer Engineering, Software Engineering, Electrical Engineering or a related field with 6+ years of data engineering experience or a Master’s with 4+ years. • Strong proficiency in Python and SQL, with demonstrated ability to build production-quality data pipelines • Deep experience with cloud data infrastructure (AWS preferred: S3, Glue Athena, redshift, or equivalent) and infrastructure-as-code tools (Terraform, Cloud Formation). • Solid understanding of data partitioning strategies and columnar storage formats (Parquet, Orc, etc.) • Experience building and operating data pipelines that process time-series and binary data. • Proven ability to evaluate and integrate open-source tooling when appropriate versus building from scratch. • Strong instincts for delivering data quality through first-class implementations of monitoring, validation and lineage tracking.

🏖️ Benefits

• A competitive compensation package that includes a bonus component and stock options • 100% paid medical, dental, and vision premiums for full-time employees • 401K plan with a 6% employer match • Flexibility in schedule and generous paid vacation (available immediately after start date) • Company-wide holiday office closures • AD+D and Life Insurance

Apply Now

Similar Jobs

🔥 13 minutes ago

SS&C Technologies

10,000+ employees

🏦 Banking

💳 Fintech

Data Migration Specialist directing data migration projects at SS&C for Intralinks. Collaborating with customers to analyze and oversee data transitions into Intralinks ecosystem.

🔥 43 minutes ago

Samsara

1001 - 5000

🏢 Enterprise

🚗 Transport

🔐 Security

Senior Software Engineer designing and operating high-scale data infrastructures at Samsara. Collaborating with cross-functional teams to drive AI and analytics roadmap.

🔥 4 hours ago

DICK'S Sporting Goods

10,000+ employees

🛒 Retail

⚽ Sports

🛍️ eCommerce

Senior Data Engineer building and evolving data pipelines for personalized customer experiences at DICK'S Sporting Goods. Collaborating with teams to ensure data integrity and hygiene.

🔥 5 hours ago

Blue Orange Digital

51 - 200

🤖 Artificial Intelligence

🤝 B2B

🏢 Enterprise

Data Engineer designing and implementing scalable data solutions for Blue Orange Digital, a boutique data & AI consultancy. Working with clients across Private Equity, Financial Services, Healthcare, and Retail.

🇺🇸 United States – Remote

💵 $7.4k / month

💰 $700k Corporate round on 2022-05

⏰ Full Time

🟡 Mid-level

🟠 Senior

🚰 Data Engineer

🔥 5 hours ago

eSimplicity

51 - 200

⚕️ Healthcare Insurance

📡 Telecommunications

🤖 Artificial Intelligence

Senior Data Engineer designing, developing, and maintaining scalable data pipelines for Medicaid program at eSimplicity. Building data workflows and implementing data quality measures in a remote role.