Senior Staff Engineer, Lustre

đŸ”„ 0 minutes ago

Apply Now
Find Similar Remote Jobs

📊 Check your resume score for this job

Improve your chances of getting an interview by checking your resume score before you apply.

Logo of DDN

DDN

1001 - 5000 employees

Founded 1998

đŸ€– Artificial Intelligence

💰 $10M Funding Round on 2011-06

Artificial Intelligence ‱ Data Center and Cloud Computing ‱ High Performance Computing

DDN is a global leader in AI data intelligence solutions, providing high-performance computing and sophisticated data management technologies. With a focus on accelerating AI deployments and advanced data analytics, DDN's products, including the Data Intelligence Platform and advanced storage systems, serve diverse sectors such as healthcare, financial services, and government. DDN is committed to transforming enterprise data infrastructure to leverage the full potential of AI and drive operational efficiency.

📋 Description

‱ Provide deep technical leadership across LustreFS subsystems including llite, MDS/MDT, OSS/OST, LDLM, recovery and LNet. ‱ Own complex root-cause analysis for difficult customer, scale and production issues across kernel, filesystem, network and transport layers. ‱ Lead design and implementation of new features, reliability improvements, scale enhancements and performance optimizations in LustreFS. ‱ Drive architectural reviews for kernel-space and user-space changes with strong attention to correctness, backward compatibility and operability. ‱ Define debugging and observability strategies for complex distributed failure scenarios including failover, recovery storms, lock contention and transport degradation. ‱ Partner with principal engineers, support, QE, DevOps and release teams to improve product quality, test depth and release confidence. ‱ Mentor senior and mid-level engineers; create structured learning paths, review standards and subsystem ownership models to build redundancy. ‱ Promote use of AI-assisted workflows for issue triage, log analysis, code review assistance, knowledge capture and design acceleration with appropriate engineering guardrails.

🎯 Requirements

‱ 15+ years of experience in distributed systems, filesystems, Linux kernel development or storage infrastructure engineering. ‱ Strong hands-on expertise in LustreFS internals and production operations, including one or more of: metadata services, object storage services, client/llite, locking, recovery or LNet. ‱ Strong C systems programming skills and deep Linux debugging experience using tools such as gdb, crash, perf, ftrace, eBPF, systemtap and core analysis. ‱ Strong understanding of Linux kernel concurrency, memory management, I/O paths, networking and performance tuning. ‱ Experience with high-performance networking and transports such as InfiniBand, RDMA, RoCE and/or TCP at scale. ‱ Proven ability to diagnose complex cross-layer issues spanning kernel, storage, networking and distributed coordination. ‱ Experience leading design discussions, code reviews and subsystem-level technical decisions. ‱ Excellent written and verbal communication skills with the ability to guide senior technical audiences and influence cross-functional teams.

đŸ–ïž Benefits

‱ Career development opportunities ‱ Flexible work arrangements ‱ Health insurance ‱ Paid time off ‱ Professional development

Apply Now

Similar Jobs

đŸ”„ 3 minutes ago

Agility Robotics

201 - 500

🔧 Hardware

đŸ€– Artificial Intelligence

🏱 Enterprise

Technical leader for motion planning on humanoid robots at Agility Robotics. Collaborating with teams to architect complex planning stacks for real-world deployment.

C++

Python

đŸ”„ 5 minutes ago

Pathstream

51 - 200

📚 Education

đŸ‘„ HR Tech

⚡ Productivity

Sr Staff Software Engineer managing AI-native engineering practices at Pathstream. Leading technical direction and mentoring mid-level engineers across multiple domains of the platform.

AWS

Docker

JavaScript

Postgres

Python

React

Ruby

Ruby on Rails

TypeScript

đŸ”„ 6 minutes ago

Life360

201 - 500

đŸ‘„ B2C

📡 Telecommunications

Senior Software Engineer II at Life360 designing Android features leveraging AI tools for growth experiments. Collaborating with cross-functional teams and driving activation, conversion, and retention strategies.

Android

Dagger

Gradle

Kotlin

Maven

đŸ”„ 31 minutes ago

Ditto

11 - 50

🔌 API

📡 Telecommunications

Senior Software Engineer developing polished user interfaces and full-stack solutions at Ditto. Working closely with product and design teams on critical real-time application features.

Android

GraphQL

iOS

Postgres

React

Rust

TypeScript

đŸ”„ 4 hours ago

Mapbox

501 - 1000

🔌 API

🚗 Transport

📡 Telecommunications

Software Engineer developing offline search capabilities for Mapbox, focusing on mobile and automotive search without network connectivity. Engaging with automotive OEMs and optimizing search algorithms for performance.

Python