Principal Systems Engineer – HPC/AI, Storage Engineer, Monitoring Expert, Solution Architect, Security/Provisioning Engineer, Multi-discipline Expert

Job not on LinkedIn

10 hours ago

Apply Now
Logo of General Dynamics Information Technology

General Dynamics Information Technology

Defense • Cybersecurity • Artificial Intelligence

General Dynamics Information Technology is a company at the forefront of technological innovation, offering a wide range of services including consulting, digital modernization, and application services. The company is heavily involved in implementing solutions related to artificial intelligence, cloud computing, cybersecurity, high-performance computing, and quantum technologies. GDIT is committed to supporting government and defense sectors, providing mission-critical services such as logistics and supply chain management, intelligence, and homeland security. The company also focuses on diverse and inclusive hiring practices and actively promotes employee well-being. Through its digital accelerator solutions and pioneering use of emerging technologies, GDIT aims to propel agencies' missions forward and address complex technological challenges.

📋 Description

• Lead/Manage/Support the day-day operations, sustainment, HPC services delivery, and incremental enhancements of two, geographically separated HPC clusters • Collaborate with the GDIT WCOSS team as a senior-level HPC functional expert addressing intricate and multifaceted HPC challenges • Drive and prioritize resource utilization towards continuously improving customer satisfaction with GDIT's HPC service delivery • Utilize past experience, team collaboration, system management and troubleshooting applications, and ingenuity to support customer operations

🎯 Requirements

• 8+ years of related experience • Highly proficient with Linux (RockyOS, SLES, etc) • Scripting in Python, Perl, or Bash • Networking concepts and technology such as Ethernet, InfiniBand and Slingshot, TCP/IP networking, basic routing, and network services • Programming in Python, C/C++, or Fortran • Administrating PBSpro, SLURM or other batch systems in an HPC cluster • System performance monitoring and tuning in an HPC cluster environment (e.g., Opensearch, Grafana, Prometheus) • Security clearance level: must complete a satisfactory background investigation • US citizenship required • Expected to perform as individual SME contributor, functional lead, or project/task leader responsible for work product delivery • Extensive experience in troubleshooting, diagnosing and repairing hardware failures to component level on servers; coordinating with vendors to resolve hardware and software problems

🏖️ Benefits

• Comprehensive benefits and wellness packages • 401K with company match • Competitive pay and paid time off • Full-flex work week to own your priorities at work and at home • Variety of paid time off plans, including vacation, sick and personal time, holidays, paid parental, military, bereavement and jury duty leave

Apply Now

Similar Jobs

Yesterday

Manager of R&D Software Engineering leading a team for next-generation clinical trial technologies. Overseeing solution engineering with a focus on cloud-native engineering and technical excellence.

Angular

AWS

Azure

Cloud

Oracle

Postgres

SQL

.NET

Yesterday

Consulting Solution Architect developing comprehensive security solutions for CDW customers. Collaborating with teams to provide technical scoping and support for managed services offerings.

Azure

Cloud

Cyber Security

Yesterday

Solutions Architect responsible for designing and enabling new service offerings on Rubrik's data protection and recovery portfolio. Support strategic presales with global MSPs.

AWS

Azure

Cloud

Google Cloud Platform

VMware

Yesterday

Solutions Architect guiding clients in their modern data journey with Trace3. Delivering strategic insights, technical expertise, and effective collaboration on data architecture and analytics initiatives.

AWS

Azure

BigQuery

Cloud

ETL

Google Cloud Platform

Yesterday

Solutions Architect responsible for data architecture and analytics strategy for enterprise clients at Trace3. Driving modernization and platform transformation through consultative engagement.

AWS

Azure

BigQuery

Cloud

ETL

Google Cloud Platform

Built by Lior Neu-ner. I'd love to hear your feedback — Get in touch via DM or support@remoterocketship.com