Lead AI Engineer, FM Hosting, LLM Inference

🕒 3 days ago

🏢🏡 New York City – Hybrid

💵 $197.3k - $245.6k / year

⏰ Full Time

🟠 Senior

🤖 AI Engineer

🦅 H1B Visa Sponsor

info
Apply Now
Find Similar Remote Jobs

📊 Check your resume score for this job

Improve your chances of getting an interview by checking your resume score before you apply.

Logo of Capital One

Capital One

WebsiteLinkedIn

10,000+ employees

🏦 Banking

💳 Fintech

💸 Finance

💰 Post-IPO Equity on 2023-05

Banking • Fintech • Finance

Capital One is a leading financial services company that specializes in offering credit cards, auto loans, banking, and savings accounts. With a focus on innovation and technology, Capital One aims to change banking for good by providing customer-friendly solutions and fostering a diverse and inclusive workforce. The company is known for its commitment to creating a positive impact in the banking industry through advanced digital tools and customer service excellence.

📋 Description

• Partner with a cross-functional team of engineers, research scientists, technical program managers, and product managers to deliver AI-powered products that change how our associates work and how our customers interact with Capital One. • Design, develop, test, deploy, and support AI software components including foundation model training, large language model inference, similarity search, guardrails, model evaluation, experimentation, governance, and observability, etc. • Leverage a broad stack of Open Source and SaaS AI technologies such as AWS Ultraclusters, Huggingface, VectorDBs, Nemo Guardrails, PyTorch, and more. • Invent and introduce state-of-the-art LLM optimization techniques to improve the performance — scalability, cost, latency, throughput — of large scale production AI systems. • Contribute to the technical vision and the long term roadmap of foundational AI systems at Capital One.

🎯 Requirements

• Bachelor's degree in Computer Science, AI, Electrical Engineering, Computer Engineering, or related fields plus at least 4 years of experience developing AI and ML algorithms or technologies, or a Master's degree in Computer Science, AI, Electrical Engineering, Computer Engineering, or related fields plus at least 2 years of experience developing AI and ML algorithms or technologies • At least 4 years of experience programming with Python, Go, Scala, or Java • 6 years of experience deploying scalable and responsible AI solutions on cloud platforms (e.g. AWS, Google Cloud, Azure, or equivalent private cloud) • Experience designing, developing, delivering, and supporting AI services • Experience developing AI and ML algorithms or technologies (e.g. LLM Inference, Similarity Search and VectorDBs, Guardrails, Memory) using Python, C++, C#, Java, or Golang • Experience developing and applying state-of-the-art techniques for optimizing training and inference software to improve hardware utilization, latency, throughput, and cost • Passion for staying abreast of the latest AI research and AI systems, and judiciously apply novel techniques in production.

🏖️ Benefits

• comprehensive, competitive, and inclusive set of health, financial and other benefits that support your total well-being.

Apply Now

Similar Jobs

🕒 3 days ago

Guardian Life

5001 - 10000

💸 Finance

🧘 Wellness

WebsiteLinkedIn

Lead AI Engineer driving AI and Generative AI solutions for Guardian. Collaborating across teams to optimize automation and improve operational excellence.

🏢🏡 New York City – Hybrid

💵 $119k - $195.5k / year

💰 Non Equity Assistance on 2016-08

⏰ Full Time

🟠 Senior

🤖 AI Engineer

🦅 H1B Visa Sponsor

info

🕒 6 days ago

PulseRise Technologies

1 - 10

🤝 B2B

🎯 Recruiter

🔒 Cybersecurity

WebsiteLinkedIn

Senior ML/AI Engineer developing models and systems for intelligent enterprise decision-making. Focused on applied AI, data analytics, and production-level machine learning integrations.

🏢🏡 New York City – Hybrid

⏰ Full Time

🟠 Senior

🤖 AI Engineer

🕒 May 27

Socure

501 - 1000

🤖 Artificial Intelligence

🔐 Security

💸 Finance

WebsiteLinkedIn

Senior AI Engineer defining and building AI-driven identity solutions. Collaborate within Socure to enhance digital trust and operational efficiency through innovative automation.

🏢🏡 New York City – Hybrid

💵 $200k - $230k / year

💰 $450M Series E on 2021-11

⏰ Full Time

🟠 Senior

🤖 AI Engineer

🦅 H1B Visa Sponsor

info

🕒 May 24

Pear VC

11 - 50

🤖 Artificial Intelligence

🧬 Biotechnology

WebsiteLinkedIn

Founding Applied AI Lead designing and owning AI agents and workflows for Paxos Health. Transforming healthcare through AI to help patients access medical technologies with high reliability.

🏢🏡 New York City – Hybrid

💵 $110k - $175k / year

⏰ Full Time

🟠 Senior

🤖 AI Engineer

🕒 May 16

Pfizer

10,000+ employees

WebsiteLinkedIn

Scientific AI Engineer working on AI solutions that impact Oncology R&D decision-making. Collaborating with scientists and clinicians to deliver AI-enabled insights.

🏢🏡 New York City – Hybrid

💵 $139.1k - $231.9k / year

💰 Post-IPO Debt on 2023-05

⏰ Full Time

🟠 Senior

🤖 AI Engineer

🦅 H1B Visa Sponsor

info