Mid-Level Data Scientist

Job not on LinkedIn

🔥 23 minutes ago

Apply Now
Find Similar Remote Jobs

📊 Check your resume score for this job

Improve your chances of getting an interview by checking your resume score before you apply.

Logo of Simple Technology Solutions

Simple Technology Solutions

51 - 200 employees

🏛️ Government

🤖 Artificial Intelligence

Government • Cloud • Artificial Intelligence

Simple Technology Solutions is a HUBZone small business that specializes in IT modernization and digital experience for government operations. They focus on digitalizing government processes using cloud-native technologies and Agile practices to deliver full-stack digital solutions. The company emphasizes security, scalability, and interoperability in its enterprise approach. They work on enhancing cloud environments, migrating legacy IT systems, and promoting DevSecOps practices. Additionally, Simple Technology Solutions develops enterprise data management strategies using machine learning and AI, modernizes applications, and provides cloud contact center services. They primarily serve federal government agencies, particularly in law enforcement and public safety missions.

📋 Description

• Build and maintain knowledge bases, vector stores, and Retrieval Augmented Generation (RAG) pipelines using Amazon Bedrock and Amazon OpenSearch Services to make financial and regulatory datasets AI-ready for advanced analytics and machine learning consumption • Support the development, validation, and operationalization of statistical outputs and derived data products; coordinate with the agency data science team and SME data scientists to implement Airflow DAGs and AWS Glue jobs that ensure automated, recurring updates • Support transition of data science outputs into production by validating accuracy, completeness, and reporting readiness; ensure all production data products are incorporated into the agency's ETL load and gap reporting infrastructure • Develop and validate machine learning models and analytical pipelines using large-scale financial and regulatory datasets in the data lake • Leverage AI-assisted development tools for code generation, debugging, and performance tuning; adhere to agency security standards and applicable federal AI governance requirements • Write Python 3.10 code conforming to PEP 8; integrate analytical pipelines with the agency's ETL metadata infrastructure and produce required load and gap reporting outputs • Support entity resolution work to ensure consistent identification and linkage of records across high-volume financial datasets • Produce required documentation for all analytical models and pipelines: methodology, data lineage, model assumptions, refresh schedules, and IV&V Questionnaires • Write automated tests achieving the 90% minimum code coverage threshold; complete security scans at least once per sprint as part of the Definition of Done per OWASP ASVS Level 2 • Participate in 2-week sprint ceremonies, quarterly PI planning, backlog refinement, and agile delivery using JIRA and GitHub

🎯 Requirements

• US Citizenship is required • Bachelor's Degree is required • minimum of 3-5 years position related experience is required • Proficiency in Python 3.10 (PEP 8) including pandas, NumPy, scikit-learn, and related libraries • Hands-on experience with Amazon Bedrock, knowledge bases, vector stores, and RAG pipeline design on AWS • Experience with Amazon OpenSearch Services or equivalent vector/search infrastructure • Experience with Apache Airflow (MWAA) for DAG-based pipeline orchestration • Familiarity with AWS Glue, S3, and Apache Spark for large-scale data processing • Experience with SQL and query tools such as Trino, Athena, or Redshift • Experience working with large-scale financial or regulatory datasets is strongly preferred • Knowledge of federal AI governance requirements and responsible AI practices in a government setting • Experience with agile development, CI/CD pipelines, GitHub, and sprint-based delivery • Familiarity with FISMA, NIST 800-53, and Zero Trust principles • Must be able to work 8am-5pm Eastern Time regardless of home location • Active federal public trust suitability determination or ability to obtain one required

🏖️ Benefits

• Special incentives for team members living in qualified HUBZones • Flexibility to help them thrive personally and professionally

Apply Now

Similar Jobs

🔥 3 hours ago

Navarro Research and Engineering

201 - 500

⚡ Energy

🏛️ Government

Senior Data Scientist / AI Engineer developing and maintaining AI solutions for government environments. Collaborating with cross-functional teams to enhance operational efficiency through advanced analytics and automation.

🔥 5 hours ago

Serve Robotics

51 - 200

🚗 Transport

🤖 Artificial Intelligence

Senior Data Scientist at Serve Robotics developing machine learning solutions to enhance robotic delivery efficiency in urban areas while collaborating with cross-functional teams.

🔥 7 hours ago

Northbeam

11 - 50

🛍️ eCommerce

☁️ SaaS

🤖 Artificial Intelligence

Senior Data Scientist building measurement products including MMM and Incrementality at Northbeam. Focused on translating statistical methodologies into reliable production systems.

🇺🇸 United States – Remote

💵 $170k - $200k / year

💰 $15M Series A on 2022-08

⏰ Full Time

🟠 Senior

📊 Data Scientist

🔥 10 hours ago

The Muse

51 - 200

👥 B2C

🎯 Recruiter

☁️ SaaS

Lead Actuarial Data Scientist enhancing pricing models for personal auto and home insurance products. Collaborating with cross-functional teams and mentoring junior data scientists.

🔥 16 hours ago

Independence Pet Group

1001 - 5000

👥 B2C

🧘 Wellness

Lead Data Scientist focused on pricing model development for pet insurance at Independence Pet Holdings. In this senior role, drive analytical best practices and deliver complex projects independently.