Guidehouse
AI

AI / ML Engineer

Guidehouse · Huntsville, AL, US

Actively hiring Posted 6 months ago

Responsibilities

  • Serves as the lead AI/ML engineer responsible for developing, optimizing, and operationalizing advanced LLM-driven workflows for the FBI adjudication platform. Drives design and implementation of inference pipelines, RAG workflows, retrieval systems, prompt architectures, and model lifecycle processes.
  • Leads development of dual-path model operations supporting self-hosted open‑weight LLMs in AWS GovCloud and FedRAMP‑High managed endpoints. Engineers GPU-based inference infrastructure, model containerization, distributed inference strategies, and performance‑optimized reasoning workflows.
  • Designs and maintains continuous learning systems including SFT, LoRA/QLoRA adapters, dataset curation, automated evaluation suites, hallucination detection, bias evaluation, and model drift monitoring. Ensures models are safe, accurate, reliable, and aligned to SEAD‑4 adjudication criteria.
  • Ensures all model operations adhere to FedRAMP High, RMF, CJIS, and FBI ATO requirements, including controls for logging, access, explainability, evidence provenance, and data protection.
  • Develop and maintain LLM inference pipelines supporting long‑document reasoning, multi‑document fusion, entity extraction, anomaly detection, SEAD‑4 scoring, and structured memo generation.
  • Build and manage advanced prompt architectures including system prompts, instruction sets, retrieval‑augmented prompts, multi-step reasoning flows, and output‑schema enforcement to ensure accuracy and stability.
  • Implement distributed GPU inference frameworks (vLLM, TGI, DeepSpeed, Sagemaker) and optimize workloads with KV caching, tensor parallelism, dynamic batching, and memory efficiency strategies.
  • Develop output‑validation routines enforcing schema correctness, key‑evidence referencing, structured scoring, and quality controls for all model‑generated adjudicative content.
  • Implement RAG architectures including embedding generation, vector indexing, long‑context retrieval, and retrieval scoring to support evidence‑grounded outputs for 300–400‑page investigative files.
  • Optimize chunking strategies, ranking models, hybrid search pipelines, and retrieval heuristics to ensure accurate and contextually aligned LLM output.
  • Develop retrieval pipelines that reduce hallucination risk, enforce evidence provenance, and provide structured citation‑linked responses consistent with adjudication standards.
  • Lead development of supervised fine‑tuning (SFT) pipelines using adjudicator examples, SEAD‑4 scoring decisions, historical memos, and SME‑curated datasets.
  • Build LoRA/QLoRA fine‑tuning workflows for secure GovCloud environments, enabling high‑fidelity model specialization without full retraining cycles.
  • Design evaluation suites measuring guideline adherence, evidence alignment, factual consistency, hallucination probability, and reasoning stability across adjudicative categories.
  • Implement model drift detection, scoring distribution monitoring, and automated retraining triggers tied to analyst feedback and dataset evolution.
  • Ensure ML operations align with FedRAMP High and RMF requirements, including encryption, boundary isolation, identity controls, inference logging, and auditable model‑output trails.
  • Establish secure input‑validation flows, restricted‑context enforcement, prompt sanitization, and runtime protections to mitigate security and data‑integrity risks.
  • Develop telemetry pipelines capturing query metadata, retrieval context, response confidence, scoring variances, and override patterns for audit and monitoring.
  • Integrate LLM inference services with backend APIs, scoring engines, memo‑generation modules, entity‑resolution tools, and analyst‑facing UI workflows.
  • Develop supporting microservices for prompt routing, retrieval assembly, evaluation probes, model profiling, and inference orchestration.
  • Collaborate with backend engineers to optimize throughput, latency, concurrency, and reliability for high‑volume adjudication workflows.
  • Work with the AI Solutions Architect to maintain coherence between ML pipelines and system‑wide architecture.
  • Collaborate with adjudicators, SEAD‑4 SMEs, and mission stakeholders to translate adjudicative logic into prompts, features, and structured model outputs.
  • Mentor junior engineers, lead experimentation cycles, participate in design reviews, and contribute to Guidehouse AI/ML engineering best practices.

Basic qualifications

  • An ACTIVE and MAINTAINED "TOP SECRET" Federal or DoD security clearance and obtained and maintain TS/SCI clearance.
  • Minimum of Eight (8) years of experience in AI/ML engineering with 4+ years focused on NLP, LLMs, or MLOps.
  • Bachelor' s Degree or Four (4) additional Years of experience in lieu of degree.
  • Expertise in PyTorch, HuggingFace Transformers, vLLM, DeepSpeed, or equivalent frameworks.
  • Strong background in retrieval systems, embeddings, RAG pipelines, vector databases, and long‑context optimization.
  • Experience implementing MLOps workflows, evaluation frameworks, drift detection, and responsible‑AI safeguards.
  • Experience delivering ML systems in secure federal environments subject to FedRAMP High or RMF controls.
  • Experience supporting adjudication, continuous vetting, background investigations, or SEAD‑4 scoring workflows.
  • Experience deploying open‑weight LLMs in GovCloud or secure enclaves.
  • Experience with citation‑grounding pipelines, evidence‑verification workflows, or structured model‑output evaluation.
  • AWS Machine Learning Specialty, Solutions Architect Professional, or GPU Compute certifications.
  • Experience with explainability tooling, guardrails, reasoning verification, or adversarial evaluation.

Benefits

  • Medical, Rx, Dental & Vision Insurance
  • Personal and Family Sick Time & Company Paid Holidays
  • Parental Leave
  • 401(k) Retirement Plan
  • Group Term Life and Travel Assistance
  • Voluntary Life and AD&D Insurance
  • Health Savings Account, Health Care & Dependent Care Flexible Spending Accounts
  • Transit and Parking Commuter Benefits
  • Short-Term & Long-Term Disability
  • Tuition Reimbursement, Personal Development, Certifications & Learning Opportunities
  • Employee Referral Program
  • Corporate Sponsored Events & Community Outreach
  • Care.com annual membership
  • Employee Assistance Program
  • Supplemental Benefits via Corestream (Critical Care, Hospital Indemnity, Accident Insurance, Legal Assistance and ID theft protection, etc.)
  • Position may be eligible for a discretionary variable incentive bonus

Tags & focus areas

Used for matching and alerts on DevFound
Ai Ai Engineer Machine Learning Data Science Generative Ai
Common Questions

Frequently asked questions

Quick answers about how DevFound's AI matching, resumes, and referrals work.

DevFound's AI Copilot ingests your profile, goals, and live job data to deliver curated matches in seconds. Every match includes a resume variant, suggested referrals, and interview prep so you can act immediately. The more feedback you provide, the sharper the Copilot becomes.

AI-led job searches shrink the hours spent sifting through boards and formatting resumes. DevFound pairs automation with your personal outreach, so you reserve energy for interviews and negotiation. Traditional networking still matters, but AI gives you a lift before you even send a message.

Modern AI roles expect comfort with production-grade code, data fluency, and practical ML tooling. The strongest candidates pair deep technical chops with storytelling—translating model impact to product, GTM, and exec partners. Continuous learning keeps you ahead as stacks evolve.

DevFound rewards active seekers. Keep your profile fresh, respond to match quality prompts, and enable alerts so you never miss a role. The AI prioritizes companies and teams that align with your feedback, accelerating both introductions and interview invites.

High-density tech hubs continue to host the deepest AI talent pools, yet distributed teams are catching up fast. Use DevFound filters to hone in on onsite, hybrid, or fully remote roles and watch openings expand across time zones.

DevFound aggregates thousands of remote AI openings and flags the nuances—core hours, async culture, and visa needs—up front. The Copilot also recommends how to position your distributed work experience so hiring managers know you can thrive on a remote team.