AgentVersity
AI

Generative AI Engineer

AgentVersity · Huntersville, NC

Actively hiring Posted 6 months ago

Company Description

AgentVersity is dedicated to building a community of skilled AI professionals prepared to address the challenges of the future. Our mission is to make AI education accessible, engaging, and effective, empowering individuals to develop impactful AI-driven solutions. With a network of over 8,000 AI professionals from leading companies like Google, Microsoft, and Meta, we provide the latest updates, exclusive training, and exceptional support. Join us to advance your AI expertise and be part of an innovative and dynamic organization.

Role Description

This is a full-time, on-site role for a Generative AI Engineer located in Huntersville, NC. The role involves developing, training, and optimizing generative AI models. Daily responsibilities include researching state-of-the-art AI techniques, designing machine learning algorithms, collaborating with interdisciplinary teams, and testing and deploying AI solutions. As a Generative AI Engineer, you will also help in identifying new AI use cases and driving innovation to meet strategic business objectives.

Responsibilties :

  • Design and implement AI agents using frameworks such as LangGraph, Google ADK, or other agentic frameworks
  • Build Retrieval-Augmented Generation (RAG) systems using vector databases and structured data sources
  • Develop evaluation frameworks for:
  • Agent behavior and decision quality
  • RAG retrieval accuracy and answer faithfulness
  • Fine-tune language models using LoRA / QLoRA for task-specific performance improvements
  • Deploy and host small and medium language models using vLLM for high-throughput inference
  • Optimize inference performance (latency, memory, GPU utilization)
  • Integrate tools, APIs, and external systems into agent workflows
  • Work with cloud infrastructure on AWS, GCP, or Azure to deploy GenAI workloads
  • Collaborate with DevOps / SRE teams to ensure reliability, observability, and scalability

Required SKills and Experience :

  • Strong experience building Generative AI applications using LLMs
  • Hands-on experience with AI agents and agent orchestration frameworks:
  • LangGraph, Google ADK, or similar agentic frameworks
  • Solid understanding of RAG architectures , including:
  • Embeddings, vector databases, chunking strategies, and retrieval evaluation
  • Experience implementing evaluation pipelines for LLMs, agents, and RAG systems
  • Practical experience with LoRA / QLoRA fine-tuning
  • Experience hosting models using vLLM or similar inference servers
  • Proficiency in Python
  • Experience working in at least one cloud environment:
  • AWS, GCP, or Azure
  • Familiarity with Docker and basic Kubernetes concepts

Nice to have

  • Experience with multi-agent systems and tool-using agents
  • Knowledge of prompt engineering and structured output generation
  • Experience with GPU-based workloads and memory optimization
  • Familiarity with observability tools for GenAI systems
  • Prior experience deploying GenAI systems in production

What we value :

  • Strong problem-solving mindset
  • Ability to think in systems, not just prompts
  • Focus on production-ready AI , not demos
  • Clear communication and collaboration skills

**Preference will be given to USC and GC holders. No H1B. OPT and H4 EAD can apply. NO C2C

Tags & focus areas

Used for matching and alerts on DevFound
Fulltime Ai Ai Engineer Machine Learning Generative Ai
Common Questions

Frequently asked questions

Quick answers about how DevFound's AI matching, resumes, and referrals work.

DevFound's AI Copilot ingests your profile, goals, and live job data to deliver curated matches in seconds. Every match includes a resume variant, suggested referrals, and interview prep so you can act immediately. The more feedback you provide, the sharper the Copilot becomes.

AI-led job searches shrink the hours spent sifting through boards and formatting resumes. DevFound pairs automation with your personal outreach, so you reserve energy for interviews and negotiation. Traditional networking still matters, but AI gives you a lift before you even send a message.

Modern AI roles expect comfort with production-grade code, data fluency, and practical ML tooling. The strongest candidates pair deep technical chops with storytelling—translating model impact to product, GTM, and exec partners. Continuous learning keeps you ahead as stacks evolve.

DevFound rewards active seekers. Keep your profile fresh, respond to match quality prompts, and enable alerts so you never miss a role. The AI prioritizes companies and teams that align with your feedback, accelerating both introductions and interview invites.

High-density tech hubs continue to host the deepest AI talent pools, yet distributed teams are catching up fast. Use DevFound filters to hone in on onsite, hybrid, or fully remote roles and watch openings expand across time zones.

DevFound aggregates thousands of remote AI openings and flags the nuances—core hours, async culture, and visa needs—up front. The Copilot also recommends how to position your distributed work experience so hiring managers know you can thrive on a remote team.