Trustlab
AI

Senior AI Engineer

Trustlab · San Mateo, CA, US

Actively hiring Posted 6 months ago

Who we are:

TrustLab deploys cutting edge solutions to evaluate AI agents, models and apps for enterprise customers. With a 5 year track record working with large and small clients including social media companies and digital market places, and guided by founders who previously worked in senior leadership positions at Google, YouTube, TikTok, and Reddit, we are creating industry leading LLM based solutions for agentic system evaluation and labeling. Our approach includes human-in-the-loop and LLM-as-a-judge technologies, with a focus on rapid innovation and production level scaling. You’ll join a small, mission-driven team where your contributions have a direct impact on real-world issues.

What you’ll do:

At TrustLab, your work won’t live in theory - it will power live systems used at large scale. You’ll develop, tune, and optimize LLM-driven solutions that interpret and reason about complex digital content, while experimenting rapidly from design to deployment and seeing immediate feedback from real-world use cases. Partnering closely with other engineers, researchers, and product leaders, you’ll pioneer new approaches to model training and evaluation, taking ownership from early R&D through to production launches, and ensuring your work directly shapes how millions of people experience AI-powered content.

Key Responsibilities:

  • Train, evaluate, and monitor new and improved LLMs and other algorithmic models
  • Test and deploy content moderation models in production, and iterate based on real-world performance metrics and feedback loops.
  • Develop medium to long-term vision for content understanding-related R&D, working with management, product, policy & operations, and engineering teams.
  • Take ownership of results delivered to customers, pushing for change in approach where needed and taking the lead on cross-functional execution.

What we’re looking for:

  • Bachelor's or Master’s degree in Computer Science, Artificial Intelligence, Machine Learning, or a related field. Ph.D. is a plus. Proficiency in Python. Experience with AWS and CI/CD processes & tools is a strong plus.
  • Experience with prompt-engineering techniques and familiarity with multiple LLM providers.
  • Several years of industry experience in NLP / Computer Vision, or making LLM’s work in production for non-trivial use cases, incl. familiarity with evaluation metrics for classification tasks and best practices for handling imbalanced datasets.
  • Hands-on experience with debugging issues in production environments, especially on AWS.
  • Strong track record delivering results under time and resource pressure

Why Join Us?

  • Work with a group of renown industry leaders in AI and Online Safety to shape the future of the industry.
  • Ample opportunity and support for growth, as a technical individual contributor, or manager.
  • Apply AI technology to real-world business use cases at a significant scale, with blue chip customers
  • Work as part of a team where you can know everyone, but don’t have to do everyone’s job.
  • Competitive compensation, comprehensive benefits, and hybrid in-office policy.

Tags & focus areas

Used for matching and alerts on DevFound
Fulltime Ai Ai Engineer
Common Questions

Frequently asked questions

Quick answers about how DevFound's AI matching, resumes, and referrals work.

DevFound's AI Copilot ingests your profile, goals, and live job data to deliver curated matches in seconds. Every match includes a resume variant, suggested referrals, and interview prep so you can act immediately. The more feedback you provide, the sharper the Copilot becomes.

AI-led job searches shrink the hours spent sifting through boards and formatting resumes. DevFound pairs automation with your personal outreach, so you reserve energy for interviews and negotiation. Traditional networking still matters, but AI gives you a lift before you even send a message.

Modern AI roles expect comfort with production-grade code, data fluency, and practical ML tooling. The strongest candidates pair deep technical chops with storytelling—translating model impact to product, GTM, and exec partners. Continuous learning keeps you ahead as stacks evolve.

DevFound rewards active seekers. Keep your profile fresh, respond to match quality prompts, and enable alerts so you never miss a role. The AI prioritizes companies and teams that align with your feedback, accelerating both introductions and interview invites.

High-density tech hubs continue to host the deepest AI talent pools, yet distributed teams are catching up fast. Use DevFound filters to hone in on onsite, hybrid, or fully remote roles and watch openings expand across time zones.

DevFound aggregates thousands of remote AI openings and flags the nuances—core hours, async culture, and visa needs—up front. The Copilot also recommends how to position your distributed work experience so hiring managers know you can thrive on a remote team.