P-1 AI logo

P-1 AI

HIRING · 2 OPEN AI

About

Own the training pipeline for large-scale LLM fine-tuning and post-training workflows
Configure, launch, monitor, and debug multi-node distributed training jobs using FSDP, DeepSpeed, or custom wrappers
Contribute to upstream and internal forks of training frameworks like TorchTune, TRL, and Hugging Face Transformers
Tune training parameters, memory footprints, and sharding strategies for optimal throughput
Work closely with infra and systems teams to maintain the health and utilization o...

Updates & Highlights

Senior Machine Learning Engineer

location_on Remote

Fulltime Remote Ai

Machine Learning Engineer - Training & Infrastructure

location_on San Francisco, CA

Fulltime Ai Ai Engineer