Nous Research logo

Nous Research

HIRING · 2 OPEN

About

Architect and implement efficient ML Inference pipelines for large language models. Responsibilities: Design and implement high-performance inference pipelines Optimize model serving for throughput, latency, and cost across different workloads Collaborate with research and product teams to integrate inference into real-world applications Help enhance and manage the deployment pipeline and monitor production clusters Debug production inference issues Stay up-to-date with the latest in inferen...

Updates & Highlights

Full TimeResearch Scientist

location_on Saratoga, CA, USA

Scientist Solana Python

Full TimeML Ops Engineer

location_on Saratoga, CA, USA

Engineer Aws Kubernetes