NVIDIA Triton Deployment Engineer 

Apply for this job

Name(Required)
Earliest start date for a full-time, permanent position(Required)
Please confirm your current work authorization status as a US Citizen or US Permanent Resident?(Required)
How many years of professional experience do you have?(Required)
Max. file size: 10 MB.

OUR PROJECT

The NVIDIA Triton Deployment Engineer will support the deployment of NVIDIA-accelerated AI models to edge devices in regulated, real-world environments. This role focuses on production-grade model deployment, automation, and lifecycle management, rather than research or model development. You will own the end-to-end process of converting, optimizing, packaging, deploying, and updating trained AI models on edge systems using NVIDIA Triton Inference Server and AWS-based automation.

This is a contract position that will be 100% remote and you can expect very competitive pay.

WHO WE ARE LOOKING FOR

We are looking for a NVIDIA Triton Deployment Engineer with deep experience operationalizing AI models in GPU and edge environments. The ideal candidate is comfortable managing the full deployment lifecycle—from model optimization to edge delivery—while automating workflows in the cloud. You have a strong understanding of Triton, TensorRT/ONNX integration, and GPU inference, and can collaborate effectively with ML, platform, and systems teams to deliver production-ready AI solutions.

We are interviewing qualified candidates immediately and will move into the offer stage quickly. If you are interested, please apply with an updated resume.

QUALIFICATIONS

  • Hands-on experience with NVIDIA Triton Inference Server, including repository structure, ensembles, versioning, TensorRT, and ONNX.
  • Proven experience deploying NVIDIA-accelerated models to edge devices.
  • Experience converting, optimizing, packaging, and deploying AI models for production inference.
  • Experience automating deployments using AWS (IAM, VPC, S3, KMS).
  • Strong understanding of GPU-based inference and production deployment considerations.

Effective written and verbal communication skills are absolutely required for this role. You must be able to work LEGALLY in the United States as NO SPONSORSHIP will be provided. NO 3rd PARTIES.

Browse other posts

Lead Equipment Engineer

We are seeking an experienced Lead Equipment Engineer with a strong background in semiconductor manufacturing equipment and proven leadership experience.

Process Engineering Manager

We are looking for a Process Engineering Manager with deep semiconductor process expertise and a strong people-management foundation.

Plasma Equipment Engineer  

We are looking for a Plasma Equipment Engineer who thrives in a collaborative manufacturing environment and leads through influence, data, and trust.

NVIDIA Triton Deployment Engineer 

We are looking for a NVIDIA Triton Deployment Engineer with deep experience operationalizing AI models in GPU and edge environments.

Software Engineer – Defense & Aerospace (Cloud Systems)

We are seeking a Software Engineer with defense or avionics experience who combines strong backend development skills with modern cloud and microservices expertise.

Principal Design Engineer – Missile Nozzle & Propulsion

We are seeking a Principal Design Engineer with extensive experience in rocket propulsion and nozzle physics.