NVIDIA Triton Deployment Engineer 

Apply for this job

Name(Required)
Earliest start date for a full-time, permanent position(Required)
Please confirm your current work authorization status as a US Citizen or US Permanent Resident?(Required)
How many years of professional experience do you have?(Required)
Max. file size: 10 MB.

OUR PROJECT

The NVIDIA Triton Deployment Engineer will support the deployment of NVIDIA-accelerated AI models to edge devices in regulated, real-world environments. This role focuses on production-grade model deployment, automation, and lifecycle management, rather than research or model development. You will own the end-to-end process of converting, optimizing, packaging, deploying, and updating trained AI models on edge systems using NVIDIA Triton Inference Server and AWS-based automation.

This is a contract position that will be 100% remote and you can expect very competitive pay.

WHO WE ARE LOOKING FOR

We are looking for a NVIDIA Triton Deployment Engineer with deep experience operationalizing AI models in GPU and edge environments. The ideal candidate is comfortable managing the full deployment lifecycle—from model optimization to edge delivery—while automating workflows in the cloud. You have a strong understanding of Triton, TensorRT/ONNX integration, and GPU inference, and can collaborate effectively with ML, platform, and systems teams to deliver production-ready AI solutions.

We are interviewing qualified candidates immediately and will move into the offer stage quickly. If you are interested, please apply with an updated resume.

QUALIFICATIONS

  • Hands-on experience with NVIDIA Triton Inference Server, including repository structure, ensembles, versioning, TensorRT, and ONNX.
  • Proven experience deploying NVIDIA-accelerated models to edge devices.
  • Experience converting, optimizing, packaging, and deploying AI models for production inference.
  • Experience automating deployments using AWS (IAM, VPC, S3, KMS).
  • Strong understanding of GPU-based inference and production deployment considerations.

Effective written and verbal communication skills are absolutely required for this role. You must be able to work LEGALLY in the United States as NO SPONSORSHIP will be provided. NO 3rd PARTIES.

Browse other posts

Pricing Data Analyst

We are seeking a motivated and technically skilled Pricing Data Analyst who thrives in a fast-paced retail environment.

Technical Recruiting Manager

This role would be ideal for a Recruiting Manager, Recruiting Lead, or Account Manager with 6-10 years of professional management experience in recruiting or account management in the technical staffing industry. You’re excited to lead and mentor a team!

Financial Analyst – Life Sciences

We are looking for a Financial Analyst to support IT budgeting, forecasting, variance analysis, and financial reporting in the Chicago area.

Senior Cloud Architect – Defense & Aerospace – Melbourne, FL

We are seeking a Senior Cloud Architect with defense or avionics experience who combines strong backend development skills with modern cloud and microservices expertise.

Optical Engineer

We are looking for a Senior Optical Engineer who is technically rigorous, and a hands-on leader that thrives in a fast-paced aerospace environment.

Principal Navigation Engineer (Kalman Filters)

We are looking for a Principal Navigation Engineer (Kalman Filters) who will lead the design, implementation, and validation of advanced state estimation algorithms for spacecraft relative motion.