Job Description

Quadric has created an innovative general purpose neural processing unit (GPNPU) architecture. Quadric’s co-optimized software and hardware is targeted to run neural network (NN) inference workloads in a wide variety of edge and endpoint devices, ranging from battery operated smart-sensor systems to high-performance automotive or autonomous vehicle systems. Unlike other NPUs or neural network accelerators in the industry today that can only accelerate a portion of a machine learning graph, the Quadric GPNPU executes both NN graph code and conventional C++ DSP and control code.

Role:

The AI Inference Engineer in Quadric is the key bridge between the world of AI/LLM models and Quadric unique platforms. The AI Inference Engineer at Quadric will [1] port AI models to Quadric platform; [2] optimize the model deployment for efficient inference; [3] profile and benchmark the model performance. This senior technical role demands deep knowledge of AI model algorithms, system architecture and AI toolchains/frameworks.

Responsibilities:

  • Quantize, prune and convert models for deployment
  • Port models to Quadric platform using Quadric toolchain
  • Optimize inference deployment for latency, speed
  • Benchmark and profile model performance and accuracy
  • Develop tools to scale and speed up the deployment
  • Make Improvement to SDK and runtime
  • Provide technical support and documents to customers and developer community

Requirements:

  • Bachelorโ€™s or Masterโ€™s in Computer Science and/or Electric Engineering.

  • 5+ years of experience in AI/LLM model inference and deployment frameworks/tools

  • experience with model quantization (PTQ, QAT) and tools

  • experience with model accuracy measures

  • experience with model inference performance profiling

  • experience with at least one of the following frameworks: onnxruntime, Pytorch, vLLM, huggingface-transformer, neural-compressor, llamacpp

  • Proficiency in C/C++ and Python

  • Demonstrate good capability in problem solving, debug and communication

  • Health Care Plan (Medical, Dental & Vision)

  • Retirement Plan (401k, IRA)

  • Life Insurance (Basic, Voluntary & AD&D)

  • Paid Time Off (Vacation, Sick & Public Holidays)

  • Family Leave (Maternity, Paternity)

  • Short Term & Long Term Disability

  • Training & Development

  • Work From Home

  • Free Food & Snacks

  • Stock Option Plan

Share this job:
Please let Quadric know you found this job on Remote First Jobs ๐Ÿ™

521 similar remote jobs

Explore latest remote opportunities and join a team that values work flexibility.

Remote companies like Quadric

Explore remote-first companies similar to Quadric. Discover other top-rated employers that offer flexible schedules and work-from-anywhere options.

d-Matrix Logo

d-Matrix

Develops AI inference computing platforms and accelerators for datacenters, focusing on generative AI performance and efficiency.

View company profile โ†’
IT Labs Logo

IT Labs

Crafting high-performing teams

View company profile โ†’
Streamline Logo

Streamline

IT solutions and strategy

View company profile โ†’
Untether AI Logo

Untether AI

Energy-centric AI inference acceleration hardware with an at-memory architecture for edge to cloud applications.

View company profile โ†’
Telnyx Logo

Telnyx

A full-stack platform for real-time conversational AI, integrating global telephony and AI infrastructure.

View company profile โ†’
Semaphore Solutions Logo

Semaphore Solutions

Engineers digital transformation for complex clinical and life science laboratories.

View company profile โ†’

Project: Career Search

Rev. 2026.3

[ Remote Jobs ]
Direct Access

We source jobs directly from 21,000+ company career pages. No intermediaries.

01

Discover Hidden Jobs

Unique jobs you won't find on other job boards.

02

Advanced Filters

Filter by category, benefits, seniority, and more.

03

Priority Job Alerts

Get timely alerts for new job openings every day.

04

Manage Your Job Hunt

Save jobs you like and keep a simple list of your applications.

21,000+ SOURCES UPDATED 24/7
Apply