quadric, Inc

via Workable

All our jobs are verified from trusted employers and sources. We connect to legitimate platforms only.

AI Kernel Engineer

Anywhere

full-time

Posted 11/24/2025

Direct Apply

Key Skills:

C/C++

Python

AI kernel development

Performance profiling

Hardware architecture

Compiler toolchain

CUDA

DSP

NEON

Assembly language

Compensation

Salary Range

$120K - 160K a year

Responsibilities

Develop and optimize AI/LLM kernels on Quadric platform, profile performance, improve toolchain, and support customers.

Requirements

5+ years in AI kernel development, proficiency in C/C++ and Python, experience with CUDA/DSP/NEON, and strong problem-solving skills.

Full Description

Quadric has created an innovative general purpose neural processing unit (GPNPU) architecture. Quadric's co-optimized software and hardware is targeted to run neural network (NN) inference workloads in a wide variety of edge and endpoint devices, ranging from battery operated smart-sensor systems to high-performance automotive or autonomous vehicle systems. Unlike other NPUs or neural network accelerators in the industry today that can only accelerate a portion of a machine learning graph, the Quadric GPNPU executes both NN graph code and conventional C++ DSP and control code. Role: The AI Kernel Engineer in Quadric plays the key role to enable a large number of AI kernels/operators to run efficiently on the Quadric platform. The AI Kernel Engineer at Quadric will [1] develop a highly efficient Quadric kernel library for a variety of AI/LLM models; [2] analyze the performance and optimize the kernel for different hardware configurations; This senior technical role demands deep knowledge of hardware architecture, compiler toolchain and optimization techniques. Responsibilities: Develop AI/LLM kernels/operators on Quadric platform for efficient inference Optimize the kernel performance for different hardware configurations and workloads Profile and analyze kernel performance in terms of compute, data and parallelism; identify micro-architecture and software bottlenecks and provide optimization solutions Optimize kernel C/C++ codes, maximize hardware utilization Make Improvement to Quadric toolchain, compiler and runtime Provide technical support and documents to customers and developer community Bachelor’s or Master’s in Computer Science and/or Electric Engineering. 5+ years of experience in AI kernel development and optimization experience with model and kernel inference performance profiling experience with at least one of the following compute development: CUDA, DSP, NEON, Triton-lang Proficiency in C/C++ and Python, experience with assembly language a plus Demonstrate good capability in problem solving, debug and communication Health Care Plan (Medical, Dental & Vision) Retirement Plan (401k, IRA) Life Insurance (Basic, Voluntary & AD&D) Paid Time Off (Vacation, Sick & Public Holidays) Family Leave (Maternity, Paternity) Short Term & Long Term Disability Training & Development Work From Home Free Food & Snacks Stock Option Plan

This job posting was last updated on 11/26/2025

JobLogr gets you hired faster

Save $15k

in lost income

Get back 54 hrs + hired 3.5x faster

than average job search

Try for Free

No credit card required

Ready to have AI work for you in your job search?

Sign-up for free and start using JobLogr today!

Get Started »