$120K - 180K a year
Lead design and optimization of AI inference software stack leveraging specialized hardware, collaborating across hardware and ML teams.
5+ years in low-level systems programming with expertise in kernel development, parallel programming, and ML frameworks.
At d-Matrix, we are focused on unleashing the potential of generative AI to power the transformation of technology. We are at the forefront of software and hardware innovation, pushing the boundaries of what is possible. Our culture is one of respect and collaboration. We value humility and believe in direct communication. Our team is inclusive, and our differing perspectives allow for better solutions. We are seeking individuals passionate about tackling challenges and are driven by execution. Ready to come find your playground? Together, we can help shape the endless possibilities of AI. Location: Remote The Role: AI Software Architect, Staff d-Matrix is redefining the future of AI compute with groundbreaking in-memory computing technologies. We're looking for a highly skilled and experienced AI Software Architect to lead the design of our next-generation AI software stack, purpose-built for cutting-edge inference workloads. This role requires deep expertise in ML model architectures and low-level kernel development, combined with a systems-level mindset to drive performance across the stack. What you will do: • Architect Scalable Systems: Lead the design of a scale-up and scale-out inference software stack that fully leverages d-Matrix's unique hardware capabilities. • Full-Stack Problem Solving: Tackle challenges across multiple software layers—from runtime and compilers to kernel-level performance—delivering innovative, end-to-end solutions. • Cross-Functional Collaboration: Partner with hardware engineers, ML researchers, and product teams to define software requirements and deliver tightly integrated systems. • Optimize for Performance: Develop and implement advanced optimization strategies to ensure ultra-low latency and high throughput in distributed and high-performance computing environments. • Elevate Code Quality: Champion best practices in code design, testing, and peer reviews to maintain the highest standards of quality and maintainability. • Document for Scale: Create robust technical documentation to support ongoing development, deployment, and scaling of our software systems. What you will bring: • Bachelor’s degree in Computer Science or related field (Master’s preferred) • Minimum 5+ years of professional experience in software development, with a focus on low-level systems programming and performance optimization Technical Skills: • Deep expertise in CPU/GPU/xPU kernel development and low-level programming (C/C++, CUDA, etc.) • Strong background in parallel and concurrent programming • Proven experience in performance bottleneck analysis and system-level optimization • Familiarity with modern ML frameworks like PyTorch, ONNX Runtime (ORT), or JAX • Skilled in using development tools and frameworks for building, profiling, and deploying large-scale applications Soft Skills: • Exceptional analytical and problem-solving capabilities • Self-starter who thrives in ambiguity and fast-paced environments • Collaborative team player with excellent communication and interpersonal skills Equal Opportunity Employment Policy d-Matrix is proud to be an equal opportunity workplace and affirmative action employer. We’re committed to fostering an inclusive environment where everyone feels welcomed and empowered to do their best work. We hire the best talent for our teams, regardless of race, religion, color, age, disability, sex, gender identity, sexual orientation, ancestry, genetic information, marital status, national origin, political affiliation, or veteran status. Our focus is on hiring teammates with humble expertise, kindness, dedication and a willingness to embrace challenges and learn together every day. d-Matrix does not accept resumes or candidate submissions from external agencies. We appreciate the interest and effort of recruitment firms, but we kindly request that individual interested in opportunities with d-Matrix apply directly through our official channels. This approach allows us to streamline our hiring processes and maintain a consistent and fair evaluation of al applicants. Thank you for your understanding and cooperation.
This job posting was last updated on 9/26/2025