Kairos
Back to jobs

Sr Engineer, Server Inference

On-site
TenstorrentBelgrade, RS / Serbia11 months agoWebsite
Senior
Product Software Engineering

Compensation

Salary undisclosed
Apply
Share

Description

Join our Inference Server Technologies team, where we develop software that powers state-of-the-art AI inferencing on Tenstorrent’s cutting-edge hardware. Our team builds the layer that works on top of the Tenstorrent ML libraries - designing APIs, deploying workloads, and benchmarking end-to-end inference speed. You’ll help us shape how developers consume and scale model execution on Tenstorrent’s stack.

This role is hybrid based in Belgrade, Serbia.

We welcome candidates at various experience levels. During the interview process, candidates will be assessed for the appropriate level, and offers will align with that level, which may differ from the one in this posting.

 

Who You Are

  • An engineer who enjoys designing modern APIs and improving how ML models are deployed in production.
  • Curious about performance gains through techniques like batching, caching, and model parallelism.
  • Passionate about clean software architecture and effective abstraction layers.
  • Motivated to deliver backend systems that developers trust and rely on.

 

What We Need

  • Backend engineers who enjoy solving performance bottlenecks and scaling infrastructure.
  • Experience with web technologies, protocols, and system design.
  • Familiarity with Python, Docker, and Linux-based environments.
  • Strong coding practices and a clear ability to break down complex problems into high-quality, maintainable code.

 

What You Will Learn

  • How to optimize end-to-end ML inference on custom silicon.
  • Strategies for building scalable, reliable software interfaces for real-world AI applications.
  • How to shape the experience developers have when using Tenstorrent’s hardware for AI workloads.

 

Tenstorrent offers a highly competitive compensation package and benefits, and we are an equal opportunity employer.

Stack

PythonMachine LearningDocker
Posted
Jul 18, 2025
Last seen
Jun 25, 2026
First seen
Jun 25, 2026
Status
active
Sr Engineer, Server Inference at Tenstorrent | Kairos