Full-time

Edge AI Inference Engineer: Kernel & Speed Optimization

Posted by Tether Operations Limited • June 07, 2026

📍 workfromhome, bogotá, distrito capital, Colombia
Apply Now

Description

Tether Operations Limited is seeking a specialized candidate for a fully remote position focusing on the design and deployment of state-of-the-art model serving architectures. The ideal applicant should have a PhD in NLP or Machine Learning, with a solid AI R&D track record and expertise in GPU kernels.

The role includes optimizing and integrating frameworks for low-resource devices, ensuring high performance, and diagnosing computational bottlenecks. If you have deep technical knowledge and proven experience in inference optimization, this opportunity is for you.

#J-18808-Ljbffr

Ready to Seal the Deal?

Submit your application today and take the next step in your career with Tether Operations Limited.

Apply for this Job