AI Model Optimization Specialist
We are looking for an experienced AI Model Engineer with deep expertise in kernel development, model optimization, fine-tuning, and GPU acceleration.
* Implement and optimize custom inference and fine-tuning kernels for small and large language models across multiple hardware backends.
* Implement and optimize full and LoRA fine-tuning for small and large language models across multiple hardware backends.
The role requires hands-on experience with quantization techniques, LoRA architectures, Vulkan backend, and mobile GPU debugging. This position plays a critical role in pushing the boundaries of desktop and on-device inference and fine-tuning performance for next-generation SLM/LLMs.
To succeed in this position, you will need to stay up-to-date with advancements in AI model optimization and GPU acceleration technologies. Strong problem-solving skills and ability to work independently are essential. The ideal candidate will have a strong background in computer science and software engineering.
Key Responsibilities:
* Optimize AI model performance for various hardware platforms.
* Develop and implement new algorithms for AI model optimization.
* Collaborate with cross-functional teams to integrate optimized models into production environments.
Requirements:
* Bachelor's degree in Computer Science or related field.
* 5+ years of experience in AI model optimization and GPU acceleration.
* Strong understanding of AI frameworks and programming languages (e.g., TensorFlow, PyTorch).
Benefits:
* Competitive salary and benefits package.
* Opportunity to work with cutting-edge technologies.
* Collaborative and dynamic work environment.
About Us:
We are a leading provider of AI solutions, committed to delivering high-quality products and services that meet our customers' needs. We are seeking talented individuals who share our passion for innovation and excellence.