Project description
Luxoft is searching for talented developers with GPU compute and performance profiling experience to join the rapidly growing team. We are seeking an experienced individual proficient in GPGPU applications to join our team. The primary responsibility of this role will be to lead the effort in optimizing HIP kernels on AMD GPUs. The candidate should possess a strong background in GPU computing, parallel programming, and a deep understanding of CUDA or HIP frameworks. Additionally, familiarity with optimization techniques is highly desirable.
Responsibilities
- The main task will be to help optimize HIP kernels for specific AMD hardware
- Collaborate with development teams to optimize and enhance GPU-accelerated applications
- Debug, profile, and fine-tune code for performance improvements
- Stay updated with the latest advancements in GPU architectures and programming models
SKILLS
Must have
Proficiency with C++ and low-level programming (at least C++ 17)Proficiency in CUDA or HIP / ROCm programmingSolid understanding of GPU architectures, parallel programming models, and optimization techniquesStrong problem-solving skills and the ability to work in a collaborative environmentOne of AI / ML / DL / NN / NLP / Computer Vision experiencePythonNice to have
LinuxCPU Intrinsics (AVX / SSE)GPU AssemblerProfilinggdb / LLDBJinja2 or similar templating engines#J-18808-Ljbffr