Innotech Vietnam

33 Ba Vi, TP Hồ Chí Minh

Company Size : 25-99

View more

Job Summary

25-99

Outsourcing

Việt Nam

Senior AI Engineer

Innotech Vietnam

Tan Binh, TP Hồ Chí Minh

  • English
  • Experienced (Non-Manager)
  • Full Time
  • Negotiable
  • Posted:03/03/2025
  • 1

Job description

Overview of job

As an AI Engineer Level 3, you will lead LLM development, AI infrastructure scaling, and distributed training. You will optimize high-performance AI systems, manage multi-GPU environments, and deploy AI models at scale. Build and lead a team, enhancing overall technical capabilities and performance. 

Key Responsibilities: 

  • Make a proposal of AI solution in align with a set of customer requirements and goal. 
  • Build and lead a team, enhancing overall technical capabilities and performance. 
  • Optimize Large Language Models (LLMs) & AI models, including: 
  • Efficient training of LLMs (DeepSpeed, FSDP, LoRA) 
  • Deploying models with Kubernetes, Ray, Triton Inference Server 
  • Optimizing model inference speed with ONNX, TensorRT, GGUF, vLLM 
  • Implementing Retrieval-Augmented Generation (RAG) pipelines 
  • Applying AI distillation and quantization techniques 
  • Work with HPC infrastructure and distributed AI computing. 
  • Implement system monitoring tools (htop, tcpdump, iostat, netstat). 
  • Troubleshoot AI system performance bottlenecks. 
  • 13th-month salary calculated based on actual working time at INNOTECH.
  • PVI Healthcare Insurance for all employees
  • PVI Healthcare Insurance for family
  • Moon cake, Tet Gift
  • Quarterly/project kickoff team-building budget.
  • Resolution laptop and monitor provided for work.
  • Performance bonus plan.
  • Employee referral bonus: 2,000,000 – 10,000,000 VND (depending on level/role).
  • Working hours: Monday to Friday.
  • Annual company trips / Football club / Climbing club / Year-end party.
  • Learning and certification support.
  • Value-oriented, international working environment with a flexible culture.

Job Requirement

  • Bachelor’s or Master’s degree in AI, Computer Science, Machine Learning or a related field. 
  • 3+ years of experience in LLM model development and optimization. 
  • Hands-on experience with distributed AI training and HPC for AI workloads. 
  • Expertise in GPU acceleration (CUDA, TensorRT, vLLM). 
  • Deep understanding of LLM architectures (GPT, Llama, Falcon, T5, Mistral). 
  • Experience in cloud AI deployment (Kubernetes, OpenStack, Ray, Triton). 
  • Strong ability to troubleshoot system errors and optimize AI workloads. 
  • English communication, reading, writing professional

Languages

  • English

    Speaking: Intermediate - Reading: Intermediate - Writing: Intermediate

Technical Skill

  • AI (Artificial Intelligence)
  • Machine Learning
  • LLM
  • OpenStack
  • Kubernetes
  • CUDA
  • GPT
  • GPU
  • HPC
  • TensorRT
  • Llama

COMPETENCES

  • Communication Skills

BUSINESS PROFILE

Innotech Vietnam strives for the creation, innovation and development of advanced solutions.

We provide a wide range of software services to meet all service requirements for customers.
 
Innotech is a well established software company in Vietnam to serve various clients in Vietnam, Japan, America, Australia, and Singapore. We translate these advanced technologies into value for our customers through our professional solutions and services business worldwide.