AI Research

    Key Responsibilities

    1. Data Engineering for Pretraining

    • Build and maintain scalable pipelines for text collection, cleaning, deduplication, filtering, and quality scoring.

    • Process large-scale Vietnamese and multilingual datasets.

    • Implement tokenization workflows, corpus sharding, mixture sampling, and dataset balancing.

    • Develop automated dataset validation and QA tools.

    2. Model Training & Optimization

    • Support distributed LLM training using DeepSpeed, Megatron-LM, FSDP, etc.

    • Optimize throughput, memory, and multi-node GPU performance.

    • Execute large-scale LLM experiments and troubleshoot issues.

    • Conduct fine-tuning, instruction tuning, and alignment.

    3. Infrastructure & Engineering

    • Work with multi-GPU/multi-node clusters (Slurm, Docker/Singularity).

    • Maintain experiment tracking systems.

    • Develop reusable tooling for logging, checkpointing, and evaluation.

    4. Evaluation & Benchmarking

    • Prepare multilingual and Vietnamese benchmark suites.

    • Implement automated evaluation pipelines.

    • Analyze results to inform improvements.

    Requirements

    • Bachelor’s/Master’s/PhD in CS/AI/ML or related fields.

    • Strong Python skills, PyTorch expertise.

    • Understanding of transformers and tokenization.

    • Experience with GPU clusters, Linux, Bash.

    • Familiarity with distributed training frameworks.

    Preferred:

    • Experience with large-scale datasets.

    • Knowledge of Vietnamese NLP.

    • Experience with MoE, long-context models, deduplication.

    • Open-source contributions.

    • Experience with quantization, distillation, compression.

    Benefits

    • Competitive salary: 13 months/year.

    • Full insurance + premium healthcare package.

    • Employee privileges across Vingroup services.

    • Large-scale projects with opportunities to lead national-level products.

    HOW TO APPLY: Please send your CV to the consultant in charge: 
    Ms. My Do Huyen
    Email: my.do@ev-search.com 
    All applications will be considered without regard to race, color, religion, sex (inclusing pregnancy and fender identity), national origion, political affiliation, sexual orientation, mariatal status, disability, genetic information, age, membership in an employee organization, parental status, military service or other nonmerit factor

    Interested in this position?

    Get in touch with us now!

    Quick Apply
    Email