LLM Engineer (SFT / RLHF / Post-Training)

Website AchieveGroup Achieve Group

We’re partnering with a leading global tech platform building next-generation AI systems used at massive scale.

This team is focused on LLM post-training, reasoning, and alignment — not just prompt engineering.

If you’ve worked on fine-tuning, RLHF, or large-scale model training, this is where you level up.

💡 What You’ll Be Working On

  • LLM post-training pipelines (SFT, DPO, RLHF)
  • Improving reasoning, alignment, and model reliability
  • Training and optimising models across 7B–100B+ scale
  • Building agent systems / tool-use / multi-step reasoning workflows
  • Working with large-scale GPU clusters and distributed training

🎯 What We’re Looking For

  • Hands-on experience with LLM fine-tuning (SFT, LoRA, etc.)
  • Exposure to RLHF / DPO / PPO / GRPO (any is a strong plus)
  • Experience with LLaMA, Qwen, Mistral or similar models
  • Strong Python + deep learning frameworks (PyTorch, DeepSpeed, etc.)
  • Background in LLM, NLP, or applied ML systems

🌍 Why This Role

  • Remote Work Mode
  • Fast track career development
  • Work on cutting-edge LLM systems at global scale
  • Fast-moving, high-impact environment
  • Strong exposure to next-gen AI (reasoning, agents, alignment)
  • Clear career acceleration in the AI space

This role involves collaboration with Mandarin-speaking stakeholders; proficiency in Mandarin is highly beneficial.

📩 Interested?

Connect with me and drop me a message.

Only shortlisted candidates will be contacted.

YOUR SUCCESS IS OUR ACHIEVEMENT!

We regret only shortlisted candidate will be notified. All applications will be treated with the strictest confidence.

By submitting any application or résumé to us, you will be deemed to have agreed and consented to us collecting, using, retaining and disclosing your personal information to prospective employers for their consideration.

Cessation Of Collection Of Full NRIC Numbers:

In compliance with the Personal Data Protection Act and commitment to protect candidates’ personal data, Achieve Group will cease to collect, process or use full NRIC numbers during our screening and job application process.

Kindly ensure your resumes provided to us does not contain your full NRIC number and full home address during your job application.

We regret that only shortlisted candidates will be notified.

Labeled as: , ,

To apply for this job please visit www.linkedin.com.

All job content is user‑submitted or from public web sources; web4.career is not liable for its accuracy or source.

Post a job Contact
SHARE
TOP
The AI Era Has Arrived !