Senior Machine Learning Engineer: Post Training & Speculative Decoding

September 5, 2025

Apply for this job

Job Description

Description

Senior Machine Learning Engineer: Post Training & Speculative Decoding

Join to apply for the Senior Machine Learning Engineer: Post Training & Speculative Decoding role at Groq

Senior Machine Learning Engineer: Post Training & Speculative Decoding

Join to apply for the Senior Machine Learning Engineer: Post Training & Speculative Decoding role at Groq

Mission: We are seeking a highly skilled Machine Learning Engineer to join our advanced model development team. This role focuses on pre-training, continued training, and post-training of models , with a particular emphasis on draft model optimization for speculative decoding and quantization-aware training (QAT) . The ideal candidate has deep experience with training methodologies, open-weight models, and performance-tuning for inference.
Responsibilities & Outcomes

  • Lead pre-training and post-training efforts for draft models tailored to speculative decoding architectures.
  • Conduct continued training and post-training of open-weight models for non-draft (standard) inference scenarios.
  • Implement and optimize quantization-aware training pipelines to enable low-precision inference with minimal accuracy loss.
  • Collaborate with model architecture, inference, and systems teams to evaluate model readiness across training and deployment stages.
  • Develop tooling and evaluation metrics for training effectiveness, draft model fidelity, and speculative hit-rate optimization.
  • Contribute to experimental designs for novel training regimes and speculative decoding strategies.

Ideal Candidates Have/are

  • 5+ years of experience in machine learning, with a strong focus on model training.
  • Proven experience with transformer-based architectures (e.g., LLaMA, Mistral, Gemma).
  • Deep understanding of speculative decoding and draft model usage.
  • Hands-on experience with quantization-aware training, including PyTorch QAT workflows or similar frameworks.
  • Familiarity with open-weight foundation models and continued/pre-training techniques.
  • Proficient in Python and ML frameworks such as PyTorch, JAX, or TensorFlow.

Preferred Qualifications

  • Experience optimizing models for fast inference and sampling in production environments.
  • Exposure to distributed training, low-level kernel optimizations, and inference-time system constraints.
  • Publications or contributions to open-source ML projects.

Attributes Of a Groqster

  • Humility – Egos are checked at the door
  • Collaborative & Team Savvy – We make up the smartest person in the room, together
  • Growth & Giver Mindset – Learn it all versus know it all, we share knowledge generously
  • Curious & Innovative – Take a creative approach to projects, problems, and design
  • Passion, Grit, & Boldness – no limit thinking, fueling informed risk taking

If this sounds like you, we’d love to hear from you!
Compensation: At Groq, a competitive base salary is part of our comprehensive compensation package, which includes equity and benefits. For this role, the base salary range is TBD, determined by your skills, qualifications, experience and internal benchmarks.

Seniority level

  • Seniority level

    Mid-Senior level

Employment type

  • Employment type

    Full-time

Job function

  • Job function

    Engineering and Information Technology

  • Industries

    Semiconductor Manufacturing

Referrals increase your chances of interviewing at Groq by 2x

Sign in to set job alerts for “Machine Learning Engineer” roles.

Frontend Software Engineer (Remote – Canada)

Toronto, Ontario, Canada $153,000.00-$244, hours ago

Toronto, Ontario, Canada $140,000.00-$170,000.00 2 weeks ago

Full Stack Software Engineer (Remote Canada)

Python and Kubernetes Software Engineer – Data, Workflows, AI/ML & Analytics

Principal Software Engineer, Data Platform

Python and Kubernetes Software Engineer – Data, AI/ML & Analytics

Python and Kubernetes Software Engineer – Data, Workflows, AI/ML & Analytics

Python and Kubernetes Software Engineer – Data, AI/ML & Analytics

Senior Software Engineer – STCE New Products

Senior Software Development Engineer in Test

Sr Machine Learning Engineer – Fintech Foundation (100% Remote – Canada)

AI Process Improvement Engineer- Black Belt

We’re unlocking community knowledge in a new way. Experts add insights directly into each article, started with the help of AI.

#J-18808-Ljbffr

Company

Groq

Location

Toronto

Country

Canada

Salary

100.000

URL

https://en-ca.whatjobs.com/coopob__cpl___291_2626831__3337?utm_source=3337&utm_medium=feed&keyword=Senior-Machine-Learning&location=Toronto&geoID=6225