Eagle3 for Llama3 - Online#
Introduction#
This document provides a step-by-step guide on how to train the EAGLE3 model for the Llama3.1-8B-Instruct model in an online manner. In online training, we generate the hidden states required by EAGLE3 draft model on the fly during training. This example is using ShareGPT dataset for training, the performance is not optimal due to the size and limited coverage of the dataset. If you look for optimal performance, we recommend you to try more diverse datasets such as Perfect-Blend. We have also included a section on training on Perfect-Blend dataset at the end of this document.
Training on Perfect-Blend dataset#
Step 1. Prepare Perfect-Blend dataset#
First of all, we should download the dataset.
python ./scripts/prepare_data.py --dataset perfectblend
Step 2. Launch Online Training#
We just need to change the --train-data-path to the path of the Perfect-Blend dataset (e.g. ./cache/dataset/perfectblend_train.jsonl), then we can launch training smoothly.