Reasoning Distillation with Huggingface Trainer

Pavarissy · November 8, 2023, 1:47pm

I want to do Reasoning Distillation, to distillate a rationale of teacher model to the student model. Now, I already have a rationale generated from GPT3.5 acted as a rationale from teacher model kept as a json file. However, to do distillation by using HuggingFace Trainer. I need to convert it to dataset that is compatible to HuggingFace by using load_dataset, and the format is like

# {
#     "data": [
#         {
#             "text": "..."
#         },
#         {
#             "text": "..."
#         },
#         {
#             "text": "..."
#         },
#         ...
#     ]
# }

According to the case, since I do it on retrieval-augmented generation task, I want to ask that do we need to provide a retrieved document into text along with the rationale or not? or we can just put just only rationale into it ?

Topic		Replies	Views
How do we insert our own datasets in DPR / RAG retrieval Q&A models? 🤗Transformers	1	1647	October 11, 2020
Questions on distilling [from] T5 🤗Transformers	15	4807	August 2, 2022
Trying RAG with other Retriever Models 🤗Transformers	0	432	January 21, 2021
Regarding Training a Task Specific Knowledge Distillation model 🤗Transformers	8	3444	September 2, 2023
Distillation: create student model from a different base model than teacher 🤗Transformers	9	2112	October 14, 2020

Reasoning Distillation with Huggingface Trainer

Related topics