I’m experimenting with GPT-Neo variants, and I wonder whether these models have different checkpoints? I.e., independently trained versions with different weights. If so, how can I access those?
Related topics
Topic | Replies | Views | Activity | |
---|---|---|---|---|
Availability of checkpoints earlier in training? | 0 | 214 | April 1, 2021 | |
GPT-Neo 125M Squad model? | 1 | 597 | December 17, 2022 | |
Is there any reason why GPT-Neo would behave differently (fundamentally) from GPT2? | 0 | 431 | January 15, 2023 | |
Difference between checkpoints | 0 | 400 | February 21, 2023 | |
Checkpoint vs model weight | 2 | 4839 | October 12, 2020 |