I’m experimenting with GPT-Neo variants, and I wonder whether these models have different checkpoints? I.e., independently trained versions with different weights. If so, how can I access those?
Related topics
| Topic | Replies | Views | Activity | |
|---|---|---|---|---|
| Availability of checkpoints earlier in training? | 0 | 214 | April 1, 2021 | |
| Checkpoint vs model weight | 2 | 4890 | October 12, 2020 | |
| GPT-Neo 125M Squad model? | 1 | 599 | December 17, 2022 | |
| Is there any reason why GPT-Neo would behave differently (fundamentally) from GPT2? | 0 | 434 | January 15, 2023 | |
| Checkpoints - still confused | 0 | 1665 | July 30, 2022 |