Hi I wonder if anyone here tried to train a reward model for image pairs based on Bradley Terry loss? I couldn’t really find anything like this online, so I wonder how I would setup something like this?
Related topics
Topic | Replies | Views | Activity | |
---|---|---|---|---|
Help getting started and choosing AI model | 0 | 421 | January 25, 2024 | |
Wav2Vec2: how much context is available for self-attention | 0 | 255 | March 21, 2023 | |
Pre-trained Sandwich transformer model | 0 | 300 | November 5, 2020 | |
There are so many models out there, which would help perform retrieval | 0 | 182 | September 8, 2023 | |
Just starting out can't find anything | 0 | 143 | April 25, 2024 |