We started compiling a wiki of how different models were pre-trained, please add your knowledge there - thanks!