Zero-shot Classification With Generative Language Models

I’m trying to use Llama-2 for Zero-Shot (binary) classification. Can anyone point me towards a direction on how this is done correctly? One simple way is ofc. to compare the probability of the model predicting true vs. false. But I’m not sure if there is better approaches?

Best,
Simon