RTX 6000 Ada slower then 3090

Finally got RTX 6000 Ada and trying to compare with 3090. I downloaded 12 billion language model from Open Assistant. And I get the following results:
7.8 tokens/sec - 3090
5.3 tokens/sec - 6000 Ada
Although if i use only pytorch with simple model 6000 Ada is much faster.

My specifications:
-Intel-8600k processor
-GIGABYTE LGA1151-v2 H370 Aorus
-48GB RAM
-MSI 3090 Ventus 24gb
-PNY RTX 6000 Ada 48 gb

My software features:
python 3.9.13
pytorch 2.0
cuda 11.8