+1 I have a similar issue when I fine-tune with GPU. The training takes no long, however predictions on development set takes too long.
+1 I have a similar issue when I fine-tune with GPU. The training takes no long, however predictions on development set takes too long.