Intel OpenVINO backend

dkurt · October 28, 2021, 7:12pm

Hi! We would like to start a discussion about adding Intel OpenVINO backend in Transformers library.

If you have not heard about OpenVINO before, it’s a library which accelerates deep learning inference (not training, but inference of pretrained models) on Intel Architecture (CPU, GPU, VPU and others). The library is distributed in PyPI and developed in open source.

Currently, there is an issue here: Intel OpenVINO inference backend · Issue #13987 · huggingface/transformers · GitHub (for GitHub discussions) and the latest proposal here: Intel OpenVINO backend by dkurt · Pull Request #1 · dkurt/transformers · GitHub

Example (QA):

from transformers import AutoTokenizer, OVAutoModelForQuestionAnswering

tok = AutoTokenizer.from_pretrained("dkurt/bert-large-uncased-whole-word-masking-squad-int8-0001")
model = OVAutoModelForQuestionAnswering.from_pretrained("dkurt/bert-large-uncased-whole-word-masking-squad-int8-0001")

context = """
Soon her eye fell on a little glass box that
was lying under the table: she opened it, and
found in it a very small cake, on which the
words “EAT ME” were beautifully marked in
currants. “Well, I’ll eat it,” said Alice, “ and if
it makes me grow larger, I can reach the key ;
and if it makes me grow smaller, I can creep
under the door; so either way I’ll get into the
garden, and I don’t care which happens !”
"""

question = "Where Alice should go?"

input_ids = tok.encode(question + " " + tok.sep_token + " " + context, return_tensors="pt")

outputs = model(input_ids)

start_pos = outputs.start_logits.argmax()
end_pos = outputs.end_logits.argmax() + 1

answer_ids = input_ids[0, start_pos:end_pos]
answer = tok.convert_tokens_to_string(tok.convert_ids_to_tokens(answer_ids))

print("Question:", question)
print("Answer:", answer)

dkurt · November 1, 2021, 3:21pm

Opened a pull request at Intel OpenVINO backend (inference only) by dkurt · Pull Request #14203 · huggingface/transformers · GitHub. Any feedback welcome!

Topic		Replies	Views
AutoModelForCausalLM and Openvino 🤗Optimum	5	2105	February 3, 2023
How do I upstream a brand-new hardware backend to 🤗 Transformers/Optimum? 🤗Optimum	2	25	July 2, 2025
Open Source survey results [Jan 2022] Community Calls	1	2256	March 10, 2022
Tutorials on transformers Beginners	6	1497	May 21, 2021
Tutorial: Implementing Transformer from Scratch - A Step-by-Step Guide Show and Tell	5	4481	May 1, 2025

Intel OpenVINO backend

Related topics