Pipeline inference with Dataset api

hmanju · December 8, 2021, 10:40pm

I am running inference using the pipeline api. But I get the following warning which recommends using the Dataset api. How can I do so?

transformers4/lib/python3.8/site-packages/transformers/pipelines/base.py:899: 
UserWarning: You seem to be using the pipelines sequentially on GPU. 
In order to maximize efficiency please use a dataset
  warnings.warn()

sentiment_analysis = pipeline("sentiment-analysis", model="siebert/sentiment-roberta-large-english", device=0)

def roberta_sent(verb):
    
    result = sentiment_analysis(verb)[0]    
            
    return result["label"], result["score"]



df = pd.read_csv(file)

df["roberta_sentiment"] = df["sentences"].apply(roberta_sent)

cyrilw · March 23, 2022, 2:22pm

Hi, I came across this warning as well. By any chance did you find a solution?

cameronw · March 30, 2022, 6:26pm

I found the following help in the documentation, but I haven’t implemented yet because I’m just doing a POC with 50 documents. Sharing now in case it helps you get started!

sowmiyan · May 12, 2022, 3:09pm

Having same issue while using T5-base-grammar-correction for grammer correction on my dataframe with text column

rom happytransformer import HappyTextToText
from happytransformer import TTSettings
from tqdm.notebook import tqdm
tqdm.pandas()

happy_tt = HappyTextToText("T5",  "./t5-base-grammar-correction")
beam_settings =  TTSettings(num_beams=5, min_length=1, max_length=30)
def grammer_pipeline(text):
    text = "gec: " + text
    result = happy_tt.generate_text(text, args=beam_settings)
    
    return result.text

df['new_text'] =  df['original_text'].progress_apply(grammer_pipeline)

It runs very slow and provides the User Warning

/home/.local/lib/python3.6/site-packages/transformers/pipelines/base.py:908: UserWarning: You seem to be using the pipelines sequentially on GPU. In order to maximize efficiency please use a dataset
  UserWarning,

How to implement this to efficiently utilise GPUs?

roma999 · December 14, 2022, 5:36pm

What about you, have you found any solution yet?

xiaobinbaby · November 15, 2023, 4:12pm

Hi, I have met the same question. Run very slowly and present this warning. Is there any solution now?

Topic		Replies	Views
Happytransformer Inference on dataset Beginners	1	699	January 31, 2023
Very low GPU usage when translating text, datasets not helping 🤗Transformers	3	5844	July 12, 2022
Error using datasets with pipeline for text generation 🤗Datasets	5	926	December 30, 2024
How to make single-input inference faster? Create my own pipeline? 🤗Transformers	9	3949	August 26, 2021
Progress bar for HF pipelines 🤗Transformers	9	18537	December 24, 2023

Pipeline inference with Dataset api

Related topics