Chapter 1 questions

rajen2023 · October 28, 2023, 6:33am

I do not see chapter 10-12 though it is mentioned in this Introduction chapter that ‘Chapters 9 to 12 go beyond NLP, and explore how Transformer models can be used to tackle tasks in speech processing and computer vision’.

May I know if chapter 11-12 are under development and yet to release ?

pulkitagrawal2007 · December 12, 2023, 7:09am

Hi,

From the original architecture, decoders have two inputs- first the text prompted/generated by it and the other is the embedding
from encoders. GPT like models are classified as decoders. So when GPT models generate text, the prompt is provided by the user, but where do they get the input embedding from ?

cjonas · January 1, 2024, 6:44pm

Better Encoder-Decoders example

The example used by in the encoders/decoders video doesn’t really do much to help understand the actual inputs and outputs.

On the Encoder side, it has:

Welcome to NYC

But on the Decoder side, it just has Start of Sequence in the input and then Word_1, Word_2, etc.

This left me pretty confused as to what you would actually pass into the model. In this example, would the “Wecome to NYC” input be an inference input (prompt)? Or are the Encoder outputs evaluated during training and then embedded into the decoder?

It would be very helpful to actually provide a full end-to-end example showing a real life use-case (maybe a “completion” style model)

pornpitaksuko · February 1, 2024, 5:47pm

What optimization metrics are used to train large language models?

JacquesRoth · February 11, 2024, 8:22pm

I tried out the question answering example from Chapter 1. My context was from a Wikipedia article about 3772 characters long and asked 11 simple questions very carefully worded to match answers to be easily extracted from the context. It amazes me when it gets the right answers but only 7/11 were at least partially correct. I tried a model trained with Squad v2 and got identical results. What might I need to do to get more appropriate answers? Eventually I want to use a large set of documentation as the context from which simple questions would extract answers from.

JacquesRoth · February 11, 2024, 8:28pm

I was experimenting with the question answering code in Chapter 1 and am trying to improve the results. I am using TensorFlow and would like to perform the operations without the pipeline, perhaps by tokenizing and breaking up the context somehow I can get better results. So far I can’t get the right sequence to replace the pipeline. I have been looking for an example that would show how to do this, but have not found any. I don’t want to get into training the model at this point as I am using pretrained Squad v1/v2. Can anyone point to an example?

JacquesRoth · February 12, 2024, 12:56am

There is a video on YouTube “Inside the Question Answer Pipeline” that shows some code. But there are two variables, start_pos and end_pos that appear to be undefined. Does anyone know what they are supposed to be?

The problem is with the TenbsorFlow version of the video. The Pytorch version does not use either of these variables and creates the scores matrix using:
scores = start_probabilities[:, None] * end_probabilities[None,:]

Got this working with small contexts.

OniMenoKyo · March 1, 2024, 9:29pm

from transformers import pipeline

classifier = pipeline(“sentiment-analysis”)
classifier(“I’ve been waiting for a HuggingFace course my whole life.”)

I am trying to follow transformers-course
says i need a model but on the page there is no model or hit to its usage. the whole examples generate errors instead of the expected results. is there an update to the syntax?
thank for your insights on this

borissabo · March 15, 2024, 10:00am

Hello, in decoder models section, you mentioned GPT-2 as a decoder example model. However, the GPT-2 uses both, the encoder and decoder, as it is complete language model, right?

daysidanae · March 15, 2024, 11:54am

I’m new here ! Thanks for providing.

amarcu78 · May 3, 2024, 10:31pm

What’s a good way to search the models? for example, to find the Part Of Sentence (as recommended in the curse), I searched for pos and went through some results until I found something. But what if I wanted to find POS in Spanish for example? trying combinations and manually looking through is cumbersome and I may be missing results

mbigger · May 25, 2024, 6:23pm

Never a good sign if the very first activity in a course ends in an runtime error:

At least one of TensorFlow 2.0 or PyTorch should be installed. To install TensorFlow 2.0, …

The install and import code in the supplied notebook are incomplete:

!pip install datasets evaluate transformers[sentencepiece]
from transformers import pipeline

when running it in Amazone SageMaker Studio Lab. What exactly needs to be installed?

myHuggingFace001 · June 5, 2024, 5:44am

yes, need to install.

myHuggingFace001 · June 5, 2024, 5:48am

This is the all codes.

mbigger · June 5, 2024, 1:38pm

What is this for a nonsense reply? The supplied notebook with the listed installs produces a runtime error because of missing packages.

danbq2289 · June 11, 2024, 2:02am

I’m not part of the staff but from what I can see, in chapter 0, they recommend using Google Colab (probably because it already comes with Pytorch and Tensorflow, which are necessary for the transformers library to work)

If you use another framework, it probably doesn’t have neither Pytorch or Tensorflow, so you have to install those first (probably just running “pip install torch” would work), and then run the provided code.

RaquelFS · June 18, 2024, 4:53pm

I am trying to run the first code in Chapter 1 and I am getting this message:

ERROR: pip’s dependency resolver does not currently take into account all the packages that are installed. This behaviour is the source of the following dependency conflicts.
datasets 2.20.0 requires pyarrow>=15.0.0, but you have pyarrow 14.0.1 which is incompatible.
datasets 2.20.0 requires requests>=2.32.2, but you have requests 2.31.0 which is incompatible.

How to correct this error?

Thanks!

iamanshuljain · June 22, 2024, 10:18am

I’m getting the following error why I run the first code snippet on Google Colab. Can someone help me identify the problem here?

ERROR: pip’s dependency resolver does not currently take into account all the packages that are installed. This behaviour is the source of the following dependency conflicts.
cudf-cu12 24.4.1 requires pyarrow<15.0.0a0,>=14.0.1, but you have pyarrow 16.1.0 which is incompatible.
google-colab 1.0.0 requires requests==2.31.0, but you have requests 2.32.3 which is incompatible.
ibis-framework 8.0.0 requires pyarrow<16,>=2, but you have pyarrow 16.1.0 which is incompatible.

Razaneee · July 7, 2024, 1:22pm

can i use text generation based on feature-extraction?

richardprobe · July 8, 2024, 3:12pm

excited to get started!

Topic		Replies	Views
VisionEncoderDecoder/TrOCR Models	0	702	October 21, 2021
Chapter 3 questions Course	141	10206	June 8, 2025
T5 Model, T5 Encoder Model and T5 Model for Conditional Generation Beginners	1	1294	November 20, 2022
Using an encoder-decoder model for Recognizing Textual Entailment (GLUE task) Models	0	170	March 31, 2024
Which model of transformers to use if I want to do multiclassification of a pair of sentences containing a questionair 🤗Transformers	0	251	February 18, 2022

Chapter 1 questions

Better Encoder-Decoders example

Related topics