NLP chapter 3 question

John6666 · April 1, 2025, 3:19pm

Apparently, there is a version incompatibility issue between Keras and TensorFlow that has been around for a long time. The solution differs for each version…

For more information, search for the version you want to use…

github.com/keras-team/keras

ValueError: Could not interpret optimizer identifier: <keras.src.optimizers.adam.Adam object at 0x79d9071160e0>

opened 05:56AM - 07 Mar 24 UTC

closed 01:49AM - 09 May 24 UTC

YikunHan42

type:support stat:awaiting response from contributor stale

```python import tensorflow as tf from datasets import load_dataset from tran…sformers import AutoTokenizer, TFAutoModelForSequenceClassification, DataCollatorWithPadding from tensorflow.keras.optimizers import Adam from tensorflow.keras.optimizers.schedules import PolynomialDecay from tensorflow.keras.losses import SparseCategoricalCrossentropy def prepare_imdb_dataset(tokenizer): """ Prepares the IMDB dataset for training and validation. Args: tokenizer: The tokenizer to use for text tokenization. Returns: A tuple containing the tokenized training and validation datasets. """ imdb = load_dataset("imdb") train_set = imdb['train'].map(lambda x: tokenizer(x['text'], truncation=True), batched=True) test_set = imdb['test'].map(lambda x: tokenizer(x['text'], truncation=True), batched=True) return train_set, test_set tokenizer = AutoTokenizer.from_pretrained("distilbert-base-uncased") model = TFAutoModelForSequenceClassification.from_pretrained("distilbert-base-uncased", num_labels=2) train_set, test_set = prepare_imdb_dataset(tokenizer) data_collator = DataCollatorWithPadding(tokenizer=tokenizer, return_tensors="tf") tf_train_dataset = train_set.to_tf_dataset( columns=["attention_mask", "input_ids"], label_cols=["label"], shuffle=True, collate_fn=data_collator, batch_size=8, ) tf_validation_dataset = test_set.to_tf_dataset( columns=["attention_mask", "input_ids"], label_cols=["label"], shuffle=False, collate_fn=data_collator, batch_size=8, ) batch_size = 16 num_epochs = 1 num_train_steps = len(tf_train_dataset) * num_epochs lr_scheduler = PolynomialDecay( initial_learning_rate=5e-5, end_learning_rate=0.0, decay_steps=num_train_steps ) optimizer = Adam(learning_rate=lr_scheduler) loss = SparseCategoricalCrossentropy(from_logits=True) model.compile(optimizer=optimizer, loss=loss, metrics=["accuracy"]) model.fit(tf_train_dataset, validation_data=tf_validation_dataset, epochs=5) ``` ``` Some weights of the PyTorch model were not used when initializing the TF 2.0 model TFDistilBertForSequenceClassification: ['vocab_layer_norm.weight', 'vocab_transform.weight', 'vocab_projector.bias', 'vocab_transform.bias', 'vocab_layer_norm.bias'] - This IS expected if you are initializing TFDistilBertForSequenceClassification from a PyTorch model trained on another task or with another architecture (e.g. initializing a TFBertForSequenceClassification model from a BertForPreTraining model). - This IS NOT expected if you are initializing TFDistilBertForSequenceClassification from a PyTorch model that you expect to be exactly identical (e.g. initializing a TFBertForSequenceClassification model from a BertForSequenceClassification model). Some weights or buffers of the TF 2.0 model TFDistilBertForSequenceClassification were not initialized from the PyTorch model and are newly initialized: ['pre_classifier.weight', 'pre_classifier.bias', 'classifier.weight', 'classifier.bias'] You should probably TRAIN this model on a down-stream task to be able to use it for predictions and inference. Map: 100% 25000/25000 [00:23<00:00, 1086.84 examples/s] Map: 100% 25000/25000 [00:20<00:00, 1304.86 examples/s] --------------------------------------------------------------------------- ValueError Traceback (most recent call last) [<ipython-input-17-ac80246ded67>](https://localhost:8080/#) in <cell line: 55>() 53 optimizer = Adam(learning_rate=lr_scheduler) 54 loss = SparseCategoricalCrossentropy(from_logits=True) ---> 55 model.compile(optimizer=optimizer, loss=loss, metrics=["accuracy"]) 56 57 model.fit(tf_train_dataset, validation_data=tf_validation_dataset, epochs=5) 2 frames [/usr/local/lib/python3.10/dist-packages/tf_keras/src/optimizers/__init__.py](https://localhost:8080/#) in get(identifier, **kwargs) 332 ) 333 else: --> 334 raise ValueError( 335 f"Could not interpret optimizer identifier: {identifier}" 336 ) ValueError: Could not interpret optimizer identifier: <keras.src.optimizers.adam.Adam object at 0x79d9071160e0> ```

Topic		Replies	Views
Pretrain model not accepting optimizer 🤗Transformers	30	4723	December 7, 2024
Error when following Transformers Language modeling tutorial step by step Beginners	1	2086	July 28, 2022
ValueError: You have to specify either decoder_input_ids or decoder_inputs_embeds 🤗Transformers	3	1760	November 14, 2023
Chapter 3 problem Course	2	540	July 26, 2021
Getting errors while running google Collab's code attached to chapter 3 Beginners	3	31	June 24, 2025

NLP chapter 3 question

Related topics