Accelerate.init() got an unexpected keyword argument 'logging_dir'

WaveDave · January 21, 2025, 2:55am

I’m trying to train an unconditional diffusion model on a greyscale image dataset. I am using diffusers_training_example.ipynb on Google Colab connected to my local GPU. When running the ‘Let’s train!’ cell I am getting this Accelerate error. Initially, I tried downgrading my Accelerate from 1.3.0 to 0.3.0 and 0.27.0 as some forums suggested but this made no difference. Any advice would be great! Thank you.

John6666 · January 21, 2025, 9:03am

There is a possibility that it is simply a bug in Accelerate…

github.com/huggingface/accelerate

[BUG] Accelerator.init() got an unexpected keyword argument 'logging_dir'

opened 02:13AM - 25 Nov 24 UTC

closed 06:41PM - 02 Dec 24 UTC

as12138

### System Info ```Shell accelerate version: main python version: 3.11 torch …version: 2.4 numpy version: 1.26.4 ``` ### Information - [X] The official example scripts - [ ] My own modified scripts ### Tasks - [X] One of the scripts in the examples/ folder of Accelerate or an officially supported `no_trainer` script in the `examples` folder of the `transformers` repo (such as `run_no_trainer_glue.py`) - [ ] My own task or dataset (give details below) ### Reproduction When I run the accelerate/examples/megatron_1m_gpt_pretraining. py file accelerate launch --config_file megatron_gpt_pretraining.py \ --config_name "gpt2-large" \ --tokenizer_name "gpt2-large" \ --dataset_name wikitext \ --dataset_config_name wikitext-2-raw-v1 \ --block_size 1024 \ --learning_rate 5e-5 \ --per_device_train_batch_size 24 \ --per_device_eval_batch_size 24 \ --num_train_epochs 5 \ --with_tracking \ --report_to "wandb" \ --output_dir "awesome_model" ### Expected behavior Normal training, but I found that in megatron_1m_gpt_pretraining on line 255 of the py file, there is an undefined parameter 'logging.dir' in the __init__ method of the Accelerator function

WaveDave · January 22, 2025, 12:33am

I see, so it seems the pull was resolved? what do I need to do to replicate such? I would of assumed i was using the latest Accelerate with the supposed fix

John6666 · January 22, 2025, 4:36am

pip install git+https://github.com/huggingface/accelerate

?
But I think it’s also merged into the pip version. Maybe it’s a different error.

Alanturner2 · January 22, 2025, 8:50am

Hello,

It seems like you’re encountering an issue where the logging_dir argument is causing a problem with the Accelerate.__init__() method during your training process. This error might be related to mismatched versions of libraries or changes in the API.

Here are a few steps you can try to resolve the issue:

Ensure Version Compatibility: Since you’ve already tried downgrading Accelerate, ensure that all dependencies (such as diffusers and transformers) are compatible with the version of Accelerate you are using. Sometimes, even if Accelerate is downgraded, the version of diffusers may require a more recent version of Accelerate.

You can update diffusers to the latest version with:
```
pip install --upgrade diffusers
```
Check for logging_dir Argument in the Code: The error suggests that the logging_dir argument is not expected by Accelerate.__init__(). This might be due to a change in the API or a version mismatch. Ensure that your code doesn’t pass this argument to the Accelerate initialization method or check if it can be handled elsewhere.

You can remove or modify the use of logging_dir by checking where it’s being passed to Accelerate and whether it needs to be included. For instance:
```
from accelerate import Accelerator
accelerator = Accelerator()  # Ensure no 'logging_dir' argument here
```
Update Accelerate: Sometimes, errors like this can occur due to an outdated or incompatible version of the Accelerate library. Ensure you’re using the latest stable version of Accelerate. To update, run:
```
pip install --upgrade accelerate
```
Check for Additional Arguments: If the logging_dir argument is still needed for logging, make sure that you’re passing it correctly to the logging setup and not directly to Accelerate.__init__(). You might need to pass it to a different component of the training pipeline (e.g., TensorBoard, wandb, or the Trainer class).
Restart Your Runtime: After updating or downgrading the libraries, be sure to restart your runtime in Google Colab to clear any residual errors and ensure that the updated versions are being used correctly.
Consider using logging_dir with the Trainer: If you’re using Hugging Face’s Trainer or another high-level API for training, the logging_dir argument might be better placed there, not directly in the Accelerate object initialization.

If these steps don’t resolve the issue, you might want to explore further compatibility details between diffusers, accelerate, and other training components you’re using.

Hope this helps, and let me know if you need further assistance!

WaveDave · January 23, 2025, 12:10am

Hi, thanks for the extensive options! upgrading Accelerate or Diffusers did not solve the problem. Can you expand a little on what you mean in method 2? I’m not sure I fully understand how I can check where its being passed using the 2 lines of code you provided. Also, I am using a pre-written training script, i.e. any calls for accelerate.init() are being made within functions I have not edited. Thanks again for your help.

Alanturner2 · January 23, 2025, 12:19am

This is one part of my code. I think this is useful for you.

from transformers import AutoModelForSequenceClassification, AutoTokenizer, get_scheduler
from datasets import load_dataset
from torch.utils.data import DataLoader
from torch.optim import AdamW
import torch
from accelerate import Accelerator

# Initialize the Accelerator
accelerator = Accelerator()

# Load dataset and tokenizer
dataset = load_dataset("imdb")
tokenizer = AutoTokenizer.from_pretrained("bert-base-uncased")

# Tokenize the dataset
def preprocess_function(examples):
    return tokenizer(examples["text"], padding="max_length", truncation=True)

tokenized_datasets = dataset.map(preprocess_function, batched=True)
tokenized_datasets = tokenized_datasets.remove_columns(["text"]).with_format("torch")

train_dataset = tokenized_datasets["train"]
test_dataset = tokenized_datasets["test"]

# DataLoaders
train_dataloader = DataLoader(train_dataset, shuffle=True, batch_size=8)
test_dataloader = DataLoader(test_dataset, batch_size=8)

# Model and optimizer
model = AutoModelForSequenceClassification.from_pretrained("bert-base-uncased", num_labels=2)
optimizer = AdamW(model.parameters(), lr=5e-5)

# Scheduler
num_training_steps = len(train_dataloader) * 3  # 3 epochs
lr_scheduler = get_scheduler("linear", optimizer=optimizer, num_warmup_steps=0, num_training_steps=num_training_steps)

# Prepare everything for Accelerate
model, optimizer, train_dataloader, test_dataloader, lr_scheduler = accelerator.prepare(
    model, optimizer, train_dataloader, test_dataloader, lr_scheduler
)

# Training loop
num_epochs = 3
for epoch in range(num_epochs):
    model.train()
    for batch in train_dataloader:
        outputs = model(**batch)
        loss = outputs.loss
        accelerator.backward(loss)  # Backpropagation with Accelerator

        optimizer.step()
        lr_scheduler.step()
        optimizer.zero_grad()

    print(f"Epoch {epoch + 1} completed.")

# Save the model
accelerator.wait_for_everyone()  # Synchronize across processes
unwrapped_model = accelerator.unwrap_model(model)  # Get the original model
unwrapped_model.save_pretrained("my_model")

print("Training completed and model saved!")

Topic		Replies	Views
Unknown keyword argument when using accelerate 🤗Accelerate	0	2410	October 15, 2022
Accelerator.__init__() got an unexpected keyword argument 'use_seedable_sampler' 🤗Accelerate	2	2591	June 26, 2024
Diffusion Models Course - Unit 3 🧨 Diffusers	9	1305	January 4, 2023
Fine tune gpt2 language model error - unexpected keyword argument 'cache_dir' Intermediate	0	1205	September 8, 2020
Audio Course Exercise Accelerator problems Beginners	0	200	June 12, 2024

Accelerate.__init__() got an unexpected keyword argument 'logging_dir'

Related topics

Accelerate.init() got an unexpected keyword argument 'logging_dir'