Pytorch-Image models

mohitb1i · May 10, 2025, 4:41am

In the VisionTransformer class, the default act_layer is None . If we do not provide it - this will lead to a TypeError in MLP because none of the classes (Block , MLP , or VisionTransformer ) handle this case. Obvious error message:
TypeError: ‘NoneType’ object is not callable

Pimpcat-AU · June 10, 2025, 8:24pm

Fix:
Always set act_layer to a valid activation function (e.g., nn.GELU, nn.ReLU) when instantiating VisionTransformer.
Example:

import torch.nn as nn
model = VisionTransformer(act_layer=nn.GELU)

If not set, you’ll get TypeError: ‘NoneType’ object is not callable.

Solution provided by Triskel Data Deterministic AI.

dbrenes · June 11, 2025, 12:05am

Hello @mohitb1i ,

In which PyTorch version are you experiencing this error?

Machine Learning Engineer at RidgeRun.ai
Contact us: support@ridgerun.ai

mohitb1i · June 11, 2025, 8:19am

I understand, but I am saying the default value of act_layer should be nn.GELU or just set it in instantiation, like:

block_fn(
...
act_layer = act_layer or nn.GELU,
...
)

mohitb1i · June 11, 2025, 8:20am

No it is a vision-transformer code from hugging face,
original repo

code of Vision Transformer

dbrenes · June 16, 2025, 6:20pm

Upon reviewing the code, it appears that this behavior likely stems from the fact that the VisionTransformer class is not meant to be instantiated directly. Instead, the recommended approach is to use the timm.create_model function, which handles proper initialization of the available Vision Transformer variants. For example, calling models like vit_small_patch16_224 or vit_large_patch32_384 through timm.create_model returns a fully configured VisionTransformer instance.

However, if you choose to instantiate the VisionTransformer class directly, you are probably responsible for explicitly providing certain arguments—such as the act_layer—as you noted earlier.

Machine Learning Engineer at RidgeRun.ai
Contact us: support@ridgerun.ai

Pimpcat-AU · June 17, 2025, 6:03am

import torch
import torch.nn as nn

class VisionTransformer(nn.Module):
def init(self, act_layer=None, **kwargs):
super().init()
# Default to GELU if none provided
if act_layer is None:
act_layer = nn.GELU

    # Support both nn.ReLU and nn.ReLU() styles
    self.act = act_layer() if isinstance(act_layer, type) else act_layer

    # Example MLP block using activation
    self.mlp = nn.Sequential(
        nn.Linear(768, 3072),
        self.act,
        nn.Linear(3072, 768)
    )

def forward(self, x):
    return self.mlp(x)

Example usage:

if name == “main”:
model = VisionTransformer()
x = torch.randn(1, 768)
out = model(x)
print(out.shape)

Solution provided by Triskel Data Deterministic AI.

mohitb1i · June 17, 2025, 7:12pm

Thanks, it was an oversight.

system · June 18, 2025, 7:12am

This topic was automatically closed 12 hours after the last reply. New replies are no longer allowed.

Topic		Replies	Views
'NoneType' object is not callable Beginners	3	468	January 7, 2025
Resolve TypeError: expected Tensor as element 1 in argument 0, but got NoneType 🤗Transformers	0	411	June 21, 2024
Attribute error "nonetype object has no shape" Beginners	4	26	June 11, 2025
Wrong tensor shape when using a model: TypeError: Cannot handle this data type: (1, 1, 1280, 3), \|u1 Beginners	3	1484	January 9, 2024
TypeError: new_full(): argument ‘fill_value’ (position 2) must be Number, not NoneType Beginners	0	1193	March 27, 2021

Pytorch-Image models

Example usage:

Related topics