Fix LayerNorm crash when model.half() is used by OWU-4f5755 · Pull Request #2729 · openai/whisper

OWU-4f5755 · 2026-02-11T01:26:43Z

LayerNorm.forward() casts the input to fp32 but doesn't cast its own weight/bias, so calling model.half() before transcription causes:

RuntimeError: expected scalar type Float but found Half

Linear and Conv1d in the same file already guard against this by casting their weights to match the input dtype. This PR does the same for LayerNorm.

# Before — weight/bias stay fp16 after model.half()
class LayerNorm(nn.LayerNorm):
    def forward(self, x: Tensor) -> Tensor:
        return super().forward(x.float()).type(x.dtype)

# After — explicitly cast weight/bias to fp32
class LayerNorm(nn.LayerNorm):
    def forward(self, x: Tensor) -> Tensor:
        return F.layer_norm(
            x.float(),
            self.normalized_shape,
            self.weight.float() if self.weight is not None else None,
            self.bias.float() if self.bias is not None else None,
            self.eps,
        ).type(x.dtype)

Observed no overhead in the normal case (.float() on an fp32 tensor is a no-op). Tested with model.half() + fp16=True and the standard path — both work.

…as to fp32 in LayerNorm.forward(), matching the pattern already used in Linear and Conv1d. Thus, RuntimeError is prevented ('expected scalar type Float but found Half') when you call model.half() prior to transcription. Tested: model.half() + fp16=True transcription works. Standard path no.half() also works and isn't affected.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix LayerNorm crash when model.half() is used#2729

Fix LayerNorm crash when model.half() is used#2729
OWU-4f5755 wants to merge 1 commit into
openai:mainfrom
OWU-4f5755:fix/layernorm-dtype-defense

OWU-4f5755 commented Feb 11, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

OWU-4f5755 commented Feb 11, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant