This repository was archived by the owner on Sep 10, 2025. It is now read-only.

Slimming down torchchat: Replace replace_attention_with_custom_sdpa_attention() with ET's implementation #1058

Open

Labels

ExecuTorch enhancement good first issue triaged

@Jack-Khuu

Description

@Jack-Khuu

Jack-Khuu

opened

on Aug 23, 2024

🚀 The feature, motivation and pitch

First surfaced in #1057, the replace_attention_with_custom_sdpa_attention function, used when exporting models in torchchat, can be replaced with the equivalent API provided in the Excecutorch https://github.com/pytorch/executorch/blob/main/examples/models/llama2/source_transformation/sdpa.py

Task: Swap the torchchat implementation with that of ExecuTorch's. Delete the then defunct code from torchchat

Alternatives

No response

Additional context

No response

RFC (Optional)

No response

Metadata

Assignees

No one assigned

Labels

ExecuTorch enhancement good first issue triaged

Type

No type

Projects

[torchchat] Looking for Contributors

Status

No status

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Slimming down torchchat: Replace replace_attention_with_custom_sdpa_attention() with ET's implementation #1058

Description

🚀 The feature, motivation and pitch

Alternatives

Additional context

RFC (Optional)

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions