integrate SAM (segment anything) encoder with Unet #757

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Open

Rusteam wants to merge 26 commits into qubvel-org:main

from Rusteam:sam

Open

integrate SAM (segment anything) encoder with Unet #757

Rusteam wants to merge 26 commits into qubvel-org:main from Rusteam:sam

Conversation

Rusteam

Copy link

@Rusteam Rusteam commented May 3, 2023 •

edited

Loading

Closes #756

Added:

SAM to models
3 SAM backbones (vit_h, vit_b and vit_l) to encoders
unittests and docs for SAM

Changed:

flake8 pre-commit repo to github (current) and version to latest

Rustem Galiullin added 6 commits

May 2, 2023 16:44


 add sam encoder and decoder

4beb571


 refactor sam vit encoder to common format

3dc3235


 refactor sam decoder init

85565ce


 add segmentation head to sam


 add sam to encoder and model docs

48033cb


 remove sam encoders from test_models

1f1eaca

@Rusteam Rusteam changed the title ~~(削除) integrate SAM (segment anything) model and encoders (削除ここまで)~~ (追記) Draft: integrate SAM (segment anything) model and encoders (追記ここまで)

May 3, 2023

Rustem Galiullin added 3 commits

May 4, 2023 14:07


 wip weights

f37c9b3


 load pretrained sam state dict for a model

6b36927


 update readme with sam model and encoders

64a2516

@Rusteam Rusteam changed the title ~~(削除) Draft: integrate SAM (segment anything) model and encoders (削除ここまで)~~ (追記) integrate SAM (segment anything) model and encoders (追記ここまで)

May 5, 2023

@Rusteam

Copy link

Author

Rusteam commented May 5, 2023

hi @qubvel is there any update on this?
I've just trained a model using this branch and it worked.

Rustem Galiullin and others added 5 commits

May 8, 2023 16:41


 use iou scaling to avoid errors with torch ddp

4d1144e


 set unused sam modules to require grad False

c1a9319


 set unused sam modules to None

2ed775d


 remove prompt encoder from sam

9c93eb4

@Rusteam


 Merge pull request #1 from Rusteam/sam-ddp

9731e8f

Make sam changes to enable DDP training

@Rusteam

Copy link

Author

Rusteam commented May 14, 2023

@Rusteam is the code merged into the main repo??i want to use this model to fine-tune my data?

It's not. Not sure if @qubvel has had a chance to look into this PR. You could use my fork in the meanwhile. And do let me know how your fine-Tuning goes because I haven't had much success so far.

@Rusteam

Copy link

Author

Rusteam commented May 15, 2023

@Rusteam how to train a model ,can u give some outlines?as author is not responding pls help me to train a model.. I have sent u an mail pls give a look

make sure you install this package from my fork pip instal git+https://github.com/Rusteam/segmentation_models.pytorch.git@sam and then initialize your model as usual create_model("SAM", "sam-vit_b", encoder_weights=None, **kwargs) and run your training. You could pass weights="sa-1b" in kwargs if you want to fine-tune from pre-trained weights.

So far I have been able to train the model, but I can't say it's learning. I'm still struggling there. Also I cannot fit more than 1 sample per batch on a 32gb gpu with a 512 input size.

@ccl-private

Copy link

ccl-private commented May 16, 2023

@Rusteam how about this: https://github.com/tianrun-chen/SAM-Adapter-PyTorch

@Rusteam

Copy link

Author

Rusteam commented May 16, 2023

thanks for sharing, I'll try it if my current approach does not work. I've able to get some learning with this transformers notebook

@qubvel

Copy link

Collaborator

qubvel commented May 17, 2023

Hi @Rusteam, thanks a lot for your contribution and sorry for the delay, I am going to review the request and will let you know

@Rusteam

Copy link

Author

Rusteam commented May 17, 2023

Hey hey hey. While this solution worked I can't say the model was able to learn on my data. We might need to use the version before my ddp adjustments or make the model handle points and boxes as inputs, or use Sam image encoder with unet or other architectures.

qubvel

qubvel reviewed

May 17, 2023

View reviewed changes

segmentation_models_pytorch/decoders/sam/model.py Outdated

from typing import Optional, Union, List, Tuple

import torch

from segment_anything.modeling import MaskDecoder, TwoWayTransformer, PromptEncoder

Copy link

Collaborator

@qubvel qubvel May 17, 2023

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Is it a pip package? probably need to add to reqs

Copy link

Author

@Rusteam Rusteam May 18, 2023

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

just added it to reqs, or should we make it optional?

@qubvel

Copy link

Collaborator

qubvel commented May 17, 2023

Yes, I was actually thinking about just pre-trained encoder integration, did you test it?


 add segment-anything to reqs

500779e

@Rusteam

Copy link

Author

Rusteam commented May 18, 2023

can we use this model to train on custom data??

@qubvel It didn't work with Unet yet, but I can make it work. Which models would be essential to integrate?

@Rusteam

Copy link

Author

Rusteam commented May 18, 2023 •

edited

Loading

@Rusteam @qubvel can we use this model to train on custom data??

that was my intention as well, but I was unable to make it learn without passing box/point prompts. However, when passing a prompt along with input image, it does learn. We might need to integrate multiple inputs to forward() call for it to work, or just use sam's image encoder with other arches like Unet

Rustem Galiullin added 2 commits

May 19, 2023 09:28


 integrate sam encoder to Unet model

b301d30


 ensure sam encoder weights loading

12a0db6

@siddpiku

Copy link

siddpiku commented Jul 5, 2023

The following worked for me:
-git clone the sam branch,
-modify the sam.py file like below to get rid of the errors:
-change def forward(self, x: torch.Tensor) -> list[torch.Tensor]: to def forward(self, x: torch.Tensor):

import segmentation_models_pytorch as smp (python file in same folder as git clone branch)
smp.create_model("Unet", "sam-vit_b", encoder_weights="sa-1b", encoder_depth=4, decoder_channels=[256, 128, 64, 32])
Try training
What did not work -
For me, I tried fine tuning with 2 RTX A6000 GPU with batch size of 2 on the ACDC data (https://www.creatis.insa-lyon.fr/Challenge/acdc/databases.html) but my Dice loss did not improve after 700 epochs. (Maybe some other setting works, but I did not have time to recreate it)