Name	Name	Last commit message	Last commit date
parent directory ..
configs	configs
datasets	datasets
demo	demo
demo_video	demo_video
mask2former	mask2former
mask2former_video	mask2former_video
tools	tools
ADVANCED_USAGE.md	ADVANCED_USAGE.md
CODE_OF_CONDUCT.md	CODE_OF_CONDUCT.md
CONTRIBUTING.md	CONTRIBUTING.md
GETTING_STARTED.md	GETTING_STARTED.md
INSTALL.md	INSTALL.md
LICENSE	LICENSE
MODEL_ZOO.md	MODEL_ZOO.md
README.md	README.md
cog.yaml	cog.yaml
predict.py	predict.py
requirements.txt	requirements.txt
train_net.py	train_net.py
train_net_video.py	train_net_video.py

VW-Mask2Former

Installation

Datasets

Train

More Utilization: See Getting Started with MaskFormer.

Swin-Tiny

python ./train_net.py \
--resume --num-gpus 2 --dist-url auto \
--config-file configs/ade20k/semantic-segmentation/swin/vw/vw_maskformer2_swin_tiny_bs16_160k.yaml \
OUTPUT_DIR path/to/tiny TEST.EVAL_PERIOD 10000 MODEL.MASK_FORMER.SIZE_DIVISIBILITY 64

Swin-Tiny with Deformable Attention

python ./train_net.py \
--resume --num-gpus 2 --dist-url auto \
--config-file configs/ade20k/semantic-segmentation/swin/vw/vw_deformattn_maskformer2_swin_tiny_bs16_160k.yaml \
OUTPUT_DIR path/to/tiny TEST.EVAL_PERIOD 10000 MODEL.MASK_FORMER.SIZE_DIVISIBILITY 64

Swin-Small

python ./train_net.py \
--resume --num-gpus 4 --dist-url auto \
--config-file configs/ade20k/semantic-segmentation/swin/vw/vw_maskformer2_swin_small_bs16_160k.yaml \
OUTPUT_DIR path/to/small TEST.EVAL_PERIOD 10000 MODEL.MASK_FORMER.SIZE_DIVISIBILITY 64

Swin-Small with Deformable Attention

python ./train_net.py \
--resume --num-gpus 4 --dist-url auto \
--config-file configs/ade20k/semantic-segmentation/swin/vw/vw_deformattn_maskformer2_swin_small_bs16_160k.yaml \
OUTPUT_DIR path/to/small TEST.EVAL_PERIOD 10000 MODEL.MASK_FORMER.SIZE_DIVISIBILITY 64

Swin-Base

python ./train_net.py \
--resume --num-gpus 8 --dist-url auto \
--config-file configs/ade20k/semantic-segmentation/swin/vw/vw_maskformer2_swin_base_IN21k_384_bs16_160k_res640.yaml \
OUTPUT_DIR path/to/base TEST.EVAL_PERIOD 10000 MODEL.MASK_FORMER.SIZE_DIVISIBILITY 64

Swin-Base with Deformable Attention

python ./train_net.py \
--resume --num-gpus 8 --dist-url auto \
--config-file configs/ade20k/semantic-segmentation/swin/vw/vw_deformattn_maskformer2_swin_base_IN21k_384_bs16_160k_res640.yaml \
OUTPUT_DIR path/to/base TEST.EVAL_PERIOD 10000 MODEL.MASK_FORMER.SIZE_DIVISIBILITY 64

Swin-Large

python ./train_net.py \
--resume --num-gpus 16 --dist-url auto \
--config-file configs/ade20k/semantic-segmentation/swin/vw/vw_maskformer2_swin_large_IN21k_384_bs16_160k_res640.yaml \
OUTPUT_DIR path/to/large TEST.EVAL_PERIOD 10000 MODEL.MASK_FORMER.SIZE_DIVISIBILITY 64

Swin-Large with Deformable Attention

python ./train_net.py \
--resume --num-gpus 16 --dist-url auto \
--config-file configs/ade20k/semantic-segmentation/swin/vw/vw_deformattn_maskformer2_swin_large_IN21k_384_bs16_160k_res640.yaml \
OUTPUT_DIR path/to/large TEST.EVAL_PERIOD 10000 MODEL.MASK_FORMER.SIZE_DIVISIBILITY 64

Evaluation

python ./train_net.py \
--eval-only --num-gpus NGPUS --dist-url auto \
--config-file path/to/config \
MODEL.WEIGHTS path/to/weight TEST.AUG.ENABLED True MODEL.MASK_FORMER.SIZE_DIVISIBILITY 64

Model

Name	Backbone	crop size	lr sched	mIoU	mIoU (ms+flip)	download
VW-Mask2Former	Swin-T	512x512	160k	48.2	50.5	model
VW-Mask2Former	Swin-S	512x512	160k	52.1	53.7	model
VW-Mask2Former	Swin-B	640x640	160k	54.6	56.0	model
VW-Mask2Former	Swin-L	640x640	160k	56.5	57.8	model

Swin Transformer with Deformable Attention

Name	Backbone	crop size	lr sched	mIoU	mIoU (ms+flip)	download
VW-Mask2Former	Swin-T	512x512	160k	48.5	50.3	model
VW-Mask2Former	Swin-S	512x512	160k	52.0	53.6	model
VW-Mask2Former	Swin-B	640x640	160k	55.2	56.5	model
VW-Mask2Former	Swin-L	640x640	160k	56.9	58.3	model

Citing VW-Mask2Former

@inproceedings{yan2023multi,
  title={Multi-Scale Representations by Varing Window Attention for Semantic Segmentation},
  author={Yan, Haotian and Wu, Ming and Zhang, Chuang},
  booktitle={The Twelfth International Conference on Learning Representations},
  year={2023}
}

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Mask2Former

Mask2Former

README.md

VW-Mask2Former

Installation

Datasets

Train

Evaluation

Model

Swin Transformer with Deformable Attention

Citing VW-Mask2Former

Files

Mask2Former

Directory actions

More options

Directory actions

More options

Latest commit

History

Mask2Former

Folders and files

parent directory

README.md

VW-Mask2Former

Installation

Datasets

Train

Evaluation

Model

Swin Transformer with Deformable Attention

Citing VW-Mask2Former