File size: 2,271 Bytes
f773839
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
# MaskAdapter
<div align ="center">
<img src="./assets/logo.jpeg" width="20%">
<h1> Mask-Adapter </h1>
<h3> Mask-Adapter: The Devil is in the Masks for Open-Vocabulary Segmentation </h3>

YongKang Li<sup>1,\*</sup>, [Tianheng Cheng](https://scholar.google.com/citations?user=PH8rJHYAAAAJ&hl=zh-CN)<sup>1,\*</sup>, [Wenyu Liu](http://eic.hust.edu.cn/professor/liuwenyu)<sup>1</sup>, [Xinggang Wang](https://xwcv.github.io/)<sup>1,πŸ“§</sup>

<sup>1</sup> Huazhong University of Science and Technology,


(\* equal contribution, πŸ“§ corresponding author)

[![arxiv paper](https://img.shields.io/badge/arXiv-Paper-red)]()
[![checkpoints](https://img.shields.io/badge/HuggingFace-πŸ€—-orange)]()
[![πŸ€— HuggingFace Demo](https://img.shields.io/badge/Mask_Adapter-πŸ€—_HF_Demo-orange)]()

</div>


<div align="center">
<img src="./assets/main_fig.png">
</div>

## Highlights

* Mask-Adapter is a simple yet remarkably effective method and can be seamlessly integrated into open-vocabulary segmentation methods, e.g., [FC-CLIP](https://github.com/bytedance/fc-clip) and [MAFT-Plus](https://github.com/jiaosiyu1999/MAFT-Plus), to tackle the existing bottlenecks.

* Mask-Adapter effectively extends to SAM without training, achieving impressive results across multiple open-vocabulary segmentation benchmarks.

## Updates
- [x] Release code
- [x] Release weights
- [x] Release demo with SAM-2πŸ‘‰ [πŸ€— Mask-Adapter]()
- [ ] Release weights training with addtional data


## Installation
Please follow [installation](INSTALL.md).

## Getting Started

See [Preparing Datasets for Mask-Adapter](datasets/README.md).Following [FC-CLIP](https://github.com/bytedance/fc-clip) and [MAFT-Plus](https://github.com/jiaosiyu1999/MAFT-Plus) to prepare datasets.

See [Getting Started with   Mask-Adapter](GETTING_STARTED.md).



## <a name="Citing Mask-Adapter"></a>Citing  Mask-Adapter

If you use Mask-Adapter in your research, please use the following BibTeX entry.

```BibTeX

```

## Acknowledgement

[Mask2Former](https://github.com/facebookresearch/Mask2Former)

[ODISE](https://github.com/NVlabs/ODISE)

[FC-CLIP](https://github.com/bytedance/fc-clip)

[MAFTP](https://github.com/jiaosiyu1999/MAFT-Plus)

[SAM](https://github.com/facebookresearch/segment-anything)
# MaskAdapter