first commit
This commit is contained in:
118
Seg_All_In_One_MMSeg/configs/deeplabv3/README.md
Normal file
118
Seg_All_In_One_MMSeg/configs/deeplabv3/README.md
Normal file
@@ -0,0 +1,118 @@
|
||||
# DeepLabV3
|
||||
|
||||
> [Rethinking atrous convolution for semantic image segmentation](https://arxiv.org/abs/1706.05587)
|
||||
|
||||
## Introduction
|
||||
|
||||
<!-- [ALGORITHM] -->
|
||||
|
||||
<a href="https://github.com/tensorflow/models/tree/master/research/deeplab">Official Repo</a>
|
||||
|
||||
<a href="https://github.com/open-mmlab/mmsegmentation/blob/v0.17.0/mmseg/models/decode_heads/aspp_head.py#L54">Code Snippet</a>
|
||||
|
||||
## Abstract
|
||||
|
||||
<!-- [ABSTRACT] -->
|
||||
|
||||
In this work, we revisit atrous convolution, a powerful tool to explicitly adjust filter's field-of-view as well as control the resolution of feature responses computed by Deep Convolutional Neural Networks, in the application of semantic image segmentation. To handle the problem of segmenting objects at multiple scales, we design modules which employ atrous convolution in cascade or in parallel to capture multi-scale context by adopting multiple atrous rates. Furthermore, we propose to augment our previously proposed Atrous Spatial Pyramid Pooling module, which probes convolutional features at multiple scales, with image-level features encoding global context and further boost performance. We also elaborate on implementation details and share our experience on training our system. The proposed \`DeepLabv3' system significantly improves over our previous DeepLab versions without DenseCRF post-processing and attains comparable performance with other state-of-art models on the PASCAL VOC 2012 semantic image segmentation benchmark.
|
||||
|
||||
<!-- [IMAGE] -->
|
||||
|
||||
<div align=center >
|
||||
<img alt="DEEPLABv3_ResNet-D8" src="https://user-images.githubusercontent.com/61172629/209305311-87ff9e36-b7cd-46d7-8b4c-9e26e10c27d0.jpg"/>
|
||||
DEEPLABv3_ResNet-D8 model structure
|
||||
</div>
|
||||
|
||||
## Results and models
|
||||
|
||||
### Cityscapes
|
||||
|
||||
| Method | Backbone | Crop Size | Lr schd | Mem (GB) | Inf time (fps) | Device | mIoU | mIoU(ms+flip) | config | download |
|
||||
| ---------------- | --------------- | --------- | ------: | -------- | -------------- | ------ | ----: | ------------: | ------------------------------------------------------------------------------------------------------------------------------------------- | ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------ |
|
||||
| DeepLabV3 | R-50-D8 | 512x1024 | 40000 | 6.1 | 2.57 | V100 | 79.09 | 80.45 | [config](https://github.com/open-mmlab/mmsegmentation/blob/main/configs/deeplabv3/deeplabv3_r50-d8_4xb2-40k_cityscapes-512x1024.py) | [model](https://download.openmmlab.com/mmsegmentation/v0.5/deeplabv3/deeplabv3_r50-d8_512x1024_40k_cityscapes/deeplabv3_r50-d8_512x1024_40k_cityscapes_20200605_022449-acadc2f8.pth) \| [log](https://download.openmmlab.com/mmsegmentation/v0.5/deeplabv3/deeplabv3_r50-d8_512x1024_40k_cityscapes/deeplabv3_r50-d8_512x1024_40k_cityscapes_20200605_022449.log.json) |
|
||||
| DeepLabV3 | R-101-D8 | 512x1024 | 40000 | 9.6 | 1.92 | V100 | 77.12 | 79.61 | [config](https://github.com/open-mmlab/mmsegmentation/blob/main/configs/deeplabv3/deeplabv3_r101-d8_4xb2-40k_cityscapes-512x1024.py) | [model](https://download.openmmlab.com/mmsegmentation/v0.5/deeplabv3/deeplabv3_r101-d8_512x1024_40k_cityscapes/deeplabv3_r101-d8_512x1024_40k_cityscapes_20200605_012241-7fd3f799.pth) \| [log](https://download.openmmlab.com/mmsegmentation/v0.5/deeplabv3/deeplabv3_r101-d8_512x1024_40k_cityscapes/deeplabv3_r101-d8_512x1024_40k_cityscapes_20200605_012241.log.json) |
|
||||
| DeepLabV3 | R-50-D8 | 769x769 | 40000 | 6.9 | 1.11 | V100 | 78.58 | 79.89 | [config](https://github.com/open-mmlab/mmsegmentation/blob/main/configs/deeplabv3/deeplabv3_r50-d8_4xb2-40k_cityscapes-769x769.py) | [model](https://download.openmmlab.com/mmsegmentation/v0.5/deeplabv3/deeplabv3_r50-d8_769x769_40k_cityscapes/deeplabv3_r50-d8_769x769_40k_cityscapes_20200606_113723-7eda553c.pth) \| [log](https://download.openmmlab.com/mmsegmentation/v0.5/deeplabv3/deeplabv3_r50-d8_769x769_40k_cityscapes/deeplabv3_r50-d8_769x769_40k_cityscapes_20200606_113723.log.json) |
|
||||
| DeepLabV3 | R-101-D8 | 769x769 | 40000 | 10.9 | 0.83 | V100 | 79.27 | 80.11 | [config](https://github.com/open-mmlab/mmsegmentation/blob/main/configs/deeplabv3/deeplabv3_r101-d8_4xb2-40k_cityscapes-769x769.py) | [model](https://download.openmmlab.com/mmsegmentation/v0.5/deeplabv3/deeplabv3_r101-d8_769x769_40k_cityscapes/deeplabv3_r101-d8_769x769_40k_cityscapes_20200606_113809-c64f889f.pth) \| [log](https://download.openmmlab.com/mmsegmentation/v0.5/deeplabv3/deeplabv3_r101-d8_769x769_40k_cityscapes/deeplabv3_r101-d8_769x769_40k_cityscapes_20200606_113809.log.json) |
|
||||
| DeepLabV3 | R-18-D8 | 512x1024 | 80000 | 1.7 | 13.78 | V100 | 76.70 | 78.27 | [config](https://github.com/open-mmlab/mmsegmentation/blob/main/configs/deeplabv3/deeplabv3_r18-d8_4xb2-80k_cityscapes-512x1024.py) | [model](https://download.openmmlab.com/mmsegmentation/v0.5/deeplabv3/deeplabv3_r18-d8_512x1024_80k_cityscapes/deeplabv3_r18-d8_512x1024_80k_cityscapes_20201225_021506-23dffbe2.pth) \| [log](https://download.openmmlab.com/mmsegmentation/v0.5/deeplabv3/deeplabv3_r18-d8_512x1024_80k_cityscapes/deeplabv3_r18-d8_512x1024_80k_cityscapes-20201225_021506.log.json) |
|
||||
| DeepLabV3 | R-50-D8 | 512x1024 | 80000 | - | - | V100 | 79.32 | 80.57 | [config](https://github.com/open-mmlab/mmsegmentation/blob/main/configs/deeplabv3/deeplabv3_r50-d8_4xb2-80k_cityscapes-512x1024.py) | [model](https://download.openmmlab.com/mmsegmentation/v0.5/deeplabv3/deeplabv3_r50-d8_512x1024_80k_cityscapes/deeplabv3_r50-d8_512x1024_80k_cityscapes_20200606_113404-b92cfdd4.pth) \| [log](https://download.openmmlab.com/mmsegmentation/v0.5/deeplabv3/deeplabv3_r50-d8_512x1024_80k_cityscapes/deeplabv3_r50-d8_512x1024_80k_cityscapes_20200606_113404.log.json) |
|
||||
| DeepLabV3 | R-101-D8 | 512x1024 | 80000 | - | - | V100 | 80.20 | 81.21 | [config](https://github.com/open-mmlab/mmsegmentation/blob/main/configs/deeplabv3/deeplabv3_r101-d8_4xb2-80k_cityscapes-512x1024.py) | [model](https://download.openmmlab.com/mmsegmentation/v0.5/deeplabv3/deeplabv3_r101-d8_512x1024_80k_cityscapes/deeplabv3_r101-d8_512x1024_80k_cityscapes_20200606_113503-9e428899.pth) \| [log](https://download.openmmlab.com/mmsegmentation/v0.5/deeplabv3/deeplabv3_r101-d8_512x1024_80k_cityscapes/deeplabv3_r101-d8_512x1024_80k_cityscapes_20200606_113503.log.json) |
|
||||
| DeepLabV3 (FP16) | R-101-D8 | 512x1024 | 80000 | 5.75 | 3.86 | V100 | 80.48 | - | [config](https://github.com/open-mmlab/mmsegmentation/blob/main/configs/deeplabv3/deeplabv3_r101-d8_4xb2-amp-80k_cityscapes-512x1024.py) | [model](https://download.openmmlab.com/mmsegmentation/v0.5/deeplabv3/deeplabv3_r101-d8_fp16_512x1024_80k_cityscapes/deeplabv3_r101-d8_fp16_512x1024_80k_cityscapes_20200717_230920-774d9cec.pth) \| [log](https://download.openmmlab.com/mmsegmentation/v0.5/deeplabv3/deeplabv3_r101-d8_fp16_512x1024_80k_cityscapes/deeplabv3_r101-d8_fp16_512x1024_80k_cityscapes_20200717_230920.log.json) |
|
||||
| DeepLabV3 | R-18-D8 | 769x769 | 80000 | 1.9 | 5.55 | V100 | 76.60 | 78.26 | [config](https://github.com/open-mmlab/mmsegmentation/blob/main/configs/deeplabv3/deeplabv3_r18-d8_4xb2-80k_cityscapes-769x769.py) | [model](https://download.openmmlab.com/mmsegmentation/v0.5/deeplabv3/deeplabv3_r18-d8_769x769_80k_cityscapes/deeplabv3_r18-d8_769x769_80k_cityscapes_20201225_021506-6452126a.pth) \| [log](https://download.openmmlab.com/mmsegmentation/v0.5/deeplabv3/deeplabv3_r18-d8_769x769_80k_cityscapes/deeplabv3_r18-d8_769x769_80k_cityscapes-20201225_021506.log.json) |
|
||||
| DeepLabV3 | R-50-D8 | 769x769 | 80000 | - | - | V100 | 79.89 | 81.06 | [config](https://github.com/open-mmlab/mmsegmentation/blob/main/configs/deeplabv3/deeplabv3_r50-d8_4xb2-80k_cityscapes-769x769.py) | [model](https://download.openmmlab.com/mmsegmentation/v0.5/deeplabv3/deeplabv3_r50-d8_769x769_80k_cityscapes/deeplabv3_r50-d8_769x769_80k_cityscapes_20200606_221338-788d6228.pth) \| [log](https://download.openmmlab.com/mmsegmentation/v0.5/deeplabv3/deeplabv3_r50-d8_769x769_80k_cityscapes/deeplabv3_r50-d8_769x769_80k_cityscapes_20200606_221338.log.json) |
|
||||
| DeepLabV3 | R-101-D8 | 769x769 | 80000 | - | - | V100 | 79.67 | 80.81 | [config](https://github.com/open-mmlab/mmsegmentation/blob/main/configs/deeplabv3/deeplabv3_r101-d8_4xb2-80k_cityscapes-769x769.py) | [model](https://download.openmmlab.com/mmsegmentation/v0.5/deeplabv3/deeplabv3_r101-d8_769x769_80k_cityscapes/deeplabv3_r101-d8_769x769_80k_cityscapes_20200607_013353-60e95418.pth) \| [log](https://download.openmmlab.com/mmsegmentation/v0.5/deeplabv3/deeplabv3_r101-d8_769x769_80k_cityscapes/deeplabv3_r101-d8_769x769_80k_cityscapes_20200607_013353.log.json) |
|
||||
| DeepLabV3 | R-101-D16-MG124 | 512x1024 | 40000 | 4.7 | 6.96 | V100 | 76.71 | 78.63 | [config](https://github.com/open-mmlab/mmsegmentation/blob/main/configs/deeplabv3/deeplabv3_r101-d16-mg124_4xb2-40k_cityscapes-512x1024.py) | [model](https://download.openmmlab.com/mmsegmentation/v0.5/deeplabv3/deeplabv3_r101-d16-mg124_512x1024_40k_cityscapes/deeplabv3_r101-d16-mg124_512x1024_40k_cityscapes_20200908_005644-67b0c992.pth) \| [log](https://download.openmmlab.com/mmsegmentation/v0.5/deeplabv3/deeplabv3_r101-d16-mg124_512x1024_40k_cityscapes/deeplabv3_r101-d16-mg124_512x1024_40k_cityscapes-20200908_005644.log.json) |
|
||||
| DeepLabV3 | R-101-D16-MG124 | 512x1024 | 80000 | - | - | V100 | 78.36 | 79.84 | [config](https://github.com/open-mmlab/mmsegmentation/blob/main/configs/deeplabv3/deeplabv3_r101-d16-mg124_4xb2-80k_cityscapes-512x1024.py) | [model](https://download.openmmlab.com/mmsegmentation/v0.5/deeplabv3/deeplabv3_r101-d16-mg124_512x1024_80k_cityscapes/deeplabv3_r101-d16-mg124_512x1024_80k_cityscapes_20200908_005644-57bb8425.pth) \| [log](https://download.openmmlab.com/mmsegmentation/v0.5/deeplabv3/deeplabv3_r101-d16-mg124_512x1024_80k_cityscapes/deeplabv3_r101-d16-mg124_512x1024_80k_cityscapes-20200908_005644.log.json) |
|
||||
| DeepLabV3 | R-18b-D8 | 512x1024 | 80000 | 1.6 | 13.93 | V100 | 76.26 | 77.88 | [config](https://github.com/open-mmlab/mmsegmentation/blob/main/configs/deeplabv3/deeplabv3_r18b-d8_4xb2-80k_cityscapes-512x1024.py) | [model](https://download.openmmlab.com/mmsegmentation/v0.5/deeplabv3/deeplabv3_r18b-d8_512x1024_80k_cityscapes/deeplabv3_r18b-d8_512x1024_80k_cityscapes_20201225_094144-46040cef.pth) \| [log](https://download.openmmlab.com/mmsegmentation/v0.5/deeplabv3/deeplabv3_r18b-d8_512x1024_80k_cityscapes/deeplabv3_r18b-d8_512x1024_80k_cityscapes-20201225_094144.log.json) |
|
||||
| DeepLabV3 | R-50b-D8 | 512x1024 | 80000 | 6.0 | 2.74 | V100 | 79.63 | 80.98 | [config](https://github.com/open-mmlab/mmsegmentation/blob/main/configs/deeplabv3/deeplabv3_r50b-d8_4xb2-80k_cityscapes-512x1024.py) | [model](https://download.openmmlab.com/mmsegmentation/v0.5/deeplabv3/deeplabv3_r50b-d8_512x1024_80k_cityscapes/deeplabv3_r50b-d8_512x1024_80k_cityscapes_20201225_155148-ec368954.pth) \| [log](https://download.openmmlab.com/mmsegmentation/v0.5/deeplabv3/deeplabv3_r50b-d8_512x1024_80k_cityscapes/deeplabv3_r50b-d8_512x1024_80k_cityscapes-20201225_155148.log.json) |
|
||||
| DeepLabV3 | R-101b-D8 | 512x1024 | 80000 | 9.5 | 1.81 | V100 | 80.01 | 81.21 | [config](https://github.com/open-mmlab/mmsegmentation/blob/main/configs/deeplabv3/deeplabv3_r101b-d8_4xb2-80k_cityscapes-512x1024.py) | [model](https://download.openmmlab.com/mmsegmentation/v0.5/deeplabv3/deeplabv3_r101b-d8_512x1024_80k_cityscapes/deeplabv3_r101b-d8_512x1024_80k_cityscapes_20201226_171821-8fd49503.pth) \| [log](https://download.openmmlab.com/mmsegmentation/v0.5/deeplabv3/deeplabv3_r101b-d8_512x1024_80k_cityscapes/deeplabv3_r101b-d8_512x1024_80k_cityscapes-20201226_171821.log.json) |
|
||||
| DeepLabV3 | R-18b-D8 | 769x769 | 80000 | 1.8 | 5.79 | V100 | 75.63 | 77.51 | [config](https://github.com/open-mmlab/mmsegmentation/blob/main/configs/deeplabv3/deeplabv3_r18b-d8_4xb2-80k_cityscapes-769x769.py) | [model](https://download.openmmlab.com/mmsegmentation/v0.5/deeplabv3/deeplabv3_r18b-d8_769x769_80k_cityscapes/deeplabv3_r18b-d8_769x769_80k_cityscapes_20201225_094144-fdc985d9.pth) \| [log](https://download.openmmlab.com/mmsegmentation/v0.5/deeplabv3/deeplabv3_r18b-d8_769x769_80k_cityscapes/deeplabv3_r18b-d8_769x769_80k_cityscapes-20201225_094144.log.json) |
|
||||
| DeepLabV3 | R-50b-D8 | 769x769 | 80000 | 6.8 | 1.16 | V100 | 78.80 | 80.27 | [config](https://github.com/open-mmlab/mmsegmentation/blob/main/configs/deeplabv3/deeplabv3_r50b-d8_4xb2-80k_cityscapes-769x769.py) | [model](https://download.openmmlab.com/mmsegmentation/v0.5/deeplabv3/deeplabv3_r50b-d8_769x769_80k_cityscapes/deeplabv3_r50b-d8_769x769_80k_cityscapes_20201225_155404-87fb0cf4.pth) \| [log](https://download.openmmlab.com/mmsegmentation/v0.5/deeplabv3/deeplabv3_r50b-d8_769x769_80k_cityscapes/deeplabv3_r50b-d8_769x769_80k_cityscapes-20201225_155404.log.json) |
|
||||
| DeepLabV3 | R-101b-D8 | 769x769 | 80000 | 10.7 | 0.82 | V100 | 79.41 | 80.73 | [config](https://github.com/open-mmlab/mmsegmentation/blob/main/configs/deeplabv3/deeplabv3_r101b-d8_4xb2-80k_cityscapes-769x769.py) | [model](https://download.openmmlab.com/mmsegmentation/v0.5/deeplabv3/deeplabv3_r101b-d8_769x769_80k_cityscapes/deeplabv3_r101b-d8_769x769_80k_cityscapes_20201226_190843-9142ee57.pth) \| [log](https://download.openmmlab.com/mmsegmentation/v0.5/deeplabv3/deeplabv3_r101b-d8_769x769_80k_cityscapes/deeplabv3_r101b-d8_769x769_80k_cityscapes-20201226_190843.log.json) |
|
||||
|
||||
### ADE20K
|
||||
|
||||
| Method | Backbone | Crop Size | Lr schd | Mem (GB) | Inf time (fps) | Device | mIoU | mIoU(ms+flip) | config | download |
|
||||
| --------- | -------- | --------- | ------: | -------- | -------------- | ------ | ----: | ------------: | -------------------------------------------------------------------------------------------------------------------------------- | ---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- |
|
||||
| DeepLabV3 | R-50-D8 | 512x512 | 80000 | 8.9 | 14.76 | V100 | 42.42 | 43.28 | [config](https://github.com/open-mmlab/mmsegmentation/blob/main/configs/deeplabv3/deeplabv3_r50-d8_4xb4-80k_ade20k-512x512.py) | [model](https://download.openmmlab.com/mmsegmentation/v0.5/deeplabv3/deeplabv3_r50-d8_512x512_80k_ade20k/deeplabv3_r50-d8_512x512_80k_ade20k_20200614_185028-0bb3f844.pth) \| [log](https://download.openmmlab.com/mmsegmentation/v0.5/deeplabv3/deeplabv3_r50-d8_512x512_80k_ade20k/deeplabv3_r50-d8_512x512_80k_ade20k_20200614_185028.log.json) |
|
||||
| DeepLabV3 | R-101-D8 | 512x512 | 80000 | 12.4 | 10.14 | V100 | 44.08 | 45.19 | [config](https://github.com/open-mmlab/mmsegmentation/blob/main/configs/deeplabv3/deeplabv3_r101-d8_4xb4-80k_ade20k-512x512.py) | [model](https://download.openmmlab.com/mmsegmentation/v0.5/deeplabv3/deeplabv3_r101-d8_512x512_80k_ade20k/deeplabv3_r101-d8_512x512_80k_ade20k_20200615_021256-d89c7fa4.pth) \| [log](https://download.openmmlab.com/mmsegmentation/v0.5/deeplabv3/deeplabv3_r101-d8_512x512_80k_ade20k/deeplabv3_r101-d8_512x512_80k_ade20k_20200615_021256.log.json) |
|
||||
| DeepLabV3 | R-50-D8 | 512x512 | 160000 | - | - | V100 | 42.66 | 44.09 | [config](https://github.com/open-mmlab/mmsegmentation/blob/main/configs/deeplabv3/deeplabv3_r50-d8_4xb4-160k_ade20k-512x512.py) | [model](https://download.openmmlab.com/mmsegmentation/v0.5/deeplabv3/deeplabv3_r50-d8_512x512_160k_ade20k/deeplabv3_r50-d8_512x512_160k_ade20k_20200615_123227-5d0ee427.pth) \| [log](https://download.openmmlab.com/mmsegmentation/v0.5/deeplabv3/deeplabv3_r50-d8_512x512_160k_ade20k/deeplabv3_r50-d8_512x512_160k_ade20k_20200615_123227.log.json) |
|
||||
| DeepLabV3 | R-101-D8 | 512x512 | 160000 | - | - | V100 | 45.00 | 46.66 | [config](https://github.com/open-mmlab/mmsegmentation/blob/main/configs/deeplabv3/deeplabv3_r101-d8_4xb4-160k_ade20k-512x512.py) | [model](https://download.openmmlab.com/mmsegmentation/v0.5/deeplabv3/deeplabv3_r101-d8_512x512_160k_ade20k/deeplabv3_r101-d8_512x512_160k_ade20k_20200615_105816-b1f72b3b.pth) \| [log](https://download.openmmlab.com/mmsegmentation/v0.5/deeplabv3/deeplabv3_r101-d8_512x512_160k_ade20k/deeplabv3_r101-d8_512x512_160k_ade20k_20200615_105816.log.json) |
|
||||
|
||||
### Pascal VOC 2012 + Aug
|
||||
|
||||
| Method | Backbone | Crop Size | Lr schd | Mem (GB) | Inf time (fps) | Device | mIoU | mIoU(ms+flip) | config | download |
|
||||
| --------- | -------- | --------- | ------: | -------- | -------------- | ------ | ----: | ------------: | --------------------------------------------------------------------------------------------------------------------------------- | -------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- |
|
||||
| DeepLabV3 | R-50-D8 | 512x512 | 20000 | 6.1 | 13.88 | V100 | 76.17 | 77.42 | [config](https://github.com/open-mmlab/mmsegmentation/blob/main/configs/deeplabv3/deeplabv3_r50-d8_4xb4-20k_voc12aug-512x512.py) | [model](https://download.openmmlab.com/mmsegmentation/v0.5/deeplabv3/deeplabv3_r50-d8_512x512_20k_voc12aug/deeplabv3_r50-d8_512x512_20k_voc12aug_20200617_010906-596905ef.pth) \| [log](https://download.openmmlab.com/mmsegmentation/v0.5/deeplabv3/deeplabv3_r50-d8_512x512_20k_voc12aug/deeplabv3_r50-d8_512x512_20k_voc12aug_20200617_010906.log.json) |
|
||||
| DeepLabV3 | R-101-D8 | 512x512 | 20000 | 9.6 | 9.81 | V100 | 78.70 | 79.95 | [config](https://github.com/open-mmlab/mmsegmentation/blob/main/configs/deeplabv3/deeplabv3_r101-d8_4xb4-20k_voc12aug-512x512.py) | [model](https://download.openmmlab.com/mmsegmentation/v0.5/deeplabv3/deeplabv3_r101-d8_512x512_20k_voc12aug/deeplabv3_r101-d8_512x512_20k_voc12aug_20200617_010932-8d13832f.pth) \| [log](https://download.openmmlab.com/mmsegmentation/v0.5/deeplabv3/deeplabv3_r101-d8_512x512_20k_voc12aug/deeplabv3_r101-d8_512x512_20k_voc12aug_20200617_010932.log.json) |
|
||||
| DeepLabV3 | R-50-D8 | 512x512 | 40000 | - | - | V100 | 77.68 | 78.78 | [config](https://github.com/open-mmlab/mmsegmentation/blob/main/configs/deeplabv3/deeplabv3_r50-d8_4xb4-40k_voc12aug-512x512.py) | [model](https://download.openmmlab.com/mmsegmentation/v0.5/deeplabv3/deeplabv3_r50-d8_512x512_40k_voc12aug/deeplabv3_r50-d8_512x512_40k_voc12aug_20200613_161546-2ae96e7e.pth) \| [log](https://download.openmmlab.com/mmsegmentation/v0.5/deeplabv3/deeplabv3_r50-d8_512x512_40k_voc12aug/deeplabv3_r50-d8_512x512_40k_voc12aug_20200613_161546.log.json) |
|
||||
| DeepLabV3 | R-101-D8 | 512x512 | 40000 | - | - | V100 | 77.92 | 79.18 | [config](https://github.com/open-mmlab/mmsegmentation/blob/main/configs/deeplabv3/deeplabv3_r101-d8_4xb4-40k_voc12aug-512x512.py) | [model](https://download.openmmlab.com/mmsegmentation/v0.5/deeplabv3/deeplabv3_r101-d8_512x512_40k_voc12aug/deeplabv3_r101-d8_512x512_40k_voc12aug_20200613_161432-0017d784.pth) \| [log](https://download.openmmlab.com/mmsegmentation/v0.5/deeplabv3/deeplabv3_r101-d8_512x512_40k_voc12aug/deeplabv3_r101-d8_512x512_40k_voc12aug_20200613_161432.log.json) |
|
||||
|
||||
### Pascal Context
|
||||
|
||||
| Method | Backbone | Crop Size | Lr schd | Mem (GB) | Inf time (fps) | Device | mIoU | mIoU(ms+flip) | config | download |
|
||||
| --------- | -------- | --------- | ------: | -------- | -------------- | ------ | ----: | ------------: | --------------------------------------------------------------------------------------------------------------------------------------- | -------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- |
|
||||
| DeepLabV3 | R-101-D8 | 480x480 | 40000 | 9.2 | 7.09 | V100 | 46.55 | 47.81 | [config](https://github.com/open-mmlab/mmsegmentation/blob/main/configs/deeplabv3/deeplabv3_r101-d8_4xb4-40k_pascal-context-480x480.py) | [model](https://download.openmmlab.com/mmsegmentation/v0.5/deeplabv3/deeplabv3_r101-d8_480x480_40k_pascal_context/deeplabv3_r101-d8_480x480_40k_pascal_context_20200911_204118-1aa27336.pth) \| [log](https://download.openmmlab.com/mmsegmentation/v0.5/deeplabv3/deeplabv3_r101-d8_480x480_40k_pascal_context/deeplabv3_r101-d8_480x480_40k_pascal_context-20200911_204118.log.json) |
|
||||
| DeepLabV3 | R-101-D8 | 480x480 | 80000 | - | - | V100 | 46.42 | 47.53 | [config](https://github.com/open-mmlab/mmsegmentation/blob/main/configs/deeplabv3/deeplabv3_r101-d8_4xb4-80k_pascal-context-480x480.py) | [model](https://download.openmmlab.com/mmsegmentation/v0.5/deeplabv3/deeplabv3_r101-d8_480x480_80k_pascal_context/deeplabv3_r101-d8_480x480_80k_pascal_context_20200911_170155-2a21fff3.pth) \| [log](https://download.openmmlab.com/mmsegmentation/v0.5/deeplabv3/deeplabv3_r101-d8_480x480_80k_pascal_context/deeplabv3_r101-d8_480x480_80k_pascal_context-20200911_170155.log.json) |
|
||||
|
||||
### Pascal Context 59
|
||||
|
||||
| Method | Backbone | Crop Size | Lr schd | Mem (GB) | Inf time (fps) | Device | mIoU | mIoU(ms+flip) | config | download |
|
||||
| --------- | -------- | --------- | ------: | -------- | -------------- | ------ | ----: | ------------: | ------------------------------------------------------------------------------------------------------------------------------------------ | -------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- |
|
||||
| DeepLabV3 | R-101-D8 | 480x480 | 40000 | - | - | V100 | 52.61 | 54.28 | [config](https://github.com/open-mmlab/mmsegmentation/blob/main/configs/deeplabv3/deeplabv3_r101-d8_4xb4-40k_pascal-context-59-480x480.py) | [model](https://download.openmmlab.com/mmsegmentation/v0.5/deeplabv3/deeplabv3_r101-d8_480x480_40k_pascal_context_59/deeplabv3_r101-d8_480x480_40k_pascal_context_59_20210416_110332-cb08ea46.pth) \| [log](https://download.openmmlab.com/mmsegmentation/v0.5/deeplabv3/deeplabv3_r101-d8_480x480_40k_pascal_context_59/deeplabv3_r101-d8_480x480_40k_pascal_context_59-20210416_110332.log.json) |
|
||||
| DeepLabV3 | R-101-D8 | 480x480 | 80000 | - | - | V100 | 52.46 | 54.09 | [config](https://github.com/open-mmlab/mmsegmentation/blob/main/configs/deeplabv3/deeplabv3_r101-d8_4xb4-80k_pascal-context-59-480x480.py) | [model](https://download.openmmlab.com/mmsegmentation/v0.5/deeplabv3/deeplabv3_r101-d8_480x480_80k_pascal_context_59/deeplabv3_r101-d8_480x480_80k_pascal_context_59_20210416_113002-26303993.pth) \| [log](https://download.openmmlab.com/mmsegmentation/v0.5/deeplabv3/deeplabv3_r101-d8_480x480_80k_pascal_context_59/deeplabv3_r101-d8_480x480_80k_pascal_context_59-20210416_113002.log.json) |
|
||||
|
||||
### COCO-Stuff 10k
|
||||
|
||||
| Method | Backbone | Crop Size | Lr schd | Mem (GB) | Inf time (fps) | Device | mIoU | mIoU(ms+flip) | config | download |
|
||||
| --------- | -------- | --------- | ------: | -------- | -------------- | ------ | ----: | ------------: | -------------------------------------------------------------------------------------------------------------------------------------- | -------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- |
|
||||
| DeepLabV3 | R-50-D8 | 512x512 | 20000 | 9.6 | 10.8 | V100 | 34.66 | 36.08 | [config](https://github.com/open-mmlab/mmsegmentation/blob/main/configs/deeplabv3/deeplabv3_r50-d8_4xb4-20k_coco-stuff10k-512x512.py) | [model](https://download.openmmlab.com/mmsegmentation/v0.5/deeplabv3/deeplabv3_r50-d8_512x512_4x4_20k_coco-stuff10k/deeplabv3_r50-d8_512x512_4x4_20k_coco-stuff10k_20210821_043025-b35f789d.pth) \| [log](https://download.openmmlab.com/mmsegmentation/v0.5/deeplabv3/deeplabv3_r50-d8_512x512_4x4_20k_coco-stuff10k/deeplabv3_r50-d8_512x512_4x4_20k_coco-stuff10k_20210821_043025.log.json) |
|
||||
| DeepLabV3 | R-101-D8 | 512x512 | 20000 | 13.2 | 8.7 | V100 | 37.30 | 38.42 | [config](https://github.com/open-mmlab/mmsegmentation/blob/main/configs/deeplabv3/deeplabv3_r101-d8_4xb4-20k_coco-stuff10k-512x512.py) | [model](https://download.openmmlab.com/mmsegmentation/v0.5/deeplabv3/deeplabv3_r101-d8_512x512_4x4_20k_coco-stuff10k/deeplabv3_r101-d8_512x512_4x4_20k_coco-stuff10k_20210821_043025-c49752cb.pth) \| [log](https://download.openmmlab.com/mmsegmentation/v0.5/deeplabv3/deeplabv3_r101-d8_512x512_4x4_20k_coco-stuff10k/deeplabv3_r101-d8_512x512_4x4_20k_coco-stuff10k_20210821_043025.log.json) |
|
||||
| DeepLabV3 | R-50-D8 | 512x512 | 40000 | - | - | V100 | 35.73 | 37.09 | [config](https://github.com/open-mmlab/mmsegmentation/blob/main/configs/deeplabv3/deeplabv3_r50-d8_4xb4-40k_coco-stuff10k-512x512.py) | [model](https://download.openmmlab.com/mmsegmentation/v0.5/deeplabv3/deeplabv3_r50-d8_512x512_4x4_40k_coco-stuff10k/deeplabv3_r50-d8_512x512_4x4_40k_coco-stuff10k_20210821_043305-dc76f3ff.pth) \| [log](https://download.openmmlab.com/mmsegmentation/v0.5/deeplabv3/deeplabv3_r50-d8_512x512_4x4_40k_coco-stuff10k/deeplabv3_r50-d8_512x512_4x4_40k_coco-stuff10k_20210821_043305.log.json) |
|
||||
| DeepLabV3 | R-101-D8 | 512x512 | 40000 | - | - | V100 | 37.81 | 38.80 | [config](https://github.com/open-mmlab/mmsegmentation/blob/main/configs/deeplabv3/deeplabv3_r101-d8_4xb4-40k_coco-stuff10k-512x512.py) | [model](https://download.openmmlab.com/mmsegmentation/v0.5/deeplabv3/deeplabv3_r101-d8_512x512_4x4_40k_coco-stuff10k/deeplabv3_r101-d8_512x512_4x4_40k_coco-stuff10k_20210821_043305-636cb433.pth) \| [log](https://download.openmmlab.com/mmsegmentation/v0.5/deeplabv3/deeplabv3_r101-d8_512x512_4x4_40k_coco-stuff10k/deeplabv3_r101-d8_512x512_4x4_40k_coco-stuff10k_20210821_043305.log.json) |
|
||||
|
||||
### COCO-Stuff 164k
|
||||
|
||||
| Method | Backbone | Crop Size | Lr schd | Mem (GB) | Inf time (fps) | Device | mIoU | mIoU(ms+flip) | config | download |
|
||||
| --------- | -------- | --------- | ------: | -------- | -------------- | ------ | ----: | ------------: | ---------------------------------------------------------------------------------------------------------------------------------------- | ---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- |
|
||||
| DeepLabV3 | R-50-D8 | 512x512 | 80000 | 9.6 | 10.8 | V100 | 39.38 | 40.03 | [config](https://github.com/open-mmlab/mmsegmentation/blob/main/configs/deeplabv3/deeplabv3_r50-d8_4xb4-80k_coco-stuff164k-512x512.py) | [model](https://download.openmmlab.com/mmsegmentation/v0.5/deeplabv3/deeplabv3_r50-d8_512x512_4x4_80k_coco-stuff164k/deeplabv3_r50-d8_512x512_4x4_80k_coco-stuff164k_20210709_163016-88675c24.pth) \| [log](https://download.openmmlab.com/mmsegmentation/v0.5/deeplabv3/deeplabv3_r50-d8_512x512_4x4_80k_coco-stuff164k/deeplabv3_r50-d8_512x512_4x4_80k_coco-stuff164k_20210709_163016.log.json) |
|
||||
| DeepLabV3 | R-101-D8 | 512x512 | 80000 | 13.2 | 8.7 | V100 | 40.87 | 41.50 | [config](https://github.com/open-mmlab/mmsegmentation/blob/main/configs/deeplabv3/deeplabv3_r101-d8_4xb4-80k_coco-stuff164k-512x512.py) | [model](https://download.openmmlab.com/mmsegmentation/v0.5/deeplabv3/deeplabv3_r101-d8_512x512_4x4_80k_coco-stuff164k/deeplabv3_r101-d8_512x512_4x4_80k_coco-stuff164k_20210709_201252-13600dc2.pth) \| [log](https://download.openmmlab.com/mmsegmentation/v0.5/deeplabv3/deeplabv3_r101-d8_512x512_4x4_80k_coco-stuff164k/deeplabv3_r101-d8_512x512_4x4_80k_coco-stuff164k_20210709_201252.log.json) |
|
||||
| DeepLabV3 | R-50-D8 | 512x512 | 160000 | - | - | V100 | 41.09 | 41.69 | [config](https://github.com/open-mmlab/mmsegmentation/blob/main/configs/deeplabv3/deeplabv3_r50-d8_4xb4-160k_coco-stuff164k-512x512.py) | [model](https://download.openmmlab.com/mmsegmentation/v0.5/deeplabv3/deeplabv3_r50-d8_512x512_4x4_160k_coco-stuff164k/deeplabv3_r50-d8_512x512_4x4_160k_coco-stuff164k_20210709_163016-49f2812b.pth) \| [log](https://download.openmmlab.com/mmsegmentation/v0.5/deeplabv3/deeplabv3_r50-d8_512x512_4x4_160k_coco-stuff164k/deeplabv3_r50-d8_512x512_4x4_160k_coco-stuff164k_20210709_163016.log.json) |
|
||||
| DeepLabV3 | R-101-D8 | 512x512 | 160000 | - | - | V100 | 41.82 | 42.49 | [config](https://github.com/open-mmlab/mmsegmentation/blob/main/configs/deeplabv3/deeplabv3_r101-d8_4xb4-160k_coco-stuff164k-512x512.py) | [model](https://download.openmmlab.com/mmsegmentation/v0.5/deeplabv3/deeplabv3_r101-d8_512x512_4x4_160k_coco-stuff164k/deeplabv3_r101-d8_512x512_4x4_160k_coco-stuff164k_20210709_155402-f035acfd.pth) \| [log](https://download.openmmlab.com/mmsegmentation/v0.5/deeplabv3/deeplabv3_r101-d8_512x512_4x4_160k_coco-stuff164k/deeplabv3_r101-d8_512x512_4x4_160k_coco-stuff164k_20210709_155402.log.json) |
|
||||
| DeepLabV3 | R-50-D8 | 512x512 | 320000 | - | - | V100 | 41.37 | 42.22 | [config](https://github.com/open-mmlab/mmsegmentation/blob/main/configs/deeplabv3/deeplabv3_r50-d8_4xb4-320k_coco-stuff164k-512x512.py) | [model](https://download.openmmlab.com/mmsegmentation/v0.5/deeplabv3/deeplabv3_r50-d8_512x512_4x4_320k_coco-stuff164k/deeplabv3_r50-d8_512x512_4x4_320k_coco-stuff164k_20210709_155403-51b21115.pth) \| [log](https://download.openmmlab.com/mmsegmentation/v0.5/deeplabv3/deeplabv3_r50-d8_512x512_4x4_320k_coco-stuff164k/deeplabv3_r50-d8_512x512_4x4_320k_coco-stuff164k_20210709_155403.log.json) |
|
||||
| DeepLabV3 | R-101-D8 | 512x512 | 320000 | - | - | V100 | 42.61 | 43.42 | [config](https://github.com/open-mmlab/mmsegmentation/blob/main/configs/deeplabv3/deeplabv3_r101-d8_4xb4-320k_coco-stuff164k-512x512.py) | [model](https://download.openmmlab.com/mmsegmentation/v0.5/deeplabv3/deeplabv3_r101-d8_512x512_4x4_320k_coco-stuff164k/deeplabv3_r101-d8_512x512_4x4_320k_coco-stuff164k_20210709_155402-3cbca14d.pth) \| [log](https://download.openmmlab.com/mmsegmentation/v0.5/deeplabv3/deeplabv3_r101-d8_512x512_4x4_320k_coco-stuff164k/deeplabv3_r101-d8_512x512_4x4_320k_coco-stuff164k_20210709_155402.log.json) |
|
||||
|
||||
Note:
|
||||
|
||||
- `D-8` here corresponding to the output stride 8 setting for DeepLab series.
|
||||
- `FP16` means Mixed Precision (FP16) is adopted in training.
|
||||
|
||||
## Citation
|
||||
|
||||
```bibtext
|
||||
@article{chen2017rethinking,
|
||||
title={Rethinking atrous convolution for semantic image segmentation},
|
||||
author={Chen, Liang-Chieh and Papandreou, George and Schroff, Florian and Adam, Hartwig},
|
||||
journal={arXiv preprint arXiv:1706.05587},
|
||||
year={2017}
|
||||
}
|
||||
```
|
||||
@@ -0,0 +1,11 @@
|
||||
_base_ = './deeplabv3_r50-d8_4xb2-40k_cityscapes-512x1024.py'
|
||||
model = dict(
|
||||
pretrained='open-mmlab://resnet101_v1c',
|
||||
backbone=dict(
|
||||
depth=101,
|
||||
dilations=(1, 1, 1, 2),
|
||||
strides=(1, 2, 2, 1),
|
||||
multi_grid=(1, 2, 4)),
|
||||
decode_head=dict(
|
||||
dilations=(1, 6, 12, 18),
|
||||
sampler=dict(type='OHEMPixelSampler', min_kept=100000)))
|
||||
@@ -0,0 +1,11 @@
|
||||
_base_ = './deeplabv3_r50-d8_4xb2-80k_cityscapes-512x1024.py'
|
||||
model = dict(
|
||||
pretrained='open-mmlab://resnet101_v1c',
|
||||
backbone=dict(
|
||||
depth=101,
|
||||
dilations=(1, 1, 1, 2),
|
||||
strides=(1, 2, 2, 1),
|
||||
multi_grid=(1, 2, 4)),
|
||||
decode_head=dict(
|
||||
dilations=(1, 6, 12, 18),
|
||||
sampler=dict(type='OHEMPixelSampler', min_kept=100000)))
|
||||
@@ -0,0 +1,2 @@
|
||||
_base_ = './deeplabv3_r50-d8_4xb2-40k_cityscapes-512x1024.py'
|
||||
model = dict(pretrained='open-mmlab://resnet101_v1c', backbone=dict(depth=101))
|
||||
@@ -0,0 +1,2 @@
|
||||
_base_ = './deeplabv3_r50-d8_4xb2-40k_cityscapes-769x769.py'
|
||||
model = dict(pretrained='open-mmlab://resnet101_v1c', backbone=dict(depth=101))
|
||||
@@ -0,0 +1,2 @@
|
||||
_base_ = './deeplabv3_r50-d8_4xb2-80k_cityscapes-512x1024.py'
|
||||
model = dict(pretrained='open-mmlab://resnet101_v1c', backbone=dict(depth=101))
|
||||
@@ -0,0 +1,2 @@
|
||||
_base_ = './deeplabv3_r50-d8_4xb2-80k_cityscapes-769x769.py'
|
||||
model = dict(pretrained='open-mmlab://resnet101_v1c', backbone=dict(depth=101))
|
||||
@@ -0,0 +1,7 @@
|
||||
_base_ = './deeplabv3_r101-d8_4xb2-40k_cityscapes-512x1024.py'
|
||||
optimizer = dict(type='SGD', lr=0.01, momentum=0.9, weight_decay=0.0005)
|
||||
optim_wrapper = dict(
|
||||
_delete_=True,
|
||||
type='AmpOptimWrapper',
|
||||
optimizer=optimizer,
|
||||
loss_scale=512.)
|
||||
@@ -0,0 +1,2 @@
|
||||
_base_ = './deeplabv3_r50-d8_4xb4-160k_ade20k-512x512.py'
|
||||
model = dict(pretrained='open-mmlab://resnet101_v1c', backbone=dict(depth=101))
|
||||
@@ -0,0 +1,2 @@
|
||||
_base_ = './deeplabv3_r50-d8_4xb4-160k_coco-stuff164k-512x512.py'
|
||||
model = dict(pretrained='open-mmlab://resnet101_v1c', backbone=dict(depth=101))
|
||||
@@ -0,0 +1,2 @@
|
||||
_base_ = './deeplabv3_r50-d8_4xb4-20k_coco-stuff10k-512x512.py'
|
||||
model = dict(pretrained='open-mmlab://resnet101_v1c', backbone=dict(depth=101))
|
||||
@@ -0,0 +1,2 @@
|
||||
_base_ = './deeplabv3_r50-d8_4xb4-20k_voc12aug-512x512.py'
|
||||
model = dict(pretrained='open-mmlab://resnet101_v1c', backbone=dict(depth=101))
|
||||
@@ -0,0 +1,2 @@
|
||||
_base_ = './deeplabv3_r50-d8_4xb4-320k_coco-stuff164k-512x512.py'
|
||||
model = dict(pretrained='open-mmlab://resnet101_v1c', backbone=dict(depth=101))
|
||||
@@ -0,0 +1,2 @@
|
||||
_base_ = './deeplabv3_r50-d8_4xb4-40k_coco-stuff10k-512x512.py'
|
||||
model = dict(pretrained='open-mmlab://resnet101_v1c', backbone=dict(depth=101))
|
||||
@@ -0,0 +1,2 @@
|
||||
_base_ = './deeplabv3_r50-d8_4xb4-40k_pascal-context-480x480.py'
|
||||
model = dict(pretrained='open-mmlab://resnet101_v1c', backbone=dict(depth=101))
|
||||
@@ -0,0 +1,2 @@
|
||||
_base_ = './deeplabv3_r50-d8_4xb4-40k_pascal-context-59-480x480.py'
|
||||
model = dict(pretrained='open-mmlab://resnet101_v1c', backbone=dict(depth=101))
|
||||
@@ -0,0 +1,2 @@
|
||||
_base_ = './deeplabv3_r50-d8_4xb4-40k_voc12aug-512x512.py'
|
||||
model = dict(pretrained='open-mmlab://resnet101_v1c', backbone=dict(depth=101))
|
||||
@@ -0,0 +1,2 @@
|
||||
_base_ = './deeplabv3_r50-d8_4xb4-80k_ade20k-512x512.py'
|
||||
model = dict(pretrained='open-mmlab://resnet101_v1c', backbone=dict(depth=101))
|
||||
@@ -0,0 +1,2 @@
|
||||
_base_ = './deeplabv3_r50-d8_4xb4-80k_coco-stuff164k-512x512.py'
|
||||
model = dict(pretrained='open-mmlab://resnet101_v1c', backbone=dict(depth=101))
|
||||
@@ -0,0 +1,2 @@
|
||||
_base_ = './deeplabv3_r50-d8_4xb4-80k_pascal-context-480x480.py'
|
||||
model = dict(pretrained='open-mmlab://resnet101_v1c', backbone=dict(depth=101))
|
||||
@@ -0,0 +1,2 @@
|
||||
_base_ = './deeplabv3_r50-d8_4xb4-40k_pascal-context-59-480x480.py'
|
||||
model = dict(pretrained='open-mmlab://resnet101_v1c', backbone=dict(depth=101))
|
||||
@@ -0,0 +1,4 @@
|
||||
_base_ = './deeplabv3_r50-d8_4xb2-80k_cityscapes-512x1024.py'
|
||||
model = dict(
|
||||
pretrained='torchvision://resnet101',
|
||||
backbone=dict(type='ResNet', depth=101))
|
||||
@@ -0,0 +1,4 @@
|
||||
_base_ = './deeplabv3_r50-d8_4xb2-80k_cityscapes-769x769.py'
|
||||
model = dict(
|
||||
pretrained='torchvision://resnet101',
|
||||
backbone=dict(type='ResNet', depth=101))
|
||||
@@ -0,0 +1,9 @@
|
||||
_base_ = './deeplabv3_r50-d8_4xb2-80k_cityscapes-512x1024.py'
|
||||
model = dict(
|
||||
pretrained='open-mmlab://resnet18_v1c',
|
||||
backbone=dict(depth=18),
|
||||
decode_head=dict(
|
||||
in_channels=512,
|
||||
channels=128,
|
||||
),
|
||||
auxiliary_head=dict(in_channels=256, channels=64))
|
||||
@@ -0,0 +1,9 @@
|
||||
_base_ = './deeplabv3_r50-d8_4xb2-80k_cityscapes-769x769.py'
|
||||
model = dict(
|
||||
pretrained='open-mmlab://resnet18_v1c',
|
||||
backbone=dict(depth=18),
|
||||
decode_head=dict(
|
||||
in_channels=512,
|
||||
channels=128,
|
||||
),
|
||||
auxiliary_head=dict(in_channels=256, channels=64))
|
||||
@@ -0,0 +1,9 @@
|
||||
_base_ = './deeplabv3_r50-d8_4xb2-80k_cityscapes-512x1024.py'
|
||||
model = dict(
|
||||
pretrained='torchvision://resnet18',
|
||||
backbone=dict(type='ResNet', depth=18),
|
||||
decode_head=dict(
|
||||
in_channels=512,
|
||||
channels=128,
|
||||
),
|
||||
auxiliary_head=dict(in_channels=256, channels=64))
|
||||
@@ -0,0 +1,9 @@
|
||||
_base_ = './deeplabv3_r50-d8_4xb2-80k_cityscapes-769x769.py'
|
||||
model = dict(
|
||||
pretrained='torchvision://resnet18',
|
||||
backbone=dict(type='ResNet', depth=18),
|
||||
decode_head=dict(
|
||||
in_channels=512,
|
||||
channels=128,
|
||||
),
|
||||
auxiliary_head=dict(in_channels=256, channels=64))
|
||||
@@ -0,0 +1,7 @@
|
||||
_base_ = [
|
||||
'../_base_/models/deeplabv3_r50-d8.py', '../_base_/datasets/cityscapes.py',
|
||||
'../_base_/default_runtime.py', '../_base_/schedules/schedule_40k.py'
|
||||
]
|
||||
crop_size = (512, 1024)
|
||||
data_preprocessor = dict(size=crop_size)
|
||||
model = dict(data_preprocessor=data_preprocessor)
|
||||
@@ -0,0 +1,12 @@
|
||||
_base_ = [
|
||||
'../_base_/models/deeplabv3_r50-d8.py',
|
||||
'../_base_/datasets/cityscapes_769x769.py', '../_base_/default_runtime.py',
|
||||
'../_base_/schedules/schedule_40k.py'
|
||||
]
|
||||
crop_size = (769, 769)
|
||||
data_preprocessor = dict(size=crop_size)
|
||||
model = dict(
|
||||
data_preprocessor=data_preprocessor,
|
||||
decode_head=dict(align_corners=True),
|
||||
auxiliary_head=dict(align_corners=True),
|
||||
test_cfg=dict(mode='slide', crop_size=(769, 769), stride=(513, 513)))
|
||||
@@ -0,0 +1,7 @@
|
||||
_base_ = [
|
||||
'../_base_/models/deeplabv3_r50-d8.py', '../_base_/datasets/cityscapes.py',
|
||||
'../_base_/default_runtime.py', '../_base_/schedules/schedule_80k.py'
|
||||
]
|
||||
crop_size = (512, 1024)
|
||||
data_preprocessor = dict(size=crop_size)
|
||||
model = dict(data_preprocessor=data_preprocessor)
|
||||
@@ -0,0 +1,12 @@
|
||||
_base_ = [
|
||||
'../_base_/models/deeplabv3_r50-d8.py',
|
||||
'../_base_/datasets/cityscapes_769x769.py', '../_base_/default_runtime.py',
|
||||
'../_base_/schedules/schedule_80k.py'
|
||||
]
|
||||
crop_size = (769, 769)
|
||||
data_preprocessor = dict(size=crop_size)
|
||||
model = dict(
|
||||
data_preprocessor=data_preprocessor,
|
||||
decode_head=dict(align_corners=True),
|
||||
auxiliary_head=dict(align_corners=True),
|
||||
test_cfg=dict(mode='slide', crop_size=(769, 769), stride=(513, 513)))
|
||||
@@ -0,0 +1,10 @@
|
||||
_base_ = [
|
||||
'../_base_/models/deeplabv3_r50-d8.py', '../_base_/datasets/ade20k.py',
|
||||
'../_base_/default_runtime.py', '../_base_/schedules/schedule_160k.py'
|
||||
]
|
||||
crop_size = (512, 512)
|
||||
data_preprocessor = dict(size=crop_size)
|
||||
model = dict(
|
||||
data_preprocessor=data_preprocessor,
|
||||
decode_head=dict(num_classes=150),
|
||||
auxiliary_head=dict(num_classes=150))
|
||||
@@ -0,0 +1,11 @@
|
||||
_base_ = [
|
||||
'../_base_/models/deeplabv3_r50-d8.py',
|
||||
'../_base_/datasets/coco-stuff164k.py', '../_base_/default_runtime.py',
|
||||
'../_base_/schedules/schedule_160k.py'
|
||||
]
|
||||
crop_size = (512, 512)
|
||||
data_preprocessor = dict(size=crop_size)
|
||||
model = dict(
|
||||
data_preprocessor=data_preprocessor,
|
||||
decode_head=dict(num_classes=171),
|
||||
auxiliary_head=dict(num_classes=171))
|
||||
@@ -0,0 +1,11 @@
|
||||
_base_ = [
|
||||
'../_base_/models/deeplabv3_r50-d8.py',
|
||||
'../_base_/datasets/coco-stuff10k.py', '../_base_/default_runtime.py',
|
||||
'../_base_/schedules/schedule_20k.py'
|
||||
]
|
||||
crop_size = (512, 512)
|
||||
data_preprocessor = dict(size=crop_size)
|
||||
model = dict(
|
||||
data_preprocessor=data_preprocessor,
|
||||
decode_head=dict(num_classes=171),
|
||||
auxiliary_head=dict(num_classes=171))
|
||||
@@ -0,0 +1,11 @@
|
||||
_base_ = [
|
||||
'../_base_/models/deeplabv3_r50-d8.py',
|
||||
'../_base_/datasets/pascal_voc12_aug.py', '../_base_/default_runtime.py',
|
||||
'../_base_/schedules/schedule_20k.py'
|
||||
]
|
||||
crop_size = (512, 512)
|
||||
data_preprocessor = dict(size=crop_size)
|
||||
model = dict(
|
||||
data_preprocessor=data_preprocessor,
|
||||
decode_head=dict(num_classes=21),
|
||||
auxiliary_head=dict(num_classes=21))
|
||||
@@ -0,0 +1,11 @@
|
||||
_base_ = [
|
||||
'../_base_/models/deeplabv3_r50-d8.py',
|
||||
'../_base_/datasets/coco-stuff164k.py', '../_base_/default_runtime.py',
|
||||
'../_base_/schedules/schedule_320k.py'
|
||||
]
|
||||
crop_size = (512, 512)
|
||||
data_preprocessor = dict(size=crop_size)
|
||||
model = dict(
|
||||
data_preprocessor=data_preprocessor,
|
||||
decode_head=dict(num_classes=171),
|
||||
auxiliary_head=dict(num_classes=171))
|
||||
@@ -0,0 +1,11 @@
|
||||
_base_ = [
|
||||
'../_base_/models/deeplabv3_r50-d8.py',
|
||||
'../_base_/datasets/coco-stuff10k.py', '../_base_/default_runtime.py',
|
||||
'../_base_/schedules/schedule_40k.py'
|
||||
]
|
||||
crop_size = (512, 512)
|
||||
data_preprocessor = dict(size=crop_size)
|
||||
model = dict(
|
||||
data_preprocessor=data_preprocessor,
|
||||
decode_head=dict(num_classes=171),
|
||||
auxiliary_head=dict(num_classes=171))
|
||||
@@ -0,0 +1,14 @@
|
||||
_base_ = [
|
||||
'../_base_/models/deeplabv3_r50-d8.py',
|
||||
'../_base_/datasets/pascal_context.py', '../_base_/default_runtime.py',
|
||||
'../_base_/schedules/schedule_40k.py'
|
||||
]
|
||||
crop_size = (480, 480)
|
||||
data_preprocessor = dict(size=crop_size)
|
||||
model = dict(
|
||||
data_preprocessor=data_preprocessor,
|
||||
decode_head=dict(num_classes=60),
|
||||
auxiliary_head=dict(num_classes=60),
|
||||
test_cfg=dict(mode='slide', crop_size=(480, 480), stride=(320, 320)))
|
||||
optimizer = dict(type='SGD', lr=0.004, momentum=0.9, weight_decay=0.0001)
|
||||
optim_wrapper = dict(type='OptimWrapper', optimizer=optimizer)
|
||||
@@ -0,0 +1,14 @@
|
||||
_base_ = [
|
||||
'../_base_/models/deeplabv3_r50-d8.py',
|
||||
'../_base_/datasets/pascal_context_59.py', '../_base_/default_runtime.py',
|
||||
'../_base_/schedules/schedule_40k.py'
|
||||
]
|
||||
crop_size = (480, 480)
|
||||
data_preprocessor = dict(size=crop_size)
|
||||
model = dict(
|
||||
data_preprocessor=data_preprocessor,
|
||||
decode_head=dict(num_classes=59),
|
||||
auxiliary_head=dict(num_classes=59),
|
||||
test_cfg=dict(mode='slide', crop_size=(480, 480), stride=(320, 320)))
|
||||
optimizer = dict(type='SGD', lr=0.004, momentum=0.9, weight_decay=0.0001)
|
||||
optim_wrapper = dict(type='OptimWrapper', optimizer=optimizer)
|
||||
@@ -0,0 +1,11 @@
|
||||
_base_ = [
|
||||
'../_base_/models/deeplabv3_r50-d8.py',
|
||||
'../_base_/datasets/pascal_voc12_aug.py', '../_base_/default_runtime.py',
|
||||
'../_base_/schedules/schedule_40k.py'
|
||||
]
|
||||
crop_size = (512, 512)
|
||||
data_preprocessor = dict(size=crop_size)
|
||||
model = dict(
|
||||
data_preprocessor=data_preprocessor,
|
||||
decode_head=dict(num_classes=21),
|
||||
auxiliary_head=dict(num_classes=21))
|
||||
@@ -0,0 +1,10 @@
|
||||
_base_ = [
|
||||
'../_base_/models/deeplabv3_r50-d8.py', '../_base_/datasets/ade20k.py',
|
||||
'../_base_/default_runtime.py', '../_base_/schedules/schedule_80k.py'
|
||||
]
|
||||
crop_size = (512, 512)
|
||||
data_preprocessor = dict(size=crop_size)
|
||||
model = dict(
|
||||
data_preprocessor=data_preprocessor,
|
||||
decode_head=dict(num_classes=150),
|
||||
auxiliary_head=dict(num_classes=150))
|
||||
@@ -0,0 +1,11 @@
|
||||
_base_ = [
|
||||
'../_base_/models/deeplabv3_r50-d8.py',
|
||||
'../_base_/datasets/coco-stuff164k.py', '../_base_/default_runtime.py',
|
||||
'../_base_/schedules/schedule_80k.py'
|
||||
]
|
||||
crop_size = (512, 512)
|
||||
data_preprocessor = dict(size=crop_size)
|
||||
model = dict(
|
||||
data_preprocessor=data_preprocessor,
|
||||
decode_head=dict(num_classes=171),
|
||||
auxiliary_head=dict(num_classes=171))
|
||||
@@ -0,0 +1,14 @@
|
||||
_base_ = [
|
||||
'../_base_/models/deeplabv3_r50-d8.py',
|
||||
'../_base_/datasets/pascal_context.py', '../_base_/default_runtime.py',
|
||||
'../_base_/schedules/schedule_80k.py'
|
||||
]
|
||||
crop_size = (480, 480)
|
||||
data_preprocessor = dict(size=crop_size)
|
||||
model = dict(
|
||||
data_preprocessor=data_preprocessor,
|
||||
decode_head=dict(num_classes=60),
|
||||
auxiliary_head=dict(num_classes=60),
|
||||
test_cfg=dict(mode='slide', crop_size=(480, 480), stride=(320, 320)))
|
||||
optimizer = dict(type='SGD', lr=0.004, momentum=0.9, weight_decay=0.0001)
|
||||
optim_wrapper = dict(type='OptimWrapper', optimizer=optimizer)
|
||||
@@ -0,0 +1,14 @@
|
||||
_base_ = [
|
||||
'../_base_/models/deeplabv3_r50-d8.py',
|
||||
'../_base_/datasets/pascal_context_59.py', '../_base_/default_runtime.py',
|
||||
'../_base_/schedules/schedule_80k.py'
|
||||
]
|
||||
crop_size = (480, 480)
|
||||
data_preprocessor = dict(size=crop_size)
|
||||
model = dict(
|
||||
data_preprocessor=data_preprocessor,
|
||||
decode_head=dict(num_classes=59),
|
||||
auxiliary_head=dict(num_classes=59),
|
||||
test_cfg=dict(mode='slide', crop_size=(480, 480), stride=(320, 320)))
|
||||
optimizer = dict(type='SGD', lr=0.004, momentum=0.9, weight_decay=0.0001)
|
||||
optim_wrapper = dict(type='OptimWrapper', optimizer=optimizer)
|
||||
@@ -0,0 +1,2 @@
|
||||
_base_ = './deeplabv3_r50-d8_4xb2-80k_cityscapes-512x1024.py'
|
||||
model = dict(pretrained='torchvision://resnet50', backbone=dict(type='ResNet'))
|
||||
@@ -0,0 +1,2 @@
|
||||
_base_ = './deeplabv3_r50-d8_4xb2-80k_cityscapes-769x769.py'
|
||||
model = dict(pretrained='torchvision://resnet50', backbone=dict(type='ResNet'))
|
||||
985
Seg_All_In_One_MMSeg/configs/deeplabv3/metafile.yaml
Normal file
985
Seg_All_In_One_MMSeg/configs/deeplabv3/metafile.yaml
Normal file
@@ -0,0 +1,985 @@
|
||||
Collections:
|
||||
- Name: DeepLabV3
|
||||
License: Apache License 2.0
|
||||
Metadata:
|
||||
Training Data:
|
||||
- Cityscapes
|
||||
- ADE20K
|
||||
- Pascal VOC 2012 + Aug
|
||||
- Pascal Context
|
||||
- Pascal Context 59
|
||||
- COCO-Stuff 10k
|
||||
- COCO-Stuff 164k
|
||||
Paper:
|
||||
Title: Rethinking atrous convolution for semantic image segmentation
|
||||
URL: https://arxiv.org/abs/1706.05587
|
||||
README: configs/deeplabv3/README.md
|
||||
Frameworks:
|
||||
- PyTorch
|
||||
Models:
|
||||
- Name: deeplabv3_r50-d8_4xb2-40k_cityscapes-512x1024
|
||||
In Collection: DeepLabV3
|
||||
Results:
|
||||
Task: Semantic Segmentation
|
||||
Dataset: Cityscapes
|
||||
Metrics:
|
||||
mIoU: 79.09
|
||||
mIoU(ms+flip): 80.45
|
||||
Config: configs/deeplabv3/deeplabv3_r50-d8_4xb2-40k_cityscapes-512x1024.py
|
||||
Metadata:
|
||||
Training Data: Cityscapes
|
||||
Batch Size: 8
|
||||
Architecture:
|
||||
- R-50-D8
|
||||
- DeepLabV3
|
||||
Training Resources: 4x V100 GPUS
|
||||
Memory (GB): 6.1
|
||||
Weights: https://download.openmmlab.com/mmsegmentation/v0.5/deeplabv3/deeplabv3_r50-d8_512x1024_40k_cityscapes/deeplabv3_r50-d8_512x1024_40k_cityscapes_20200605_022449-acadc2f8.pth
|
||||
Training log: https://download.openmmlab.com/mmsegmentation/v0.5/deeplabv3/deeplabv3_r50-d8_512x1024_40k_cityscapes/deeplabv3_r50-d8_512x1024_40k_cityscapes_20200605_022449.log.json
|
||||
Paper:
|
||||
Title: Rethinking atrous convolution for semantic image segmentation
|
||||
URL: https://arxiv.org/abs/1706.05587
|
||||
Code: https://github.com/open-mmlab/mmsegmentation/blob/v0.17.0/mmseg/models/decode_heads/aspp_head.py#L54
|
||||
Framework: PyTorch
|
||||
- Name: deeplabv3_r101-d8_4xb2-40k_cityscapes-512x1024
|
||||
In Collection: DeepLabV3
|
||||
Results:
|
||||
Task: Semantic Segmentation
|
||||
Dataset: Cityscapes
|
||||
Metrics:
|
||||
mIoU: 77.12
|
||||
mIoU(ms+flip): 79.61
|
||||
Config: configs/deeplabv3/deeplabv3_r101-d8_4xb2-40k_cityscapes-512x1024.py
|
||||
Metadata:
|
||||
Training Data: Cityscapes
|
||||
Batch Size: 8
|
||||
Architecture:
|
||||
- R-101-D8
|
||||
- DeepLabV3
|
||||
Training Resources: 4x V100 GPUS
|
||||
Memory (GB): 9.6
|
||||
Weights: https://download.openmmlab.com/mmsegmentation/v0.5/deeplabv3/deeplabv3_r101-d8_512x1024_40k_cityscapes/deeplabv3_r101-d8_512x1024_40k_cityscapes_20200605_012241-7fd3f799.pth
|
||||
Training log: https://download.openmmlab.com/mmsegmentation/v0.5/deeplabv3/deeplabv3_r101-d8_512x1024_40k_cityscapes/deeplabv3_r101-d8_512x1024_40k_cityscapes_20200605_012241.log.json
|
||||
Paper:
|
||||
Title: Rethinking atrous convolution for semantic image segmentation
|
||||
URL: https://arxiv.org/abs/1706.05587
|
||||
Code: https://github.com/open-mmlab/mmsegmentation/blob/v0.17.0/mmseg/models/decode_heads/aspp_head.py#L54
|
||||
Framework: PyTorch
|
||||
- Name: deeplabv3_r50-d8_4xb2-40k_cityscapes-769x769
|
||||
In Collection: DeepLabV3
|
||||
Results:
|
||||
Task: Semantic Segmentation
|
||||
Dataset: Cityscapes
|
||||
Metrics:
|
||||
mIoU: 78.58
|
||||
mIoU(ms+flip): 79.89
|
||||
Config: configs/deeplabv3/deeplabv3_r50-d8_4xb2-40k_cityscapes-769x769.py
|
||||
Metadata:
|
||||
Training Data: Cityscapes
|
||||
Batch Size: 8
|
||||
Architecture:
|
||||
- R-50-D8
|
||||
- DeepLabV3
|
||||
Training Resources: 4x V100 GPUS
|
||||
Memory (GB): 6.9
|
||||
Weights: https://download.openmmlab.com/mmsegmentation/v0.5/deeplabv3/deeplabv3_r50-d8_769x769_40k_cityscapes/deeplabv3_r50-d8_769x769_40k_cityscapes_20200606_113723-7eda553c.pth
|
||||
Training log: https://download.openmmlab.com/mmsegmentation/v0.5/deeplabv3/deeplabv3_r50-d8_769x769_40k_cityscapes/deeplabv3_r50-d8_769x769_40k_cityscapes_20200606_113723.log.json
|
||||
Paper:
|
||||
Title: Rethinking atrous convolution for semantic image segmentation
|
||||
URL: https://arxiv.org/abs/1706.05587
|
||||
Code: https://github.com/open-mmlab/mmsegmentation/blob/v0.17.0/mmseg/models/decode_heads/aspp_head.py#L54
|
||||
Framework: PyTorch
|
||||
- Name: deeplabv3_r101-d8_4xb2-40k_cityscapes-769x769
|
||||
In Collection: DeepLabV3
|
||||
Results:
|
||||
Task: Semantic Segmentation
|
||||
Dataset: Cityscapes
|
||||
Metrics:
|
||||
mIoU: 79.27
|
||||
mIoU(ms+flip): 80.11
|
||||
Config: configs/deeplabv3/deeplabv3_r101-d8_4xb2-40k_cityscapes-769x769.py
|
||||
Metadata:
|
||||
Training Data: Cityscapes
|
||||
Batch Size: 8
|
||||
Architecture:
|
||||
- R-101-D8
|
||||
- DeepLabV3
|
||||
Training Resources: 4x V100 GPUS
|
||||
Memory (GB): 10.9
|
||||
Weights: https://download.openmmlab.com/mmsegmentation/v0.5/deeplabv3/deeplabv3_r101-d8_769x769_40k_cityscapes/deeplabv3_r101-d8_769x769_40k_cityscapes_20200606_113809-c64f889f.pth
|
||||
Training log: https://download.openmmlab.com/mmsegmentation/v0.5/deeplabv3/deeplabv3_r101-d8_769x769_40k_cityscapes/deeplabv3_r101-d8_769x769_40k_cityscapes_20200606_113809.log.json
|
||||
Paper:
|
||||
Title: Rethinking atrous convolution for semantic image segmentation
|
||||
URL: https://arxiv.org/abs/1706.05587
|
||||
Code: https://github.com/open-mmlab/mmsegmentation/blob/v0.17.0/mmseg/models/decode_heads/aspp_head.py#L54
|
||||
Framework: PyTorch
|
||||
- Name: deeplabv3_r18-d8_4xb2-80k_cityscapes-512x1024
|
||||
In Collection: DeepLabV3
|
||||
Results:
|
||||
Task: Semantic Segmentation
|
||||
Dataset: Cityscapes
|
||||
Metrics:
|
||||
mIoU: 76.7
|
||||
mIoU(ms+flip): 78.27
|
||||
Config: configs/deeplabv3/deeplabv3_r18-d8_4xb2-80k_cityscapes-512x1024.py
|
||||
Metadata:
|
||||
Training Data: Cityscapes
|
||||
Batch Size: 8
|
||||
Architecture:
|
||||
- R-18-D8
|
||||
- DeepLabV3
|
||||
Training Resources: 4x V100 GPUS
|
||||
Memory (GB): 1.7
|
||||
Weights: https://download.openmmlab.com/mmsegmentation/v0.5/deeplabv3/deeplabv3_r18-d8_512x1024_80k_cityscapes/deeplabv3_r18-d8_512x1024_80k_cityscapes_20201225_021506-23dffbe2.pth
|
||||
Training log: https://download.openmmlab.com/mmsegmentation/v0.5/deeplabv3/deeplabv3_r18-d8_512x1024_80k_cityscapes/deeplabv3_r18-d8_512x1024_80k_cityscapes-20201225_021506.log.json
|
||||
Paper:
|
||||
Title: Rethinking atrous convolution for semantic image segmentation
|
||||
URL: https://arxiv.org/abs/1706.05587
|
||||
Code: https://github.com/open-mmlab/mmsegmentation/blob/v0.17.0/mmseg/models/decode_heads/aspp_head.py#L54
|
||||
Framework: PyTorch
|
||||
- Name: deeplabv3_r50-d8_4xb2-80k_cityscapes-512x1024
|
||||
In Collection: DeepLabV3
|
||||
Results:
|
||||
Task: Semantic Segmentation
|
||||
Dataset: Cityscapes
|
||||
Metrics:
|
||||
mIoU: 79.32
|
||||
mIoU(ms+flip): 80.57
|
||||
Config: configs/deeplabv3/deeplabv3_r50-d8_4xb2-80k_cityscapes-512x1024.py
|
||||
Metadata:
|
||||
Training Data: Cityscapes
|
||||
Batch Size: 8
|
||||
Architecture:
|
||||
- R-50-D8
|
||||
- DeepLabV3
|
||||
Training Resources: 4x V100 GPUS
|
||||
Weights: https://download.openmmlab.com/mmsegmentation/v0.5/deeplabv3/deeplabv3_r50-d8_512x1024_80k_cityscapes/deeplabv3_r50-d8_512x1024_80k_cityscapes_20200606_113404-b92cfdd4.pth
|
||||
Training log: https://download.openmmlab.com/mmsegmentation/v0.5/deeplabv3/deeplabv3_r50-d8_512x1024_80k_cityscapes/deeplabv3_r50-d8_512x1024_80k_cityscapes_20200606_113404.log.json
|
||||
Paper:
|
||||
Title: Rethinking atrous convolution for semantic image segmentation
|
||||
URL: https://arxiv.org/abs/1706.05587
|
||||
Code: https://github.com/open-mmlab/mmsegmentation/blob/v0.17.0/mmseg/models/decode_heads/aspp_head.py#L54
|
||||
Framework: PyTorch
|
||||
- Name: deeplabv3_r101-d8_4xb2-80k_cityscapes-512x1024
|
||||
In Collection: DeepLabV3
|
||||
Results:
|
||||
Task: Semantic Segmentation
|
||||
Dataset: Cityscapes
|
||||
Metrics:
|
||||
mIoU: 80.2
|
||||
mIoU(ms+flip): 81.21
|
||||
Config: configs/deeplabv3/deeplabv3_r101-d8_4xb2-80k_cityscapes-512x1024.py
|
||||
Metadata:
|
||||
Training Data: Cityscapes
|
||||
Batch Size: 8
|
||||
Architecture:
|
||||
- R-101-D8
|
||||
- DeepLabV3
|
||||
Training Resources: 4x V100 GPUS
|
||||
Weights: https://download.openmmlab.com/mmsegmentation/v0.5/deeplabv3/deeplabv3_r101-d8_512x1024_80k_cityscapes/deeplabv3_r101-d8_512x1024_80k_cityscapes_20200606_113503-9e428899.pth
|
||||
Training log: https://download.openmmlab.com/mmsegmentation/v0.5/deeplabv3/deeplabv3_r101-d8_512x1024_80k_cityscapes/deeplabv3_r101-d8_512x1024_80k_cityscapes_20200606_113503.log.json
|
||||
Paper:
|
||||
Title: Rethinking atrous convolution for semantic image segmentation
|
||||
URL: https://arxiv.org/abs/1706.05587
|
||||
Code: https://github.com/open-mmlab/mmsegmentation/blob/v0.17.0/mmseg/models/decode_heads/aspp_head.py#L54
|
||||
Framework: PyTorch
|
||||
- Name: deeplabv3_r101-d8_4xb2-amp-80k_cityscapes-512x1024
|
||||
In Collection: DeepLabV3
|
||||
Results:
|
||||
Task: Semantic Segmentation
|
||||
Dataset: Cityscapes
|
||||
Metrics:
|
||||
mIoU: 80.48
|
||||
Config: configs/deeplabv3/deeplabv3_r101-d8_4xb2-amp-80k_cityscapes-512x1024.py
|
||||
Metadata:
|
||||
Training Data: Cityscapes
|
||||
Batch Size: 8
|
||||
Architecture:
|
||||
- R-101-D8
|
||||
- DeepLabV3
|
||||
- (FP16)
|
||||
Training Resources: 4x V100 GPUS
|
||||
Memory (GB): 5.75
|
||||
Weights: https://download.openmmlab.com/mmsegmentation/v0.5/deeplabv3/deeplabv3_r101-d8_fp16_512x1024_80k_cityscapes/deeplabv3_r101-d8_fp16_512x1024_80k_cityscapes_20200717_230920-774d9cec.pth
|
||||
Training log: https://download.openmmlab.com/mmsegmentation/v0.5/deeplabv3/deeplabv3_r101-d8_fp16_512x1024_80k_cityscapes/deeplabv3_r101-d8_fp16_512x1024_80k_cityscapes_20200717_230920.log.json
|
||||
Paper:
|
||||
Title: Rethinking atrous convolution for semantic image segmentation
|
||||
URL: https://arxiv.org/abs/1706.05587
|
||||
Code: https://github.com/open-mmlab/mmsegmentation/blob/v0.17.0/mmseg/models/decode_heads/aspp_head.py#L54
|
||||
Framework: PyTorch
|
||||
- Name: deeplabv3_r18-d8_4xb2-80k_cityscapes-769x769
|
||||
In Collection: DeepLabV3
|
||||
Results:
|
||||
Task: Semantic Segmentation
|
||||
Dataset: Cityscapes
|
||||
Metrics:
|
||||
mIoU: 76.6
|
||||
mIoU(ms+flip): 78.26
|
||||
Config: configs/deeplabv3/deeplabv3_r18-d8_4xb2-80k_cityscapes-769x769.py
|
||||
Metadata:
|
||||
Training Data: Cityscapes
|
||||
Batch Size: 8
|
||||
Architecture:
|
||||
- R-18-D8
|
||||
- DeepLabV3
|
||||
Training Resources: 4x V100 GPUS
|
||||
Memory (GB): 1.9
|
||||
Weights: https://download.openmmlab.com/mmsegmentation/v0.5/deeplabv3/deeplabv3_r18-d8_769x769_80k_cityscapes/deeplabv3_r18-d8_769x769_80k_cityscapes_20201225_021506-6452126a.pth
|
||||
Training log: https://download.openmmlab.com/mmsegmentation/v0.5/deeplabv3/deeplabv3_r18-d8_769x769_80k_cityscapes/deeplabv3_r18-d8_769x769_80k_cityscapes-20201225_021506.log.json
|
||||
Paper:
|
||||
Title: Rethinking atrous convolution for semantic image segmentation
|
||||
URL: https://arxiv.org/abs/1706.05587
|
||||
Code: https://github.com/open-mmlab/mmsegmentation/blob/v0.17.0/mmseg/models/decode_heads/aspp_head.py#L54
|
||||
Framework: PyTorch
|
||||
- Name: deeplabv3_r50-d8_4xb2-80k_cityscapes-769x769
|
||||
In Collection: DeepLabV3
|
||||
Results:
|
||||
Task: Semantic Segmentation
|
||||
Dataset: Cityscapes
|
||||
Metrics:
|
||||
mIoU: 79.89
|
||||
mIoU(ms+flip): 81.06
|
||||
Config: configs/deeplabv3/deeplabv3_r50-d8_4xb2-80k_cityscapes-769x769.py
|
||||
Metadata:
|
||||
Training Data: Cityscapes
|
||||
Batch Size: 8
|
||||
Architecture:
|
||||
- R-50-D8
|
||||
- DeepLabV3
|
||||
Training Resources: 4x V100 GPUS
|
||||
Weights: https://download.openmmlab.com/mmsegmentation/v0.5/deeplabv3/deeplabv3_r50-d8_769x769_80k_cityscapes/deeplabv3_r50-d8_769x769_80k_cityscapes_20200606_221338-788d6228.pth
|
||||
Training log: https://download.openmmlab.com/mmsegmentation/v0.5/deeplabv3/deeplabv3_r50-d8_769x769_80k_cityscapes/deeplabv3_r50-d8_769x769_80k_cityscapes_20200606_221338.log.json
|
||||
Paper:
|
||||
Title: Rethinking atrous convolution for semantic image segmentation
|
||||
URL: https://arxiv.org/abs/1706.05587
|
||||
Code: https://github.com/open-mmlab/mmsegmentation/blob/v0.17.0/mmseg/models/decode_heads/aspp_head.py#L54
|
||||
Framework: PyTorch
|
||||
- Name: deeplabv3_r101-d8_4xb2-80k_cityscapes-769x769
|
||||
In Collection: DeepLabV3
|
||||
Results:
|
||||
Task: Semantic Segmentation
|
||||
Dataset: Cityscapes
|
||||
Metrics:
|
||||
mIoU: 79.67
|
||||
mIoU(ms+flip): 80.81
|
||||
Config: configs/deeplabv3/deeplabv3_r101-d8_4xb2-80k_cityscapes-769x769.py
|
||||
Metadata:
|
||||
Training Data: Cityscapes
|
||||
Batch Size: 8
|
||||
Architecture:
|
||||
- R-101-D8
|
||||
- DeepLabV3
|
||||
Training Resources: 4x V100 GPUS
|
||||
Weights: https://download.openmmlab.com/mmsegmentation/v0.5/deeplabv3/deeplabv3_r101-d8_769x769_80k_cityscapes/deeplabv3_r101-d8_769x769_80k_cityscapes_20200607_013353-60e95418.pth
|
||||
Training log: https://download.openmmlab.com/mmsegmentation/v0.5/deeplabv3/deeplabv3_r101-d8_769x769_80k_cityscapes/deeplabv3_r101-d8_769x769_80k_cityscapes_20200607_013353.log.json
|
||||
Paper:
|
||||
Title: Rethinking atrous convolution for semantic image segmentation
|
||||
URL: https://arxiv.org/abs/1706.05587
|
||||
Code: https://github.com/open-mmlab/mmsegmentation/blob/v0.17.0/mmseg/models/decode_heads/aspp_head.py#L54
|
||||
Framework: PyTorch
|
||||
- Name: deeplabv3_r101-d16-mg124_4xb2-40k_cityscapes-512x1024
|
||||
In Collection: DeepLabV3
|
||||
Results:
|
||||
Task: Semantic Segmentation
|
||||
Dataset: Cityscapes
|
||||
Metrics:
|
||||
mIoU: 76.71
|
||||
mIoU(ms+flip): 78.63
|
||||
Config: configs/deeplabv3/deeplabv3_r101-d16-mg124_4xb2-40k_cityscapes-512x1024.py
|
||||
Metadata:
|
||||
Training Data: Cityscapes
|
||||
Batch Size: 8
|
||||
Architecture:
|
||||
- R-101-D16-MG124
|
||||
- DeepLabV3
|
||||
Training Resources: 4x V100 GPUS
|
||||
Memory (GB): 4.7
|
||||
Weights: https://download.openmmlab.com/mmsegmentation/v0.5/deeplabv3/deeplabv3_r101-d16-mg124_512x1024_40k_cityscapes/deeplabv3_r101-d16-mg124_512x1024_40k_cityscapes_20200908_005644-67b0c992.pth
|
||||
Training log: https://download.openmmlab.com/mmsegmentation/v0.5/deeplabv3/deeplabv3_r101-d16-mg124_512x1024_40k_cityscapes/deeplabv3_r101-d16-mg124_512x1024_40k_cityscapes-20200908_005644.log.json
|
||||
Paper:
|
||||
Title: Rethinking atrous convolution for semantic image segmentation
|
||||
URL: https://arxiv.org/abs/1706.05587
|
||||
Code: https://github.com/open-mmlab/mmsegmentation/blob/v0.17.0/mmseg/models/decode_heads/aspp_head.py#L54
|
||||
Framework: PyTorch
|
||||
- Name: deeplabv3_r101-d16-mg124_4xb2-80k_cityscapes-512x1024
|
||||
In Collection: DeepLabV3
|
||||
Results:
|
||||
Task: Semantic Segmentation
|
||||
Dataset: Cityscapes
|
||||
Metrics:
|
||||
mIoU: 78.36
|
||||
mIoU(ms+flip): 79.84
|
||||
Config: configs/deeplabv3/deeplabv3_r101-d16-mg124_4xb2-80k_cityscapes-512x1024.py
|
||||
Metadata:
|
||||
Training Data: Cityscapes
|
||||
Batch Size: 8
|
||||
Architecture:
|
||||
- R-101-D16-MG124
|
||||
- DeepLabV3
|
||||
Training Resources: 4x V100 GPUS
|
||||
Weights: https://download.openmmlab.com/mmsegmentation/v0.5/deeplabv3/deeplabv3_r101-d16-mg124_512x1024_80k_cityscapes/deeplabv3_r101-d16-mg124_512x1024_80k_cityscapes_20200908_005644-57bb8425.pth
|
||||
Training log: https://download.openmmlab.com/mmsegmentation/v0.5/deeplabv3/deeplabv3_r101-d16-mg124_512x1024_80k_cityscapes/deeplabv3_r101-d16-mg124_512x1024_80k_cityscapes-20200908_005644.log.json
|
||||
Paper:
|
||||
Title: Rethinking atrous convolution for semantic image segmentation
|
||||
URL: https://arxiv.org/abs/1706.05587
|
||||
Code: https://github.com/open-mmlab/mmsegmentation/blob/v0.17.0/mmseg/models/decode_heads/aspp_head.py#L54
|
||||
Framework: PyTorch
|
||||
- Name: deeplabv3_r18b-d8_4xb2-80k_cityscapes-512x1024
|
||||
In Collection: DeepLabV3
|
||||
Results:
|
||||
Task: Semantic Segmentation
|
||||
Dataset: Cityscapes
|
||||
Metrics:
|
||||
mIoU: 76.26
|
||||
mIoU(ms+flip): 77.88
|
||||
Config: configs/deeplabv3/deeplabv3_r18b-d8_4xb2-80k_cityscapes-512x1024.py
|
||||
Metadata:
|
||||
Training Data: Cityscapes
|
||||
Batch Size: 8
|
||||
Architecture:
|
||||
- R-18b-D8
|
||||
- DeepLabV3
|
||||
Training Resources: 4x V100 GPUS
|
||||
Memory (GB): 1.6
|
||||
Weights: https://download.openmmlab.com/mmsegmentation/v0.5/deeplabv3/deeplabv3_r18b-d8_512x1024_80k_cityscapes/deeplabv3_r18b-d8_512x1024_80k_cityscapes_20201225_094144-46040cef.pth
|
||||
Training log: https://download.openmmlab.com/mmsegmentation/v0.5/deeplabv3/deeplabv3_r18b-d8_512x1024_80k_cityscapes/deeplabv3_r18b-d8_512x1024_80k_cityscapes-20201225_094144.log.json
|
||||
Paper:
|
||||
Title: Rethinking atrous convolution for semantic image segmentation
|
||||
URL: https://arxiv.org/abs/1706.05587
|
||||
Code: https://github.com/open-mmlab/mmsegmentation/blob/v0.17.0/mmseg/models/decode_heads/aspp_head.py#L54
|
||||
Framework: PyTorch
|
||||
- Name: deeplabv3_r50b-d8_4xb2-80k_cityscapes-512x1024
|
||||
In Collection: DeepLabV3
|
||||
Results:
|
||||
Task: Semantic Segmentation
|
||||
Dataset: Cityscapes
|
||||
Metrics:
|
||||
mIoU: 79.63
|
||||
mIoU(ms+flip): 80.98
|
||||
Config: configs/deeplabv3/deeplabv3_r50b-d8_4xb2-80k_cityscapes-512x1024.py
|
||||
Metadata:
|
||||
Training Data: Cityscapes
|
||||
Batch Size: 8
|
||||
Architecture:
|
||||
- R-50b-D8
|
||||
- DeepLabV3
|
||||
Training Resources: 4x V100 GPUS
|
||||
Memory (GB): 6.0
|
||||
Weights: https://download.openmmlab.com/mmsegmentation/v0.5/deeplabv3/deeplabv3_r50b-d8_512x1024_80k_cityscapes/deeplabv3_r50b-d8_512x1024_80k_cityscapes_20201225_155148-ec368954.pth
|
||||
Training log: https://download.openmmlab.com/mmsegmentation/v0.5/deeplabv3/deeplabv3_r50b-d8_512x1024_80k_cityscapes/deeplabv3_r50b-d8_512x1024_80k_cityscapes-20201225_155148.log.json
|
||||
Paper:
|
||||
Title: Rethinking atrous convolution for semantic image segmentation
|
||||
URL: https://arxiv.org/abs/1706.05587
|
||||
Code: https://github.com/open-mmlab/mmsegmentation/blob/v0.17.0/mmseg/models/decode_heads/aspp_head.py#L54
|
||||
Framework: PyTorch
|
||||
- Name: deeplabv3_r101b-d8_4xb2-80k_cityscapes-512x1024
|
||||
In Collection: DeepLabV3
|
||||
Results:
|
||||
Task: Semantic Segmentation
|
||||
Dataset: Cityscapes
|
||||
Metrics:
|
||||
mIoU: 80.01
|
||||
mIoU(ms+flip): 81.21
|
||||
Config: configs/deeplabv3/deeplabv3_r101b-d8_4xb2-80k_cityscapes-512x1024.py
|
||||
Metadata:
|
||||
Training Data: Cityscapes
|
||||
Batch Size: 8
|
||||
Architecture:
|
||||
- R-101b-D8
|
||||
- DeepLabV3
|
||||
Training Resources: 4x V100 GPUS
|
||||
Memory (GB): 9.5
|
||||
Weights: https://download.openmmlab.com/mmsegmentation/v0.5/deeplabv3/deeplabv3_r101b-d8_512x1024_80k_cityscapes/deeplabv3_r101b-d8_512x1024_80k_cityscapes_20201226_171821-8fd49503.pth
|
||||
Training log: https://download.openmmlab.com/mmsegmentation/v0.5/deeplabv3/deeplabv3_r101b-d8_512x1024_80k_cityscapes/deeplabv3_r101b-d8_512x1024_80k_cityscapes-20201226_171821.log.json
|
||||
Paper:
|
||||
Title: Rethinking atrous convolution for semantic image segmentation
|
||||
URL: https://arxiv.org/abs/1706.05587
|
||||
Code: https://github.com/open-mmlab/mmsegmentation/blob/v0.17.0/mmseg/models/decode_heads/aspp_head.py#L54
|
||||
Framework: PyTorch
|
||||
- Name: deeplabv3_r18b-d8_4xb2-80k_cityscapes-769x769
|
||||
In Collection: DeepLabV3
|
||||
Results:
|
||||
Task: Semantic Segmentation
|
||||
Dataset: Cityscapes
|
||||
Metrics:
|
||||
mIoU: 75.63
|
||||
mIoU(ms+flip): 77.51
|
||||
Config: configs/deeplabv3/deeplabv3_r18b-d8_4xb2-80k_cityscapes-769x769.py
|
||||
Metadata:
|
||||
Training Data: Cityscapes
|
||||
Batch Size: 8
|
||||
Architecture:
|
||||
- R-18b-D8
|
||||
- DeepLabV3
|
||||
Training Resources: 4x V100 GPUS
|
||||
Memory (GB): 1.8
|
||||
Weights: https://download.openmmlab.com/mmsegmentation/v0.5/deeplabv3/deeplabv3_r18b-d8_769x769_80k_cityscapes/deeplabv3_r18b-d8_769x769_80k_cityscapes_20201225_094144-fdc985d9.pth
|
||||
Training log: https://download.openmmlab.com/mmsegmentation/v0.5/deeplabv3/deeplabv3_r18b-d8_769x769_80k_cityscapes/deeplabv3_r18b-d8_769x769_80k_cityscapes-20201225_094144.log.json
|
||||
Paper:
|
||||
Title: Rethinking atrous convolution for semantic image segmentation
|
||||
URL: https://arxiv.org/abs/1706.05587
|
||||
Code: https://github.com/open-mmlab/mmsegmentation/blob/v0.17.0/mmseg/models/decode_heads/aspp_head.py#L54
|
||||
Framework: PyTorch
|
||||
- Name: deeplabv3_r50b-d8_4xb2-80k_cityscapes-769x769
|
||||
In Collection: DeepLabV3
|
||||
Results:
|
||||
Task: Semantic Segmentation
|
||||
Dataset: Cityscapes
|
||||
Metrics:
|
||||
mIoU: 78.8
|
||||
mIoU(ms+flip): 80.27
|
||||
Config: configs/deeplabv3/deeplabv3_r50b-d8_4xb2-80k_cityscapes-769x769.py
|
||||
Metadata:
|
||||
Training Data: Cityscapes
|
||||
Batch Size: 8
|
||||
Architecture:
|
||||
- R-50b-D8
|
||||
- DeepLabV3
|
||||
Training Resources: 4x V100 GPUS
|
||||
Memory (GB): 6.8
|
||||
Weights: https://download.openmmlab.com/mmsegmentation/v0.5/deeplabv3/deeplabv3_r50b-d8_769x769_80k_cityscapes/deeplabv3_r50b-d8_769x769_80k_cityscapes_20201225_155404-87fb0cf4.pth
|
||||
Training log: https://download.openmmlab.com/mmsegmentation/v0.5/deeplabv3/deeplabv3_r50b-d8_769x769_80k_cityscapes/deeplabv3_r50b-d8_769x769_80k_cityscapes-20201225_155404.log.json
|
||||
Paper:
|
||||
Title: Rethinking atrous convolution for semantic image segmentation
|
||||
URL: https://arxiv.org/abs/1706.05587
|
||||
Code: https://github.com/open-mmlab/mmsegmentation/blob/v0.17.0/mmseg/models/decode_heads/aspp_head.py#L54
|
||||
Framework: PyTorch
|
||||
- Name: deeplabv3_r101b-d8_4xb2-80k_cityscapes-769x769
|
||||
In Collection: DeepLabV3
|
||||
Results:
|
||||
Task: Semantic Segmentation
|
||||
Dataset: Cityscapes
|
||||
Metrics:
|
||||
mIoU: 79.41
|
||||
mIoU(ms+flip): 80.73
|
||||
Config: configs/deeplabv3/deeplabv3_r101b-d8_4xb2-80k_cityscapes-769x769.py
|
||||
Metadata:
|
||||
Training Data: Cityscapes
|
||||
Batch Size: 8
|
||||
Architecture:
|
||||
- R-101b-D8
|
||||
- DeepLabV3
|
||||
Training Resources: 4x V100 GPUS
|
||||
Memory (GB): 10.7
|
||||
Weights: https://download.openmmlab.com/mmsegmentation/v0.5/deeplabv3/deeplabv3_r101b-d8_769x769_80k_cityscapes/deeplabv3_r101b-d8_769x769_80k_cityscapes_20201226_190843-9142ee57.pth
|
||||
Training log: https://download.openmmlab.com/mmsegmentation/v0.5/deeplabv3/deeplabv3_r101b-d8_769x769_80k_cityscapes/deeplabv3_r101b-d8_769x769_80k_cityscapes-20201226_190843.log.json
|
||||
Paper:
|
||||
Title: Rethinking atrous convolution for semantic image segmentation
|
||||
URL: https://arxiv.org/abs/1706.05587
|
||||
Code: https://github.com/open-mmlab/mmsegmentation/blob/v0.17.0/mmseg/models/decode_heads/aspp_head.py#L54
|
||||
Framework: PyTorch
|
||||
- Name: deeplabv3_r50-d8_4xb4-80k_ade20k-512x512
|
||||
In Collection: DeepLabV3
|
||||
Results:
|
||||
Task: Semantic Segmentation
|
||||
Dataset: ADE20K
|
||||
Metrics:
|
||||
mIoU: 42.42
|
||||
mIoU(ms+flip): 43.28
|
||||
Config: configs/deeplabv3/deeplabv3_r50-d8_4xb4-80k_ade20k-512x512.py
|
||||
Metadata:
|
||||
Training Data: ADE20K
|
||||
Batch Size: 16
|
||||
Architecture:
|
||||
- R-50-D8
|
||||
- DeepLabV3
|
||||
Training Resources: 4x V100 GPUS
|
||||
Memory (GB): 8.9
|
||||
Weights: https://download.openmmlab.com/mmsegmentation/v0.5/deeplabv3/deeplabv3_r50-d8_512x512_80k_ade20k/deeplabv3_r50-d8_512x512_80k_ade20k_20200614_185028-0bb3f844.pth
|
||||
Training log: https://download.openmmlab.com/mmsegmentation/v0.5/deeplabv3/deeplabv3_r50-d8_512x512_80k_ade20k/deeplabv3_r50-d8_512x512_80k_ade20k_20200614_185028.log.json
|
||||
Paper:
|
||||
Title: Rethinking atrous convolution for semantic image segmentation
|
||||
URL: https://arxiv.org/abs/1706.05587
|
||||
Code: https://github.com/open-mmlab/mmsegmentation/blob/v0.17.0/mmseg/models/decode_heads/aspp_head.py#L54
|
||||
Framework: PyTorch
|
||||
- Name: deeplabv3_r101-d8_4xb4-80k_ade20k-512x512
|
||||
In Collection: DeepLabV3
|
||||
Results:
|
||||
Task: Semantic Segmentation
|
||||
Dataset: ADE20K
|
||||
Metrics:
|
||||
mIoU: 44.08
|
||||
mIoU(ms+flip): 45.19
|
||||
Config: configs/deeplabv3/deeplabv3_r101-d8_4xb4-80k_ade20k-512x512.py
|
||||
Metadata:
|
||||
Training Data: ADE20K
|
||||
Batch Size: 16
|
||||
Architecture:
|
||||
- R-101-D8
|
||||
- DeepLabV3
|
||||
Training Resources: 4x V100 GPUS
|
||||
Memory (GB): 12.4
|
||||
Weights: https://download.openmmlab.com/mmsegmentation/v0.5/deeplabv3/deeplabv3_r101-d8_512x512_80k_ade20k/deeplabv3_r101-d8_512x512_80k_ade20k_20200615_021256-d89c7fa4.pth
|
||||
Training log: https://download.openmmlab.com/mmsegmentation/v0.5/deeplabv3/deeplabv3_r101-d8_512x512_80k_ade20k/deeplabv3_r101-d8_512x512_80k_ade20k_20200615_021256.log.json
|
||||
Paper:
|
||||
Title: Rethinking atrous convolution for semantic image segmentation
|
||||
URL: https://arxiv.org/abs/1706.05587
|
||||
Code: https://github.com/open-mmlab/mmsegmentation/blob/v0.17.0/mmseg/models/decode_heads/aspp_head.py#L54
|
||||
Framework: PyTorch
|
||||
- Name: deeplabv3_r50-d8_4xb4-160k_ade20k-512x512
|
||||
In Collection: DeepLabV3
|
||||
Results:
|
||||
Task: Semantic Segmentation
|
||||
Dataset: ADE20K
|
||||
Metrics:
|
||||
mIoU: 42.66
|
||||
mIoU(ms+flip): 44.09
|
||||
Config: configs/deeplabv3/deeplabv3_r50-d8_4xb4-160k_ade20k-512x512.py
|
||||
Metadata:
|
||||
Training Data: ADE20K
|
||||
Batch Size: 16
|
||||
Architecture:
|
||||
- R-50-D8
|
||||
- DeepLabV3
|
||||
Training Resources: 4x V100 GPUS
|
||||
Weights: https://download.openmmlab.com/mmsegmentation/v0.5/deeplabv3/deeplabv3_r50-d8_512x512_160k_ade20k/deeplabv3_r50-d8_512x512_160k_ade20k_20200615_123227-5d0ee427.pth
|
||||
Training log: https://download.openmmlab.com/mmsegmentation/v0.5/deeplabv3/deeplabv3_r50-d8_512x512_160k_ade20k/deeplabv3_r50-d8_512x512_160k_ade20k_20200615_123227.log.json
|
||||
Paper:
|
||||
Title: Rethinking atrous convolution for semantic image segmentation
|
||||
URL: https://arxiv.org/abs/1706.05587
|
||||
Code: https://github.com/open-mmlab/mmsegmentation/blob/v0.17.0/mmseg/models/decode_heads/aspp_head.py#L54
|
||||
Framework: PyTorch
|
||||
- Name: deeplabv3_r101-d8_4xb4-160k_ade20k-512x512
|
||||
In Collection: DeepLabV3
|
||||
Results:
|
||||
Task: Semantic Segmentation
|
||||
Dataset: ADE20K
|
||||
Metrics:
|
||||
mIoU: 45.0
|
||||
mIoU(ms+flip): 46.66
|
||||
Config: configs/deeplabv3/deeplabv3_r101-d8_4xb4-160k_ade20k-512x512.py
|
||||
Metadata:
|
||||
Training Data: ADE20K
|
||||
Batch Size: 16
|
||||
Architecture:
|
||||
- R-101-D8
|
||||
- DeepLabV3
|
||||
Training Resources: 4x V100 GPUS
|
||||
Weights: https://download.openmmlab.com/mmsegmentation/v0.5/deeplabv3/deeplabv3_r101-d8_512x512_160k_ade20k/deeplabv3_r101-d8_512x512_160k_ade20k_20200615_105816-b1f72b3b.pth
|
||||
Training log: https://download.openmmlab.com/mmsegmentation/v0.5/deeplabv3/deeplabv3_r101-d8_512x512_160k_ade20k/deeplabv3_r101-d8_512x512_160k_ade20k_20200615_105816.log.json
|
||||
Paper:
|
||||
Title: Rethinking atrous convolution for semantic image segmentation
|
||||
URL: https://arxiv.org/abs/1706.05587
|
||||
Code: https://github.com/open-mmlab/mmsegmentation/blob/v0.17.0/mmseg/models/decode_heads/aspp_head.py#L54
|
||||
Framework: PyTorch
|
||||
- Name: deeplabv3_r50-d8_4xb4-20k_voc12aug-512x512
|
||||
In Collection: DeepLabV3
|
||||
Results:
|
||||
Task: Semantic Segmentation
|
||||
Dataset: Pascal VOC 2012 + Aug
|
||||
Metrics:
|
||||
mIoU: 76.17
|
||||
mIoU(ms+flip): 77.42
|
||||
Config: configs/deeplabv3/deeplabv3_r50-d8_4xb4-20k_voc12aug-512x512.py
|
||||
Metadata:
|
||||
Training Data: Pascal VOC 2012 + Aug
|
||||
Batch Size: 16
|
||||
Architecture:
|
||||
- R-50-D8
|
||||
- DeepLabV3
|
||||
Training Resources: 4x V100 GPUS
|
||||
Memory (GB): 6.1
|
||||
Weights: https://download.openmmlab.com/mmsegmentation/v0.5/deeplabv3/deeplabv3_r50-d8_512x512_20k_voc12aug/deeplabv3_r50-d8_512x512_20k_voc12aug_20200617_010906-596905ef.pth
|
||||
Training log: https://download.openmmlab.com/mmsegmentation/v0.5/deeplabv3/deeplabv3_r50-d8_512x512_20k_voc12aug/deeplabv3_r50-d8_512x512_20k_voc12aug_20200617_010906.log.json
|
||||
Paper:
|
||||
Title: Rethinking atrous convolution for semantic image segmentation
|
||||
URL: https://arxiv.org/abs/1706.05587
|
||||
Code: https://github.com/open-mmlab/mmsegmentation/blob/v0.17.0/mmseg/models/decode_heads/aspp_head.py#L54
|
||||
Framework: PyTorch
|
||||
- Name: deeplabv3_r101-d8_4xb4-20k_voc12aug-512x512
|
||||
In Collection: DeepLabV3
|
||||
Results:
|
||||
Task: Semantic Segmentation
|
||||
Dataset: Pascal VOC 2012 + Aug
|
||||
Metrics:
|
||||
mIoU: 78.7
|
||||
mIoU(ms+flip): 79.95
|
||||
Config: configs/deeplabv3/deeplabv3_r101-d8_4xb4-20k_voc12aug-512x512.py
|
||||
Metadata:
|
||||
Training Data: Pascal VOC 2012 + Aug
|
||||
Batch Size: 16
|
||||
Architecture:
|
||||
- R-101-D8
|
||||
- DeepLabV3
|
||||
Training Resources: 4x V100 GPUS
|
||||
Memory (GB): 9.6
|
||||
Weights: https://download.openmmlab.com/mmsegmentation/v0.5/deeplabv3/deeplabv3_r101-d8_512x512_20k_voc12aug/deeplabv3_r101-d8_512x512_20k_voc12aug_20200617_010932-8d13832f.pth
|
||||
Training log: https://download.openmmlab.com/mmsegmentation/v0.5/deeplabv3/deeplabv3_r101-d8_512x512_20k_voc12aug/deeplabv3_r101-d8_512x512_20k_voc12aug_20200617_010932.log.json
|
||||
Paper:
|
||||
Title: Rethinking atrous convolution for semantic image segmentation
|
||||
URL: https://arxiv.org/abs/1706.05587
|
||||
Code: https://github.com/open-mmlab/mmsegmentation/blob/v0.17.0/mmseg/models/decode_heads/aspp_head.py#L54
|
||||
Framework: PyTorch
|
||||
- Name: deeplabv3_r50-d8_4xb4-40k_voc12aug-512x512
|
||||
In Collection: DeepLabV3
|
||||
Results:
|
||||
Task: Semantic Segmentation
|
||||
Dataset: Pascal VOC 2012 + Aug
|
||||
Metrics:
|
||||
mIoU: 77.68
|
||||
mIoU(ms+flip): 78.78
|
||||
Config: configs/deeplabv3/deeplabv3_r50-d8_4xb4-40k_voc12aug-512x512.py
|
||||
Metadata:
|
||||
Training Data: Pascal VOC 2012 + Aug
|
||||
Batch Size: 16
|
||||
Architecture:
|
||||
- R-50-D8
|
||||
- DeepLabV3
|
||||
Training Resources: 4x V100 GPUS
|
||||
Weights: https://download.openmmlab.com/mmsegmentation/v0.5/deeplabv3/deeplabv3_r50-d8_512x512_40k_voc12aug/deeplabv3_r50-d8_512x512_40k_voc12aug_20200613_161546-2ae96e7e.pth
|
||||
Training log: https://download.openmmlab.com/mmsegmentation/v0.5/deeplabv3/deeplabv3_r50-d8_512x512_40k_voc12aug/deeplabv3_r50-d8_512x512_40k_voc12aug_20200613_161546.log.json
|
||||
Paper:
|
||||
Title: Rethinking atrous convolution for semantic image segmentation
|
||||
URL: https://arxiv.org/abs/1706.05587
|
||||
Code: https://github.com/open-mmlab/mmsegmentation/blob/v0.17.0/mmseg/models/decode_heads/aspp_head.py#L54
|
||||
Framework: PyTorch
|
||||
- Name: deeplabv3_r101-d8_4xb4-40k_voc12aug-512x512
|
||||
In Collection: DeepLabV3
|
||||
Results:
|
||||
Task: Semantic Segmentation
|
||||
Dataset: Pascal VOC 2012 + Aug
|
||||
Metrics:
|
||||
mIoU: 77.92
|
||||
mIoU(ms+flip): 79.18
|
||||
Config: configs/deeplabv3/deeplabv3_r101-d8_4xb4-40k_voc12aug-512x512.py
|
||||
Metadata:
|
||||
Training Data: Pascal VOC 2012 + Aug
|
||||
Batch Size: 16
|
||||
Architecture:
|
||||
- R-101-D8
|
||||
- DeepLabV3
|
||||
Training Resources: 4x V100 GPUS
|
||||
Weights: https://download.openmmlab.com/mmsegmentation/v0.5/deeplabv3/deeplabv3_r101-d8_512x512_40k_voc12aug/deeplabv3_r101-d8_512x512_40k_voc12aug_20200613_161432-0017d784.pth
|
||||
Training log: https://download.openmmlab.com/mmsegmentation/v0.5/deeplabv3/deeplabv3_r101-d8_512x512_40k_voc12aug/deeplabv3_r101-d8_512x512_40k_voc12aug_20200613_161432.log.json
|
||||
Paper:
|
||||
Title: Rethinking atrous convolution for semantic image segmentation
|
||||
URL: https://arxiv.org/abs/1706.05587
|
||||
Code: https://github.com/open-mmlab/mmsegmentation/blob/v0.17.0/mmseg/models/decode_heads/aspp_head.py#L54
|
||||
Framework: PyTorch
|
||||
- Name: deeplabv3_r101-d8_4xb4-40k_pascal-context-480x480
|
||||
In Collection: DeepLabV3
|
||||
Results:
|
||||
Task: Semantic Segmentation
|
||||
Dataset: Pascal Context
|
||||
Metrics:
|
||||
mIoU: 46.55
|
||||
mIoU(ms+flip): 47.81
|
||||
Config: configs/deeplabv3/deeplabv3_r101-d8_4xb4-40k_pascal-context-480x480.py
|
||||
Metadata:
|
||||
Training Data: Pascal Context
|
||||
Batch Size: 16
|
||||
Architecture:
|
||||
- R-101-D8
|
||||
- DeepLabV3
|
||||
Training Resources: 4x V100 GPUS
|
||||
Memory (GB): 9.2
|
||||
Weights: https://download.openmmlab.com/mmsegmentation/v0.5/deeplabv3/deeplabv3_r101-d8_480x480_40k_pascal_context/deeplabv3_r101-d8_480x480_40k_pascal_context_20200911_204118-1aa27336.pth
|
||||
Training log: https://download.openmmlab.com/mmsegmentation/v0.5/deeplabv3/deeplabv3_r101-d8_480x480_40k_pascal_context/deeplabv3_r101-d8_480x480_40k_pascal_context-20200911_204118.log.json
|
||||
Paper:
|
||||
Title: Rethinking atrous convolution for semantic image segmentation
|
||||
URL: https://arxiv.org/abs/1706.05587
|
||||
Code: https://github.com/open-mmlab/mmsegmentation/blob/v0.17.0/mmseg/models/decode_heads/aspp_head.py#L54
|
||||
Framework: PyTorch
|
||||
- Name: deeplabv3_r101-d8_4xb4-80k_pascal-context-480x480
|
||||
In Collection: DeepLabV3
|
||||
Results:
|
||||
Task: Semantic Segmentation
|
||||
Dataset: Pascal Context
|
||||
Metrics:
|
||||
mIoU: 46.42
|
||||
mIoU(ms+flip): 47.53
|
||||
Config: configs/deeplabv3/deeplabv3_r101-d8_4xb4-80k_pascal-context-480x480.py
|
||||
Metadata:
|
||||
Training Data: Pascal Context
|
||||
Batch Size: 16
|
||||
Architecture:
|
||||
- R-101-D8
|
||||
- DeepLabV3
|
||||
Training Resources: 4x V100 GPUS
|
||||
Weights: https://download.openmmlab.com/mmsegmentation/v0.5/deeplabv3/deeplabv3_r101-d8_480x480_80k_pascal_context/deeplabv3_r101-d8_480x480_80k_pascal_context_20200911_170155-2a21fff3.pth
|
||||
Training log: https://download.openmmlab.com/mmsegmentation/v0.5/deeplabv3/deeplabv3_r101-d8_480x480_80k_pascal_context/deeplabv3_r101-d8_480x480_80k_pascal_context-20200911_170155.log.json
|
||||
Paper:
|
||||
Title: Rethinking atrous convolution for semantic image segmentation
|
||||
URL: https://arxiv.org/abs/1706.05587
|
||||
Code: https://github.com/open-mmlab/mmsegmentation/blob/v0.17.0/mmseg/models/decode_heads/aspp_head.py#L54
|
||||
Framework: PyTorch
|
||||
- Name: deeplabv3_r101-d8_4xb4-40k_pascal-context-59-480x480
|
||||
In Collection: DeepLabV3
|
||||
Results:
|
||||
Task: Semantic Segmentation
|
||||
Dataset: Pascal Context 59
|
||||
Metrics:
|
||||
mIoU: 52.61
|
||||
mIoU(ms+flip): 54.28
|
||||
Config: configs/deeplabv3/deeplabv3_r101-d8_4xb4-40k_pascal-context-59-480x480.py
|
||||
Metadata:
|
||||
Training Data: Pascal Context 59
|
||||
Batch Size: 16
|
||||
Architecture:
|
||||
- R-101-D8
|
||||
- DeepLabV3
|
||||
Training Resources: 4x V100 GPUS
|
||||
Weights: https://download.openmmlab.com/mmsegmentation/v0.5/deeplabv3/deeplabv3_r101-d8_480x480_40k_pascal_context_59/deeplabv3_r101-d8_480x480_40k_pascal_context_59_20210416_110332-cb08ea46.pth
|
||||
Training log: https://download.openmmlab.com/mmsegmentation/v0.5/deeplabv3/deeplabv3_r101-d8_480x480_40k_pascal_context_59/deeplabv3_r101-d8_480x480_40k_pascal_context_59-20210416_110332.log.json
|
||||
Paper:
|
||||
Title: Rethinking atrous convolution for semantic image segmentation
|
||||
URL: https://arxiv.org/abs/1706.05587
|
||||
Code: https://github.com/open-mmlab/mmsegmentation/blob/v0.17.0/mmseg/models/decode_heads/aspp_head.py#L54
|
||||
Framework: PyTorch
|
||||
- Name: deeplabv3_r101-d8_4xb4-80k_pascal-context-59-480x480
|
||||
In Collection: DeepLabV3
|
||||
Results:
|
||||
Task: Semantic Segmentation
|
||||
Dataset: Pascal Context 59
|
||||
Metrics:
|
||||
mIoU: 52.46
|
||||
mIoU(ms+flip): 54.09
|
||||
Config: configs/deeplabv3/deeplabv3_r101-d8_4xb4-80k_pascal-context-59-480x480.py
|
||||
Metadata:
|
||||
Training Data: Pascal Context 59
|
||||
Batch Size: 16
|
||||
Architecture:
|
||||
- R-101-D8
|
||||
- DeepLabV3
|
||||
Training Resources: 4x V100 GPUS
|
||||
Weights: https://download.openmmlab.com/mmsegmentation/v0.5/deeplabv3/deeplabv3_r101-d8_480x480_80k_pascal_context_59/deeplabv3_r101-d8_480x480_80k_pascal_context_59_20210416_113002-26303993.pth
|
||||
Training log: https://download.openmmlab.com/mmsegmentation/v0.5/deeplabv3/deeplabv3_r101-d8_480x480_80k_pascal_context_59/deeplabv3_r101-d8_480x480_80k_pascal_context_59-20210416_113002.log.json
|
||||
Paper:
|
||||
Title: Rethinking atrous convolution for semantic image segmentation
|
||||
URL: https://arxiv.org/abs/1706.05587
|
||||
Code: https://github.com/open-mmlab/mmsegmentation/blob/v0.17.0/mmseg/models/decode_heads/aspp_head.py#L54
|
||||
Framework: PyTorch
|
||||
- Name: deeplabv3_r50-d8_4xb4-20k_coco-stuff10k-512x512
|
||||
In Collection: DeepLabV3
|
||||
Results:
|
||||
Task: Semantic Segmentation
|
||||
Dataset: COCO-Stuff 10k
|
||||
Metrics:
|
||||
mIoU: 34.66
|
||||
mIoU(ms+flip): 36.08
|
||||
Config: configs/deeplabv3/deeplabv3_r50-d8_4xb4-20k_coco-stuff10k-512x512.py
|
||||
Metadata:
|
||||
Training Data: COCO-Stuff 10k
|
||||
Batch Size: 16
|
||||
Architecture:
|
||||
- R-50-D8
|
||||
- DeepLabV3
|
||||
Training Resources: 4x V100 GPUS
|
||||
Memory (GB): 9.6
|
||||
Weights: https://download.openmmlab.com/mmsegmentation/v0.5/deeplabv3/deeplabv3_r50-d8_512x512_4x4_20k_coco-stuff10k/deeplabv3_r50-d8_512x512_4x4_20k_coco-stuff10k_20210821_043025-b35f789d.pth
|
||||
Training log: https://download.openmmlab.com/mmsegmentation/v0.5/deeplabv3/deeplabv3_r50-d8_512x512_4x4_20k_coco-stuff10k/deeplabv3_r50-d8_512x512_4x4_20k_coco-stuff10k_20210821_043025.log.json
|
||||
Paper:
|
||||
Title: Rethinking atrous convolution for semantic image segmentation
|
||||
URL: https://arxiv.org/abs/1706.05587
|
||||
Code: https://github.com/open-mmlab/mmsegmentation/blob/v0.17.0/mmseg/models/decode_heads/aspp_head.py#L54
|
||||
Framework: PyTorch
|
||||
- Name: deeplabv3_r101-d8_4xb4-20k_coco-stuff10k-512x512
|
||||
In Collection: DeepLabV3
|
||||
Results:
|
||||
Task: Semantic Segmentation
|
||||
Dataset: COCO-Stuff 10k
|
||||
Metrics:
|
||||
mIoU: 37.3
|
||||
mIoU(ms+flip): 38.42
|
||||
Config: configs/deeplabv3/deeplabv3_r101-d8_4xb4-20k_coco-stuff10k-512x512.py
|
||||
Metadata:
|
||||
Training Data: COCO-Stuff 10k
|
||||
Batch Size: 16
|
||||
Architecture:
|
||||
- R-101-D8
|
||||
- DeepLabV3
|
||||
Training Resources: 4x V100 GPUS
|
||||
Memory (GB): 13.2
|
||||
Weights: https://download.openmmlab.com/mmsegmentation/v0.5/deeplabv3/deeplabv3_r101-d8_512x512_4x4_20k_coco-stuff10k/deeplabv3_r101-d8_512x512_4x4_20k_coco-stuff10k_20210821_043025-c49752cb.pth
|
||||
Training log: https://download.openmmlab.com/mmsegmentation/v0.5/deeplabv3/deeplabv3_r101-d8_512x512_4x4_20k_coco-stuff10k/deeplabv3_r101-d8_512x512_4x4_20k_coco-stuff10k_20210821_043025.log.json
|
||||
Paper:
|
||||
Title: Rethinking atrous convolution for semantic image segmentation
|
||||
URL: https://arxiv.org/abs/1706.05587
|
||||
Code: https://github.com/open-mmlab/mmsegmentation/blob/v0.17.0/mmseg/models/decode_heads/aspp_head.py#L54
|
||||
Framework: PyTorch
|
||||
- Name: deeplabv3_r50-d8_4xb4-40k_coco-stuff10k-512x512
|
||||
In Collection: DeepLabV3
|
||||
Results:
|
||||
Task: Semantic Segmentation
|
||||
Dataset: COCO-Stuff 10k
|
||||
Metrics:
|
||||
mIoU: 35.73
|
||||
mIoU(ms+flip): 37.09
|
||||
Config: configs/deeplabv3/deeplabv3_r50-d8_4xb4-40k_coco-stuff10k-512x512.py
|
||||
Metadata:
|
||||
Training Data: COCO-Stuff 10k
|
||||
Batch Size: 16
|
||||
Architecture:
|
||||
- R-50-D8
|
||||
- DeepLabV3
|
||||
Training Resources: 4x V100 GPUS
|
||||
Weights: https://download.openmmlab.com/mmsegmentation/v0.5/deeplabv3/deeplabv3_r50-d8_512x512_4x4_40k_coco-stuff10k/deeplabv3_r50-d8_512x512_4x4_40k_coco-stuff10k_20210821_043305-dc76f3ff.pth
|
||||
Training log: https://download.openmmlab.com/mmsegmentation/v0.5/deeplabv3/deeplabv3_r50-d8_512x512_4x4_40k_coco-stuff10k/deeplabv3_r50-d8_512x512_4x4_40k_coco-stuff10k_20210821_043305.log.json
|
||||
Paper:
|
||||
Title: Rethinking atrous convolution for semantic image segmentation
|
||||
URL: https://arxiv.org/abs/1706.05587
|
||||
Code: https://github.com/open-mmlab/mmsegmentation/blob/v0.17.0/mmseg/models/decode_heads/aspp_head.py#L54
|
||||
Framework: PyTorch
|
||||
- Name: deeplabv3_r101-d8_4xb4-40k_coco-stuff10k-512x512
|
||||
In Collection: DeepLabV3
|
||||
Results:
|
||||
Task: Semantic Segmentation
|
||||
Dataset: COCO-Stuff 10k
|
||||
Metrics:
|
||||
mIoU: 37.81
|
||||
mIoU(ms+flip): 38.8
|
||||
Config: configs/deeplabv3/deeplabv3_r101-d8_4xb4-40k_coco-stuff10k-512x512.py
|
||||
Metadata:
|
||||
Training Data: COCO-Stuff 10k
|
||||
Batch Size: 16
|
||||
Architecture:
|
||||
- R-101-D8
|
||||
- DeepLabV3
|
||||
Training Resources: 4x V100 GPUS
|
||||
Weights: https://download.openmmlab.com/mmsegmentation/v0.5/deeplabv3/deeplabv3_r101-d8_512x512_4x4_40k_coco-stuff10k/deeplabv3_r101-d8_512x512_4x4_40k_coco-stuff10k_20210821_043305-636cb433.pth
|
||||
Training log: https://download.openmmlab.com/mmsegmentation/v0.5/deeplabv3/deeplabv3_r101-d8_512x512_4x4_40k_coco-stuff10k/deeplabv3_r101-d8_512x512_4x4_40k_coco-stuff10k_20210821_043305.log.json
|
||||
Paper:
|
||||
Title: Rethinking atrous convolution for semantic image segmentation
|
||||
URL: https://arxiv.org/abs/1706.05587
|
||||
Code: https://github.com/open-mmlab/mmsegmentation/blob/v0.17.0/mmseg/models/decode_heads/aspp_head.py#L54
|
||||
Framework: PyTorch
|
||||
- Name: deeplabv3_r50-d8_4xb4-80k_coco-stuff164k-512x512
|
||||
In Collection: DeepLabV3
|
||||
Results:
|
||||
Task: Semantic Segmentation
|
||||
Dataset: COCO-Stuff 164k
|
||||
Metrics:
|
||||
mIoU: 39.38
|
||||
mIoU(ms+flip): 40.03
|
||||
Config: configs/deeplabv3/deeplabv3_r50-d8_4xb4-80k_coco-stuff164k-512x512.py
|
||||
Metadata:
|
||||
Training Data: COCO-Stuff 164k
|
||||
Batch Size: 16
|
||||
Architecture:
|
||||
- R-50-D8
|
||||
- DeepLabV3
|
||||
Training Resources: 4x V100 GPUS
|
||||
Memory (GB): 9.6
|
||||
Weights: https://download.openmmlab.com/mmsegmentation/v0.5/deeplabv3/deeplabv3_r50-d8_512x512_4x4_80k_coco-stuff164k/deeplabv3_r50-d8_512x512_4x4_80k_coco-stuff164k_20210709_163016-88675c24.pth
|
||||
Training log: https://download.openmmlab.com/mmsegmentation/v0.5/deeplabv3/deeplabv3_r50-d8_512x512_4x4_80k_coco-stuff164k/deeplabv3_r50-d8_512x512_4x4_80k_coco-stuff164k_20210709_163016.log.json
|
||||
Paper:
|
||||
Title: Rethinking atrous convolution for semantic image segmentation
|
||||
URL: https://arxiv.org/abs/1706.05587
|
||||
Code: https://github.com/open-mmlab/mmsegmentation/blob/v0.17.0/mmseg/models/decode_heads/aspp_head.py#L54
|
||||
Framework: PyTorch
|
||||
- Name: deeplabv3_r101-d8_4xb4-80k_coco-stuff164k-512x512
|
||||
In Collection: DeepLabV3
|
||||
Results:
|
||||
Task: Semantic Segmentation
|
||||
Dataset: COCO-Stuff 164k
|
||||
Metrics:
|
||||
mIoU: 40.87
|
||||
mIoU(ms+flip): 41.5
|
||||
Config: configs/deeplabv3/deeplabv3_r101-d8_4xb4-80k_coco-stuff164k-512x512.py
|
||||
Metadata:
|
||||
Training Data: COCO-Stuff 164k
|
||||
Batch Size: 16
|
||||
Architecture:
|
||||
- R-101-D8
|
||||
- DeepLabV3
|
||||
Training Resources: 4x V100 GPUS
|
||||
Memory (GB): 13.2
|
||||
Weights: https://download.openmmlab.com/mmsegmentation/v0.5/deeplabv3/deeplabv3_r101-d8_512x512_4x4_80k_coco-stuff164k/deeplabv3_r101-d8_512x512_4x4_80k_coco-stuff164k_20210709_201252-13600dc2.pth
|
||||
Training log: https://download.openmmlab.com/mmsegmentation/v0.5/deeplabv3/deeplabv3_r101-d8_512x512_4x4_80k_coco-stuff164k/deeplabv3_r101-d8_512x512_4x4_80k_coco-stuff164k_20210709_201252.log.json
|
||||
Paper:
|
||||
Title: Rethinking atrous convolution for semantic image segmentation
|
||||
URL: https://arxiv.org/abs/1706.05587
|
||||
Code: https://github.com/open-mmlab/mmsegmentation/blob/v0.17.0/mmseg/models/decode_heads/aspp_head.py#L54
|
||||
Framework: PyTorch
|
||||
- Name: deeplabv3_r50-d8_4xb4-160k_coco-stuff164k-512x512
|
||||
In Collection: DeepLabV3
|
||||
Results:
|
||||
Task: Semantic Segmentation
|
||||
Dataset: COCO-Stuff 164k
|
||||
Metrics:
|
||||
mIoU: 41.09
|
||||
mIoU(ms+flip): 41.69
|
||||
Config: configs/deeplabv3/deeplabv3_r50-d8_4xb4-160k_coco-stuff164k-512x512.py
|
||||
Metadata:
|
||||
Training Data: COCO-Stuff 164k
|
||||
Batch Size: 16
|
||||
Architecture:
|
||||
- R-50-D8
|
||||
- DeepLabV3
|
||||
Training Resources: 4x V100 GPUS
|
||||
Weights: https://download.openmmlab.com/mmsegmentation/v0.5/deeplabv3/deeplabv3_r50-d8_512x512_4x4_160k_coco-stuff164k/deeplabv3_r50-d8_512x512_4x4_160k_coco-stuff164k_20210709_163016-49f2812b.pth
|
||||
Training log: https://download.openmmlab.com/mmsegmentation/v0.5/deeplabv3/deeplabv3_r50-d8_512x512_4x4_160k_coco-stuff164k/deeplabv3_r50-d8_512x512_4x4_160k_coco-stuff164k_20210709_163016.log.json
|
||||
Paper:
|
||||
Title: Rethinking atrous convolution for semantic image segmentation
|
||||
URL: https://arxiv.org/abs/1706.05587
|
||||
Code: https://github.com/open-mmlab/mmsegmentation/blob/v0.17.0/mmseg/models/decode_heads/aspp_head.py#L54
|
||||
Framework: PyTorch
|
||||
- Name: deeplabv3_r101-d8_4xb4-160k_coco-stuff164k-512x512
|
||||
In Collection: DeepLabV3
|
||||
Results:
|
||||
Task: Semantic Segmentation
|
||||
Dataset: COCO-Stuff 164k
|
||||
Metrics:
|
||||
mIoU: 41.82
|
||||
mIoU(ms+flip): 42.49
|
||||
Config: configs/deeplabv3/deeplabv3_r101-d8_4xb4-160k_coco-stuff164k-512x512.py
|
||||
Metadata:
|
||||
Training Data: COCO-Stuff 164k
|
||||
Batch Size: 16
|
||||
Architecture:
|
||||
- R-101-D8
|
||||
- DeepLabV3
|
||||
Training Resources: 4x V100 GPUS
|
||||
Weights: https://download.openmmlab.com/mmsegmentation/v0.5/deeplabv3/deeplabv3_r101-d8_512x512_4x4_160k_coco-stuff164k/deeplabv3_r101-d8_512x512_4x4_160k_coco-stuff164k_20210709_155402-f035acfd.pth
|
||||
Training log: https://download.openmmlab.com/mmsegmentation/v0.5/deeplabv3/deeplabv3_r101-d8_512x512_4x4_160k_coco-stuff164k/deeplabv3_r101-d8_512x512_4x4_160k_coco-stuff164k_20210709_155402.log.json
|
||||
Paper:
|
||||
Title: Rethinking atrous convolution for semantic image segmentation
|
||||
URL: https://arxiv.org/abs/1706.05587
|
||||
Code: https://github.com/open-mmlab/mmsegmentation/blob/v0.17.0/mmseg/models/decode_heads/aspp_head.py#L54
|
||||
Framework: PyTorch
|
||||
- Name: deeplabv3_r50-d8_4xb4-320k_coco-stuff164k-512x512
|
||||
In Collection: DeepLabV3
|
||||
Results:
|
||||
Task: Semantic Segmentation
|
||||
Dataset: COCO-Stuff 164k
|
||||
Metrics:
|
||||
mIoU: 41.37
|
||||
mIoU(ms+flip): 42.22
|
||||
Config: configs/deeplabv3/deeplabv3_r50-d8_4xb4-320k_coco-stuff164k-512x512.py
|
||||
Metadata:
|
||||
Training Data: COCO-Stuff 164k
|
||||
Batch Size: 16
|
||||
Architecture:
|
||||
- R-50-D8
|
||||
- DeepLabV3
|
||||
Training Resources: 4x V100 GPUS
|
||||
Weights: https://download.openmmlab.com/mmsegmentation/v0.5/deeplabv3/deeplabv3_r50-d8_512x512_4x4_320k_coco-stuff164k/deeplabv3_r50-d8_512x512_4x4_320k_coco-stuff164k_20210709_155403-51b21115.pth
|
||||
Training log: https://download.openmmlab.com/mmsegmentation/v0.5/deeplabv3/deeplabv3_r50-d8_512x512_4x4_320k_coco-stuff164k/deeplabv3_r50-d8_512x512_4x4_320k_coco-stuff164k_20210709_155403.log.json
|
||||
Paper:
|
||||
Title: Rethinking atrous convolution for semantic image segmentation
|
||||
URL: https://arxiv.org/abs/1706.05587
|
||||
Code: https://github.com/open-mmlab/mmsegmentation/blob/v0.17.0/mmseg/models/decode_heads/aspp_head.py#L54
|
||||
Framework: PyTorch
|
||||
- Name: deeplabv3_r101-d8_4xb4-320k_coco-stuff164k-512x512
|
||||
In Collection: DeepLabV3
|
||||
Results:
|
||||
Task: Semantic Segmentation
|
||||
Dataset: COCO-Stuff 164k
|
||||
Metrics:
|
||||
mIoU: 42.61
|
||||
mIoU(ms+flip): 43.42
|
||||
Config: configs/deeplabv3/deeplabv3_r101-d8_4xb4-320k_coco-stuff164k-512x512.py
|
||||
Metadata:
|
||||
Training Data: COCO-Stuff 164k
|
||||
Batch Size: 16
|
||||
Architecture:
|
||||
- R-101-D8
|
||||
- DeepLabV3
|
||||
Training Resources: 4x V100 GPUS
|
||||
Weights: https://download.openmmlab.com/mmsegmentation/v0.5/deeplabv3/deeplabv3_r101-d8_512x512_4x4_320k_coco-stuff164k/deeplabv3_r101-d8_512x512_4x4_320k_coco-stuff164k_20210709_155402-3cbca14d.pth
|
||||
Training log: https://download.openmmlab.com/mmsegmentation/v0.5/deeplabv3/deeplabv3_r101-d8_512x512_4x4_320k_coco-stuff164k/deeplabv3_r101-d8_512x512_4x4_320k_coco-stuff164k_20210709_155402.log.json
|
||||
Paper:
|
||||
Title: Rethinking atrous convolution for semantic image segmentation
|
||||
URL: https://arxiv.org/abs/1706.05587
|
||||
Code: https://github.com/open-mmlab/mmsegmentation/blob/v0.17.0/mmseg/models/decode_heads/aspp_head.py#L54
|
||||
Framework: PyTorch
|
||||
@@ -0,0 +1,125 @@
|
||||
_base_ = [
|
||||
'../_base_/models/deeplabv3_r50-d8.py',
|
||||
'../_base_/datasets/my_dataset_model.py',
|
||||
'../_base_/default_runtime.py',
|
||||
'../_base_/schedules/schedule_40k_check_4000.py',
|
||||
]
|
||||
|
||||
norm_cfg = dict(
|
||||
type='BN',
|
||||
)
|
||||
|
||||
crop_size = (769, 769)
|
||||
|
||||
data_preprocessor = dict(
|
||||
size=(769, 769),
|
||||
mean=[
|
||||
94.94709810464303,
|
||||
61.72942233949928,
|
||||
75.93763705236906,
|
||||
],
|
||||
std=[
|
||||
44.005506081132594,
|
||||
42.69595666984776,
|
||||
44.99354156225523,
|
||||
],
|
||||
bgr_to_rgb=False,
|
||||
)
|
||||
|
||||
model = dict(
|
||||
pretrained='./My_Local_Model/open_mmlab/resnet50_v1c.pth',
|
||||
backbone=dict(
|
||||
depth=50,
|
||||
type='ResNetV1c',
|
||||
dilations=(1, 1, 1, 2),
|
||||
strides=(1, 2, 2, 1),
|
||||
multi_grid=(1, 2, 4),
|
||||
),
|
||||
data_preprocessor=dict(
|
||||
size=(769, 769),
|
||||
mean=[
|
||||
94.94709810464303,
|
||||
61.72942233949928,
|
||||
75.93763705236906,
|
||||
],
|
||||
std=[
|
||||
44.005506081132594,
|
||||
42.69595666984776,
|
||||
44.99354156225523,
|
||||
],
|
||||
bgr_to_rgb=False,
|
||||
),
|
||||
decode_head=dict(
|
||||
sampler=dict(
|
||||
type='OHEMPixelSampler',
|
||||
thresh=0.7,
|
||||
min_kept=10000,
|
||||
),
|
||||
dilations=(1, 6, 12, 18),
|
||||
in_channels=2048,
|
||||
channels=512,
|
||||
num_classes=36,
|
||||
loss_decode=dict(
|
||||
type='DiceLoss',
|
||||
use_sigmoid=False,
|
||||
loss_weight=1.0,
|
||||
),
|
||||
align_corners=True,
|
||||
norm_cfg=dict(
|
||||
type='BN',
|
||||
),
|
||||
),
|
||||
auxiliary_head=dict(
|
||||
in_channels=1024,
|
||||
channels=256,
|
||||
num_classes=36,
|
||||
loss_decode=dict(
|
||||
type='DiceLoss',
|
||||
use_sigmoid=False,
|
||||
loss_weight=1.0,
|
||||
),
|
||||
align_corners=True,
|
||||
norm_cfg=dict(
|
||||
type='BN',
|
||||
),
|
||||
),
|
||||
)
|
||||
|
||||
test_cfg = dict(
|
||||
mode='slide',
|
||||
crop_size=(769, 769),
|
||||
stride=(512, 512),
|
||||
)
|
||||
|
||||
optim_wrapper = dict(
|
||||
type='OptimWrapper',
|
||||
_delete_=True,
|
||||
optimizer=dict(
|
||||
type='AdamW',
|
||||
lr=0.0001,
|
||||
weight_decay=0.0005,
|
||||
),
|
||||
clip_grad=dict(
|
||||
max_norm=1,
|
||||
norm_type=2,
|
||||
),
|
||||
)
|
||||
|
||||
param_scheduler = [
|
||||
dict(
|
||||
type='LinearLR',
|
||||
start_factor=1e-06,
|
||||
by_epoch=False,
|
||||
begin=0,
|
||||
end=1500,
|
||||
),
|
||||
dict(
|
||||
type='PolyLR',
|
||||
power=0.9,
|
||||
begin=1500,
|
||||
end=40000,
|
||||
eta_min=1e-05,
|
||||
by_epoch=False,
|
||||
),
|
||||
]
|
||||
|
||||
@@ -0,0 +1,20 @@
|
||||
_base_ = ['../_base_/models/deeplabv3_r50-d8.py', '../_base_/datasets/publicdataset_cholecseg8k.py', '../_base_/default_runtime.py', '../_base_/schedules/schedule_300e_val1_check10.py']
|
||||
|
||||
norm_cfg = {'type': 'BN'}
|
||||
|
||||
crop_size = (512, 512)
|
||||
|
||||
data_preprocessor = {'size': (512, 512), 'mean': [85.65740418979115, 53.99282220050495, 46.074045888534535], 'std': [72.24589167201978, 56.76979155397199, 49.056637115061775], 'bgr_to_rgb': False}
|
||||
|
||||
model = {'pretrained': './My_Local_Model/open_mmlab/resnet18_v1c.pth', 'backbone': {'depth': 18, 'type': 'ResNetV1c'}, 'data_preprocessor': {'size': (512, 512), 'mean': [85.65740418979115, 53.99282220050495, 46.074045888534535], 'std': [72.24589167201978, 56.76979155397199, 49.056637115061775], 'bgr_to_rgb': False}, 'decode_head': {'in_channels': 512, 'channels': 128, 'num_classes': 13, 'loss_decode': {'type': 'DiceLoss', 'use_sigmoid': False, 'loss_weight': 1.0}, 'align_corners': False, 'norm_cfg': {'type': 'BN'}}, 'auxiliary_head': {'in_channels': 256, 'channels': 64, 'num_classes': 13, 'loss_decode': {'type': 'DiceLoss', 'use_sigmoid': False, 'loss_weight': 1.0}, 'align_corners': False, 'norm_cfg': {'type': 'BN'}}}
|
||||
|
||||
test_cfg = {'crop_size': (512, 512)}
|
||||
|
||||
optim_wrapper = {'type': 'OptimWrapper', '_delete_': True, 'optimizer': {'type': 'AdamW', 'lr': 0.0001, 'weight_decay': 0.0005}, 'clip_grad': {'max_norm': 1, 'norm_type': 2}}
|
||||
|
||||
param_scheduler = [{'type': 'LinearLR', 'start_factor': 1e-06, 'by_epoch': True, 'begin': 0, 'end': 10}, {'type': 'PolyLR', 'power': 0.9, 'begin': 10, 'end': 300, 'eta_min': 1e-05, 'by_epoch': True}]
|
||||
|
||||
vis_backends = [{'type': 'LocalVisBackend'}, {'type': 'TensorboardVisBackend'}, {'type': 'WandbVisBackend', 'init_kwargs': {'project': 'Seg_MMSeg_Test', 'name': 'my_deeplabv3_r50_r18_Pre_g1-300e_val1_check10_publicdataset_cholecseg8k-512x512-no_testslide'}}]
|
||||
|
||||
visualizer = {'name': 'visualizer', 'type': 'SegLocalVisualizer', 'vis_backends': [{'type': 'LocalVisBackend'}, {'type': 'TensorboardVisBackend'}, {'type': 'WandbVisBackend', 'init_kwargs': {'project': 'Seg_MMSeg_Test', 'name': 'my_deeplabv3_r50_r18_Pre_g1-300e_val1_check10_publicdataset_cholecseg8k-512x512-no_testslide'}}]}
|
||||
|
||||
Reference in New Issue
Block a user