## Changelog ### v2.11. (01/4/2021) **Highlights** - Support new method: [Localization Distillation for Object Detection](https://arxiv.org/pdf/2102.12252.pdf) - Support Pytorch2ONNX with batch inference and dynamic shape **New Features** - Support localization distillation for object detection (#4758) - Support Pytorch2ONNX with batch inference and dynamic shape for Faster-RCNN and mainstream one-stage detectors (#4796) - Add Deformable Conv2d TensorRT plugin (#858) **Improvements** - Support batch inference in head of RetinaNet (#4699) - Add batch dimension in second stage of Faster-RCNN (#4785) - Support batch inference in bbox coder (#4721) - Add check for `ann_ids` in `COCODataset` to ensure it is unique (#4789) - support for showing the FPN results (#4716) - support dynamic shape for grid_anchor (#4684) - Support automatic statistical evaluation results and export them to EXCEL (#4693) - Move pycocotools version check to when it is used (#4880) **Bug Fixes** - Fix a bug of TridentNet when doing the batch inference (#4717) - Fix a bug of Pytorch2ONNX in FASF (#4735) - Fix a bug when show the image with float type (#4732) ### v2.10.0 (01/03/2021) #### Highlights - Support new methods: [FPG](https://arxiv.org/abs/2004.03580) - Support ONNX2TensorRT for SSD, FSAF, FCOS, YOLOv3, and Faster R-CNN. #### New Features - Support ONNX2TensorRT for SSD, FSAF, FCOS, YOLOv3, and Faster R-CNN (#4569) - Support [Feature Pyramid Grids (FPG)](https://arxiv.org/abs/2004.03580) (#4645) - Support video demo (#4420) - Add seed option for sampler (#4665) - Support to customize type of runner (#4570, #4669) - Support synchronizing BN buffer in `EvalHook` (#4582) - Add script for GIF demo (#4573) #### Bug Fixes - Fix ConfigDict AttributeError and add Colab link (#4643) - Avoid crash in empty gt training of GFL head (#4631) - Fix `iou_thrs` bug in RPN evaluation (#4581) - Fix syntax error of config when upgrading model version (#4584) #### Improvements - Refactor unit test file structures (#4600) - Refactor nms config (#4636) - Get loading pipeline by checking the class directly rather than through config strings (#4619) - Add doctests for mask target generation and mask structures (#4614) - Use deep copy when copying pipeline arguments (#4621) - Update documentations (#4642, #4650, #4620, #4630) - Remove redundant code calling `import_modules_from_strings` (#4601) - Clean deprecated FP16 API (#4571) - Check whether `CLASSES` is correctly initialized in the intialization of `XMLDataset` (#4555) - Support batch inference in the inference API (#4462, #4526) - Clean deprecated warning and fix 'meta' error (#4695) ### v2.9.0 (01/02/2021) #### Highlights - Support new methods: [SCNet](https://arxiv.org/abs/2012.10150), [Sparse R-CNN](https://arxiv.org/abs/2011.12450) - Move `train_cfg` and `test_cfg` into model in configs - Support to visualize results based on prediction quality #### New Features - Support [SCNet](https://arxiv.org/abs/2012.10150) (#4356) - Support [Sparse R-CNN](https://arxiv.org/abs/2011.12450) (#4219) - Support evaluate mAP by multiple IoUs (#4398) - Support concatenate dataset for testing (#4452) - Support to visualize results based on prediction quality (#4441) - Add ONNX simplify option to Pytorch2ONNX script (#4468) - Add hook for checking compatibility of class numbers in heads and datasets (#4508) #### Bug Fixes - Fix CPU inference bug of Cascade RPN (#4410) - Fix NMS error of CornerNet when there is no prediction box (#4409) - Fix TypeError in CornerNet inference (#4411) - Fix bug of PAA when training with background images (#4391) - Fix the error that the window data is not destroyed when `out_file is not None` and `show==False` (#4442) - Fix order of NMS `score_factor` that will decrease the performance of YOLOv3 (#4473) - Fix bug in HTC TTA when the number of detection boxes is 0 (#4516) - Fix resize error in mask data structures (#4520) #### Improvements - Allow to customize classes in LVIS dataset (#4382) - Add tutorials for building new models with existing datasets (#4396) - Add CPU compatibility information in documentation (#4405) - Add documentation of deprecated `ImageToTensor` for batch inference (#4408) - Add more details in documentation for customizing dataset (#4430) - Switch `imshow_det_bboxes` visualization backend from OpenCV to Matplotlib (#4389) - Deprecate `ImageToTensor` in `image_demo.py` (#4400) - Move train_cfg/test_cfg into model (#4347, #4489) - Update docstring for `reg_decoded_bbox` option in bbox heads (#4467) - Update dataset information in documentation (#4525) - Release pre-trained R50 and R101 PAA detectors with multi-scale 3x training schedules (#4495) - Add guidance for speed benchmark (#4537) ### v2.8.0 (04/01/2021) #### Highlights - Support new methods: [Cascade RPN](https://arxiv.org/abs/1909.06720), [TridentNet](https://arxiv.org/abs/1901.01892) #### New Features - Support [Cascade RPN](https://arxiv.org/abs/1909.06720) (#1900) - Support [TridentNet](https://arxiv.org/abs/1901.01892) (#3313) #### Bug Fixes - Fix bug of show result in async_benchmark (#4367) - Fix scale factor in MaskTestMixin (#4366) - Fix but when returning indices in `multiclass_nms` (#4362) - Fix bug of empirical attention in resnext backbone error (#4300) - Fix bug of `img_norm_cfg` in FCOS-HRNet models with updated performance and models (#4250) - Fix invalid checkpoint and log in Mask R-CNN models on Cityscapes dataset (#4287) - Fix bug in distributed sampler when dataset is too small (#4257) - Fix bug of 'PAFPN has no attribute extra_convs_on_inputs' (#4235) #### Improvements - Update model url from aws to aliyun (#4349) - Update ATSS for PyTorch 1.6+ (#4359) - Update script to install ruby in pre-commit installation (#4360) - Delete deprecated `mmdet.ops` (#4325) - Refactor hungarian assigner for more general usage in Sparse R-CNN (#4259) - Handle scipy import in DETR to reduce package dependencies (#4339) - Update documentation of usages for config options after MMCV (1.2.3) supports overriding list in config (#4326) - Update pre-train models of faster rcnn trained on COCO subsets (#4307) - Avoid zero or too small value for beta in Dynamic R-CNN (#4303) - Add doccumentation for Pytorch2ONNX (#4271) - Add deprecated warning FPN arguments (#4264) - Support returning indices of kept bboxes when using nms (#4251) - Update type and device requirements when creating tensors `GFLHead` (#4210) - Update device requirements when creating tensors in `CrossEntropyLoss` (#4224) ### v2.7.0 (30/11/2020) - Support new method: [DETR](https://arxiv.org/abs/2005.12872), [ResNest](https://arxiv.org/abs/2004.08955), Faster R-CNN DC5. - Support YOLO, Mask R-CNN, and Cascade R-CNN models exportable to ONNX. #### New Features - Support [DETR](https://arxiv.org/abs/2005.12872) (#4201, #4206) - Support to link the best checkpoint in training (#3773) - Support to override config through options in inference.py (#4175) - Support YOLO, Mask R-CNN, and Cascade R-CNN models exportable to ONNX (#4087, #4083) - Support [ResNeSt](https://arxiv.org/abs/2004.08955) backbone (#2959) - Support unclip border bbox regression (#4076) - Add tpfp func in evaluating AP (#4069) - Support mixed precision training of SSD detector with other backbones (#4081) - Add Faster R-CNN DC5 models (#4043) #### Bug Fixes - Fix bug of `gpu_id` in distributed training mode (#4163) - Support Albumentations with version higher than 0.5 (#4032) - Fix num_classes bug in faster rcnn config (#4088) - Update code in docs/2_new_data_model.md (#4041) #### Improvements - Ensure DCN offset to have similar type as features in VFNet (#4198) - Add config links in README files of models (#4190) - Add tutorials for loss conventions (#3818) - Add solution to installation issues in 30-series GPUs (#4176) - Update docker version in get_started.md (#4145) - Add model statistics and polish some titles in configs README (#4140) - Clamp neg probability in FreeAnchor (#4082) - Speed up expanding large images (#4089) - Fix Pytorch 1.7 incompatibility issues (#4103) - Update trouble shooting page to resolve segmentation fault (#4055) - Update aLRP-Loss in project page (#4078) - Clean duplicated `reduce_mean` function (#4056) - Refactor Q&A (#4045) ### v2.6.0 (1/11/2020) - Support new method: [VarifocalNet](https://arxiv.org/abs/2008.13367). - Refactored documentation with more tutorials. #### New Features - Support GIoU calculation in `BboxOverlaps2D`, and re-implement `giou_loss` using `bbox_overlaps` (#3936) - Support random sampling in CPU mode (#3948) - Support VarifocalNet (#3666, #4024) #### Bug Fixes - Fix SABL validating bug in Cascade R-CNN (#3913) - Avoid division by zero in PAA head when num_pos=0 (#3938) - Fix temporary directory bug of multi-node testing error (#4034, #4017) - Fix `--show-dir` option in test script (#4025) - Fix GA-RetinaNet r50 model url (#3983) - Update code in docs and fix broken urls (#3947) #### Improvements - Refactor pytorch2onnx API into `mmdet.core.export` and use `generate_inputs_and_wrap_model` for pytorch2onnx (#3857, #3912) - Update RPN upgrade scripts for v2.5.0 compatibility (#3986) - Use mmcv `tensor2imgs` (#4010) - Update test robustness (#4000) - Update trouble shooting page (#3994) - Accelerate PAA training speed (#3985) - Support batch_size > 1 in validation (#3966) - Use RoIAlign implemented in MMCV for inference in CPU mode (#3930) - Documentation refactoring (#4031) ### v2.5.0 (5/10/2020) #### Highlights - Support new methods: [YOLACT](https://arxiv.org/abs/1904.02689), [CentripetalNet](https://arxiv.org/abs/2003.09119). - Add more documentations for easier and more clear usage. #### Backwards Incompatible Changes **FP16 related methods are imported from mmcv instead of mmdet. (#3766, #3822)** Mixed precision training utils in `mmdet.core.fp16` are moved to `mmcv.runner`, including `force_fp32`, `auto_fp16`, `wrap_fp16_model`, and `Fp16OptimizerHook`. A deprecation warning will be raised if users attempt to import those methods from `mmdet.core.fp16`, and will be finally removed in V2.10.0. **[0, N-1] represents foreground classes and N indicates background classes for all models. (#3221)** Before v2.5.0, the background label for RPN is 0, and N for other heads. Now the behavior is consistent for all models. Thus `self.background_labels` in `dense_heads` is removed and all heads use `self.num_classes` to indicate the class index of background labels. This change has no effect on the pre-trained models in the v2.x model zoo, but will affect the training of all models with RPN heads. Two-stage detectors whose RPN head uses softmax will be affected because the order of categories is changed. **Only call `get_subset_by_classes` when `test_mode=True` and `self.filter_empty_gt=True` (#3695)** Function `get_subset_by_classes` in dataset is refactored and only filters out images when `test_mode=True` and `self.filter_empty_gt=True`. In the original implementation, `get_subset_by_classes` is not related to the flag `self.filter_empty_gt` and will only be called when the classes is set during initialization no matter `test_mode` is `True` or `False`. This brings ambiguous behavior and potential bugs in many cases. After v2.5.0, if `filter_empty_gt=False`, no matter whether the classes are specified in a dataset, the dataset will use all the images in the annotations. If `filter_empty_gt=True` and `test_mode=True`, no matter whether the classes are specified, the dataset will call ``get_subset_by_classes` to check the images and filter out images containing no GT boxes. Therefore, the users should be responsible for the data filtering/cleaning process for the test dataset. #### New Features - Test time augmentation for single stage detectors (#3844, #3638) - Support to show the name of experiments during training (#3764) - Add `Shear`, `Rotate`, `Translate` Augmentation (#3656, #3619, #3687) - Add image-only transformations including `Constrast`, `Equalize`, `Color`, and `Brightness`. (#3643) - Support [YOLACT](https://arxiv.org/abs/1904.02689) (#3456) - Support [CentripetalNet](https://arxiv.org/abs/2003.09119) (#3390) - Support PyTorch 1.6 in docker (#3905) #### Bug Fixes - Fix the bug of training ATSS when there is no ground truth boxes (#3702) - Fix the bug of using Focal Loss when there is `num_pos` is 0 (#3702) - Fix the label index mapping in dataset browser (#3708) - Fix Mask R-CNN training stuck problem when ther is no positive rois (#3713) - Fix the bug of `self.rpn_head.test_cfg` in `RPNTestMixin` by using `self.rpn_head` in rpn head (#3808) - Fix deprecated `Conv2d` from mmcv.ops (#3791) - Fix device bug in RepPoints (#3836) - Fix SABL validating bug (#3849) - Use `https://download.openmmlab.com/mmcv/dist/index.html` for installing MMCV (#3840) - Fix nonzero in NMS for PyTorch 1.6.0 (#3867) - Fix the API change bug of PAA (#3883) - Fix typo in bbox_flip (#3886) - Fix cv2 import error of ligGL.so.1 in Dockerfile (#3891) #### Improvements - Change to use `mmcv.utils.collect_env` for collecting environment information to avoid duplicate codes (#3779) - Update checkpoint file names to v2.0 models in documentation (#3795) - Update tutorials for changing runtime settings (#3778), modifing loss (#3777) - Improve the function of `simple_test_bboxes` in SABL (#3853) - Convert mask to bool before using it as img's index for robustness and speedup (#3870) - Improve documentation of modules and dataset customization (#3821) ### v2.4.0 (5/9/2020) **Highlights** - Fix lots of issues/bugs and reorganize the trouble shooting page - Support new methods [SABL](https://arxiv.org/abs/1912.04260), [YOLOv3](https://arxiv.org/abs/1804.02767), and [PAA Assign](https://arxiv.org/abs/2007.08103) - Support Batch Inference - Start to publish `mmdet` package to PyPI since v2.3.0 - Switch model zoo to download.openmmlab.com **Backwards Incompatible Changes** - Support Batch Inference (#3564, #3686, #3705): Since v2.4.0, MMDetection could inference model with multiple images in a single GPU. This change influences all the test APIs in MMDetection and downstream codebases. To help the users migrate their code, we use `replace_ImageToTensor` (#3686) to convert legacy test data pipelines during dataset initialization. - Support RandomFlip with horizontal/vertical/diagonal direction (#3608): Since v2.4.0, MMDetection supports horizontal/vertical/diagonal flip in the data augmentation. This influences bounding box, mask, and image transformations in data augmentation process and the process that will map those data back to the original format. - Migrate to use `mmlvis` and `mmpycocotools` for COCO and LVIS dataset (#3727). The APIs are fully compatible with the original `lvis` and `pycocotools`. Users need to uninstall the existing pycocotools and lvis packages in their environment first and install `mmlvis` & `mmpycocotools`. **Bug Fixes** - Fix default mean/std for onnx (#3491) - Fix coco evaluation and add metric items (#3497) - Fix typo for install.md (#3516) - Fix atss when sampler per gpu is 1 (#3528) - Fix import of fuse_conv_bn (#3529) - Fix bug of gaussian_target, update unittest of heatmap (#3543) - Fixed VOC2012 evaluate (#3553) - Fix scale factor bug of rescale (#3566) - Fix with_xxx_attributes in base detector (#3567) - Fix boxes scaling when number is 0 (#3575) - Fix rfp check when neck config is a list (#3591) - Fix import of fuse conv bn in benchmark.py (#3606) - Fix webcam demo (#3634) - Fix typo and itemize issues in tutorial (#3658) - Fix error in distributed training when some levels of FPN are not assigned with bounding boxes (#3670) - Fix the width and height orders of stride in valid flag generation (#3685) - Fix weight initialization bug in Res2Net DCN (#3714) - Fix bug in OHEMSampler (#3677) **New Features** - Support Cutout augmentation (#3521) - Support evaluation on multiple datasets through ConcatDataset (#3522) - Support [PAA assign](https://arxiv.org/abs/2007.08103) #(3547) - Support eval metric with pickle results (#3607) - Support [YOLOv3](https://arxiv.org/abs/1804.02767) (#3083) - Support [SABL](https://arxiv.org/abs/1912.04260) (#3603) - Support to publish to Pypi in github-action (#3510) - Support custom imports (#3641) **Improvements** - Refactor common issues in documentation (#3530) - Add pytorch 1.6 to CI config (#3532) - Add config to runner meta (#3534) - Add eval-option flag for testing (#3537) - Add init_eval to evaluation hook (#3550) - Add include_bkg in ClassBalancedDataset (#3577) - Using config's loading in inference_detector (#3611) - Add ATSS ResNet-101 models in model zoo (#3639) - Update urls to download.openmmlab.com (#3665) - Support non-mask training for CocoDataset (#3711) ### v2.3.0 (5/8/2020) **Highlights** - The CUDA/C++ operators have been moved to `mmcv.ops`. For backward compatibility `mmdet.ops` is kept as warppers of `mmcv.ops`. - Support new methods [CornerNet](https://arxiv.org/abs/1808.01244), [DIOU](https://arxiv.org/abs/1911.08287)/[CIOU](https://arxiv.org/abs/2005.03572) loss, and new dataset: [LVIS V1](https://arxiv.org/abs/1908.03195) - Provide more detailed colab training tutorials and more complete documentation. - Support to convert RetinaNet from Pytorch to ONNX. **Bug Fixes** - Fix the model initialization bug of DetectoRS (#3187) - Fix the bug of module names in NASFCOSHead (#3205) - Fix the filename bug in publish_model.py (#3237) - Fix the dimensionality bug when `inside_flags.any()` is `False` in dense heads (#3242) - Fix the bug of forgetting to pass flip directions in `MultiScaleFlipAug` (#3262) - Fixed the bug caused by default value of `stem_channels` (#3333) - Fix the bug of model checkpoint loading for CPU inference (#3318, #3316) - Fix topk bug when box number is smaller than the expected topk number in ATSSAssigner (#3361) - Fix the gt priority bug in center_region_assigner.py (#3208) - Fix NaN issue of iou calculation in iou_loss.py (#3394) - Fix the bug that `iou_thrs` is not actually used during evaluation in coco.py (#3407) - Fix test-time augmentation of RepPoints (#3435) - Fix runtimeError caused by incontiguous tensor in Res2Net+DCN (#3412) **New Features** - Support [CornerNet](https://arxiv.org/abs/1808.01244) (#3036) - Support [DIOU](https://arxiv.org/abs/1911.08287)/[CIOU](https://arxiv.org/abs/2005.03572) loss (#3151) - Support [LVIS V1](https://arxiv.org/abs/1908.03195) dataset (#) - Support customized hooks in training (#3395) - Support fp16 training of generalized focal loss (#3410) - Support to convert RetinaNet from Pytorch to ONNX (#3075) **Improvements** - Support to process ignore boxes in ATSS assigner (#3082) - Allow to crop images without ground truth in `RandomCrop` (#3153) - Enable the the `Accuracy` module to set threshold (#3155) - Refactoring unit tests (#3206) - Unify the training settings of `to_float32` and `norm_cfg` in RegNets configs (#3210) - Add colab training tutorials for beginners (#3213, #3273) - Move CUDA/C++ operators into `mmcv.ops` and keep `mmdet.ops` as warppers for backward compatibility (#3232)(#3457) - Update installation scripts in documentation (#3290) and dockerfile (#3320) - Support to set image resize backend (#3392) - Remove git hash in version file (#3466) - Check mmcv version to force version compatibility (#3460) ### v2.2.0 (1/7/2020) **Highlights** - Support new methods: [DetectoRS](https://arxiv.org/abs/2006.02334), [PointRend](https://arxiv.org/abs/1912.08193), [Generalized Focal Loss](https://arxiv.org/abs/2006.04388), [Dynamic R-CNN](https://arxiv.org/abs/2004.06002) **Bug Fixes** - Fix FreeAnchor when no gt in image (#3176) - Clean up deprecated usage of `register_module()` (#3092, #3161) - Fix pretrain bug in NAS FCOS (#3145) - Fix `num_classes` in SSD (#3142) - Fix FCOS warmup (#3119) - Fix `rstrip` in `tools/publish_model.py` - Fix `flip_ratio` default value in RandomFLip pipeline (#3106) - Fix cityscapes eval with ms_rcnn (#3112) - Fix RPN softmax (#3056) - Fix filename of LVIS@v0.5 (#2998) - Fix nan loss by filtering out-of-frame gt_bboxes in COCO (#2999) - Fix bug in FSAF (#3018) - Add FocalLoss `num_classes` check (#2964) - Fix PISA Loss when there are no gts (#2992) - Avoid nan in `iou_calculator` (#2975) - Prevent possible bugs in loading and transforms caused by shallow copy (#2967) **New Features** - Add DetectoRS (#3064) - Support Generalize Focal Loss (#3097) - Support PointRend (#2752) - Support Dynamic R-CNN (#3040) - Add DeepFashion dataset (#2968) - Implement FCOS training tricks (#2935) - Use BaseDenseHead as base class for anchor-base heads (#2963) - Add `with_cp` for BasicBlock (#2891) - Add `stem_channels` argument for ResNet (#2954) **Improvements** - Add anchor free base head (#2867) - Migrate to github action (#3137) - Add docstring for datasets, pipelines, core modules and methods (#3130, #3125, #3120) - Add VOC benchmark (#3060) - Add `concat` mode in GRoI (#3098) - Remove cmd arg `autorescale-lr` (#3080) - Use `len(data['img_metas'])` to indicate `num_samples` (#3073, #3053) - Switch to EpochBasedRunner (#2976) ### v2.1.0 (8/6/2020) **Highlights** - Support new backbones: [RegNetX](https://arxiv.org/abs/2003.13678), [Res2Net](https://arxiv.org/abs/1904.01169) - Support new methods: [NASFCOS](https://arxiv.org/abs/1906.04423), [PISA](https://arxiv.org/abs/1904.04821), [GRoIE](https://arxiv.org/abs/2004.13665) - Support new dataset: [LVIS](https://arxiv.org/abs/1908.03195) **Bug Fixes** - Change the CLI argument `--validate` to `--no-validate` to enable validation after training epochs by default. (#2651) - Add missing cython to docker file (#2713) - Fix bug in nms cpu implementation (#2754) - Fix bug when showing mask results (#2763) - Fix gcc requirement (#2806) - Fix bug in async test (#2820) - Fix mask encoding-decoding bugs in test API (#2824) - Fix bug in test time augmentation (#2858, #2921, #2944) - Fix a typo in comment of apis/train (#2877) - Fix the bug of returning None when no gt bboxes are in the original image in `RandomCrop`. Fix the bug that misses to handle `gt_bboxes_ignore`, `gt_label_ignore`, and `gt_masks_ignore` in `RandomCrop`, `MinIoURandomCrop` and `Expand` modules. (#2810) - Fix bug of `base_channels` of regnet (#2917) - Fix the bug of logger when loading pre-trained weights in base detector (#2936) **New Features** - Add IoU models (#2666) - Add colab demo for inference - Support class agnostic nms (#2553) - Add benchmark gathering scripts for development only (#2676) - Add mmdet-based project links (#2736, #2767, #2895) - Add config dump in training (#2779) - Add ClassBalancedDataset (#2721) - Add res2net backbone (#2237) - Support RegNetX models (#2710) - Use `mmcv.FileClient` to support different storage backends (#2712) - Add ClassBalancedDataset (#2721) - Code Release: Prime Sample Attention in Object Detection (CVPR 2020) (#2626) - Implement NASFCOS (#2682) - Add class weight in CrossEntropyLoss (#2797) - Support LVIS dataset (#2088) - Support GRoIE (#2584) **Improvements** - Allow different x and y strides in anchor heads. (#2629) - Make FSAF loss more robust to no gt (#2680) - Compute pure inference time instead (#2657) and update inference speed (#2730) - Avoided the possibility that a patch with 0 area is cropped. (#2704) - Add warnings when deprecated `imgs_per_gpu` is used. (#2700) - Add a mask rcnn example for config (#2645) - Update model zoo (#2762, #2866, #2876, #2879, #2831) - Add `ori_filename` to img_metas and use it in test show-dir (#2612) - Use `img_fields` to handle multiple images during image transform (#2800) - Add upsample_cfg support in FPN (#2787) - Add `['img']` as default `img_fields` for back compatibility (#2809) - Rename the pretrained model from `open-mmlab://resnet50_caffe` and `open-mmlab://resnet50_caffe_bgr` to `open-mmlab://detectron/resnet50_caffe` and `open-mmlab://detectron2/resnet50_caffe`. (#2832) - Added sleep(2) in test.py to reduce hanging problem (#2847) - Support `c10::half` in CARAFE (#2890) - Improve documentations (#2918, #2714) - Use optimizer constructor in mmcv and clean the original implementation in `mmdet.core.optimizer` (#2947) ### v2.0.0 (6/5/2020) In this release, we made lots of major refactoring and modifications. 1. **Faster speed**. We optimize the training and inference speed for common models, achieving up to 30% speedup for training and 25% for inference. Please refer to [model zoo](model_zoo.md#comparison-with-detectron2) for details. 2. **Higher performance**. We change some default hyperparameters with no additional cost, which leads to a gain of performance for most models. Please refer to [compatibility](compatibility.md#training-hyperparameters) for details. 3. **More documentation and tutorials**. We add a bunch of documentation and tutorials to help users get started more smoothly. Read it [here](https://mmdetection.readthedocs.io/en/latest/). 4. **Support PyTorch 1.5**. The support for 1.1 and 1.2 is dropped, and we switch to some new APIs. 5. **Better configuration system**. Inheritance is supported to reduce the redundancy of configs. 6. **Better modular design**. Towards the goal of simplicity and flexibility, we simplify some encapsulation while add more other configurable modules like BBoxCoder, IoUCalculator, OptimizerConstructor, RoIHead. Target computation is also included in heads and the call hierarchy is simpler. 7. Support new methods: [FSAF](https://arxiv.org/abs/1903.00621) and PAFPN (part of [PAFPN](https://arxiv.org/abs/1803.01534)). **Breaking Changes** Models training with MMDetection 1.x are not fully compatible with 2.0, please refer to the [compatibility doc](compatibility.md) for the details and how to migrate to the new version. **Improvements** - Unify cuda and cpp API for custom ops. (#2277) - New config files with inheritance. (#2216) - Encapsulate the second stage into RoI heads. (#1999) - Refactor GCNet/EmpericalAttention into plugins. (#2345) - Set low quality match as an option in IoU-based bbox assigners. (#2375) - Change the codebase's coordinate system. (#2380) - Refactor the category order in heads. 0 means the first positive class instead of background now. (#2374) - Add bbox sampler and assigner registry. (#2419) - Speed up the inference of RPN. (#2420) - Add `train_cfg` and `test_cfg` as class members in all anchor heads. (#2422) - Merge target computation methods into heads. (#2429) - Add bbox coder to support different bbox encoding and losses. (#2480) - Unify the API for regression loss. (#2156) - Refactor Anchor Generator. (#2474) - Make `lr` an optional argument for optimizers. (#2509) - Migrate to modules and methods in MMCV. (#2502, #2511, #2569, #2572) - Support PyTorch 1.5. (#2524) - Drop the support for Python 3.5 and use F-string in the codebase. (#2531) **Bug Fixes** - Fix the scale factors for resized images without keep the aspect ratio. (#2039) - Check if max_num > 0 before slicing in NMS. (#2486) - Fix Deformable RoIPool when there is no instance. (#2490) - Fix the default value of assigned labels. (#2536) - Fix the evaluation of Cityscapes. (#2578) **New Features** - Add deep_stem and avg_down option to ResNet, i.e., support ResNetV1d. (#2252) - Add L1 loss. (#2376) - Support both polygon and bitmap for instance masks. (#2353, #2540) - Support CPU mode for inference. (#2385) - Add optimizer constructor for complicated configuration of optimizers. (#2397, #2488) - Implement PAFPN. (#2392) - Support empty tensor input for some modules. (#2280) - Support for custom dataset classes without overriding it. (#2408, #2443) - Support to train subsets of coco dataset. (#2340) - Add iou_calculator to potentially support more IoU calculation methods. (2405) - Support class wise mean AP (was removed in the last version). (#2459) - Add option to save the testing result images. (#2414) - Support MomentumUpdaterHook. (#2571) - Add a demo to inference a single image. (#2605) ### v1.1.0 (24/2/2020) **Highlights** - Dataset evaluation is rewritten with a unified api, which is used by both evaluation hooks and test scripts. - Support new methods: [CARAFE](https://arxiv.org/abs/1905.02188). **Breaking Changes** - The new MMDDP inherits from the official DDP, thus the `__init__` api is changed to be the same as official DDP. - The `mask_head` field in HTC config files is modified. - The evaluation and testing script is updated. - In all transforms, instance masks are stored as a numpy array shaped (n, h, w) instead of a list of (h, w) arrays, where n is the number of instances. **Bug Fixes** - Fix IOU assigners when ignore_iof_thr > 0 and there is no pred boxes. (#2135) - Fix mAP evaluation when there are no ignored boxes. (#2116) - Fix the empty RoI input for Deformable RoI Pooling. (#2099) - Fix the dataset settings for multiple workflows. (#2103) - Fix the warning related to `torch.uint8` in PyTorch 1.4. (#2105) - Fix the inference demo on devices other than gpu:0. (#2098) - Fix Dockerfile. (#2097) - Fix the bug that `pad_val` is unused in Pad transform. (#2093) - Fix the albumentation transform when there is no ground truth bbox. (#2032) **Improvements** - Use torch instead of numpy for random sampling. (#2094) - Migrate to the new MMDDP implementation in MMCV v0.3. (#2090) - Add meta information in logs. (#2086) - Rewrite Soft NMS with pytorch extension and remove cython as a dependency. (#2056) - Rewrite dataset evaluation. (#2042, #2087, #2114, #2128) - Use numpy array for masks in transforms. (#2030) **New Features** - Implement "CARAFE: Content-Aware ReAssembly of FEatures". (#1583) - Add `worker_init_fn()` in data_loader when seed is set. (#2066, #2111) - Add logging utils. (#2035) ### v1.0.0 (30/1/2020) This release mainly improves the code quality and add more docstrings. **Highlights** - Documentation is online now: https://mmdetection.readthedocs.io. - Support new models: [ATSS](https://arxiv.org/abs/1912.02424). - DCN is now available with the api `build_conv_layer` and `ConvModule` like the normal conv layer. - A tool to collect environment information is available for trouble shooting. **Bug Fixes** - Fix the incompatibility of the latest numpy and pycocotools. (#2024) - Fix the case when distributed package is unavailable, e.g., on Windows. (#1985) - Fix the dimension issue for `refine_bboxes()`. (#1962) - Fix the typo when `seg_prefix` is a list. (#1906) - Add segmentation map cropping to RandomCrop. (#1880) - Fix the return value of `ga_shape_target_single()`. (#1853) - Fix the loaded shape of empty proposals. (#1819) - Fix the mask data type when using albumentation. (#1818) **Improvements** - Enhance AssignResult and SamplingResult. (#1995) - Add ability to overwrite existing module in Registry. (#1982) - Reorganize requirements and make albumentations and imagecorruptions optional. (#1969) - Check NaN in `SSDHead`. (#1935) - Encapsulate the DCN in ResNe(X)t into a ConvModule & Conv_layers. (#1894) - Refactoring for mAP evaluation and support multiprocessing and logging. (#1889) - Init the root logger before constructing Runner to log more information. (#1865) - Split `SegResizeFlipPadRescale` into different existing transforms. (#1852) - Move `init_dist()` to MMCV. (#1851) - Documentation and docstring improvements. (#1971, #1938, #1869, #1838) - Fix the color of the same class for mask visualization. (#1834) - Remove the option `keep_all_stages` in HTC and Cascade R-CNN. (#1806) **New Features** - Add two test-time options `crop_mask` and `rle_mask_encode` for mask heads. (#2013) - Support loading grayscale images as single channel. (#1975) - Implement "Bridging the Gap Between Anchor-based and Anchor-free Detection via Adaptive Training Sample Selection". (#1872) - Add sphinx generated docs. (#1859, #1864) - Add GN support for flops computation. (#1850) - Collect env info for trouble shooting. (#1812) ### v1.0rc1 (13/12/2019) The RC1 release mainly focuses on improving the user experience, and fixing bugs. **Highlights** - Support new models: [FoveaBox](https://arxiv.org/abs/1904.03797), [RepPoints](https://arxiv.org/abs/1904.11490) and [FreeAnchor](https://arxiv.org/abs/1909.02466). - Add a Dockerfile. - Add a jupyter notebook demo and a webcam demo. - Setup the code style and CI. - Add lots of docstrings and unit tests. - Fix lots of bugs. **Breaking Changes** - There was a bug for computing COCO-style mAP w.r.t different scales (AP_s, AP_m, AP_l), introduced by #621. (#1679) **Bug Fixes** - Fix a sampling interval bug in Libra R-CNN. (#1800) - Fix the learning rate in SSD300 WIDER FACE. (#1781) - Fix the scaling issue when `keep_ratio=False`. (#1730) - Fix typos. (#1721, #1492, #1242, #1108, #1107) - Fix the shuffle argument in `build_dataloader`. (#1693) - Clip the proposal when computing mask targets. (#1688) - Fix the "index out of range" bug for samplers in some corner cases. (#1610, #1404) - Fix the NMS issue on devices other than GPU:0. (#1603) - Fix SSD Head and GHM Loss on CPU. (#1578) - Fix the OOM error when there are too many gt bboxes. (#1575) - Fix the wrong keyword argument `nms_cfg` in HTC. (#1573) - Process masks and semantic segmentation in Expand and MinIoUCrop transforms. (#1550, #1361) - Fix a scale bug in the Non Local op. (#1528) - Fix a bug in transforms when `gt_bboxes_ignore` is None. (#1498) - Fix a bug when `img_prefix` is None. (#1497) - Pass the device argument to `grid_anchors` and `valid_flags`. (#1478) - Fix the data pipeline for test_robustness. (#1476) - Fix the argument type of deformable pooling. (#1390) - Fix the coco_eval when there are only two classes. (#1376) - Fix a bug in Modulated DeformableConv when deformable_group>1. (#1359) - Fix the mask cropping in RandomCrop. (#1333) - Fix zero outputs in DeformConv when not running on cuda:0. (#1326) - Fix the type issue in Expand. (#1288) - Fix the inference API. (#1255) - Fix the inplace operation in Expand. (#1249) - Fix the from-scratch training config. (#1196) - Fix inplace add in RoIExtractor which cause an error in PyTorch 1.2. (#1160) - Fix FCOS when input images has no positive sample. (#1136) - Fix recursive imports. (#1099) **Improvements** - Print the config file and mmdet version in the log. (#1721) - Lint the code before compiling in travis CI. (#1715) - Add a probability argument for the `Expand` transform. (#1651) - Update the PyTorch and CUDA version in the docker file. (#1615) - Raise a warning when specifying `--validate` in non-distributed training. (#1624, #1651) - Beautify the mAP printing. (#1614) - Add pre-commit hook. (#1536) - Add the argument `in_channels` to backbones. (#1475) - Add lots of docstrings and unit tests, thanks to [@Erotemic](https://github.com/Erotemic). (#1603, #1517, #1506, #1505, #1491, #1479, #1477, #1475, #1474) - Add support for multi-node distributed test when there is no shared storage. (#1399) - Optimize Dockerfile to reduce the image size. (#1306) - Update new results of HRNet. (#1284, #1182) - Add an argument `no_norm_on_lateral` in FPN. (#1240) - Test the compiling in CI. (#1235) - Move docs to a separate folder. (#1233) - Add a jupyter notebook demo. (#1158) - Support different type of dataset for training. (#1133) - Use int64_t instead of long in cuda kernels. (#1131) - Support unsquare RoIs for bbox and mask heads. (#1128) - Manually add type promotion to make compatible to PyTorch 1.2. (#1114) - Allowing validation dataset for computing validation loss. (#1093) - Use `.scalar_type()` instead of `.type()` to suppress some warnings. (#1070) **New Features** - Add an option `--with_ap` to compute the AP for each class. (#1549) - Implement "FreeAnchor: Learning to Match Anchors for Visual Object Detection". (#1391) - Support [Albumentations](https://github.com/albumentations-team/albumentations) for augmentations in the data pipeline. (#1354) - Implement "FoveaBox: Beyond Anchor-based Object Detector". (#1339) - Support horizontal and vertical flipping. (#1273, #1115) - Implement "RepPoints: Point Set Representation for Object Detection". (#1265) - Add test-time augmentation to HTC and Cascade R-CNN. (#1251) - Add a COCO result analysis tool. (#1228) - Add Dockerfile. (#1168) - Add a webcam demo. (#1155, #1150) - Add FLOPs counter. (#1127) - Allow arbitrary layer order for ConvModule. (#1078) ### v1.0rc0 (27/07/2019) - Implement lots of new methods and components (Mixed Precision Training, HTC, Libra R-CNN, Guided Anchoring, Empirical Attention, Mask Scoring R-CNN, Grid R-CNN (Plus), GHM, GCNet, FCOS, HRNet, Weight Standardization, etc.). Thank all collaborators! - Support two additional datasets: WIDER FACE and Cityscapes. - Refactoring for loss APIs and make it more flexible to adopt different losses and related hyper-parameters. - Speed up multi-gpu testing. - Integrate all compiling and installing in a single script. ### v0.6.0 (14/04/2019) - Up to 30% speedup compared to the model zoo. - Support both PyTorch stable and nightly version. - Replace NMS and SigmoidFocalLoss with Pytorch CUDA extensions. ### v0.6rc0(06/02/2019) - Migrate to PyTorch 1.0. ### v0.5.7 (06/02/2019) - Add support for Deformable ConvNet v2. (Many thanks to the authors and [@chengdazhi](https://github.com/chengdazhi)) - This is the last release based on PyTorch 0.4.1. ### v0.5.6 (17/01/2019) - Add support for Group Normalization. - Unify RPNHead and single stage heads (RetinaHead, SSDHead) with AnchorHead. ### v0.5.5 (22/12/2018) - Add SSD for COCO and PASCAL VOC. - Add ResNeXt backbones and detection models. - Refactoring for Samplers/Assigners and add OHEM. - Add VOC dataset and evaluation scripts. ### v0.5.4 (27/11/2018) - Add SingleStageDetector and RetinaNet. ### v0.5.3 (26/11/2018) - Add Cascade R-CNN and Cascade Mask R-CNN. - Add support for Soft-NMS in config files. ### v0.5.2 (21/10/2018) - Add support for custom datasets. - Add a script to convert PASCAL VOC annotations to the expected format. ### v0.5.1 (20/10/2018) - Add BBoxAssigner and BBoxSampler, the `train_cfg` field in config files are restructured. - `ConvFCRoIHead` / `SharedFCRoIHead` are renamed to `ConvFCBBoxHead` / `SharedFCBBoxHead` for consistency.