Postprocessor is function which processes prediction and/or annotation data after model infer and before metric calculation. For correct work postprocessors require specific representation format. (e. g. clip boxes postprocessor expects detection annotation and detection prediction for processing).
In case when you use complicated representation located in representation container, you can add options annotation_source and prediction_source in configuration file,
if you want process only specific representations, another way postprocessor will be used for all suitable representations. annotation_source and prediction_source should contain
comma separated list of annotation identifiers and output layer names respectively.
Every postprocessor has parameters available for configuration.
Accuracy Checker supports following set of postprocessors:
cast_to_int- casting detection bounding box coordinates given in floating point format to integer. Supported representations:DetectionAnotation,DetectionPrediction,TextDetectionAnnotation,TextDetectionPrediction.round_policy- method for rounding:nearest,greater,lower,nearest_to_zero.
clip_boxes- clipping detection bounding box sizes. Supported representations:DetectionAnotation,DetectionPrediction.dst_widthanddst_height- destination width and height for box clipping respectively. You can also usesizeinstead in case when destination sizes are equal. If not provided, image size will be used.apply_to- option which determines target boxes for processing (annotationfor ground truth boxes andpredictionfor detection results,allfor both).bboxes_normalizedis flag which says that target bounding boxes are in normalized format.
normalize_boxes- normalizing ground truth detection bounding boxes (cast to range [0, 1]). Supported representations:DetectionAnotation,DetectionPrediction,ActionDetectionAnotation,ActionDetectionPredictioncorrect_yolo_v2_boxes- resizing detection prediction bbox coordinates using specific for Yolo v2 approach. Supported representations:DetectionAnotation,DetectionPrediction.dst_widthanddst_height- destination width and height respectively. You can also usesizeinstead in case when destination sizes are equal.
resize_prediction_boxes- resizing normalized detection prediction boxes according to image size. Supported representations:DetectionAnotation,DetectionPrediction.rescaleif this option enabled, rescaling boxes on input size will be performed before multiplying on original input size. (Optional, defaultFalse).
faster_rcnn_postprocessing_resize- resizing normalized detection prediction boxes according to the original image size before preprocessing steps. Supported representations:DetectionAnotation,DetectionPrediction. At the moment works in the following cases only:- the preprocessing steps contains only one operation changing input image size, and the operation is
resize - the preprocessing steps contains only two operations changing input image size, and the operations are
resizeand thenpadding.
rescaleif this option enabled, rescaling boxes on input size will be performed before applying postprocessing (Optional, defaultFalse).
- the preprocessing steps contains only one operation changing input image size, and the operation is
nms- non-maximum suppression. Supported representations:DetectionAnotation,DetectionPrediction,ActionDetectionAnnotation,ActionDetectionPrediction.overlap- overlap threshold for merging detections.use_min_area- boolean value to determine whether to use minimum area of two bounding boxes as base area to calculate overlap.
diou_nms- distance-IoU non-maximum suppression. Supported representations:DetectionAnotation,DetectionPrediction,ActionDetectionAnnotation,ActionDetectionPrediction.overlap- overlap threshold for merging detections.
soft_nms- soft non-maximum suppression. Supported representations:DetectionAnotation,DetectionPrediction,ActionDetectionAnnotation,ActionDetectionPrediction.keep_top_k- the maximal number of detections which should be kept.sigma- sigma-value for updated detection score calculation.min_score- break point.
filter- filtering data using different parameters. Supported representations:DetectionAnotation,DetectionPrediction.apply_to- determines target boxes for processing (annotationfor ground truth boxes andpredictionfor detection results,allfor both).remove_filtered- removing filtered data. Annotations support ignoring filtered data without removing as default, in other cases filtered data will be removed automatically.- Supported parameters for filtering:
labels,min_confidence,height_range,width_range,is_empty,min_visibility,aspect_ratio,area_ratio,area_range. Filtering byheight_range,width_rangeare also available forTextDetectionAnnotation,TextDetectionPrediction,area_range- forPoseEstimationAnnotation,PoseEstimationPredictionandTextDetectionAnnotation,TextDetectionPrediction.
normalize_landmarks_points- normalizing ground truth landmarks points. Supported representations:FacialLandmarksAnnotation,FacialLandmarksPrediction.use_annotation_rect- allows to use size of rectangle saved in annotation metadata for point scaling instead source image size.encode_segmentation_mask- encoding segmentation label image as segmentation mask. Supported representations:SegmentationAnotation,SegmentationPrediction.apply_to- determines target masks for processing (annotationfor ground truth andpredictionfor detection results,allfor both). Note: this postprocessing requires specific dataset meta:segmentation_colorsfor annotations andprediction_to_gt_labelsfor predictions.
resize_segmentation_mask- resizing segmentation mask. Supported representations:SegmentationAnotation,SegmentationPrediction.dst_widthanddst_height- destination width and height for resize respectively. You can also usesizeinstead in case when destination sizes are equal. For resizing to final input size without padding, you can useto_dst_image_sizeflag with valueTrue. If any of these parameters are not specified, original image size will be used as default.apply_to- determines target masks for processing (annotationfor ground truth andpredictionfor detection results,allfor both).
extend_segmentation_mask- extending annotation segmentation mask to predicted mask size making border filled by specific value. Supported representations:SegmentationAnotation,SegmentationPrediction.filling_label- value for filling border (default 255).pad_type- padding space location. Supported:center,left_top,right_bottom(Default iscenter).
zoom_segmentation_mask- zooming segmentation mask. Supported representations:SegmentationAnotation,SegmentationPrediction.zoom- size for zoom operation.
crop_segmentation_mask- cropping 2-d, 3-d annotation mask. Supported representations:BrainTumorSegmentationAnnotation,BrainTumorSegmentationPrediction,SegmentationAnnotation,SegmentationPrediction.dst_width,dst_heightanddst_volumeare destination width, height and volume for cropped 3D-image respectively. You can also usesizeinstead in case when destination sizes are equal for all three dimensions.
crop_or_pad-segmentation_mask- performs central cropping if original mask size greater then destination size and padding in case, when source size lower than destination. Padding filling value is 0, realization - right-bottom.dst_widthanddst_heightare destination width and height for keypoints resizing respectively. You can also usesizeinstead in case when destination sizes are equal. Supported representations:SegmentationAnotation,SegmentationPrediction.
clip_segmentation_mask- clipping segmentation mask values. Supported representations:BrainTumorSegmentationAnnotation,BrainTumorSegmentationPrediction.min_value- lower bound of range.max_value- upper bound of range.
segmentation_prediction_resample- resamples output prediction in two steps: 1) resizes it to bounding box size; 2) extends to annotation size. Supported representations:BrainTumorSegmentationAnnotation,BrainTumorSegmentationPrediction. For correct bounding box size must be set via tagboxes_fileinbrats_numpyconverter orcrop_bratspreprocessor.make_argmax- applies argmax operation to prediction mask after resampling (by defaultFalse). Must be specified only one optionmake_argmax.
transform_brats_prediction- transforms prediction fromWT-TC-ETformat toNCR/NET-ED-ET. Sequentially fills one-channel mask with specifiedvaluesfor elements passing the threshold (threshold is0.5) from each prediction channel in specifiedorder.order- specifies filling order for channelsvalues- specifies values for each channel according to new order
remove_brats_prediction_padding- process output prediction in two steps: 1) remove padding; 2) extends to annotation size. Supported representations:BrainTumorSegmentationAnnotation,BrainTumorSegmentationPrediction.make_argmax- applies argmax operation to prediction mask (by defaultFalse).
extract_answers_tokens- extract predicted sequence of tokens from annotation text. Supported representations:QuestionAnsweringAnnotation,QuestionAnsweringPrediction.max_answer- maximum answer length (Optional, default value is 30).n_best_size- total number of n-best prediction size for the answer (Optional, default value is 20).
bidaf_extract_answers_tokens- extract predicted sequence of tokens from annotation text. Supported representations:QuestionAnsweringBiDAFAnnotation,QuestionAnsweringPrediction.translate_3d_poses- translating 3D poses. Supported representations:PoseEstimation3dAnnotation,PoseEstimation3dPrediction. Shifts 3D coordinates of each predicted poses on corresponding translation vector.resize_super_resolution- resizing super resolution predicted image. Supported representations:SuperResolutionAnotation,SuperResolutionPrediction.dst_widthanddst_height- destination width and height for resizing respectively. You can also usesizeinstead in case when destination sizes are equal. If any of these parameters are not specified, gt high resolution image size will be used as default.target- select target image for resize (predictionorannotation)
resize_style_transfer- resizing style transfer predicted image. Supported representations:StyleTransferAnotation,StyleTransferPrediction.dst_widthanddst_height- destination width and height for resizing respectively.
crop_image- cropping image. Supported representations:ImageInpaintingAnnotation,ImageInpaintingPrediction,ImageProcessingAnnotation,ImageProcessingPrediction.corner_crop_image- corner crop of the image. Supported representations:ImageInpaintingAnnotation,ImageInpaintingPrediction,ImageProcessingAnnotation,ImageProcessingPrediction.dst_widthanddst_heightare destination width and heightcorner_typeis type of the corner crop. Options are:top-lefttop-rightbottom-leftbottom-rightDefault choice istop-left
resize- resizing image or segmentation mask. Supported representations:SegmentationAnotation,SegmentationPrediction,StyleTransferAnotation,StyleTransferPrediction,SuperResolutionAnotation,SuperResolutionPrediction,ImageProcessingAnnotation,ImageProcessingPrediction,SalientRegionAnnotation,SalientRegionPrediction.dst_widthanddst_height- destination width and height for resize respectively. You can also usesizeinstead in case when destination sizes are equal. If any of these parameters are not specified, image size will be used as default.apply_to- determines target masks for processing (annotationfor ground truth andpredictionfor detection results,allfor both).resize_realization- parameter specifies functionality of which library will be used for resize:opencvorpillow(defaultpillowis used).opencvlibrary is supported for following representations:StyleTransferAnnotation,StyleTransferPrediction,SuperResolutionAnnotation,SuperResolutionPrediction,ImageProcessingAnnotation,ImageProcessingPrediction,ImageInpaintingAnnotation,ImageInpaintingPrediction.
rgb_to_gray- converts reference data stored in RGB format to gray scale. Supported representations:SuperResolutionAnnotation,SuperResolutionPrediction,ImageProcessingAnnotation,ImageProcessingPrediction,StyleTransferAnnotation,StyleTransferPrediction.bgr_to_gray- converts reference data stored in BGR format to gray scale. Supported representations:SuperResolutionAnnotation,SuperResolutionPrediction,ImageProcessingAnnotation,ImageProcessingPrediction,StyleTransferAnnotation,StyleTransferPrediction.remove_repeats- removes repeated predicted tokens. Supported representations:MachineTranslationPrediction,MachineTranslationAnnotation.to_lower_case- convert tokens to lower case. Supported representations:MachineTranslationPrediction,MachineTranslationAnnotation.sr_image_recovery- restores input in YCrCb format and converts prediction to BRG or RGB format, using prediction gray channel and input Cr and Cb channels. Supported representation:SuperResolutionPrediction.target_color- target color space for super resolution image -bgrandrgbare supported. (Optional, defaultrgb).size- size of model input for recovering YCrCb image.dst_widthanddst_height- width and height of model input respectively for recovering YCrCb image.
colorization_recovery- restores BGR image from Colorization models results represented as AB-channels in LAB color space. Supported representations:ImageProcessingAnnotation,ImageProcessingPrediction.argmax_segmentation_mask- translates categorical annotation segmentation mask to numerical. Supported representations:SegmentationAnnotation,SegmentationPrediction.shift_labels- shifts predicted detection labels. Supported representation:DetectionPrediction.offset- value for shift.
normalize_salience_map- converts reference salience map from [0, 255] to [0, 1] range. Supported representations:SalientRegionAnnotation,SalientRegionPrediction.min_max_normalization- normalize regression data into [0, 1] with given min and max values. Supported representation:FeatureRegressionAnnotation,RegressionAnnotation,RegressionPredictionmin- minimal value in range, optional, default 0.max- maximal value in range.apply_to- determines target masks for processing (annotationfor ground truth andpredictionfor detection results,allfor both).