detectPeopleACF

Detect people using aggregate channel features (ACF)

detectPeopleACF will be removed in a future release. Use peopleDetectorACF instead.

Syntax

bboxes = detectPeopleACF(I)

[bboxes,scores]
= detectPeopleACF(I)

[___] = detectPeopleACF(I,roi)

[___] = detectPeopleACF(Name,Value)

Description

bboxes = detectPeopleACF(I) returns a matrix, bboxes, that contains the locations of detected upright people in the input image, I. The locations are represented as bounding boxes. The function uses the aggregate channel features (ACF) algorithm.

example

[bboxes,scores] = detectPeopleACF(I) also returns the detection scores for each bounding box.

[___] = detectPeopleACF(I,roi) detects people within the rectangular search region specified by roi, using either of the previous syntaxes.

[___] = detectPeopleACF(Name,Value) uses additional options specified by one or more Name,Value pair arguments. Unspecified properties have default values.

Code Generation Support:
Supports Code Generation: No
Supports MATLAB Function block: No
Code Generation Support, Usage Notes, and Limitations

Examples

collapse all

Detect People Using Aggregated Channel Features

Open Live Script

Read an image.

I = imread('visionteam1.jpg');

Detect people in the image and store results as bounding boxes and score.

[bboxes,scores] = detectPeopleACF(I);

Annotate the detected upright people in the image.

I = insertObjectAnnotation(I,'rectangle',bboxes,scores);

Display the results with annotation.

figure
imshow(I)
title('Detected people and detection scores')

Figure contains an axes object. The hidden axes object with title Detected people and detection scores contains an object of type image.

Input Arguments

collapse all

`I` — Input image
truecolor image

Input image, specified as a truecolor image. The image must be real and nonsparse.

Data Types: uint8 | uint16 | int16 | double | single

`roi` — Rectangular search region
four-element vector

Rectangular search region, specified as a four-element vector, [x,y,width,height]. The roi must be fully contained in I.

Name-Value Arguments

collapse all

Specify optional pairs of arguments as Name1=Value1,...,NameN=ValueN, where Name is the argument name and Value is the corresponding value. Name-value arguments must appear after other arguments, but the order of the pairs does not matter.

Before R2021a, use commas to separate each name and value, and enclose Name in quotes.

Example: 'Threshold',-1

`Model` — ACF classification model
`'inria-100x41'` (default) | `'caltech-50x21'`

ACF classification model, specified as the comma-separated pair consisting of 'Model' and either 'inria-100x41' or 'caltech-50x21'. The 'inria-100x41' model was trained using the INRIA Person dataset. The 'caltech-50x21' model was trained using the Caltech Pedestrian dataset.

`NumScaleLevels` — Number of scale levels per octave
`8` (default) | integer

Number of scale levels per octave, specified as the comma-separated pair consisting of 'NumScaleLevels', and an integer. Each octave is a power-of-two downscaling of the image. Increase this number to detect people at finer scale increments. Recommended values are in the range [4,8].

`WindowStride` — Window stride for sliding window
`4` (default) | integer

Window stride for sliding window, specified as the comma-separated pair consisting of 'WindowStride', and an integer. Set this value to the amount you want to move the window, in the x and y directions. The sliding window scans the images for object detection. The function uses the same stride for the x and y directions.

`SelectStrongest` — Select strongest bounding box
`true` (default) | `false`

Select strongest bounding box, specified as the comma-separated pair consisting of 'SelectStrongest' and either true or false. The process, often referred to as nonmaximum suppression, eliminates overlapping bounding boxes based on their scores. Set this property to true to use the selectStrongestBbox function to select the strongest bounding box. Set this property to false, to perform a custom selection operation. Setting this property to false returns detected bounding boxes.

`MinSize` — Minimum region size
two-element vector [height width] | `[50 21]` | `[100 41]`

Minimum region size in pixels, specified as the comma-separated pair consisting of 'MinSize', and a two-element vector [height width]. You can set this property to [50 21] for the 'caltech-50x21' model or [100 41] for the 'inria-100x41' model. You can reduce computation time by setting this value to the known minimum region size for detecting a person. By default, MinSize is set to the smallest region size possible to detect an upright person for the classification model selected.

`MaxSize` — Maximum region size
`size`(`I`) (default) | two-element vector [height width]

Maximum region size in pixels, specified as the comma-separated pair consisting of 'MaxSize', and a two-element vector, [height width]. You can reduce computation time by setting this value to the known region size for detecting a person. If you do not set this value, by default the function determines the height and width of the image using the size of I.

`Threshold` — Classification accuracy threshold
`–1` (default) | numeric value

Classification accuracy threshold, specified as the comma-separated pair consisting of 'Threshold' and a numerical value. Typical values are in the range [–1,1]. During multiscale object detection, the threshold value controls the person or nonperson classification accuracy and speed. Increase this threshold to speed up the performance at the risk of missing true detections.

Output Arguments

collapse all

`bboxes` — Locations of detected people
M-by-4 matrix

Locations of people detected using the aggregate channel features (ACF) algorithm, returned as an M-by-4 matrix. The locations are represented as bounding boxes. Each row in bboxes contains a four-element vector, [x,y,width,height]. This vector specifies the upper-left corner and size of a bounding box, in pixels, for a detected person.

`scores` — Confidence value
M-by-1 vector

Confidence value for the detections, returned as an M-by-1 vector. The vector contains a value for each bounding box in bboxes. The score for each detection is the output of a soft-cascade classifier. The range of score values is [-inf inf]. Greater scores indicate a higher confidence in the detection.

References

[1] Dollar, P., R. Appel, S. Belongie, and P. Perona. "Fast feature pyramids for object detection." Pattern Analysis and Machine Intelligence, IEEE Transactions. Vol. 36, Issue 8, 2014, pp. 1532–1545.

[2] Dollar, C. Wojeck, B. Shiele, and P. Perona. "Pedestrian detection: An evaluation of the state of the art." Pattern Analysis and Machine Intelligence, IEEE Transactions.Vol. 34, Issue 4, 2012, pp. 743–761.

[3] Dollar, C., Wojeck, B. Shiele, and P. Perona. "Pedestrian detection: A benchmark." IEEE Conference on Computer Vision and Pattern Recognition. 2009.

Version History

Introduced in R2016a

detectPeopleACF

Syntax

Description

Examples

Detect People Using Aggregated Channel Features

Input Arguments

`I` — Input image
truecolor image

`roi` — Rectangular search region
four-element vector

Name-Value Arguments

`Model` — ACF classification model
`'inria-100x41'` (default) | `'caltech-50x21'`

`NumScaleLevels` — Number of scale levels per octave
`8` (default) | integer

`WindowStride` — Window stride for sliding window
`4` (default) | integer

`SelectStrongest` — Select strongest bounding box
`true` (default) | `false`

`MinSize` — Minimum region size
two-element vector [height width] | `[50 21]` | `[100 41]`

`MaxSize` — Maximum region size
`size`(`I`) (default) | two-element vector [height width]

`Threshold` — Classification accuracy threshold
`–1` (default) | numeric value

Output Arguments

`bboxes` — Locations of detected people
M-by-4 matrix

`scores` — Confidence value
M-by-1 vector

References

Version History

See Also

Objects

Functions

Topics

detectPeopleACF

Syntax

Description

Examples

Detect People Using Aggregated Channel Features

Input Arguments

I — Input image truecolor image

roi — Rectangular search region four-element vector

Name-Value Arguments

Model — ACF classification model 'inria-100x41' (default) | 'caltech-50x21'

NumScaleLevels — Number of scale levels per octave 8 (default) | integer

WindowStride — Window stride for sliding window 4 (default) | integer

SelectStrongest — Select strongest bounding box true (default) | false

MinSize — Minimum region size two-element vector [height width] | [50 21] | [100 41]

MaxSize — Maximum region size size(I) (default) | two-element vector [height width]

Threshold — Classification accuracy threshold –1 (default) | numeric value

Output Arguments

bboxes — Locations of detected people M-by-4 matrix

scores — Confidence value M-by-1 vector

References

Version History

See Also

Objects

Functions

Topics

`I` — Input image
truecolor image

`roi` — Rectangular search region
four-element vector

`Model` — ACF classification model
`'inria-100x41'` (default) | `'caltech-50x21'`

`NumScaleLevels` — Number of scale levels per octave
`8` (default) | integer

`WindowStride` — Window stride for sliding window
`4` (default) | integer

`SelectStrongest` — Select strongest bounding box
`true` (default) | `false`

`MinSize` — Minimum region size
two-element vector [height width] | `[50 21]` | `[100 41]`

`MaxSize` — Maximum region size
`size`(`I`) (default) | two-element vector [height width]

`Threshold` — Classification accuracy threshold
`–1` (default) | numeric value

`bboxes` — Locations of detected people
M-by-4 matrix

`scores` — Confidence value
M-by-1 vector