LanguageEnglish Deutsch Français Español Italiano Polski Русский 正体字 日本語 Português
Identification and Recognition Tutorial -

Identification and Recognition Tutorial

Subscribe to e-news:

Network Video News

Enter your E-mail:


Other e-newsletter options

Search


Required resolution


The traditional way of defining requirements for resolution of an analog CCTV system has been by specifying what percentage of the full screen the observed object occupies. Different surveillance objectives require different percentages.

For example, detecting the presence of a person in a scene could require that the person occupies 10% of the view. Recognizing a known person, however, could require that the person occupies 50%, and further identifying that person could require 120% or more.

Examples from Stockholm Subway using AXIS 225FD Network Camera

   
  20%    40%    140%
 Too small
for recognition
   Face turned away
from camera
   Face partly turned away
from camera

With today’s wide range of available resolutions, it is practical to translate the percentage requirements to pixel resolution in order to compare and specify cameras as well as for viewing and recording solutions. In Europe, the percentage requirements are based on a 400 TVL (Television Lines) analog image. Table 1 shows the equivalent of required resolutions in pixels/m and the number of pixels a 16 cm wide face will cover for the different surveillance objectives.

Surveillance objective Body representation Approximate linear resolution  Face width
Identification 120% 250 pixels/m 40 pixels
Recognition 50% 100 pixels/m 17 pixels
Detection 10% 20 pixels/m 3 pixels
Table 1: Typical CCTV requirements for identification, recognition or detection

You should note that these numbers are minimum resolutions, and that if conditions are less than ideal, you will need to compensate with better resolution. Today, stronger identification is often requested. For example, recommendations from SKL, the Swedish National Laboratory of Forensic Science, suggest that resolution for identification purposes should start at 500 pixels/m. This means that a 16 cm wide face would be represented by 80 pixels or more.

Other criteria are valid for objects such as license plates, where typical recommendations are that the height of letters should be represented by 15 pixels (corresponding to about 200 pixels/m) to ensure legibility.

It is also important to take legal and regulatory requirements into account when determining the resolution needed in order to be able to use camera footage as evidence in court.

Finding a camera to match resolution requirements

The resolution of a captured scene is determined by the camera resolution and the size of the scene. For example, if you are using a camera that delivers 4CIF (704 x 576 pixels) resolution, you can cover a scene that is, at most, 1.4 m wide, if the linear resolution is 500 pixels/m or more. You will need to select a camera and lens that will allow the field of view to match the scene size at the desired distance between the camera and the scene.

Camera model Focal length Horizontal
resolution
Maximum
distance
Maximum
scene width
AXIS P1346 4 – 10 mm  2048 pixels 8 m  3.8 m 
AXIS P3344 12mm 3.3 – 12 mm  1280 pixels 8 m  2.5 m 
AXIS Q1755 5.1 – 51 mm  1920 pixels 40 m  3.7 m 
AXIS Q6032-E  3.4 – 119 mm  704 pixels 46 m  1.4 m 
AXIS Q6034 4.7 – 84,6 mm  1280 pixels 45 m  2.5 m 
Table 2: Maximum distance for identification (500 pixels/m horizontal linear resolution,
80 pixels face width) for some Axis cameras 

 

Axis lens calculators and the AXIS Object Recognition Tool are useful tools that help finding a suitable camera and focal length. For advanced users, a pixel and distance calculator spreadsheet is also available.

Higher camera resolution means fewer cameras and better overview

Since the maximum size of a scene covered at a given resolution only depends on the camera resolution, cameras with higher resolution can cover larger areas. For example, if your 7 m wide scene requires five cameras delivering 4CIF resolution, these can be replaced by two cameras at 1080p HDTV resolution (1920 x 1080 pixels). Also, a camera with higher resolution can be used to give a better overview, by covering a larger scene while maintaining the required linear resolution.

Resolution example
Cameras with higher resolution can cover larger areas

Depth of field

The larger the depth of field is, the larger the area where persons or objects are in focus. With a large depth of field, your chances of identification increase. Depth of field is determined by the iris opening, the focal length and the distance to the camera.

The depth of field increases with smaller iris openings. This means that good lighting conditions can help increase depth of field. The P-Iris feature of some Axis cameras will adjust the iris to optimize depth of field for different lighting conditions. You can learn more about the P-iris from the P-Iris white paper:

Using shorter focal lengths will also increase depth of field. Using cameras with higher resolutions will let you capture the scene using shorter focal lengths, while maintaining resolution requirements.

Distortion

Most lenses exhibit distortion. Often this is in the form of barrel distortion. Barrel distortion is caused by lens magnification being smaller on the edges of the field-of-view compared to the center of the image. The effect is that objects that are near the edge appear closer to the center compared to an undistorted image. Objects of the same size will cover fewer pixels when they are near the edge, compared to what they would cover if they were closer to the center. This means that objects that are near the edge of the field-of-view need to be closer to the camera in order to fulfill requirements on minimum resolution.

The effect of barrel distortion is often much more pronounced at short focal lengths, making wide angle lenses less suited for identification purposes.

Next topic: Illumination

axisFlexSubContent
Axis is the world’s leading expert in network video
 |  Sites
 |  Contact