Craig Bennett


2006

pdf bib
Visual Surveillance and Video Annotation and Description
Khurshid Ahmad | Craig Bennett | Tim Oliver
Proceedings of the Fifth International Conference on Language Resources and Evaluation (LREC’06)

The effectiveness of CCTV surveillance networks is in part determined by their ability to perceive possible threats. Our traditional means for determining a level of threat has been to manually observe a situation through the network and take action as appropriate. The increasing scale of such surveillance networks has however made such an approach untenable, leading us look for a means by which processes may be automated. Here we investigate the language used by security experts in an attempt to look for patterns in the way in which they describe events as observed through a CCTV camera. It is suggested that natural language based descriptions of events may provide the basis for an index which may prove an important component for future automated surveillance systems.