An image sensing device collects the speech and image of a person to be
sensed and carries out recognition of the collected speech. When the
image sensing device determines as the recognition result that the speech
represents a predetermined sentence, the image sensing device performs
recognition on an acquired image. When the image sensing device
determines as a recognition result that the image is a human face showing
a predetermined facial expression, the image sensing device records the
image and audio information.