A video camera obtains a capture scope obtained through finder optics of a
single-lens reflex camera and indicated by a view frame mask. A PC
detects the position corresponding to the capture scope from the captured
image captured by the image pickup device of the single-lens reflex
camera, generates the information designating the position, and stores
the information in the single-lens reflex camera. The system controller
of the single-lens reflex camera extracts a part of the area in the
captured image captured by the image pickup device, and records the image
of the area in a memory card.