A robot system that includes a robot and a remote station. The remote
station may be a personal computer coupled to the robot through a
broadband network. A user at the remote station may receive both video
and audio from a camera and microphone of the robot, respectively. The
remote station may include a graphical user interface that can be
selected to generate an alert input to the robot. The user initially
establishes a voice communication between the remote station and the
robot. To obtain video access the user may select a graphical icon to
generate and transmit the alert input to the robot. The caller recipient
at the robot may grant the request for video of themselves by inputting a
response into the robot. This procedure allows someone at the robot to
control the initiation of a video-conference with the user of the remote
station.