GRAPHICAL MENU ITEMS FOR A USER INTERFACE MENU OF A VIDEO
TELECONFERENCING SYSTEM
CROSS-REFERENCE TO RELATED APPLICATION
This application claims the benefit of U.S. Provisional Patent Application Serial No. 60/105,540, "User Interface for Video Teleconferencing", by Timothy J. Maroney, Ly-Huong Thi Pham, Mark Rolston, and Ruth Kaufman, filed October 24, 1998, which subject matter is incorporated herein by reference.
BACKGROUND OF THE INVENTION
1. Field of the Invention
The invention relates generally to user interface menus, for example, user interface menus for operating video teleconferencing systems. More particularly, the invention relates to user interface menus with graphics for simplifying a user's operation of a video teleconferencing system.
2. Description of the Related Art
As the result of continuous advances in technology, video teleconferencing is becoming an increasingly popular means of communication. The development and standardization of advanced coding/decoding and compression decompression schemes has facilitated the communication of ever larger amounts of information over communications links of fixed size. Technological advances in the communication links themselves and increases in the sheer number of links have further increased the effective size of communications links which are available to carry video teleconferencing information.
Advances in the basic components used in video teleconferencing systems, such as computers, cameras, displays, and microphones, have resulted in better quality components available at lower prices.
These advances translate to more powerful video teleconferencing systems available at lower prices. Video and audio quality has improved, as has the capability to combine the basic video teleconferencing functionality with other functionalities, such as presentation software,
desktop publishing applications, and networking. As a result, video teleconferencing systems have progressed from being expensive novelty systems which were infrequently used, to moderately priced systems which were more often used but still located in a dedicated facility and shared by many users, to relatively inexpensive systems which one day may be as ubiquitous as the telephone is today.
The typical video teleconferencing system, by its very nature, includes a large number of components and thus presents a large number of options to the user. For example, a system typically will include cameras and displays to support a bidirectional video link, and microphones and loudspeakers to support a bidirectional audio link. These components typically are manipulated by the user of the video teleconferencing system. For example, the user may adjust the position, zoom and picture quality of the various cameras; select between different image sources to provide images to the various displays, as well as adjusting the displays themselves; and also adjust volume and stereo effect for the microphones and loudspeakers. The user typically will also have additional options for setting up the video teleconferencing system, such as setting defaults for the system, maintaining an address book, and options for setting up actual video teleconferencing calls. The user typically manipulates the various options via a user interface.
However, until video teleconferencing systems become ubiquitous, an individual may use a video teleconferencing system only on an infrequent basis. For this reason, users of video teleconferencing systems may not be familiar with the options available to them and may not have the time to learn a complex user interface. Infrequent use also makes the user more likely to forget how to manipulate the user interface. This can make the use of video teleconferencing systems seem arcane or frustrating. In addition, once video teleconferencing systems become ubiquitous, the user interface should be simple enough for everyone to use, preferably without extensive training.
Thus, there is a need for a user interface for video teleconferencing systems which is easy and intuitive to use but still capable of handling the complexities of a video teleconferencing system. There is a further need for video teleconferencing systems which intelligently simplify the options available to the user, thus inherently reducing the complexity of the corresponding user interface.
SUMMARY OF THE INVENTION
In accordance with the present invention, a video teleconferencing system includes a display device, and a user interface menu suitable for display on the display device. The user interface menu includes a plurality of menu items, and each menu item corresponds to one of a plurality of options for a user of the video teleconferencing system. At least one menu item includes a graphic which is representative of the option corresponding to that menu item. In another aspect of the invention, each of the options includes forming a communications link between the video teleconferencing system and any one of a plurality of other video teleconferencing systems. At least one menu item includes a graphic which is representative of the remote video teleconferencing system corresponding to that menu item. The graphic may be an image captured by the remote video teleconferencing system.
Alternately, the graphic may be provided as part of the caller i.d. information for the remote video teleconferencing system.
In another aspect of the invention, an image capture device, such as a camera, captures images and provides these images to the video teleconferencing system. The image capture device can be positioned in a plurality of positions. Each of the options includes positioning the image capture device in one of a plurality of preset positions. At least one menu item includes an image captured by the image capture device when positioned in the preset position corresponding to the menu item. In one alternative, a position is automatically added as a preset position, including displaying a menu item corresponding to that position, responsive to the image capture device remaining in that position for longer than a predetermined length of time.
In another aspect of the invention, any of a plurality of image sources is capable of providing images to the video teleconferencing system. Each of the options includes one of the image sources acting as a source of images for the video teleconferencing system. At least one menu item includes an image provided by the image source corresponding to that menu item.
The present invention is particularly advantageous because the use of graphics, including images, to represent various options results in a simpler, more intuitive user interface menu simply because graphics often are more intuitive than other representations, such as text. For example, in the case of preset positions for an image capture device, the preset positions could be represented by arbitrary labels such as preset position A, preset position B, etc., but this is non-intuitive and most users would have difficulty associating these labels with the
actual preset positions and would also have difficulty later remembering which label corresponded to which preset position. In contrast, representing a preset position by an image captured by the image capture device while in the preset position in some sense allows the user to effectively preview the preset position, which is a more intuitive and useful representation. In another aspect of the invention, the video teleconferencing system includes a control unit for coordinating the actions of the various components, including for displaying the user interface menu on the display device, forming communications links with other video teleconferencing systems, and receiving graphics from various sources for display on the display device. In yet another aspect, a computer program product includes a computer readable storage medium for storing computer instructions for executing these actions.
BRIEF DESCRIPTION OF THE DRAWINGS The invention has other advantages and features which will be more readily apparent from the following detailed description of the invention and the appended claims, when taken in conjunction with the accompanying drawing, in which: FIG. 1 is a diagram of a video teleconferencing system 100 suitable for use with the present invention;
FIG. 2 is a diagram of one embodiment of a user interface menu 200 in accordance with the present invention;
FIG. 3 is a flow diagram illustrating a method 300 for generating user interface menu 200;
FIG. 4 is a diagram of a preferred embodiment of a user interface menu 400 for forming a communications link between video teleconferencing systems;
FIG. 5 is a diagram of a preferred embodiment of a user interface menu 500 for selecting a preset position for an image capture device; FIG. 6 is a flow diagram illustrating a method 600 for generating user interface menu
600;
FIG. 7 is a diagram of a preferred embodiment of a user interface menu 700 for selecting an image source for a video teleconferencing system.
Due to the requirement of rendering the FIGS, in black and white line art only (e.g., no gray scale), many of the graphics and images in the FIGS, may be depicted by black and white line art representations rather than the actual graphics and/or images. For example,
photorealistic images which in true life may be color with shading will be represented in the FIGS, by black and white line art representations.
DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS FIG. 1 is a diagram of a video teleconferencing system 100 suitable for use with the present invention. The video teleconferencing system 100 includes a control unit 110, display monitor 102, video camera 104, microphone 106, and loudspeaker 108. The control unit 110 is coupled to each of the other devices and controls the video teleconferencing system 100, including the various devices. Control unit 110 also includes electronic circuits to encode/decode and/or compress/decompress audio and image (including video) data, and to send and receive this data over communications links to other video teleconferencing systems (not shown). The user interacts with control unit 110 via a remote control, keyboard, joystick, touchpad, touch screen, graphics tablet, and/or any number of other types of input devices.
In a preferred embodiment, the control unit 110 includes a processor 120 and storage medium 122 containing software code to be executed on the processor. In a preferred embodiment, the processor 120 is an Intel Pentium microprocessor and the storage medium 122 is memory for the microprocessor. In alternate embodiments, the control unit 110 may be based on more of an embedded architecture, in which the storage medium 122 might be, for example, flash memory or programmable logic. The software code implements control of the video teleconferencing system 100, with the processor typically interfacing to the various other devices through auxiliary electronics and/or driver software in control unit 110. The processor and software also control the formation of communications links with other video teleconferencing systems, preferably using ISDN, Internet, standard telephone, or ATM protocols.
Video teleconferencing system 100 is exemplary only. Other video teleconferencing systems can contain combinations of devices other than the one shown in FIG. 1. For example, any number of display devices 102, cameras 104, microphones 106 and loudspeakers 108 may be used. Alternately, not all of the devices shown in FIG. 1 need be used, and/or devices other than those shown in FIG. 1 may be used. For example, devices other than video camera 104 may be used to capture images; examples of such image capture devices include electronic still cameras and document readers. Furthermore, image sources other image capture devices may also be used to provide images to video teleconferencing system 100. Examples would include VCRs, DVD players, and computer applications which can produce
images such as PowerPoint, Illustrator, or QuickTime video. As one final example, the functionality of the devices described above may be apportioned or combined differently than in the above examples. For example, a sophisticated graphics tablet may have a display surface which receives graphics from control unit 110 and also a touch screen whereby the user can send commands to the control unit 110 by touching the various graphics displayed. This type of device functions as both a display device and an input device.
Video teleconferencing system 100 is typically used for video teleconferencing. System 100 achieves this by forming a communications link between itself and other video teleconferencing systems. The users of the various video teleconferencing systems 100 then communicate with each other using the various devices coupled to the video teleconferencing systems. For purposes of clarity, the term "local" video teleconferencing system will be used to refer to the video teleconferencing system 100 which provides the frame of reference for this description. In other words, the user interface menus described below will be used for interfacing to the "local" video teleconferencing system. The term "remote" video teleconferencing system, then, will be used to refer to the video teleconferencing systems with which the local video teleconferencing system is communicating. The terms "local" and "remote" are used solely to establish a frame of reference for purposes of clarity and are not meant to imply any physical proximity. For example, the "remote" video teleconferencing system need not be separated from the "local" video teleconferencing system by some minimum physical distance nor need it be situated in a remote physical location. The local and remote video teleconferencing systems will sometimes be referred to by the equivalent descriptors "here" and "there," respectively.
The typical video teleconferencing system, by its very nature, includes a large number of devices and thus presents a large number of options to the user. For example, the user may adjust the position, zoom and picture quality of the various cameras 104; select between different image sources to provide images to the various displays 102, as well as adjusting the displays themselves; and also adjust volume and stereo effect for the microphones 106 and loudspeakers 108. The user typically will also have additional options for setting up the video teleconferencing system, such as setting defaults for the system, maintaining an address book, and options for setting up communications links with other video teleconferencing systems. The user manipulates these various options by interacting with control unit 110 using a user interface and an input device.
FIG. 2 is a diagram of one embodiment of a user interface in accordance with the present invention. In this embodiment, the user interface is a user interface menu 200 displayed on display device 102. The user interface menu 200 includes a number of menu items 202A-202N (collectively, menu items 202). Each menu item 202 corresponds to one of the options available to the user of the local video teleconferencing system. The user selects between the available options by activating the corresponding menu item 202, for example by clicking on it or by touching it (e.g., in the case of touch screens). At least one menu item 202B further includes a graphic 204B which represents the option corresponding to menu item 202B. The graphic, for example, could be an icon, a logo, an image, a photograph, a video, or any other type of graphic element. The term "graphic," however, is specifically intended to exclude elements which are solely textual. The use of graphics is beneficial because, in many instances, the visual identification of options is easier, quicker, and more intuitive, as will be illustrated by the examples below.
FIG. 3 is a flow diagram illustrating a method 300 for generating user interface menu 200. In method 300, control unit 110 displays 302 the menu items 202 on display device 102. Control unit 110 also receives 304 graphic 204B and displays 306 it as part of menu item 202A. Graphic 204B may be received from the same or different sources as the remainder of menu 200. For example, the various elements for menu items 202 and graphic 204B may be received by a processor separate from the one in control unit 110 and assembled by that processor into menu 200, which is then sent to control unit 110. Control unit 110 then displays the previously composed menu 200 on display 102. Alternately, control unit 110 may receive the various menu items 202 from one source and the graphic 204B from another, with control unit 110 assembling the various elements into menu 200 and displaying the resulting menu on display 102. FIG. 4 is a diagram of a preferred embodiment of a user interface menu 400 which includes menu items 402A-402F. In this embodiment, each menu item 402 corresponds to the option of forming a communications link between the local video teleconferencing system and any one of a plurality of remote video teleconferencing systems. For example, menu item 402A represents the option of forming a communications link with the video teleconferencing system associated with the individual Ann Bass. Similarly, menu item 402B corresponds to the option of forming a communications link between the local video teleconferencing system and the video teleconferencing system associated with individual Saul Bass. When the user
activates one of the menu items 402, the local video teleconferencing system initiates the formation of a communications link with the selected remote video teleconferencing system. Slider 410 allows the user to scroll through menu 400.
In this particular embodiment, each menu item 402 includes a corresponding graphics area 403 for displaying a graphic representative of the video teleconferencing system corresponding to the menu item. Furthermore, the graphic areas 403 are occupied either by graphic icons 406A, 406B, and 406D; or by images 408C, 408E, and 408F. A graphic icon 406 is used when a representative image 408 is not available. Once an image 408 becomes available, the graphic icon 406 is replaced by the image 408. For example, in menu item 402A, if an image representative of the Ann Bass video teleconferencing system were to become available, graphic icon 406A would be replaced by it. The use of graphics 406 and 408 in menu 400 is beneficial because the graphics 406 and 408, particularly images 408, are a visual indication and reminder of the various remote video teleconferencing systems.
Images 408 representing the remote video teleconferencing systems may be generated in a number of ways. In one approach, image 408 is substantially similar to images which may be captured by the remote video teleconferencing system. These images may be either simulated or actually captured. In a preferred embodiment, the remote video teleconferencing system includes a camera and image 408 is an image captured by that camera. For example, in menu item 402C, the remote video teleconferencing system is located in Neville Brody's office and includes a camera which has a view of his office. Image 408C is an image of Neville Brody's office captured by that camera. The image may be selected by the user of the local video teleconferencing system when connected to the remote video teleconferencing system in Neville Brody's office. For example, when the local video teleconferencing system is connected to the remote video teleconferencing system in Neville Brody's office, the user of the local video teleconferencing system might move the camera in Neville Brody's office to a particular position, have the camera capture an image from that position, and then instruct the local video teleconferencing system to store and use that image as image 408C. Alternately, the image 408C may be selected by the user of Neville Brody's video teleconferencing system. For example, Neville Brody might move the camera in his office to a particular position, have the camera capture an image from that position, and then instruct his video teleconferencing system to store and provide that image as image 408C. In either event, the image is received 304 by control unit 110 and displayed 306 as part of menu item 402C.
In an alternate approach, the graphics in graphics area 403 are received 304 as part of the caller i.d. information for the remote video teleconferencing system. In one approach, a central repository stores caller i.d. information for all video teleconferencing systems and provides this information as part of the process of establishing communications links between video teleconferencing systems. In an alternate approach, the caller i.d. information is distributed among the video teleconferencing systems. For example, if the local video teleconferencing system were to connect to the Ann Bass video teleconferencing system, the Ann Bass video teleconferencing system may send its caller i.d. information to the local video teleconferencing system as part of the handshake process when first establishing the communications link. In any event, the control unit 110 displays 306 the received graphic in graphics area 403 A.
The caller i.d. graphics can be formed in various ways. For example, Ann Bass could have her camera capture an image and store this image as her caller i.d. Alternately, Ann Bass could store a computer file with her caller i.d. graphic in her control unit 110. The Ann Bass video teleconferencing system would then supply this graphic as part of the caller i.d. handshake.
Throughout this description, the term "caller i.d. information" is meant to include any type of information pertaining to a video teleconferencing system and is not meant to be limited to the caller i.d. information provided in the telephone system (i.e., identity and phone number of caller). For example, the caller i.d. information for a video teleconferencing system may contain a graphic or an image representing the video teleconferencing system, such as an image captured by a camera connected to the video teleconferencing system or a company logo.
In an alternate embodiment, one or more of the menu items 402 is highlighted to emphasize that menu item. The term "highlight" as used here means simply that the menu item is emphasized. The method of emphasis may or may not be based on color highlighting. For example, the highlight may be achieved by placing a cursor or other graphic indicator in close proximity to the highlighted menu item, or by moving the highlighted menu item to a position of prominence such as the head of the menu. Alternately, the highlighted menu item might be emphasized via the use of animation or it might be rendered such that it appears to move above or below the plane of the screen. Of course, the menu item could also be color highlighted.
In one embodiment, the highlighted menu item corresponds to the remote video teleconferencing system with which the local video teleconferencing system last formed the communications link. For example, if the local video teleconferencing system last formed a communications link with the Ned Flanders video teleconferencing system, then menu item 402E would be highlighted. This would facilitate a reconnection with the Ned Flanders video teleconferencing system since the user would not need to search for menu item 402E. In an alternative embodiment, the highlighted menu item is the menu item which was last highlighted by the user. For example, if the user highlighted menu item 402F and then left menu 400, when he returned, menu 400 would "remember" the user's last choice and menu item 402F would still be highlighted.
FIG. 5 is a diagram of a preferred embodiment of user interface menu 500. In this example, menu 500 does not occupy the entire display so as not to interfere with the rest of the video teleconference. Rather, it occupies a bottom portion of the display and is side scrolling via right and left arrows 510A-510B. Menu 500 includes menu items 502, each of which corresponds to a different preset position for an image capture device (a video camera in this example) which can capture and provide images to the local video teleconferencing system. The camera itself may be connected either to the local or to the remote video teleconferencing system. Hence, if the user were to activate menu item 502A, the camera would move to the preset position represented by that menu item. In this example, each of the menu items 502 includes an image 504 which is substantially similar to the image which would be captured by the camera if it were positioned in the preset position corresponding to that menu item. Furthermore, the images 504 preferably are actual images captured by the camera. Thus, for example, if menu item 502A corresponds to the camera being zoomed out and looking straight ahead, then image 504A would be an image from this view. Similarly, if menu item 502B corresponds to the camera at medium zoom and looking down and to the right, then image 504B would be an image from this view.
The use of representative images 504 is advantageous because it is a simple and intuitive representation of the different preset positions. For example, a commonly used alternative is to represent each preset position by a label, such as preset position A, preset position B, and so on. However, this is non-intuitive because the label does not provide any information about the actual preset position. As a result, users have difficulty remembering
which label corresponds to which preset positions. In contrast, in the described approach, each menu item 502 provides the user with images which presumably are similar to those which would be captured by the camera if it were returned to the preset position. Hence, the user can more easily determine which view corresponds to which menu item. The images 504 can be manually generated. In this approach, the user would move the camera to a position which the user desired as a preset position. The user would then command the control unit 110 to retain this position as a preset position. The control unit 110 would then generate and display a corresponding menu item 502 and image 504.
FIG. 6 is a flow diagram illustrating an alternate method 600 for generating images 504. In this approach, the menu items 502 and corresponding images 504 are generated automatically in response to the camera remaining in a stationary position for longer than a predetermined length of time. When this happens, control unit 110 receives 602 an image captured by the camera in this position. Control unit 110 then displays 604, 606 a menu item corresponding to this position, including the image 504 received from the camera. The predetermined length of time preferably is adjustable by the user. Method 600 is advantageous because the user does not have to consciously create preset positions during a video teleconference. Rather, the video teleconferencing system generates the preset positions automatically and they are represented in menu 500 by actual captured images 504. Heuristically, if the camera has stayed in a certain position for a long period of time, then that position likely is a good preset position to retain.
In order to prevent new menu items from appearing every time the camera is adjusted even slightly, a threshold approach preferably is used. In this approach, if the camera is moved only a few degrees from a previously retained preset position (i.e., the movement falls within a threshold), a new preset position is not be created. Rather, the nearest preset position is replaced by the new position. This updates the previous preset position and prevents the proliferation of nearly identical preset positions. A preferred threshold allows approximately six inches of camera movement on a subject at twenty feet distance from the camera.
FIG. 7 is a diagram of the preferred embodiment of a user interface menu 700. Like the example of menu 500, menu 700 occupies a bottom portion of the display and is side scrolling via right and left arrows 710A-710B. In this embodiment, video teleconferencing system 100 may receive images from any of a plurality of different image sources. Menu 700 includes menu items 702, each of which corresponds to one of the image sources providing
images to the local video teleconferencing system. For example, menu item 702A corresponds to the SmartTrak device providing images to the video teleconferencing system; menu item 702B corresponds to the PC (or applications running on a PC) providing images, and so forth. In a preferred embodiment, each of these menu items includes a graphic 704 representative of the source. For example, menu item 702C includes a pictorial representation 704C of a document reader. The graphics 704 preferably include images representing the various image sources. For example, the image might be an image of the corresponding source, such as a digital photograph of a document reader for menu item 702C. Alternately, the image might represent the images which would be provided by the various sources, such as a simulated document for menu item 702C. In a preferred embodiment, the image is an actual image provided by the image source, such as an actual document image captured by the document reader for menu item 702C. In another aspect, the sources are represented by graphic icons, such as 702A-702C, when no image is available. These icons are replaced as images become available. Although the invention has been described in considerable detail with reference to certain preferred embodiments thereof, other embodiments will be apparent. For example, graphic representations can be used advantageously for menus other than the ones described above. In addition, all of the menus described above are based on a rectilinear layout; other layouts will be apparent. As a final example, the functionality of the control unit 110 may be implemented in forms other than software code running on a processor. For example, some or all of the functionality may be implemented in dedicated hardware. Therefore, the scope of the appended claims should not be limited to the description of the preferred embodiments contained herein.