[go: nahoru, domu]

WO2011038465A1 - Object tracking for artificial vision - Google Patents

Object tracking for artificial vision Download PDF

Info

Publication number
WO2011038465A1
WO2011038465A1 PCT/AU2010/001290 AU2010001290W WO2011038465A1 WO 2011038465 A1 WO2011038465 A1 WO 2011038465A1 AU 2010001290 W AU2010001290 W AU 2010001290W WO 2011038465 A1 WO2011038465 A1 WO 2011038465A1
Authority
WO
WIPO (PCT)
Prior art keywords
user
vision
enhancement apparatus
scene
vision enhancement
Prior art date
Application number
PCT/AU2010/001290
Other languages
French (fr)
Inventor
Nick Barnes
Chunhua Shen
Original Assignee
National Ict Australia Limited
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority claimed from AU2009904788A external-priority patent/AU2009904788A0/en
Application filed by National Ict Australia Limited filed Critical National Ict Australia Limited
Priority to US13/498,667 priority Critical patent/US9697746B2/en
Priority to AU2010302959A priority patent/AU2010302959B2/en
Priority to EP10819750.0A priority patent/EP2482760B1/en
Publication of WO2011038465A1 publication Critical patent/WO2011038465A1/en
Priority to US15/609,953 priority patent/US10062303B2/en

Links

Classifications

    • GPHYSICS
    • G09EDUCATION; CRYPTOGRAPHY; DISPLAY; ADVERTISING; SEALS
    • G09BEDUCATIONAL OR DEMONSTRATION APPLIANCES; APPLIANCES FOR TEACHING, OR COMMUNICATING WITH, THE BLIND, DEAF OR MUTE; MODELS; PLANETARIA; GLOBES; MAPS; DIAGRAMS
    • G09B21/00Teaching, or communicating with, the blind, deaf or mute
    • G09B21/001Teaching or communicating with blind persons
    • G09B21/008Teaching or communicating with blind persons using visual presentation of the information for the partially sighted
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T11/002D [Two Dimensional] image generation
    • G06T11/60Editing figures and text; Combining figures or text
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/20Image preprocessing
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61FFILTERS IMPLANTABLE INTO BLOOD VESSELS; PROSTHESES; DEVICES PROVIDING PATENCY TO, OR PREVENTING COLLAPSING OF, TUBULAR STRUCTURES OF THE BODY, e.g. STENTS; ORTHOPAEDIC, NURSING OR CONTRACEPTIVE DEVICES; FOMENTATION; TREATMENT OR PROTECTION OF EYES OR EARS; BANDAGES, DRESSINGS OR ABSORBENT PADS; FIRST-AID KITS
    • A61F9/00Methods or devices for treatment of the eyes; Devices for putting-in contact lenses; Devices to correct squinting; Apparatus to guide the blind; Protective devices for the eyes, carried on the body or in the hand
    • A61F9/08Devices or methods enabling eye-patients to replace direct visual perception by another kind of perception
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61NELECTROTHERAPY; MAGNETOTHERAPY; RADIATION THERAPY; ULTRASOUND THERAPY
    • A61N1/00Electrotherapy; Circuits therefor
    • A61N1/18Applying electric currents by contact electrodes
    • A61N1/32Applying electric currents by contact electrodes alternating or intermittent currents
    • A61N1/36Applying electric currents by contact electrodes alternating or intermittent currents for stimulation
    • A61N1/36046Applying electric currents by contact electrodes alternating or intermittent currents for stimulation of the eye
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N7/00Television systems
    • H04N7/002Special television systems not provided for by H04N7/007 - H04N7/18

Definitions

  • This invention concerns the tracking of objects in video data for artificial vision; for instance for a bionic eye. More particularly, the invention concerns a vision enhancement apparatus for a vision-impaired user. In other aspects, the invention concerns a method for enhancing vision and software to perform the method.
  • the eye operates by focussing light onto the retina which in turn passes signals to the brain, via the optic nerve, where they are processed to provide sight. Partial or total blindness can be caused by damage or malfunction of any of the elements in the chain from eye to brain. However, many common forms of blindness results from damage to the retina, for instance macular degeneration leads to deterioration in the cells of the fovea, causing a blind spot in the centre of the field of vision; but leaving the peripheral vision intact.
  • the idea behind the bionic eye is to artificially stimulate the retina using an electrode array implanted on top of it. Other visual aids, such as vision spectacles stimulate the retina using light but control the images presented before the user.
  • the invention is a vision enhancement apparatus for a vision-impaired user, comprising an image processor arranged to:
  • Automatically modify the video data by reserving a user selected area of the displayed images for displaying the tracked object as a separate video tile within the scene.
  • the separate video tile remains in the selected area despite movement of the camera relative to the scene, or movement of the user relative to the object or the scene.
  • This invention allows important elements of the scene, such as the face of a friend, to be located in a part of the displayed image where it is visible to the vision-impaired user. For instance, if the user is suffering from macular degeneration the face may appear in their peripheral vision, where they can learn to see it. The face will remain in that part of the user's field of vision despite movement of the user or the object. The image of the face may also be maintained in a chosen orientation.
  • the user may have a high resolution video camera mounted on their head so that it films the scene in front of them. In the situation where the user is among a group of people they may turn their head in the direction of each other person as they speak.
  • the object detection feature will detect faces in the images of the scene and may process the images to identify the person.
  • the apparatus may announce the newly identified person to the user and then track their movements within the scene. While they remain in the scene their face may be continually tracked so the image of it is continually updated.
  • the apparatus may permit the user to select one or more objects to be displayed in respective reserved areas of the displayed images. This allows the user to select which people's faces are displayed in which part of their field of vision. It may alternatively allow the user to switch between the face detected faces, for instance when the user speaking to different people.
  • the face may be enlarged, or otherwise manipulated, to increase the user's perception of it, particularly of the facial expressions. Having high resolution images is very useful for subsequent image manipulation. An enlarged face or other object may be enlarged to highest resolution of the image.
  • the apparatus may use default levels of enlargement for objects of different sizes, and this default may be overridden by user selection, for instance the user may override a face that has been presented to look at a single eye or move from one eye to the other.
  • the apparatus may provide the modified video data to a visual display, such as vision spectacles or a hand-held display, or to a retina, cortical or optic nerve implant that electrically stimulates nerves. It may also be able to provide video and still images offline for later viewing by the user.
  • the entire apparatus could be worn by the user, either on the head as a cap, or distributed about the body.
  • the invention is a method for enliancing vision, comprising the steps of:
  • the invention is software to perform the method.
  • Fig. 1 is a diagram that shows the system and explains the method of the invention.
  • the visually impaired user 10 wears an inconspicuous high resolution camera 12 and a pair of vision spectacles that provide a stereoscopic image of the scene 14 in front of the camera. Since the camera is worn by the user their movements cause the scene in front of the camera to change. In the scene shown there are two other people 20 and 22; friends of the user. As the three converse the user may move their head to face each of the friends 20 and 22 as they talk. Each of the three may also move around in the scene as they wish.
  • the camera 12 records the scene in front of it, and a processor 30 in the camera processes the recorded video data in real time.
  • a single monoscopic image is shown being processed at four different instances of time, 40A, 40B, 40C and 40D.
  • the first thing the processor 30 does is to detect the face of user 20; at time 40A.
  • Face detection algorithms work in different ways and this one uses a pre-trained model using statistical machine learning techniques and then draws a box 50 around the expected area of the face.
  • the processor is able to track the face in the video data from frame to frame of the image despite movement of the face in the image; resulting from relative movement between the camera and the face.
  • the user 10 is able to choose which face, or faces, the processor will track depending upon how many faces the user is able to see. As one of the friends leave or another arrives the user is able to change their selection, for instance by pressing a button. Also, as people arrive at the scene, or leave, the processor may recognise that fact as the face is detected, and when detection ceases; these events can be announced to the user by means of an ear mounted microphone. The processor may consult a list of names associated with particular faces that have been previously recognised, and use this list to announce that a person has arrived or left. The processor also modifies the video data by reserving an area 60 for the display of the chosen face.
  • the processor in this case reserves the bottom left corner for the display of the chosen face in a separate video tile 60. The user is able to see this video tile, and therefore the face, in their peripheral vision.
  • the face may be digitally modified so that it appears frontal to the user, even if it is turned to one side; this may assist recognition. This allows the user, for instance, to select the face of the friend they are talking with, and to see the changing expression on that face while they talk. When they start talking to someone else they can select the new face to be displayed in the separate video tile 60.
  • the processor may also enhance the user's vision of the face, for instance, by enlarging it or zooming in on part of it, such as the eyes. The enhancement is preserved as the face is tracked.
  • the invention has been described with reference to a particular example it should be appreciated that it could be practised in many other ways.
  • the user may be fitted with a retina implant that electrically stimulates nerves.
  • the entire apparatus could be worn by the user, either on the head as a cap, or distributed about the body.
  • the user can interact with the system in any conceivable way, for instance pressing a button, using gestures such as pointing or head movements, or a mouth activated pressure sensor.
  • gestures such as pointing or head movements, or a mouth activated pressure sensor.
  • winking, blinking or eye tracking could be used, or any combination of these things.
  • head movements could be used to bring an object to a 'hotspot' such as the centre of the field of view where the object can be enlarged by blinking.
  • eye movements could be used in concert with head movements.
  • the users hands may be recognised and used to point and gesture to select and enlarge objects. Such interactions could also be used to control pan or tilt of the camera.
  • the objects themselves could be processed in many ways to make them more readily visible.
  • the objects could for instance be coloured or distorted to increase the user's perception of them.
  • This can be useful for hand to eye co-ordination, for instance to help the user manipulate door handles, cups, cutlery and the like.
  • the object may be enlarged while maintaining video tracking of it, so that the user has continuous video feedback as they manipulate it.
  • the user may wish to enlarge their hands rather than the object. In this if the hand is touching or grasping something the computer may segment the object to facilitate the user interactively recognising it.
  • object may be enlarged but kept static, presenting the user with a still image. This could be useful for instance for instance to select an object from a set. Also, it is useful when an object is marked with writing, for instance a the packaging of products in a shop. This enables the user to read the label even when the object is no longer in front of the camera.
  • the text may be rendered to the user in a typeface and size that makes it easier to read.
  • image to voice software may also be used to read to read the label to the user.
  • the text may be presented to the user via a braille tablet.
  • the apparatus may be operated by the user to switch back between these two modes of operation, so that after reading the label the user can find an item of the product and take it from the shelf. Moving on, the user may then read another label.
  • this functionality may be useful, for instance to identify the number of an approaching bus.
  • Machine learning may also be used to automate some of these sequences of commands. It could also be used to provide alerts to the user. For instance to detect the presence of particular people or objects itemised on a watch list. The alert may be given an any convenient way, for instance using sound, a tactile transducer or light. It may result in automatic actions such as enlarging the object.
  • an alert may be provided to the user. This could result in automatic zoom-out so that the remaining scene can be reviewed by the user.
  • a mode of operation may automatically provide information about the distance from the user to objects in the scene, for instance by providing a number associated wit the object, colour-coding or by announcement to the user. Other types of information may also be provided. For instance, when a building is recognized the user may be informed of the location within the building they require and directions to it. If a bus is identified the user may be reminded about its route and frequency.
  • the scene being viewed is not limited at all. It could be anything from a landscape to photographs or even movies.

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • General Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Health & Medical Sciences (AREA)
  • Business, Economics & Management (AREA)
  • Educational Administration (AREA)
  • Educational Technology (AREA)
  • Multimedia (AREA)
  • User Interface Of Digital Computer (AREA)
  • Image Processing (AREA)
  • Position Input By Displaying (AREA)

Abstract

This invention concerns the tracking of objects in video data for artificial vision; for instance for a bionic eye. More particularly, the invention concerns a vision enhancement apparatus for a vision-impaired user. In other aspects, the invention concerns a method for enhancing vision and software to perform the method. The image processor operates to process video data representing images of a scene. Automatically detect and track a user selected object, such as a face, in the images. And, automatically modify the video data, by reserving a user selected area of the displayed images for displaying the tracked object as a separate video tile within the scene. The separate video tile remains in the selected area despite movement of the camera relative to the scene, or movement of the user relative to the object or the scene.

Description

Title
Object Tracking for Artificial Vision
Technical Field
This invention concerns the tracking of objects in video data for artificial vision; for instance for a bionic eye. More particularly, the invention concerns a vision enhancement apparatus for a vision-impaired user. In other aspects, the invention concerns a method for enhancing vision and software to perform the method.
Background Art
Following the enormous success achieved by bionic ears in returning hearing to profoundly deaf people, interest has now turned to developing a bionic eye that will return sight to the blind.
The eye operates by focussing light onto the retina which in turn passes signals to the brain, via the optic nerve, where they are processed to provide sight. Partial or total blindness can be caused by damage or malfunction of any of the elements in the chain from eye to brain. However, many common forms of blindness results from damage to the retina, for instance macular degeneration leads to deterioration in the cells of the fovea, causing a blind spot in the centre of the field of vision; but leaving the peripheral vision intact. The idea behind the bionic eye is to artificially stimulate the retina using an electrode array implanted on top of it. Other visual aids, such as vision spectacles stimulate the retina using light but control the images presented before the user.
Disclosure of the Invention
In a first aspect the invention is a vision enhancement apparatus for a vision-impaired user, comprising an image processor arranged to:
Process high resolution video data representing images of a scene.
Automatically detect and track a user selected object, such as a face, in the images.
Automatically modify the video data, by reserving a user selected area of the displayed images for displaying the tracked object as a separate video tile within the scene. The separate video tile remains in the selected area despite movement of the camera relative to the scene, or movement of the user relative to the object or the scene.
This invention allows important elements of the scene, such as the face of a friend, to be located in a part of the displayed image where it is visible to the vision-impaired user. For instance, if the user is suffering from macular degeneration the face may appear in their peripheral vision, where they can learn to see it. The face will remain in that part of the user's field of vision despite movement of the user or the object. The image of the face may also be maintained in a chosen orientation.
The user may have a high resolution video camera mounted on their head so that it films the scene in front of them. In the situation where the user is among a group of people they may turn their head in the direction of each other person as they speak. The object detection feature will detect faces in the images of the scene and may process the images to identify the person. The apparatus may announce the newly identified person to the user and then track their movements within the scene. While they remain in the scene their face may be continually tracked so the image of it is continually updated.
The apparatus may permit the user to select one or more objects to be displayed in respective reserved areas of the displayed images. This allows the user to select which people's faces are displayed in which part of their field of vision. It may alternatively allow the user to switch between the face detected faces, for instance when the user speaking to different people. The face may be enlarged, or otherwise manipulated, to increase the user's perception of it, particularly of the facial expressions. Having high resolution images is very useful for subsequent image manipulation. An enlarged face or other object may be enlarged to highest resolution of the image. In general the apparatus may use default levels of enlargement for objects of different sizes, and this default may be overridden by user selection, for instance the user may override a face that has been presented to look at a single eye or move from one eye to the other.
The apparatus may provide the modified video data to a visual display, such as vision spectacles or a hand-held display, or to a retina, cortical or optic nerve implant that electrically stimulates nerves. It may also be able to provide video and still images offline for later viewing by the user. The entire apparatus could be worn by the user, either on the head as a cap, or distributed about the body. In another aspect the invention is a method for enliancing vision, comprising the steps of:
Processing video data representing images of a scene.
Automatically detecting and tracking a user selected object in the images.
Automatically modifying the video data, by reserving a user selected area of the displayed images for displaying the tracked object in a separate video tile within the scene.
In a further aspect the invention is software to perform the method. Brief Description of the Drawings
An example of the invention will now be described with reference to the accompanying drawing, Fig. 1, which is a diagram that shows the system and explains the method of the invention.
Best Modes of the Invention
The visually impaired user 10 wears an inconspicuous high resolution camera 12 and a pair of vision spectacles that provide a stereoscopic image of the scene 14 in front of the camera. Since the camera is worn by the user their movements cause the scene in front of the camera to change. In the scene shown there are two other people 20 and 22; friends of the user. As the three converse the user may move their head to face each of the friends 20 and 22 as they talk. Each of the three may also move around in the scene as they wish.
The camera 12 records the scene in front of it, and a processor 30 in the camera processes the recorded video data in real time. In Fig. 1 a single monoscopic image is shown being processed at four different instances of time, 40A, 40B, 40C and 40D. The first thing the processor 30 does is to detect the face of user 20; at time 40A. Face detection algorithms work in different ways and this one uses a pre-trained model using statistical machine learning techniques and then draws a box 50 around the expected area of the face. The processor is able to track the face in the video data from frame to frame of the image despite movement of the face in the image; resulting from relative movement between the camera and the face.
The user 10 is able to choose which face, or faces, the processor will track depending upon how many faces the user is able to see. As one of the friends leave or another arrives the user is able to change their selection, for instance by pressing a button. Also, as people arrive at the scene, or leave, the processor may recognise that fact as the face is detected, and when detection ceases; these events can be announced to the user by means of an ear mounted microphone. The processor may consult a list of names associated with particular faces that have been previously recognised, and use this list to announce that a person has arrived or left. The processor also modifies the video data by reserving an area 60 for the display of the chosen face. In this example the user suffers from macular degeneration and cannot see the central part of the image 70, but is able to see in their peripheral vision. As a result the processor in this case reserves the bottom left corner for the display of the chosen face in a separate video tile 60. The user is able to see this video tile, and therefore the face, in their peripheral vision.
The face may be digitally modified so that it appears frontal to the user, even if it is turned to one side; this may assist recognition. This allows the user, for instance, to select the face of the friend they are talking with, and to see the changing expression on that face while they talk. When they start talking to someone else they can select the new face to be displayed in the separate video tile 60. The processor may also enhance the user's vision of the face, for instance, by enlarging it or zooming in on part of it, such as the eyes. The enhancement is preserved as the face is tracked.
Although the invention has been described with reference to a particular example it should be appreciated that it could be practised in many other ways. For instance, instead of vision spectacles, the user may be fitted with a retina implant that electrically stimulates nerves. Also the entire apparatus could be worn by the user, either on the head as a cap, or distributed about the body.
It should also be understood that the user can interact with the system in any conceivable way, for instance pressing a button, using gestures such as pointing or head movements, or a mouth activated pressure sensor. In addition winking, blinking or eye tracking could be used, or any combination of these things.
For instance, head movements could be used to bring an object to a 'hotspot' such as the centre of the field of view where the object can be enlarged by blinking. Alternatively, eye movements could be used in concert with head movements. In another alternative the users hands may be recognised and used to point and gesture to select and enlarge objects. Such interactions could also be used to control pan or tilt of the camera.
Depending on the users ability to see and the training they receive they may be able to perceive many different objects, or sets of objects, in reserved areas of the image. The objects themselves could be processed in many ways to make them more readily visible. As well as enlarging, the objects could for instance be coloured or distorted to increase the user's perception of them. This can be useful for hand to eye co-ordination, for instance to help the user manipulate door handles, cups, cutlery and the like. In this case the object may be enlarged while maintaining video tracking of it, so that the user has continuous video feedback as they manipulate it. Alternatively, the user may wish to enlarge their hands rather than the object. In this if the hand is touching or grasping something the computer may segment the object to facilitate the user interactively recognising it.
Other types of object may be enlarged but kept static, presenting the user with a still image. This could be useful for instance for instance to select an object from a set. Also, it is useful when an object is marked with writing, for instance a the packaging of products in a shop. This enables the user to read the label even when the object is no longer in front of the camera. The text may be rendered to the user in a typeface and size that makes it easier to read. Alternatively, image to voice software may also be used to read to read the label to the user. Similarly the text may be presented to the user via a braille tablet. The apparatus may be operated by the user to switch back between these two modes of operation, so that after reading the label the user can find an item of the product and take it from the shelf. Moving on, the user may then read another label. There are many other situation where this functionality may be useful, for instance to identify the number of an approaching bus.
Machine learning may also be used to automate some of these sequences of commands. It could also be used to provide alerts to the user. For instance to detect the presence of particular people or objects itemised on a watch list. The alert may be given an any convenient way, for instance using sound, a tactile transducer or light. It may result in automatic actions such as enlarging the object.
When an object is lost from sight, for instance because a person has left the scene, an alert may be provided to the user. This could result in automatic zoom-out so that the remaining scene can be reviewed by the user.
A mode of operation may automatically provide information about the distance from the user to objects in the scene, for instance by providing a number associated wit the object, colour-coding or by announcement to the user. Other types of information may also be provided. For instance, when a building is recognized the user may be informed of the location within the building they require and directions to it. If a bus is identified the user may be reminded about its route and frequency.
It should also be appreciated that the scene being viewed is not limited at all. It could be anything from a landscape to photographs or even movies.

Claims

CLAIMS:
1. A vision enhancement apparatus for a vision-impaired user, comprising an image processor arranged to:
process high resolution video data representing images of a scene and display the resulting images to the user;
automatically detect and track a user selected object in the images; and automatically modify the video data, by reserving a user selected area of the displayed images for displaying the tracked object as a separate video tile within the scene.
2. A vision enhancement apparatus according to claim 1, wherein the object is identified within a first box and the separate video tile is a second box, the same size and shape as the first box.
3. A vision enhancement apparatus according to claim 2, wherein the object in the first box is displayed in the separate video tile.
4. A vision enhancement apparatus according to claim 3, wherein the user selects the location for the separate video tile such that it is located in a part of the displayed images where it is visible to the vision-impaired user.
5. A vision enhancement apparatus according to claim 4, wherein in the event the user is suffering from macular degeneration the separate video tile is located in their peripheral vision.
6. A vision enhancement apparatus according to claim 3, wherein the separate video tile will remain in that part of the user's field of vision despite movement of the user, the camera or the tracked object.
7. A vision enhancement apparatus according to claim 1, wherein the user has a high resolution video camera mounted on their head so that it films the scene in front of them.
8. A vision enhancement apparatus according to claim 7, wherein the processor automatically detects faces in the scene and processes the images to identify the faces.
9. A vision enhancement apparatus according to claim 8, wherein the apparatus announces newly identified faces to the user and then tracks their movements within the scene.
10. A vision enhancement apparatus according to claim 9, wherein while a face remains in the scene it is continually tracked so the image of it is continually updated to show its changing facial expressions in the separate video tile.
11. A vision enhancement apparatus according to claim 3, wherein the user is able to select more than one object to be displayed in respective separate video tiles of the displayed images.
12. A vision enhancement apparatus according to claim 11, wherein the user selects which objects are displayed and which video tile each is displayed in within the displayed images.
13. A vision enhancement apparatus according to claim 2, wherein the user is able to switch between the detected objects, to select the one which is displayed in the video tile.
14. A vision enhancement apparatus according to claim 3, wherein the object is enlarged, or otherwise manipulated, to increase the user's perception of it.
15. A vision enhancement apparatus according to claim 3, wherein the modified video data is provided to a visual display, being vision spectacles, a retina, cortical or optic nerve implant.
16. A vision enhancement apparatus according to claim 1, wherein the user selected object is the users hands.
17. A vision enhancement apparatus according to claim 16, wherein the user selected object is in the users hands and the image processor operates to segment the object, to enable to user to interactively recognise it.
18. A vision enhancement apparatus according to claim 1, wherein the user selected object is text or an object with text on it.
19. A vision enhancement apparatus according to claim 18, wherein the user is able to selectively switch between a still image of the text and a video image of the scene including the object.
20. A vision enhancement apparatus according to claim 1, wherein the apparatus automatically obtains and present information to the user about the selected object.
21. A vision enhancement apparatus according to claim 1, wherein the entire apparatus is distributed about the body.
22. A method for enhancing vision, comprising the steps of:
processing high resolution video data representing images of a scene and displaying the resulting images to the user;
automatically detecting and tracking a user selected object in the images; and automatically modifying the video data, by reserving a user selected area of the displayed images for displaying the tracked object as a separate video tile within the scene.
23. Software to perform the method according to claim 22.
PCT/AU2010/001290 2009-09-30 2010-09-30 Object tracking for artificial vision WO2011038465A1 (en)

Priority Applications (4)

Application Number Priority Date Filing Date Title
US13/498,667 US9697746B2 (en) 2009-09-30 2010-09-30 Object tracking for artificial vision
AU2010302959A AU2010302959B2 (en) 2009-09-30 2010-09-30 Object tracking for artificial vision
EP10819750.0A EP2482760B1 (en) 2009-09-30 2010-09-30 Object tracking for artificial vision
US15/609,953 US10062303B2 (en) 2009-09-30 2017-05-31 Object tracking for artificial vision

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
AU2009904788 2009-09-30
AU2009904788A AU2009904788A0 (en) 2009-09-30 Object Tracking for Artificial Vision

Related Child Applications (2)

Application Number Title Priority Date Filing Date
US13/498,667 A-371-Of-International US9697746B2 (en) 2009-09-30 2010-09-30 Object tracking for artificial vision
US15/609,953 Continuation US10062303B2 (en) 2009-09-30 2017-05-31 Object tracking for artificial vision

Publications (1)

Publication Number Publication Date
WO2011038465A1 true WO2011038465A1 (en) 2011-04-07

Family

ID=43825443

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/AU2010/001290 WO2011038465A1 (en) 2009-09-30 2010-09-30 Object tracking for artificial vision

Country Status (4)

Country Link
US (2) US9697746B2 (en)
EP (1) EP2482760B1 (en)
AU (1) AU2010302959B2 (en)
WO (1) WO2011038465A1 (en)

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20130035742A1 (en) * 2011-08-05 2013-02-07 Neil Hamilton Talbot Face Detection, Tracking, and Recognition for a Visual Prosthesis
US9526896B2 (en) 2014-08-12 2016-12-27 Second Sight Medical Products, Inc. Pattern detection and location indication for a visual prosthesis
US10022544B2 (en) 2013-07-22 2018-07-17 National Ict Australia Limited Vision enhancement apparatus for a vision impaired user
CN111445527A (en) * 2019-12-31 2020-07-24 深圳硅基仿生科技有限公司 Method for detecting bar-grid vision of retina stimulator

Families Citing this family (27)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8121361B2 (en) 2006-05-19 2012-02-21 The Queen's Medical Center Motion tracking system for real time adaptive imaging and spectroscopy
US9697746B2 (en) * 2009-09-30 2017-07-04 National Ict Australia Limited Object tracking for artificial vision
EP2747641A4 (en) 2011-08-26 2015-04-01 Kineticor Inc Methods, systems, and devices for intra-scan motion correction
US9305365B2 (en) 2013-01-24 2016-04-05 Kineticor, Inc. Systems, devices, and methods for tracking moving targets
US10327708B2 (en) 2013-01-24 2019-06-25 Kineticor, Inc. Systems, devices, and methods for tracking and compensating for patient motion during a medical imaging scan
US9717461B2 (en) 2013-01-24 2017-08-01 Kineticor, Inc. Systems, devices, and methods for tracking and compensating for patient motion during a medical imaging scan
US9180053B2 (en) 2013-01-29 2015-11-10 Xerox Corporation Central vision impairment compensation
CN109008972A (en) 2013-02-01 2018-12-18 凯内蒂科尔股份有限公司 The motion tracking system of real-time adaptive motion compensation in biomedical imaging
CN106572810A (en) 2014-03-24 2017-04-19 凯内蒂科尔股份有限公司 Systems, methods, and devices for removing prospective motion correction from medical imaging scans
EP3188660A4 (en) 2014-07-23 2018-05-16 Kineticor, Inc. Systems, devices, and methods for tracking and compensating for patient motion during a medical imaging scan
US11372479B2 (en) * 2014-11-10 2022-06-28 Irisvision, Inc. Multi-modal vision enhancement system
US11144119B2 (en) * 2015-05-01 2021-10-12 Irisvision, Inc. Methods and systems for generating a magnification region in output video images
US11546527B2 (en) 2018-07-05 2023-01-03 Irisvision, Inc. Methods and apparatuses for compensating for retinitis pigmentosa
AU2015346507A1 (en) * 2014-11-10 2017-06-29 Irisvision, Inc. Methods and apparatus for vision enhancement
US20220171456A1 (en) * 2014-11-10 2022-06-02 Irisvision, Inc. Method and System for Remote Clinician Management of Head-Mounted Vision Assist Devices
US10149958B1 (en) * 2015-07-17 2018-12-11 Bao Tran Systems and methods for computer assisted operation
US9943247B2 (en) 2015-07-28 2018-04-17 The University Of Hawai'i Systems, devices, and methods for detecting false movements for motion correction during a medical imaging scan
WO2017091479A1 (en) 2015-11-23 2017-06-01 Kineticor, Inc. Systems, devices, and methods for tracking and compensating for patient motion during a medical imaging scan
US10665019B2 (en) * 2016-03-24 2020-05-26 Qualcomm Incorporated Spatial relationships for integration of visual images of physical environment into virtual reality
US10242427B2 (en) * 2016-07-29 2019-03-26 Massachusetts Institute Of Technology Deviation magnification: revealing departures from ideal geometries
CA3091176A1 (en) 2018-02-13 2019-08-22 Frank Werblin Methods and apparatus for contrast sensitivity compensation
AU2019352954B2 (en) 2018-10-01 2022-03-10 Biovisics Medical, Inc. System and methods for controlled electrical modulation for vision therapy
WO2020112980A2 (en) 2018-11-30 2020-06-04 Biovisics Medical, Llc Head worn apparatuses for vision therapy
EP3952979A1 (en) 2019-04-10 2022-02-16 Biovisics Medical, Inc. Systems and interfaces for ocular therapy
US11511112B2 (en) 2019-06-14 2022-11-29 Biovisics Medical, Inc. Wearable medical device
WO2021011255A1 (en) 2019-07-12 2021-01-21 Biovisics Medical, Inc. Ocular therapy modes and systems
EP4132799A4 (en) * 2020-04-07 2024-04-10 Entrust Corporation Laser textured identification document surfaces

Citations (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5777715A (en) * 1997-01-21 1998-07-07 Allen Vision Systems, Inc. Low vision rehabilitation system
US5912980A (en) * 1995-07-13 1999-06-15 Hunke; H. Martin Target acquisition and tracking
US20020008758A1 (en) * 2000-03-10 2002-01-24 Broemmelsiek Raymond M. Method and apparatus for video surveillance with defined zones
US20060056056A1 (en) * 2004-07-19 2006-03-16 Grandeye Ltd. Automatically expanding the zoom capability of a wide-angle video camera
US20060251382A1 (en) 2005-05-09 2006-11-09 Microsoft Corporation System and method for automatic video editing using object recognition
US7272306B2 (en) * 2002-10-22 2007-09-18 Xiaolin Zhang Bionic automatic vision and line of sight control system and method
US20070291104A1 (en) * 2006-06-07 2007-12-20 Wavetronex, Inc. Systems and methods of capturing high-resolution images of objects
US20080060034A1 (en) * 2006-02-13 2008-03-06 Geoffrey Egnal System and method to combine multiple video streams
US20080174659A1 (en) * 2007-01-18 2008-07-24 Mcdowall Ian Wide field of view display device and method
US20080316427A1 (en) * 2005-11-15 2008-12-25 Carl Zeiss Vision Australia Holdings Limited Ophthalmic Lens Simulation System and Method

Family Cites Families (25)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7788008B2 (en) * 1995-06-07 2010-08-31 Automotive Technologies International, Inc. Eye monitoring system and method for vehicular occupants
GB2272126A (en) * 1992-10-28 1994-05-04 Quantel Ltd Image processing system in which a high resolution image is stored as a plurality of low resolution images.
US5689641A (en) 1993-10-01 1997-11-18 Vicor, Inc. Multimedia collaboration system arrangement for routing compressed AV signal through a participant site without decompressing the AV signal
US6127990A (en) * 1995-11-28 2000-10-03 Vega Vista, Inc. Wearable display and methods for controlling same
DE19704197A1 (en) * 1997-02-05 1998-08-06 Zeiss Carl Jena Gmbh Arrangement for determining refractive power and/or other visual functions
CA2233047C (en) * 1998-02-02 2000-09-26 Steve Mann Wearable camera system with viewfinder means
JP3792920B2 (en) * 1998-12-25 2006-07-05 株式会社東海理化電機製作所 Touch operation input device
US6680745B2 (en) * 2000-11-10 2004-01-20 Perceptive Network Technologies, Inc. Videoconferencing method with tracking of face and dynamic bandwidth allocation
JP4143305B2 (en) * 2001-01-30 2008-09-03 日本電気株式会社 Robot device, verification environment determination method, and verification environment determination program
DE10103922A1 (en) * 2001-01-30 2002-08-01 Physoptics Opto Electronic Gmb Interactive data viewing and operating system
US7194148B2 (en) * 2001-09-07 2007-03-20 Yavitz Edward Q Technique for providing simulated vision
JP2006502421A (en) * 2001-11-06 2006-01-19 キーオティ Image projection device
US7453506B2 (en) * 2003-08-25 2008-11-18 Fujifilm Corporation Digital camera having a specified portion preview section
JP2005250772A (en) 2004-03-03 2005-09-15 Fuji Photo Film Co Ltd Object identification apparatus and method, learning method and program
JP2005277726A (en) 2004-03-24 2005-10-06 Toshiba Corp Image monitoring device
EP1842141A1 (en) * 2005-01-20 2007-10-10 Koninklijke Philips Electronics N.V. Multimedia presentation creation
JP4449782B2 (en) 2005-02-25 2010-04-14 ソニー株式会社 Imaging apparatus and image distribution method
EP1875298A1 (en) * 2005-04-03 2008-01-09 Ben-Gurion University of the Negev Research and Development Authority Low vision aid device
GB2441434B (en) * 2006-08-29 2010-06-23 David Charles Dewhurst Audiotactile vision substitution system
DE102006060045A1 (en) * 2006-12-19 2008-06-26 Imi Intelligent Medical Implants Ag Visual aid with three-dimensional image capture
GB0702150D0 (en) * 2007-02-05 2007-03-14 Amegoworld Ltd A Communication Network and Devices
US20090153654A1 (en) * 2007-12-18 2009-06-18 Enge Amy D Video customized to include person-of-interest
US9697746B2 (en) * 2009-09-30 2017-07-04 National Ict Australia Limited Object tracking for artificial vision
US9262668B2 (en) * 2010-05-21 2016-02-16 Honeywell International Inc. Distant face recognition system
US9526896B2 (en) * 2014-08-12 2016-12-27 Second Sight Medical Products, Inc. Pattern detection and location indication for a visual prosthesis

Patent Citations (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5912980A (en) * 1995-07-13 1999-06-15 Hunke; H. Martin Target acquisition and tracking
US5777715A (en) * 1997-01-21 1998-07-07 Allen Vision Systems, Inc. Low vision rehabilitation system
US20020008758A1 (en) * 2000-03-10 2002-01-24 Broemmelsiek Raymond M. Method and apparatus for video surveillance with defined zones
US7272306B2 (en) * 2002-10-22 2007-09-18 Xiaolin Zhang Bionic automatic vision and line of sight control system and method
US20060056056A1 (en) * 2004-07-19 2006-03-16 Grandeye Ltd. Automatically expanding the zoom capability of a wide-angle video camera
US20060251382A1 (en) 2005-05-09 2006-11-09 Microsoft Corporation System and method for automatic video editing using object recognition
US20080316427A1 (en) * 2005-11-15 2008-12-25 Carl Zeiss Vision Australia Holdings Limited Ophthalmic Lens Simulation System and Method
US20080060034A1 (en) * 2006-02-13 2008-03-06 Geoffrey Egnal System and method to combine multiple video streams
US20070291104A1 (en) * 2006-06-07 2007-12-20 Wavetronex, Inc. Systems and methods of capturing high-resolution images of objects
US20080174659A1 (en) * 2007-01-18 2008-07-24 Mcdowall Ian Wide field of view display device and method

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
DANKERS A. ET AL.: "MAP ZDF Segmentation and Tracking using active Stereo Vision: hand Tracking Case Study", COMPUTER VISION AND IMAGE UNDERSTANDING, vol. 108, no. 1- 2, 2007, pages 74 - 86, XP008155830 *
See also references of EP2482760A4

Cited By (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20130035742A1 (en) * 2011-08-05 2013-02-07 Neil Hamilton Talbot Face Detection, Tracking, and Recognition for a Visual Prosthesis
US9569657B2 (en) * 2011-08-05 2017-02-14 Second Sight Medical Products, Inc. Face detection, tracking, and recognition for a visual prosthesis
US20170132455A1 (en) * 2011-08-05 2017-05-11 Second Sight Medical Products, Inc. Face Detection Tracking and Recognition for a Visual Prosthesis
US10586096B2 (en) * 2011-08-05 2020-03-10 Second Sight Medical Products, Inc. Face detection tracking and recognition for a visual prosthesis
US11393247B2 (en) 2011-08-05 2022-07-19 Second Sight Medical Products, Inc. Face detection tracking and recognition for a visual prosthesis
US10022544B2 (en) 2013-07-22 2018-07-17 National Ict Australia Limited Vision enhancement apparatus for a vision impaired user
US9526896B2 (en) 2014-08-12 2016-12-27 Second Sight Medical Products, Inc. Pattern detection and location indication for a visual prosthesis
US9842248B2 (en) 2014-08-12 2017-12-12 Second Sight Medical Products, Inc. Pattern detection and location in a processed image
US11235151B2 (en) 2014-08-12 2022-02-01 Second Sight Medical Products, Inc Pattern detection and location in a processed image
CN111445527A (en) * 2019-12-31 2020-07-24 深圳硅基仿生科技有限公司 Method for detecting bar-grid vision of retina stimulator
CN111445527B (en) * 2019-12-31 2021-09-07 深圳硅基仿生科技有限公司 Method for detecting bar-grid vision of retina stimulator

Also Published As

Publication number Publication date
AU2010302959B2 (en) 2015-11-12
US10062303B2 (en) 2018-08-28
EP2482760A4 (en) 2014-10-22
AU2010302959A1 (en) 2012-04-26
EP2482760A1 (en) 2012-08-08
US9697746B2 (en) 2017-07-04
EP2482760B1 (en) 2020-03-25
US20120212594A1 (en) 2012-08-23
US20170263153A1 (en) 2017-09-14

Similar Documents

Publication Publication Date Title
US10062303B2 (en) Object tracking for artificial vision
CN108519676B (en) Head-wearing type vision-aiding device
US20180189567A1 (en) Method and device for visually impaired assistance
TWI625549B (en) Electronic contact lenses, image system comprising the same, method of providing content and non-transitory computer readable medium
EP3117290B1 (en) Interactive information display
JP7355006B2 (en) Information processing device, information processing method, and recording medium
CN112601509A (en) Hybrid see-through augmented reality system and method for low-vision users
US12079943B2 (en) Neutral avatars
WO2017168998A1 (en) Information processing device, information processing method, and program
CN109376621A (en) A kind of sample data generation method, device and robot
CN112506336A (en) Head mounted display with haptic output
WO2010142455A2 (en) Method for determining the position of an object in an image, for determining an attitude of a persons face and method for controlling an input device based on the detection of attitude or eye gaze
EP2261772A1 (en) Method for controlling an input device based on the detection of attitude or eye gaze
JP2020151012A (en) Communication system, and control method of communication system
EP2919094A1 (en) Interactive information display
EP2261857A1 (en) Method for determining the position of an object in an image, for determining an attitude of a persons face and method for controlling an input device based on the detection of attitude or eye gaze
JP6555665B2 (en) Communication apparatus and method
EP3699735A1 (en) Method, computer program and head mountable arrangement for assisting a subject to acquire spatial information about an environment
US8307295B2 (en) Method for controlling a computer generated or physical character based on visual focus
Brabyn et al. Aids for people who are blind or visually impaired
US20240267494A1 (en) Intraocular system that includes an implant with an image formation device
JP2011067479A (en) Image auralization apparatus
Gonda et al. Tactile Vision-Development of a Wearable Prosthesis for the Blind
KR20230170026A (en) Assisting the visually impaired with AR using force feedback
CN113672084A (en) AR display picture adjusting method and system

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 10819750

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

WWE Wipo information: entry into national phase

Ref document number: 2010302959

Country of ref document: AU

ENP Entry into the national phase

Ref document number: 2010302959

Country of ref document: AU

Date of ref document: 20100930

Kind code of ref document: A

WWE Wipo information: entry into national phase

Ref document number: 2010819750

Country of ref document: EP

WWE Wipo information: entry into national phase

Ref document number: 13498667

Country of ref document: US