Orcam
|
JERUSALEM — Liat Negrin, an Israeli who has been visually impaired since childhood, walked into a grocery store here recently, picked up a can of vegetables and easily read its label using a simple and unobtrusive camera attached to her glasses.
Ms. Negrin, who has coloboma, a birth defect that perforates a structure
of the eye and afflicts about 1 in 10,000 people, is an employee at
OrCam, an Israeli start-up that has developed a camera-based system
intended to give the visually impaired the ability to both “read” easily
and move freely.
Until now reading aids for the visually impaired and the blind have been
cumbersome devices that recognize text in restricted environments, or,
more recently, have been software applications on smartphones that have
limited capabilities.
In contrast, the OrCam device is a small camera worn in the style of
Google Glass, connected by a thin cable to a portable computer designed
to fit in the wearer’s pocket. The system clips on to the wearer’s
glasses with a small magnet and uses a bone-conduction speaker to offer
clear speech as it reads aloud the words or object pointed to by the
user.
The system is designed to both recognize and speak “text in the wild,” a
term used to describe newspaper articles as well as bus numbers, and
objects as diverse as landmarks, traffic lights and the faces of
friends.
It currently recognizes English-language text and beginning this week will be sold through the company’s Web site
for $2,500, about the cost of a midrange hearing aid. It is the only
product, so far, of the privately held company, which is part of the
high-tech boom in Israel.
The device is quite different from other technology that has been
developed to give some vision to people who are blind, like the artificial retina
system called Argus II, made by Second Sight Medical Products. That
system, which was approved by the Food and Drug Administration in
February, allows visual signals to bypass a damaged retina and be
transmitted to the brain.
The OrCam device is also drastically different from Google Glass, which
also offers the wearer a camera but is designed for people with normal
vision and has limited visual recognition and local computing power.
OrCam was founded several years ago by Amnon Shashua,
a well-known researcher who is a computer science professor at Hebrew
University here. It is based on computer vision algorithms that he has
pioneered with another faculty member, Shai Shalev-Shwartz, and one of his former graduate students, Yonatan Wexler.
“What is remarkable is that the device learns from the user to recognize a new product,” said Tomaso Poggio,
a computer scientist at M.I.T. who is a computer vision expert and with
whom Dr. Shashua studied as a graduate student. “This is more complex
than it appears, and, as an expert, I find it really impressive.”
The advance is the result of both rapidly improving computing processing
power that can now be carried comfortably in a wearer’s pocket and the
computer vision algorithm developed by the scientists.
On a broader technology level, the OrCam system is representative of a
wide range of rapid improvements being made in the field of artificial
intelligence, in particular with vision systems for manufacturing as
well as fields like autonomous motor vehicles. (Dr. Shashua previously
founded Mobileye, a corporation that supplies camera technology
to the automobile industry that can recognize objects like pedestrians
and bicyclists and can keep a car in a lane on a freeway.)
Speech recognition is now routinely used by tens of millions of people
on both iPhones and Android smartphones. Moreover, natural language
processing is making it possible for computer systems to “read”
documents, which is having a significant impact in the legal field,
among others.
There are now at least six competing approaches in the field of computer
vision. For example, researchers at Google and elsewhere have begun
using what are known as “deep learning” techniques that attempt to mimic
biological vision systems. However, they require vast computing
resources for accurate recognition.
In contrast, the OrCam technique, which was described in a technical paper
in 2011 by the Hebrew University researchers, offers a reasonable
trade-off between recognition accuracy and speed. The technique, known
as Shareboost, is distinguished by the fact that as the number of
objects it needs to recognize grows, the system minimizes the amount of
additional computer power required.
“The challenges are huge,” said Dr. Wexler, a co-author of the paper and
vice president of research and development at OrCam. “People who have
low vision will continue to have low vision, but we want to harness
computer science to help them.”
Additionally the OrCam system is designed to have a minimal control
system, or user interface. To recognize an object or text, the wearer
simply points at it with his or her finger, and the device then
interprets the scene.
The system recognizes a pre-stored set of objects and allows the user to
add to its library — for example, text on a label or billboard, or a
stop light or street sign — by simply waving his or her hand, or the
object, in the camera’s field of view.
One of the key challenges, Dr. Shashua said, was allowing quick optical
character recognition in a variety of lighting conditions as well as on
flexible surfaces.
“The professional optical character readers today will work very well
when the image is good, but we have additional challenges — we must read
text on flexible surfaces like a hand-held newspaper,” he said.
Although the system is usable by the blind, OrCam is initially planning
to sell the device to people in the United States who are visually
impaired, which means that their vision cannot be adequately corrected
with glasses.
In the United States, 21.2 million people over the age of 18 have some
kind of visual impairment, including age-related conditions, diseases
and birth defects, according to the 2011 National Health Survey by the
U.S. National Center for Health Statistics. OrCam said that worldwide
there were 342 million adults with significant visual impairment, and
that 52 million of them had middle-class incomes.
No comments:
Post a Comment