People struggling to follow a conversation in noisy situations could soon be helped by artificial intelligence after a technological breakthrough that claimed to have solved the "cocktail party problem".
The phenomenon describes how people can filter out background noises, such as the chatter of a party, to focus on one particular sound or speaker. Scientists have long puzzled over how the human brain is able to do this, leading Tech Crunch to call it "one of the greatest barriers to voice technologies reaching a level of understanding comparable to humans".
Voice technologies, added the website, are a growing market expected to reach $26.8 billion (£20.4 billion) by next year. However, they are not being designed to confront the "messiness" or "cacophony" of real life, in particular the background and ambient noise that "muddies" the signals they receive. The only way to combat this, said Tech Crunch, is to find a way to make voice tech as good as the human auditory system.
It is not only scientists who have been fighting to combat background noise – a growing number of people are having problems with the cocktail party problem, reported the i news site. In particular, people born between 1997 and 2012, so-called Generation Z, are struggling to hear conversations when in noisy places, it added, citing a survey that found 11.5% of this group "always" experience the condition, compared with only 8% of 25- to 34-year-olds and 7.4% of over-55s.
The researchers believed a greater use of headphones by the younger respondents was the "key reason" for the difference.
AI's day in court
As well as causing difficulties in social situations, the cocktail party problem also has legal implications, said the BBC. Technology's inability to filter out background noise can affect audio evidence in legal cases, if listeners cannot be completely certain who is talking and what is being said.
Electrical engineer Keith McElveen, founder and chief technology officer of US company Wave Sciences, told the broadcaster it was "one of the classic hard problems in acoustics".
McElveen originally became interested in the problem when working for the US government investigating a possible war crime. "Some of the evidence included recordings with a bunch of voices all talking at once – and that's when I learned what the 'cocktail party problem' was," he said.
The issue was that sounds bounced around the room and made isolating a particular noise "mathematically horrible to solve". He hit upon the idea of using AI to "pinpoint and screen out" background voices and ambient noises based on where they originated in the room.
It took researchers at Wave Sciences 10 years of testing to "finally" create an AI system that could analyse how sound bounces around the room before it reaches an ear or a mic. The result is similar to a camera focussing on a subject and blurring out the rest of the image.
The technology was put to the test in a US court case, turning an audio recording into a "pivotal piece of evidence", and is now being used by the military. Future uses could include smart speakers and hearing aid devices, added the BBC.
0 Commentaires