Google AI can differentiate individual voices in a crowd

By Digit NewsDesk | Published on 14 Apr 2018
HIGHLIGHTS

Google developers trained a neural network to identify the voices of individual people speaking by themselves and then constructed virtual ‘parties’ with background noise to teach the neural network to isolate the multiple voices into individual audio tracks.

Google AI can differentiate individual voices in a crowd

#IBMCodePatterns, a developer’s best friend.

#IBMCodePatterns provide complete solutions to problems that developers face every day. They leverage multiple technologies, products, or services to solve issues across multiple industries.

Click here to know more

Advertisements

Google’s AI is inching closer to acting like humans. The developers at Google have developed a solution that uses deep learning to pick out human voices from a crowd by look at people’s faces while they’re talking.

Essentially, the developers trained a neural network to identify the voices of individual people speaking by themselves and then constructed virtual ‘parties’ with background noise to teach the neural network to isolate the multiple voices into individual audio tracks.

Following the method, the AI can recognise the voice of one person by just focusing on their face. It works even if the face is obscured with their hands or a microphone. Google demonstrated the results through a clip on YouTube and the results are uncannily accurate.

You might soon be able to see the AI at work. Google is “exploring opportunities” to use the feature in video chat apps like Hangouts and Duo which could help the AI to learn even more. It could also help users understand people better in a crowded room. Furthermore, it could help in speech enhancement and camera-linked hearing aids that can boost the sound of whoever’s speaking in front of you.

There is also a case of potential privacy violation as the feature can be used to publicly eavesdrop on people. Google could bypass that by allowing the feature to work only for people who give their consent.

logo
Digit NewsDesk

The guy who answered the question 'What are you doing?' with 'Nothing'.

Digit caters to the largest community of tech buyers, users and enthusiasts in India. The all new Digit in continues the legacy of Thinkdigit.com as one of the largest portals in India committed to technology users and buyers. Digit is also one of the most trusted names when it comes to technology reviews and buying advice and is home to the Digit Test Lab, India's most proficient center for testing and reviewing technology products.

We are about leadership-the 9.9 kind! Building a leading media company out of India.And,grooming new leaders for this promising industry.

DMCA.com Protection Status