Skip to main content
. Author manuscript; available in PMC: 2019 Jan 10.
Published in final edited form as: IEEE Spectr. 2017 Feb 28;54(3):32–37. doi: 10.1109/MSPEC.2017.7864754

graphic file with name nihms959954f2.jpg

Clean Speech: To separate speech from noise, a machine learning program breaks a noisy speech sample into a collection of elements called time-frequency units. Next, it analyzes these units to extract 85 features known to distinguish speech from other sounds. Then, the program feeds the features into a deep neural network trained to classify the units as speech or not based on past experience with similar samples. Lastly, the program applies a digital filter that tosses out all the nonspeech units to leave only separated speech.