ios - Analyse recorded audio file with Swift for speech to text -
i'm able record audio swift ios , play recorded audio file. i'm asking if possible check recorded audio file background noise & volume/decibel can decide enough speech text framework. framework not problem , have researched available ones.
i'm curious if can analyse recorded audio file avfoundation or accelerate framework or other framework check if audio file good/clear enough process speech text framework.
i don't have lot of audio knowledge i've researched bit , found out can peak , average decibel values while recording background noise?
any information helpful analysing recorded audio file swift.
snr estimation pretty developed domain. need implement voice activity detector separate noise speech , separately compute noise energy , signal energy , calculate ratio. goes beyond simple math though, need understand statistics implement reasonable algorithm wada snr implemented here.
you not able find implementation of in swift, such software implemented in c or matlab, have port implementation.
noise estimation minor problem compared speech recognition, involves more advanced algorithms. better consider existing package speech recognition in swift tlsphinx or openears.
Comments
Post a Comment