I'm looking for an api that I can get when I receive the audio, try to recognize the text. Does anyone know of any opensource api for this? The opposite (receiving a text and generating an audio has several). The intention is to install on the local server (linux) and use together with PHP. If so, what would it be?
The first objective is to use an anti-captcha tool, in general, today there are many websites that, besides having the image, have the audio option, once I have the audio, I can send it to an api and submit the captcha. thus easier to consume services, such as consultation and validation of CPF, CNPJ and etc ...
Today I can use CURL to normally capture HTML, make anti-captcha for images come up against many issues where the algorithm is not always efficient to break the captcha and still need to develop algorithms for different images.
In the searches I did I found a lot to generate audio from text, but the opposite generate text from audio I only found in closed applications where I would need to use the application along with shortcut key to solve the question. As it will be running on a web server I did not find a good solution.
I found GoogleSpeech, it falls on link , however, I did not implement, because when I started reading I saw that it would work by the shortcut key and the microphone. This would undoubtedly be one of the worst possible implementations possible if it did not have an easier way to use it. "When you want to turn on the Google2Ubuntu voice recognition system, press the keyboard shortcut that you have set.When you press the keyboard shortcut ..."