Modify Vosk API Speech to Text Program

Closed Posted 2 years ago Paid on delivery
Closed Paid on delivery

Hi,

Background:

Vosk API is an offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node.

My program:

I have a speech to text GUI program using Vosk API that transcripts spoken words to text at the mouse cursors location. It has several features of which I would like to modify and several I would like to implement. Currently, it is using an addon called "Fastpunct" which automatically punctuates the sentences however this causes a huge delay to output the text so I am looking for a different solution. Also, the speech program has a feature to enable "commands" which means whenever you say a form of punctuation such as "Question key" it will translate to "?". Many of the commands are either working periodically or not at all. I would like to speak to someone about possibilities to fix this issue whether it is modifying the existing model or creating a separate one just for the command feature. Furthermore, I have much more future work/ideas that need implemented that we can discuss.

Python Java C++ Programming C Programming Machine Learning (ML)

Project ID: #30194656

About the project

7 proposals Remote project Active 2 years ago

7 freelancers are bidding on average $191 for this job

Devrits

Hi Hiring manager I am Natural Language Processing, Speech Recognition and TTS Expert I'm a Masters Student in Natural Language Processing with extensive experience in Deep Learning, NLP, Speech Recognition and Text- More

$250 USD in 2 days
(10 Reviews)
5.4
kevinlee1238

Hi, Sir I am very interested in your project. I have the experience for your project. I think that we can discuss the project in chat. Best regards

$200 USD in 15 days
(16 Reviews)
5.1
Tonyspychenko

Hi, thanks for your job posting. Punctuation is rather NLP processing. Vosk API has look ahead function with which I can implement your needs. please contact me. thanks. Anton.

$500 USD in 7 days
(9 Reviews)
4.6
Mikhailpopov0724

Hello I can modify vosk API speech to text program perfectly per your requirements. I am a professional web developer with rich 8+years of experience in which I have built many websites so far. I have extensive experie More

$50 USD in 7 days
(1 Review)
3.2
tracygearth

I have C/C++ Java etc 8 years experience and have used many API in different commercial applications. Look forward to Anuj

$50 USD in 1 day
(3 Reviews)
2.6
Manpreetsweden

❤️❤️❤️❤️❤️ DL Developer ❤️❤️❤️❤️❤️ Hello. Nice to meet you. I have read your job carefully and I am interested in this. I have plenty of experiences with Python libraries(keras, tensorflow, ...). I wish we will discuss More

$150 USD in 5 days
(2 Reviews)
2.1
OleksandLitkina

Hello, I'm very interested in your job as a speech processing engineer, who has many R&D experiences in LVSR(large vocabulary speech recognition) with Kaldi, deep speech2, deep speech, google API, IBM Watson, pocketsph More

$140 USD in 7 days
(2 Reviews)
1.4