Definitely! As a new user you get 20 minutes completely free, after registering your email. Then you can try the whole system based on your own files.
Only when you need more than those 20 minutes, you have to buy additional credits. Tip: start with shorter recordings, then you can try out many different ones.
We are a relatively new service with the best price/quality ratio. Most transcription services have developed their own speech engine and models at high cost.
VoiceToScript is based upon the powerful speech models of Google, Amazon (Alexa) and MicroSoft (Azure). These all have their specific strengths and we always select the best fit for your recording.
In other words, you get the highest quality for the lowest price. Now and in the future!
We support over
40 spoken languages
, with an accuracy up to 95%!
If you don't know in which language or dialect is spoken on your recording, you can select the 'Auto detect language' option.
British english (en-gb)
Us english (en-us)
Australian english (en-au)
Indian english (en-in)
Irish english (en-ie)
Scottish english (en-ab)
Welsh english (en-wl)
Dutch - belgium (nl-be)
French belgium (fr-be)
German - austria (de-at)
German - swiss (de-ch)
Canadian french (fr-ca)
Portuguese - brazil (pt-br)
Us spanish (es-us)
Indian hindi (hi-in)
Modern standard arabic (ar-sa)
Gulf arabic (ar-ae)
Chinese mandarin - mainland (zh-cn)
You can upload any Audio or Video file. The format doesn't matter, as long as it contains sound.
So whether it is an mp3, .mp4, .avi, .aac, .m4a, .wma, .wav, .flac, .avi or any other format,'
VoiceToScript analyzes the file and checks if there is an audio stream in it. So you don't have to worry about that, we'll do it for you.
You can upload files up to 2 GB (2000 MB), with max. 2 hours of audio. Whether you can actually upload these also depends on the upload speed of your own internet connection.
To be able to upload the 2 GB, you need to have an upload speed of at least 5 MB/sec, otherwise the upload will be aborted after about 15 minutes.
E.g. if you have an upload speed of 1 MB/sec, you will be able to upload a maximum file of 400MB.
Yes, because we do not keep your files on our server, you will have to upload them again.
We deliberately do not store them with us, because we put security and confidentiality first: after all, they are your files!
Transcribing is the conversion of spoken words to text. Typically based on the audio recordings a transcript is made afterwards.
This is often used by journalists to work out recorded interviews or scientists/students to record these for research purposes.
With the improved transcription services it is increasingly used to create 'spoken' reports or to transcribe meetings automatically.
We only support non-verbatim transcriptions.
This means that the recording is transcribed word-for-word.
This means that stuttering, intonation, interjections or repetitions are not included. With a verbatim transcription the latter is included.
We use the best transcription engines currently available, namely those from Google, MicroSoft, IBM and Amazon. They deliver a very high quality up to 95%, but they are not perfect.
It is a fully automatic process, where the quality of the supplied recording determines to a large extent the quality of the end result.
It is mainly about how clear the speech is and whether there are annoying background noises.
It is therefore always necessary to check the delivered texts against the original recording and make corrections where necessary!
The file is send to you by email and consists of a number of textblocks. For each block the time is also given, so you can quickly find the fragment in the original audio file.
Some of the words may be highlighted, which indicates these were more difficult to hear and understand by the system. .
This helps to identify the areas where you could pay extra attention.
For interviews there is a new block for every speaker change (max. 2 speakers).
The file is easily editable with standard editors like MicroSoft Word.
The subtitle file you receive by email is a .srt file (SubRip format). This contains both your spoken text and the exact time codes of when each line of text should be shown in your video.
The structure of this file is explained on this website .
Here you can also find out how to add it to your video.
Because the accuracy of the automatically generated subtitles depends on several factors, it is important to check the file and if necessary correct it before you start using it.
You can easily edit the file with any standard text editor, such as WordPad on MicroSoft Windows PCs.
If you are logged in, you will see '$Credits' at the top of the tab. If you click on this tab you will see the prices and the possibility to buy credits.
You can pay with PayPal, Credit Card and others. Immediately after your payment you will receive a VAT invoice by email.
We round up the time in whole minutes. For a recording of e.g. 3 minutes and 15 seconds, 4 minutes will be deducted from your credits.
Audio and Video
Upload your recording. And in minutes get the text by email.
Speech --> text
Turn speech automatically into text and edit it with Word.