autoEdit 2 User Manual
1.0.13
1.0.13
  • Introduction
  • Support the project
  • Installing
    • Installing on Mac OSX
    • Installing on Linux
    • Installing on Windows
  • Transcriptions
    • Editing Text
    • Shortcuts
  • Paperediting
  • Opening EDL in video editing software
  • Setup: STT APIs
    • Setup: STT APIs - IBM
    • Setup: STT APIs - Speechmatics
    • Setup: STT APIs - AssemblyAI
    • Setup: STT APIs - Rev
    • Setup: STT APIs - Gentle
  • uninstalling
  • Developer Options for Export
  • autoEdit - Adobe Panel
    • Install
    • Import media
    • Transcription to source monitor
    • Transcription export selections
    • Paper-edit to sequence
Powered by GitBook
On this page
  • Overview
  • IBM Watson STT Service
  • Speechmatics STT Service
  • AssemblyAI STT Service
  • Gentle Open Source STT
  • Pocketsphinx Open Source STT
  • Rev Transcription service

Setup: STT APIs

PreviousOpening EDL in video editing softwareNextSetup: STT APIs - IBM

Last updated 6 years ago

There are three options for speech to text APIs that you can use with this system.

Check them out individually for extra setup instruction.

  1. (Open Source, needs a separate app for setup)

  2. Pocketsphinx (Open Source, integrate inside of autoEdit, no extra setup needed)

Overview

IBM Watson STT Service

Pros:

  • 16 hours a month included in service. 0.02 cent a minute after that .

  • Speed. In autoEdit, always takes 5 minutes to transcribe any duration of audio or video.

  • Generally pretty accurate (my opinion, judge for yourself)

  • Supports a number of languages . Including distinction between British and American English.

Cons:

  • Need to provide card details for pay as you go fee.

  • is in the cloud so no offline support.

Speechmatics STT Service

  • 1 hour free credit with new account

  • Easy to setup credentials

  • Generally pretty accurate (my opinion, judge for yourself)

AssemblyAI STT Service

  • Free tier: 5 hours free per month

  • Competitive pricing at $0.0003 per second

  • Easy to setup credentials

  • Generally very accurate (my opinion, judge for yourself)

  • For now only support for English but more coming soon

Gentle Open Source STT

Pros:

  • Free as in free speech as well as in free beer.

  • Working locally on your machine. No internet connection needed because of that, good for sensitive material.

Cons:

  • Not as accurate as IBM one (in my opinion, but decide for yourself).

  • Only support US english STT.

  • In autoEdit, at the moment not as fast as IBM one, takes a little longer then the length of the media. (eg 27 min takes 30 min to transcribe).

Pocketsphinx Open Source STT

Pros:

  • Free as in free speech as well as in free beer.

  • working locally on your machine. no internet connection needed because of that, good for sensitive material.

  • .

Cons:

  • Not as accurate as IBM or Gentle.

  • Only support US english STT.

  • in autoEdit, at the moment not as fast as IBM one, takes a little longer then the length of the media. (eg 27 min takes 30 min to transcribe).

Pocketsphinx does not require extra setup to use.

Rev Transcription service

Pros:

  • In theory more accurate then automated transcriptions

  • word level timestamps

Cons:

  • Price more expensive then automated transcriptions

  • Turnaround slower then automated transcriptions.

.

Including support for "accent agnostic global english".

Open source and .

.

originally extracted from project.

Sign up to the , follow on and/or to keep up to date with the latest releases. Say hi at , always curious to hear what autoEdit is helping you with.

it's free and open source. Free as in free speech as well as in free beer. . Support will go towards fixing bugs, adding features, provide support for users etc...

IBM Watson STT
Speechmatics
AssemblyAI
Gentle STT
Rev - Transcriptions service
see pricing
see here
Check out for extra setup instructions for IBM
28 languages, see full list
github repo
I made a node module to work with the API
Check out for extra setup instructions for Gentle
Open source module in autoEdit
Videogrep
Transcription service from real humans
mailing list
twitter
facebook
pietro@autoEdit.io
autoEdit.io
Help support the autoEdit project to keep it that way