# Transcription & Media Processing

In this section we look at [the json spec for a transcription in autoEdit](https://autoedit.gitbook.io/documentation/master-1.0.13-1/transcriptions/transcriptions-intro/transcriptions). As well as a few backend modules

* The transcriber module
  * video to audio&#x20;
  * STT sdks
    * IBM&#x20;
    * Speechmatics
    * Rev
    * pocketsphinx
    * Gentle STT
* metadata Reader
* Video preview converter

And client side components

* Hypertranscript&#x20;

In the current implementation, the `The transcriber module`, `metadata Reader` and `Video preview converter` have been combined into a `interactive_transcription_generator` module. that give an audio and video file, generate those outputs.

## Early example of sketch&#x20;

![img/sketches/NewMedia.png](https://lh4.googleusercontent.com/gY-_kHqQm9HVGGDcbloSHTZ3rQt-762MSdc8v3_IPPJytEmXWQMeiWMvrQCAo15Wn-rCyNTUgbS7o6aDaHDybe4vtcghsEUo0denKe0OBr6diNfjrlN5v9eIw0zceQWEE9rvy6dc)
