# Transcriber

## Component/part description

This is the module responsible for generating the transcription, generally using a third party service or API such as IBM Watson one.

{% embed url="<https://trello.com/c/7jSJ7Bc1>" %}

It is composed of 2 main components

* [**Audio converter**](https://autoedit.gitbook.io/documentation/master-1.0.13-1/transcriptions/transcriptions-intro/transcriber/audio-to-video) Convert audio or video to audio specs for stt API

{% embed url="<https://trello.com/c/1fLY5mLQ>" %}

* [**STT sdk**](https://autoedit.gitbook.io/documentation/master-1.0.13-1/transcriptions/transcriptions-intro/transcriber/stt-api) audio to STT API/Service, to receive time-coded transcription.

With Extra:

* **Speaker diarization** can either happen at the STT API level or as a separate module to be interpolated with the transcription.

{% embed url="<https://trello.com/c/jFRiJJRV>" %}

And optional:

* **Srt parsing**. Allow srt as input. In case transcription comes from elsewhere. Can use module [srtParserComposer to refactor](https://github.com/pietrop/srtParserComposer)

{% embed url="<https://trello.com/c/lZQYawmy>" %}

* **Plain text as input**, if you already have the transcription, use something like Gentle to re-align and generate transcription json.

## Related projects

It was Initially prototyped as a standalone app to test quality of speech to text. see [Transcriber](https://github.com/pietrop/Transcriber).

## Implementations Options considered

NA

## Current implementation

[See component](https://github.com/OpenNewsLabs/autoEdit_2/tree/master/lib/interactive_transcription_generator/transcriber)

## What needs refactoring

Perhaps look into compositor pattern to bring together the components of this module.


---

# Agent Instructions: Querying This Documentation

If you need additional information that is not directly available in this page, you can query the documentation dynamically by asking a question.

Perform an HTTP GET request on the current page URL with the `ask` query parameter:

```
GET https://autoedit.gitbook.io/documentation/master-1.0.13-1/transcriptions/transcriptions-intro/transcriber.md?ask=<question>
```

The question should be specific, self-contained, and written in natural language.
The response will contain a direct answer to the question and relevant excerpts and sources from the documentation.

Use this mechanism when the answer is not explicitly present in the current page, you need clarification or additional context, or you want to retrieve related documentation sections.
