autoEdit 2 Documentation
1.0.13
1.0.13
  • Introduction
  • Overview
    • Intro
      • Development approach
      • from 1.0.5 to 1.0.6
    • Architecture Overview
      • Lo fi Design Sketches
      • High fidelity sketches in HTML
    • Support the project
    • documentation section template
    • Build / Deployment
      • Deployment/build for Mac OS X
      • Deployment / Build for Linux
      • Deployment / Build for Windows
      • Travis CI continuous build
        • previous travis setup
  • R&D Doc
    • About R&D doc section
    • Transcription & Media Processing
      • Transcription json
      • Transcriber
        • audio to video
        • STT sdks
          • IBM Watson STT
          • Gentle STT
          • Pocketsphinx
      • Video preview conversion
      • Read metadata
    • Hypertranscript
    • Selections, Annotations, Papercuts
      • Selections
      • Annotations
      • Tags
    • Paper-edit
      • Paper-edit json
      • search-filter
      • drag-and-drop
      • Preview Paper-edit video
    • Export
      • EDL export
      • XML export
      • mp4 export
  • Appendix
    • Dev configuration
    • Current db setup
    • EDL Format
    • Reusable components
    • Prerequisites
    • Testing
    • Updating automated documentation
    • ffmpeg and ffprobe in electron
    • Adding STT services
  • Appendix - Data structures
    • IBM Watson json specs
    • Gentle Json transcription specs
    • Pocketsphinx results
    • autoEdit transcription Json
  • QA List
    • QA Intro
  • Adobe Panel
    • autoEdit Adobe CEP Panel dev setup
      • Adobe Extensions HostList codes
    • autoEdit Adobe CEP Panel integration overview
    • Jsx functions for Adobe CEP autoEdit adobe Panel
    • Packaging and distributing Adobe CEP Extensions
      • Packaging signing Adobe CEP Panel in details
      • Submit to Adobe
  • Project Page
    • Build project page
    • Build/update demo front end page
  • Roadmap
    • Improvements
    • Roadmap
      • Paper-editing Roadmap
      • Extra Features Roadmap
      • Future Roadmap
        • Live video editing
        • Social Media Export
        • Translate transcriptions
        • Web app
          • Multi-user collaboration
        • Searchable Editable Archive
        • NLP insights
        • Slack/Chat bot integration
        • Interactive dev tool
        • Phone mms integration with twillio
        • B-roll computational photography
Powered by GitBook
On this page
  • Component/part description
  • Related projects.
  • Gentle - json
  • IBM - Json
  • Pocketsphinx - plain text
  • Implementations Options considered
  • Other
  • BBC Transcription Model
  • Current implementation
  • Transcription domain
  • What needs refactoring
  • replace array with hash
  1. R&D Doc
  2. Transcription & Media Processing

Transcription json

PreviousTranscription & Media ProcessingNextTranscriber

Last updated 6 years ago

Component/part description

In this section we are looking at the Transcription JSON schema used in autoEdit.

Related projects.

Connected to this is defining a schema for transcription, to make sure all components that work with this have a defined interface/specification. Eg

Gentle - json

example.

IBM - Json

example. As well as their and documentation.

Pocketsphinx - plain text

pocketsphinx plain text.

Implementations Options considered

Other

An array of words object, to represent lines this could also be a nested array of word objects.

Where the word object at a minimum as a start, end time and text attribute.

BBC Transcription Model

However the blocker for now is that changing the data structure for transcription at this stage would involve a pretty substantial refactor.

Current implementation

Transcription domain

autoEdit JSON Transcription schema at a high level it models the objects present in a transcription.

In this representation:

  • Transcription

    • Paragraphs ← speaker

      • Lines

        • Words

Speakers are associated to paragraphs. Paragraphs are treated as sections of lines.

A list of speakers can also be kept separate, similarly to how IBM Watson stt API returns the results of speaker diarization.

What needs refactoring

Name of paragraph and line attribute are ambiguos. It should be lines and words instead(?).

replace array with hash

This is a bigger refactoring but instead of array data structure, could use hash/dictionary.

This way id is the key. and can make use of key value methods available in js. Lookup speed would improve(?) and could easily get array of values using js method.

considered the to make use of React component, built on top of Draft.js to improve the word correction feature.

.

BBC Transcript model
See appendix for json
See appendix for json
stt api reference
See appendix for example
bbc/transcript-model
bbc/transcript-editor
See Appendix for autoEdit json schema example
Transcription modelling diagram