• TextAV Audio and Video Components (ongoing)
  • Guidelines for this board
  • Introduction
  • How to add a new library/Modules/components
  • Trello --> Github --> gitbook (programmatically)
  • Plan Roadmap taxonomy
  • github README template
  • Media processing
  • ffmpeg and electron - example boilerplate
  • convert video to audio [Open Source]
  • Generate list of ffmpeg supported file formats [Open Source]
  • Detect silence [Open Source]
  • Youtube Video downloader module (?) [Open Source]
  • Module: Open source STT //Gentle refactor [Open Source]
  • cc extraction // OCR of captions [Open Source]
  • Module: Video format converter [Open Source]
  • Module: Video metadata reader [Open Source]
  • Banpass filter module
  • Tesseract - OCR
  • Transcriptions - utils
  • Transcriber module
  • Sample material for testing STT services [Open Source / CC]
  • Create word accurate time codes from line accurate time-coded transcript (eg srt)
  • Language codes ISO-639-1 Code
  • Module: Timecode conversion [Open Source]
  • UI Utilities for timecode representation
  • Sanitise string for file path
  • Transcription STT Sdk
  • Web Speech API
  • Pocket Sphinx STT [Open Source]
  • IBM Watson STT [Proprietary]
  • Google Cloud Speech API [Proprietary]
  • Microsoft Bing STT [Proprietary]
  • Baidu STT SDK [Proprietary]
  • Speechmatics STT SDK [Proprietary]
  • Spoken Data STT SDKs [Proprietary]
  • Gentle (Server) STT node SDK [Open Source]
  • Temi.com/rev.com [Proprietary]
  • Latvian Kaldi [open source]
  • Mod9
  • Movi - arduino component, offline
  • deepgram
  • Mozilla deep speech
  • AWS Transcriber
  • Transcription UI
  • Transcription text editor with Draft.js Editor [Open Source]
  • Overtyper
  • Alignement
  • Alignement
  • Module: to align partially scripted speeches
  • Captions
  • Module: captions composer (with text pre-segmentation) [Open Source]
  • Module: Captions burner [Open Source]
  • Srt parser composer // Pietro [Open Source]
  • TTML Parser // Gary, Brightcove [Open Source]
  • Annotations
  • Annotation model atjson
  • Paper-editing & remixing UI
  • Front end component: video preview of JSON Edl
  • Cognitive insights
  • LIUM Speaker Diarization BBC - [Open Source]
  • Module: open source summarization module [Open Source]
  • Module: punctuation and capitalisation. [Open Source]
  • Translation SDK
  • Deep L - Translation SDK node
  • Export & remix & video editing
  • Parse EDL (plain text) to JSON [Open Source]
  • Module: Post to facebook [Open Source]
  • EDL composer from JSON EDL [Open Source]
  • Module: Post to Twitter Video [Open Source]
  • edit video EDL (JSON) - ffmpeg-remix (super fast video editing of mp4 videos) // Laurian [Open Source]
  • EDL Json to XML FCP7 (compatible with premiere) [Open Source]
  • Popcorn Js // Mozilla/Internet Archive [Open Source]
  • unsorted
  • NWJS boilerplate
  • QCTool
  • VRecord
  • Electron travis CI automated build: OSX, Linux, Windows
Powered by GitBook

Tesseract - OCR

Tesseract - OCR

tutorial by @corbin74 https://pietropassarelli.gitbooks.io/textav/content/unconference-projects/ai-pipeline/i-learned-what-tesseract-can-do-and-so-can-you.html

https://github.com/tesseract-ocr/tesseract

also node module

https://www.npmjs.com/package/node-tesseract


Link to trello card: Tesseract - OCR

Labels

Open Source, Node module,

results matching ""

    No results matching ""