In einem Satz

Video Breakdown wurde bereits als Garage-Projekt im Sept. 2016 veröffentlicht und noch während der Beta-Phase umbenannt in Video Indexer.

Video Indexer gehört zu den Microsoft Cognitive Services und

Ausführliche Beschreibung

Das Sharing von Videos ist eine Funktion von Microsoft Stream (ehemals Office 365 Video, welches wir natürlich auch beschreiben: https://www.skilllocation.com/microsoft-stream/)

Der Video Indexer beschäftigt sich vor allem mit der Indizierung von Videos, also z.B. der Transskription (in Echtzeit Untertexte generieren) und sogar der Echtzeit-Übersetzung dieser Transksiption.

Wichtigste Funktionen / Features

Zunächst werden Videos hochgeladen vom „Contributor“. Sind die Video einmal indiziert, kann nach einem Begriff gesucht werden. Dieser wird im Bild des Videos, als Personenname, als Transkriptionstext (Audio-Indizierung), in der Beschreibung, auf einer PPT-Folie u.m. gefunden werden. Eine Zeitangabe lässt einen auch direkt dorthin springen.

 

Video Indexer is a cloud service that enables you to extract the following insights from your videos using artificial intelligence technologies:

  • Audio Transcription: Video Indexer has speech-to-text functionality, which enables customers to get a transcript of the spoken words. Supported languages include English, Spanish, French, German, Italian, Chinese (Simplified), Portuguese (Brazilian), Japanese and Russian (with many more to come in the future).
  • Face tracking and identification: Face technologies enable detection of faces in a video. The detected faces are matched against a celebrity database to evaluate which celebrities are present in the video. Customers can also label faces that do not match a celebrity. Video Indexer builds a face model based on those labels and can recognize those faces in videos submitted in the future.
  • Speaker indexing: Video Indexer has the ability to map and understand which speaker spoke which words and when.
  • Visual text recognition: With this technology, Video Indexer service extracts text that is displayed in the videos.
  • Voice activity detection: This enables Video Indexer to separate background noise and voice activity.
  • Scene detection: Video Indexer has the ability to perform visual analysis on the video to determine when a scene changes in a video.
  • Keyframe extraction: Video Indexer automatically detects keyframes in a video.
  • Sentiment analysis: Video Indexer performs sentiment analysis on the text extracted using speech-to-text and optical character recognition, and provide that information in the form of positive, negative of neutral sentiments, along with timecodes.
  • Translation: Video Indexer has the ability to translate the audio transcript from one language to another. The following languages are supported: English, Spanish, French, German, Italian, Chinese-Simplified, Portuguese-Brazilian, Japanese, and Russian. Once translated, the user can even get captioning in the video player in other languages.
  • Visual content moderation: This technology enables detection of adult and/or racy material present in the video and can be used for content filtering.
  • Keywords extraction: Video Indexer extracts keywords based on the transcript of the spoken words and text recognized by visual text recognizer.
  • Annotation: Video Indexer annotates the video based on a pre-defined model of 2000 objects.

Once Video Indexer is done processing and analyzing, you can review, curate, and publish the video insights.

 

Unter http://dictate.ms findet sich ein weiteres Garage-Produkt, welches die Transskription in Word und PowerPoint als separate Symbolleiste integriert:

 

 

 

Hinweise und Links

https://www.videoindexer.ai

https://docs.microsoft.com/de-de/azure/cognitive-services/video-indexer/video-indexer-overview

Geändert am: 14. Juli 2017 von Carola Pantenburg