Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

Table of Contents

...

Add Indexes and Transcripts on the Resource Detail Page

To add Indexes or Transcripts to a Resource, go to the Resource Detail page, click on the appropriate tab on the right sidebar and then click the three stacked dots. You'll see options there, including "Upload another transcript" and "Request new transcript." This page discuss what happens when you select "Upload another transcript." If you select "Request new transcript" this gives you options to send video and audio content to an automated transcription service.

...

Click on "Upload Transcript/Index". The transcript or index is now loaded in the tab.


...

Add Indexes and Transcripts as Bulk Imports

...

How to Format Indexes for Import

...

  • Simple WebVTT Indexes

    • WebVTT indexes can be very simple. At a minimum, Aviary requires that a WebVTT index contain a header and at least one cue/segment with:

      • segment title (this can be a number or a text title for the segment)

      • timecode (this must be in the format HH:MM:SS.### or 00:00:00.100 --> 00:00:07.342)

      •  segment synopsis (simple text)

  • Hierarchical WebVTT Indexes

    • WebVTT indexes can describe hierarchy or structure to the segments.  In the WebVTT documentation, this feature is described as nested cues. To express a parent-child hierarchical arrangement of Aviary index segments, use nested cues where the timecode of the parent fully contains the timecode of each of the children. Thus, if child segment #1 is 00:00:00.000 --> 00:00:44.000, and child segment #2 is 00:00:44.000 --> 00:01:19.000, then the parent of #1 and #2 should be listed before them with a timecode that fully contains both of the timecodes of the children, like 00:00:00.000 → 00:01:24.000

      Image Added
      Image Added

...

How to Format Transcripts for Import

OHMS XML Formatting for Transcripts

Transcripts can be imported into Aviary formatted as OHMS XML. When imported as a transcript, the resource level descriptive metadata elements will be ignored an only the transcript data will be imported as an transcript. For more information about OHMS and OHMS XML, see: https://www.oralhistoryonline.org/documentation/See the OHMS XML xsd schema here: https://www.avpreserve.com/nunncenter/ohms/ohms.xsd

WebVTT Formatting for Transcripts

Transcripts can be imported into Aviary formatted as WebVTT the Web Video Text Tracks format (https://www.w3.org/TR/webvtt1/). A WebVTT files is a container file for chunks of data that are time-aligned with an audiovisual resource. The WebVTT file starts with a header and then contains a series of data blocks. Each data block that has a start and/or end time is called WebVTT cue (or segment). Aviary requires that a WebVTT index contain a header and at least one cue/segment with:

      • segment title (this can be a number or a text title for the segment)

      • timecode (this must be in the format HH:MM:SS.### or 00:00:00.100 --> 00:00:07.342)

      •  segment synopsis (simple text)

Plain Text Files Formatting of Transcripts

...

  • Time stamps have to be formatted: [HH:MM:SS]
  • Time stamps can be placed anywhere in the text within the document.
  • If no valid time stamps are provided, the full text will be uploaded with a time stamp [00:00:00]
  • Aviary will identify Speaker names if they are in all caps followed by a colon, e.g., SUSAN:, or JEREMIAH:.

A Sample Plain Text Document is available here.

* Note: Aviary can also accept an Aviary TEXT transcript export as a valid transcript import format. 

...

  • .docx files that match our current txt transcript formatting rules can be imported.
  • .doc files that match our current txt transcript formatting rules can be imported. 
  • additionally, when time codes do not have brackets, e.g., 00:00:00, Aviary will still import them correctly. 

Time codes can be formatted in any of the following ways:

  • [HH:MM:SS]
  • [MM:SS]
  • HH:MM:SS
  • MM:SS
  • HH:MM:SS:MS
  • HH:MM:SS.MS

Time stamps can be placed anywhere in the text within the document.

If no valid time stamps are provided, the full text will be uploaded with a time stamp [00:00:00]

Aviary will identify Speaker names that are not all caps, e.g., Nouman:  or Nouman T.: or N. Tayyab:

Aviary will ignore any headers or footers in the file.

Related articles

Content by Label
showLabelsfalse
max5
spacesAVIARYSUPP
showSpacefalse
sortmodified
reversetrue
typepage
cqllabel in ("transcript","index-tab","ohms-xml","index","webvtt","kb-how-to-article","transcript-tab") and type = "page" and space = "AVIARYSUPP"
labelskb-how-to-article transcript transcript-tab index index-tab ohms-xml webvtt

...

hiddentrue



Related issues