medkit.core.audio.document#

Classes:

AudioDocument(audio[, anns, metadata, uid])

Document holding audio annotations.

class AudioDocument(audio, anns=None, metadata=None, uid=None)[source]#

Document holding audio annotations.

Variables
  • uid (str) – Unique identifier of the document.

  • audio – Audio buffer containing the entire signal of the document.

  • anns (AudioAnnotationContainer) – Annotations of the document. Stored in an AudioAnnotationContainer but can be passed as a list at init.

  • metadata (Dict[str, Any]) – Document metadata.

  • raw_segment (Segment) – Auto-generated segment containing the full unprocessed document audio.

Attributes:

RAW_LABEL

Label to be used for raw segment

RAW_LABEL: ClassVar[str] = 'RAW_AUDIO'#

Label to be used for raw segment