Determine speakers every time it’s unclear. Supplied how a lot of the audio occurs above unrelated video clip clips, it was crucial to recognize who was speaking when it was not evident. Building it clear when the audio shifted to the narrator was also critical context, as was describing the everyday individuals who ended up speaking.

Describe the audio. There’s almost nothing much less helpful than simply looking at the phrase “[music playing].” We made the decision the captions really should describe the new music anytime it adjusted tonally — from its starting as stark, vaguely digital notes to moments in which a choir hums somberly. So significantly time and artistry goes into finding the ideal tracks, and it makes perception to seize all those selections in your captions.

Intention for brevity. The on-display text of top research queries from 2020 are essential to the video’s artistic. When the on-monitor textual content matched the language of the audio, we didn’t require to repeat those people lookup conditions in the captions. We also identified that 3 strains of text at the moment was a reasonable limit for captions, so as not to overwhelm viewers. Unpacking the expertise involves pondering just as substantially about what to leave out of your captions as what to incorporate.