describing the structure of the content in terms of video sgements, frmae, images, etc.
describing the objects, events, notions from the real word captured by the content