Multimedia Semantics — Bridging the Semantic Gap

In “Content-Based Image Retrieval at the End of the Early Years Found in: IEEE Transactions on Pattern Analysis and Machine Intelligence Arnold Smeulders , Marcel Worring , Simone Santini , Amarnath Gupta , Ramesh Jain December 2000” the semantic gap was defined as:

The semantic gap is the lack of coincidence between the information that one can extract from the visual data and the interpretation that the same data have for a user in a given situation. A linguistic description is almost always contextual, whereas an image may live by itself.

This semantic gap exists in all media. Even in text. There are all these techniques but are they solving the problem. The two popular approaches in this direction are:

Semantic Web tools (Ontologies, RDF, XML) help in creating relationships among ‘symbolic data’, and

Concept detection (and other media processing) that deal with ‘signal data’.

A deeper look at these two, however shows that they work on two different sides of the gap rather than doing anything really to bridge the gap. Semantic tools are all about symbols while the models developed at that level do not allow going across the gap to apply the models there to signals. And concept detection approaches at signal level have deep roots in implicit rather than explicit models.

Leave a Reply