Content Based Retrieval

Content Wrap up



The purpose of this exercise is to find ways to accurately query objects without having to use tags, just the objects themselves. This leads to very complex sorting and searching of a database that can be potentially very large. We need to pick and choose the correct "similarity" features to accurately query these objects. Unfourtunately, similarity choices can change, depending on what you need from each database. Also, with the higher number of features increasing (the dimensionality of the vectors increasing), we have poorer results. Most of these techniques are from the 80's, but there have been newer types of indexing that can handle higher dimensions with greater accuracy (X trees, R* trees).