VOCABULARIES
Read more
An image description algorithm used in computer vision has been programmed, whose approach consists of placing a regular grid of interest points on the image and selecting a region of pixels around each of its nodes, for which a descriptor is calculated that takes into account the gradients of gray found. The descriptors of the entire collection of images can be grouped based on their similarity, and each resulting group will determine what we call a "visual word." The total of "visual words" in the image collection generates a visual vocabulary. The method is called Bag-of-Words because it represents an image as a disordered collection of local visual features. Considering the frequency with which each "visual word" occurs in each image, we apply pLSA (Probabilistic Latent Semantic Analysis), a statistical model that will automatically classify images according to their formal category.