The clustering problem has been extensively studied over the last 50 years; however, it still has the attention of researchers. This paper presents a topological basis of a pseudometric-based clustering model which takes into account the local and global topological properties of the data to be clustered, as per the definition of homogeneity measurement. The proposed approach takes into account the homogeneity effect produced when a new particle is added to a group. The additional element can be accumulated in the group if its local homogeneity is not altered and, therefore, it is not necessary to carry out tests in another group. A new group needs to be generated if the threshold of the local homogeneity of the group exceeds. Theoretical results, their implementation, and their application to the problem of Content Based Image Retrieval (CBIR) are presented. The tests were performed using three image databases widely used in the literature, which are “Vogel and Shiele,” “Oliva and Torralba,” and “L. Fei- Fei, R. Fergus and P. Perona.” The results are presented and compared with the most competitive methods available in the literature.
1. Introduction
Nowadays, the large amount of data on the Internet requires grouping or clustering to obtain the relevant information from them. The typical clustering algorithms are extensively used in the areas of data sciences, data mining, and pattern recognition for grouping the information having common characteristics or defining the optimal number of groups.
Literature reports many clustering paradigms among which the most important can be categorized into Partitional Clustering [1–5] and its variants [6, 7], Hierarchical Clustering [1, 3, 8–12], Density-based Clustering [1, 3, 13–16], Grid-based Clustering [1, 3, 17–20], Spectral Clustering [1, 3, 21–24], and Gravitational Clustering [25–29]. The literature review on clustering is given in the in the next section.
In this research paper, a new clustering theory inspired of the thermodynamics principle of energy and based on the topological paradigm is presented. The proposal works in accordance with the homogeneity effect that occurs when a group receives a new particle. A test in another group can be avoided if the new element does not alter the local homogeneity of the group in which it is added. On the contrary, when the threshold value of the group is surpassed, a new group must be generated. The process continues until all the elements are assigned groups. Subsequently, the proposal is applied to the Content-Based Image Retrieval (CBIR) technique in databases of natural scenery images.
The CBIR technique provides a support-oriented tool for image understanding where the aim is to associate and recognize an image only by its content. Research and development in CBIR technique enabled to make its place in the market in the form of co-marketing products such as QBIC-IBM (http://www.qbic.almaden.ibm.com/), VisualSEEL (http://www.ee.columbia.edu/ln/dvmm/researchProjects/MultimediaIndexing/VisualSEEk/VisualSEEk.htm), and MARS (https://www.ideals.illinois.edu/handle/2142/25947). The increasing attention given and search for efficient methods have made it an active research area [30–33].
Regardless of the purpose of the users, each type of image has its own specific problems in terms of recognition and classification. Therefore, CBIR based techniques are designed to focus on one type of image with the natural scenery being one of the most complicated due to the mixture of irregular shapes and colors.
Our novel approach was tested and validated using three databases widely used in CBIR literature, which are described below:
Vogel and Shiele (VS) [34]: 700 images classified as 144 coast, 103 forest, 179 mountain, 131 prairie, 111 river/lake, and 32 sky/cloud.
Oliva and Torralba (OT) [35]: 1472 images classified as 360 coast, 328 forest, 374 mountain, and 410 prairie.
L. Fei–Fei, R. Fergus, and P. Perona (FP) [36]: 373 images classified as 128 Bonsai, 60 Joshua Tree, 85 Sun Flower, 64 Lotus, and 36 Water Lily.
We will refer to these image databases as VS, OT, and FP (Caltech-101).
Our results are compared with those of [34, 37–39]. We would like to point out that our proposal achieves a recognition percentage of 100%.
The system is implemented using the LabVIEW programming language (Laboratory Virtual Instrument Engineering Workbench) [40]. Due to the graphical ease of programming and the facility to interface input-output devices, this aspect is considered important because commercial computer-based devices can use this software. Visualizing in the future to implement an autonomous system of recognition of natural scenes mounted on a car, which will be managed by LabView as autonomous system [41–43]. The fact of recognizing natural scenarios in the navigation of an autonomous car or possibly a drone, with a 100% certainty, this proposed system will be an important element in the safety of autonomous vehicles.
The rest of the paper is organized as follows. In Section 2, related works are discussed. The proposed approach is presented in Section 3. A detailed description of the topology-based theory is given in Section 4. Section 5 presents the system methodology. Experimental results are given in Section 6. Finally, conclusions and future works are given in Section 7.
2. State-of-the-Art
Clustering algorithms are suitable for identifying and separating large databases into classes and they are among the major pattern recognition techniques. A clustering algorithm is expected to divide the set of features into the subsets which optimizes the intra-subset similarity and inter-subset dissimilarity, where a similarity measure is defined beforehand.
An extensive use of clustering algorithms is made in areas of data science and data mining, where the objective is to group information that has common characteristics, as well as, to define the optimal number of groups.
There are survey works giving a landscape on clustering techiques [44–50], where, most of the works divide clustering algorithms on the basis of their paradigms. Thus, the most important clustering paradigms reported in the literature are: Partitional Clustering, Hierarchical Clustering, Density-based Clustering, Spectral Clustering, and Gravitational Clustering, which are summarized in Table 1. Despite the large number of clustering algorithms, one does not work 100% for all types of clustering problems. The algorithms reported in the literature have been developed and adopted for different data to be clustered. The fundamental problem in the previously proposed algorithms reported in the literature is the overlooking of the topology of the data, both locally and globally, which limits them to work only for the data they analyze.
Comparing existing clustering paradigms.
Related works
Clustering paradigm
Works
Limitations
Partitional
[1–7]
(i) Only suitable for hyper-spherical separation. (ii) Optimization problem (non-guaranteed minimum value).
Hierarchical
[1, 3, 8–12]
(i) Suitable for small databases. (ii) Problem to find optimal number of groups.
Density-Based
[1, 3, 13–16]
(i) Main disadvantage is defining the priori density function. (ii) Suitable for databases following a given density function. (iii) Computationally heavy.
Spectral
[1, 3, 21–24]
(i) Frequency domain transformation. (ii) Preserving time-frequency correspondence. (iii) Optimization problem (non-guaranteed minimum value). (iv) Computationally heavy.
Gravitational
[25–29]
(i) Suitable for small databases. (ii) Only suitable for hyper-spherical separation. (iii) Optimization problem (non-guaranteed minimum value). (iv) Computationally heavy.
This proposal proposes a homogeneity measure that takes into account the local and global topological properties of the data to be clustered (see Sections 3 and 4).
Our proposal is an attempt to overcome the main limitations of typical clustering algorithms. With the topological model based on pseudometric, unlike partitional and gravitational paradigms, we no longer have the need to establish prior knowledge of the data, thus, it is not necessary to define any density-based function and the use of working only with the hyper-spherical separation functions is avoided. Therefore, based on Algebra of sets, in addition to not occupying excessive memory as the density based, spectral, and gravitational paradigms do, the calculations and results are faster. This research papers aims to compare the clustering and classification to databases of natural sceneries through CBIR methodology of different paradigms in terms of performance. The advantages and disadvantages of different clustering algorithms are beyond the scope of this research paper.
Concerning the most competitive CBIR works leading the same natural databases and scenarios are
Serrano [39]: used k-means algorithm (partitional paradigm) as clustering technique and k-nearest neighbor K-NN classifier.
Bosch [37]: used Probabilistic Latent Semantic Analysis (PLSA) which is based on a density function (density-based and partitional paradigms). Authors used K-NN and support vector machine SVM classifiers.
Lazebnik [38]: used multiresolution grouping methodology based on histograms (partitional paradigm) and a SVM classifier.
Vogel [34]: used grid segmentation (partitional paradigm), K-NN, and SVM classifiers.
Authors take into account eleven classes: coast, forest, mountain, prairie, schooner, bonsai, lotus flower, aquatic flower, sky, cloud, and Joshua’s tree. The classification performance of the authors does not reach 100% (about 90%). Throughout our clustering method, 100% is reached.
3. Proposal
The traditional models categorize the images of different natural sceneries (forest, desert, mountain, river, etc.) by recognizing the necessary and sufficient characteristics of that particular image or images. However, there is little consensus on what these characteristics should be.
The clustering process reflects the way visual information is used, associating common elements between different images. Therefore, the hypothesis is the way to associate and create the categories of natural sceneries using mathematical functions which measure association and the decision is made on the basis of that assessment. The approach developed in this research paper compares the categorization patterns and decides whether or not the images belong to the same category.
Therefore, the following model of human thought is considered. The way in which different objects are grouped by a person is by evaluating the attributes and properties of the object in a given class. This is done by generating association values which are based on one function; the affinity function which measures how close one image is to another. The individual generates values and decides the convenience of associating an object to a group and thus form partitions.
The principle of homogeneity states that, for a closed system, internal homogeneity maximum is reduced to a minimum value in equilibrium. Following this line of thought, if we associate the ensembles with a homogeneity function, which is obtained in principle with the affinity function, a group of images will be formed by common elements if they are kept below their homogeneity level.
A subset of a group may be designated, which we will call representatives. A good way of associating an object to a group is when there is affinity between that new object and the group of representatives, that is, when the object added to the group keeps its homogeneity level low, by maintaining the qualities of that group and therefore the association of new objects will be clearer. In such case, it is sought that even with the least affinity that is on the object, it remains below the level of homogeneity. If it turns out that the object has no relation to this grouping, then this leads to a rejection, and a new group is created in which the elements are labelled as ‘other’ because they have no similar properties to each other or to the created groups.
4. Topological-Based Clustering Theory Definition 1.
Let X be a nonempty set. The power set 2X is the set of all subsets of X. The Cartesian product X×Y of sets X and Y is the set of all ordered pairs (x,y) with x∈X and y∈Y [51].
A partition P(X) of a set X is a set of nonempty subsets of X such that every element x∈X is in exactly one of these subsets. Equivalently, a family of sets P(X)={Pi∣i∈I} is a partition of X if and only if the following conditions are satisfied:
For each i∈I, Pi⊂X.
For i,j∈I and i≠j implies Pi∩Pj=∅.
⋃i∈IPi=X.
Definition 2.
A pseudometric of a set X is a function ρ:X×X→[0,∞) satisfying the conditions defined by (1), for all x, y, and z∈X.(1)ρx,y=ρy,xρx,y≤ρx,z+ρz,yρx,x=0
The pair (X,ρ) is called a pseudometric space.
Besides, for all x∈X and λ≥0, an open-ball function of radius λ and around x is defined as Bρ(x,λ)={x∈X:ρ(x,k)<λ}.
Definition 3.
Let (X,ρ) be a pseudometric space and A be a subset of X. If an element r∈A satisfies the condition (2), then r is called representative of A, denoted by R(A).(2)∑y∈A\rρr,y≤∑y∈A\xρx,y
For all x≠y, x,y∈X.
Definition 4.
Let (X,ρ) be a pseudometric space and A be a subset of X with R(A)={x∈A:xisrepresentativeofA}; then the distance between A and x is defined as(3)ρA,x=minρr,x:x∈X
Also, a homogeneity function H(A), with H:2X→R, is defined as(4)HA=1A-1∑x≠rρx,rwhere r∈R(A).
It is noted that the calculation of the homogeneity of a set A is independent of the choice of the representative of A. By definition, if r1 and r2 are representative of A, then, the following condition must be satisfied:(5)∑y∈A\r1ρr1,y=∑y∈A\r2ρr2,y
A point p is related to A if H(A⋃{p})≤H(A).
Definition 5.
Let (X,ρ) be a pseudometric space and δ≥0. A partition P(X)={Pi:i∈I} of X is called a δ-partition if H(Pi≤δ) for each i∈I.
Definition 6.
Let (X,ρ) be a pseudometric space, A be a subset of X, and R(A) be the set of representatives of A. The set (6) given below is called a lambda-cover of A.(6)starA,λ=⋃r∈RABρr,λ
Lemma 7.
Let (X,ρ) be a pseudometric space. For each finite set A∈2X, the set of representatives R(A) is a nonempty set.
Proof.
For the set A={ai:i=1,…,n}, considers(7)Di=∑y∈A/aiρai,y
take D∗=min{D(ai):i=1,…,n}. There is an i∗ such that D∗=Di. For definition of ai∗ it follows that(8)∑y∈A\ai∗ρai∗,y≤∑y∈A\xρx,y
for each x∈A. This proves that ai∗ is a representative of A.
Example 8.
Consider the sets A,B, and C in R2 with the Euclidean metric (see Figure 1). The elements of the set A marked with a dot are distributed in two concentric circles with radii 1, and 2. The representative of the set is the element in the center of the circle marked with a star. Consider the subsets B and C as removing a circle from A (see Figure 1). Also, it can be noted that the homogeneity of set A, B, and C is 1.5, 2, and 1, respectively.
This example shows that the homogeneity is not invariant under subsets, for B⊆A does not imply H(B)≤H(A).
The following result shows one way to preserve the homogeneity.
Example of homogeneity and subsets.
Proposition 9.
Let (X,ρ) be a pseudometric space, A∈2X, and R(A) and H(A) be the set of representatives and homogeneity of A, respectively. The λ-star cover st(A,λ) with λ=H(A) satisfies H(st(A,λ))≤H(A).
Proof.
Consider r∈R(A) and p∈Bρ(r,H(A))\A; then ρ(r,p)≤H(A). Let A1=A∪{p}.(9)HA1=1A1-1∑y∈A1\rρy,r=1A1-1∑y∈A\rρy,r+1A1-1ρp,r≤1A∑y∈A\rρy,r+1AHA=A-1AA-1∑y∈A\rρy,r+1AHA=A-1AHA+1AHA=HA
It can construct the succession {Aj:j=1,…,n} which satisfied H(Am)≤H(An) for n≤m and that Aj→st(A,λ). Therefore H(st(A,λ))≤H(A).
Theorem 10.
Let (X,ρ) be a pseudometric space and δ≥0. There exists a partition P(X)=Q∪{Pi:i∈I} of X, where {Pi:i∈I} is δ-partition and the element Q are not related to Pj for each j∈I.
Proof.
It will construct a family of sets {Fi}i=1k={Qk∪{P1,P2,…,Pk}}i=1k which satisfies the following properties for each i,j=1,…,n: (10)iPi∩Pj=0∀i≠jiiQk∩Pj=0∀j=1,…,kiiiHPi≤δ
Take x1,x2∈X. If ρ(x1,x2)<δ then P1={x1,x2}. Otherwise, define Q1={x1,x2}. It can assume without loss of generality that there exists n such that condition ρ(xm,xn)<δ is satisfied for m≤n. Let P1={xm,xn} and Q1={x1,x2,…,xm-1,xm+1,…,xn-1}; then take P1=st(P1,δ) and consider F1=Q1∩{P1}. By Proposition 9 the following inequalities hold:(11)HP1=HstP1,δ≤HP1<δTherefore condition P1=st(P1,δ) is satisfied. The properties (i) and (ii) are fully satisfied.
Suppose that, for k∈N, a succession of families of sets Fk=Qk∪{P1,P2,…,Pk} has been constructed satisfying the conditions (i)-(iii).
Let Y=X\(⋃k=1KFk) and take y∈Y.
Case I (y is not related to any Pi). Define Qk=Bρ(y,δ)∩Qk.
If Qk=∅ then Qk+1=Qk∪{y}, and Fk+1=Qk+1∪{P1,P2,…,Pk}. By definition of Qk the property Qk+1∩Pj=∅, therefore property (ii) is satisfied. The property (i) and (ii) are preserved due to the family of sets {P1,P2,…,Pk} is the same in Fk+1 and Fk.
If Qk≠∅ then Pk+1=Qk, Qk+1=Qk\Qk, and Fk+1=Qk∪{P1,P2,…,Pk}.
Note Qk+1⊆Qk and Pk+1⊆X\(⋃j=1KPj). Properties (i) and (ii) are easily verified. y is a Pk+1 the condition (iii) is satisfied.
Case II (there exists i≤k such that y is related to Pi). Consider the set of indexes j={j∈N:yisrelatedtoPj} and let i∗=min{ρ(Pi,y):i∈I}. Defines Fk+1=Qk∪{P1,P2,…,Pi∗∪{y},…,Pk}. Since y is related to Pi∗ the inequality H(Pi∗∪{y})≤H(Pi∗)≤δ implies property (iii). Note that y∈X\(Fk). Therefore properties (i) and (ii) are satisfied.
By induction, there exists a decomposition of the set X of the form P(X)=Q∪{Pi:i∈I}, where the family of sets {Pi:i∈I} satisfies with H(Pi)≤δ for each j∈I. The theorem is proved.
Observe that the theorem indicates an algorithm which allows to create partitions of a set from a pseudometric and a parameter δ. This partition consists of a family of sets {Pi:i∈I} whose elements are clusters. Elements of the set Q do not belong to any class.
5. Methodology
In this section, we describe each stage of the proposed methodology (applying our topological clustering theory based on a pseudometric) for image retrieval of natural scenes from a database. As shown in Figure 2, the flow diagram for the whole proposal, there is a training phase and a testing phase explained as follows.
Flow diagram for the training and testing stage.
5.1. Training Stage
This stage is divided into four main phases: Image database, Histogram estimation, Calculate the distance, and Add the object into the suitable cluster, see Figure 2 upper line, which are described below.
5.1.1. Image Database
To develop a CBIR system, it is necessary to use databases of images for the training of the system. In this phase, standard and free-use databases such as Vogel and Shiele (VS) [34], Oliva and Torralba (OT) [35], and L. Fei–Fei, R. Fergus, and P. Perona (FP) [36] can be used.
The classes used in both the training and the experimentation phases are shown in Figure 3.
Categories for image organization and retrieval for each of the three databases: (a) FP, (b) OT, and (c) VS respectively.
5.1.2. Histogram Estimation
The histogram provides a summary of the distribution of pixel values in an image. The color histogram of an image is relatively invariant with respect to translation and rotation on the axes. The comparison of the colors contained in two images can be made using histograms. The color histograms are suitable for this investigation [52–54]. A histogram represents the number of occurrences of the values of a data set (see the following):(12)HiA=∑j=1nCijwherecij=1ifai=xi0otherwise
A good practice is to perform normalization. It is usually represented as a density probability function. This preprocessing task is carried out to bring the values to an interval regardless of the number of elements they have.(13)H∗A=HA∑j=1nHiA
In this part, the values of the image are still expressed in RGB color space. We also use HSI (Hue, Saturation, and Intensity) values. The first step for the conversion is to normalize the RGB values, as(14)r=RR+G+B;g=GR+G+B;b=BR+G+B
After normalization, the HSI components are obtained by(15)h=cos-10.5r-gr-br-g2+r-bg-b1/2∀h∈o,π,b≤gh=2π-cos-10.5r-gr-br-g2+r-bg-b1/2∀h∈π,2π,b>gs=1-3×minr,g,b∀s∈0,1i=R+G+B3×255∀i∈0,1
For convenience, for neutral color values such as black, white, and gray, where values for R, G, and B are equal, H = 0. Similarly, the H, S, and I values are converted into the following ranges:(16)H=h×180πS=s×100I=i×255
5.1.3. Calculating the Distance
Consider the set Φ of images. There is an association of an image P to the corresponding histogram H(P). For two images P1 and P2, the following relationship is defined:(17)ρ:Φ2⟶RρP1,P2=dHP1,HP2
In this case the function ρ:Φ2→R is a pseudometric.
Take P1, P2, and P3 images in Φ and H(P1), H(P2), and H(P3) with their corresponding histograms. The properties of Definition 2 are fulfilled:(18)iρP1,P1=dHP1,HP1=0iiρP1,P2=dHP1,HP2=dHP2,HP1=ρP2,P1iiiρP1,P2=dHP1,HP2=dHP2,HP1≤≤dHP1,HP3+dHP3,HP2=ρP1,P3+ρP3,P2
A pseudometric has been built in the space of the images. Theorem 10 will allow creating clusters and perform image recovery. For this study, we consider using the following distance functions [55].
The calculation of the distances must be modified in order to eliminate the indeterminate forms. If the direct substitution produces an indeterminate form then these values are discarded.
5.1.4. Add the Object into the Suitable Cluster
Having the representatives of each cluster, the homogeneity of the element to be added to the representatives is calculated. Through Theorem 10 and Proposition 9 of Section 4, homogeneities are calculated for each representative. Then, the subject image is assigned to the cluster where homogeneity is not altered.
5.2. Testing Stage
This stage is divided into five main tasks and proceeds as depicted in Figure 2 bottom line. A query image is presented to the system, and the same two first processing steps used for learning are applied to it: histogram estimation and Calculate the Distance: query to representatives. At this point, theorem one is applied measuring through a defined pseudometric. By Proposition 9 of Section 4, homogeneities are calculated for each cluster defined in learning phase. Then, the query image is assigned to the cluster where homogeneity is not altered. Therefore, the n-images of the cluster are recovered.
6. Experimental Results
In order to test the whole performance of this proposal, 3 databases were concatenated and then divided by classes. The division was done in eleven classes (See Table 2) because natural sceneries are one of the most complicated due to the textures, the mixture of irregular shapes, and color variety.
Eleven classes for learning and testing tasks.
Class
Number of images
coast
504
forest
431
mountain
553
prairie
541
schooner
350
bonsai
128
lotus flower
64
aquatic flower
36
sky
32
cloud
32
Joshua’s tree
60
Total
2,731
All tests were carried out under “Cross-validation” method, which consists of random selection of half images set apart from database of each class for training task, and the other half part for testing task. This procedure is carried out during 20 iterations and, finally, an average value is obtained of retrieval percentages.
The system was implemented using the LabVIEW programming language, which is a computer systems engineering software that facilitates testing, measurement, and provides control with quick access to hardware and data information. The programs developed with LabVIEW are called Virtual Instruments, or VIs, and their origin came from instrument control, although today it has expanded widely not only to control all types of electronics but also to the embedded programming, communications, mathematics, etc. Its main characteristic is the ease of use for professional programmers. Relatively complex programs can be made and the facility to interface input-output devices. Visualizing in the future is to implement an autonomous system of recognition of natural scenes mounted on a car or a drone, which will be managed by LabView as autonomous system [41–43]. The fact of recognizing natural scenarios in the navigation of an autonomous car or possibly a drone, with a 100% certainty, contributes to the safety of autonomous vehicles.
The developed LabVIEW application screenshot is shown in Figure 4 where the query image, the main feature of the query image which is the histogram, and the retrieved image can be seen at the top left corner, at the lower left corner, and at the center right, respectively. The most related image file can be seen at the right bottom of the figure. Also, the histogram of the query image and the histogram of the most similar images can be seen.
LabVIEW Vision Assistant.
Several subroutines were made to integrate two main modules: training and testing tasks. Only the latter has the image display for performance and processing speed issues. For each image of the selected database, the histogram is computed and sent to a ∗.tdms file.
The value of parameter δ is experimentally estimated as an intermediate value between the representatives. For each class an average of the attributes was taken and it was defined as the representative. In order to determine the parameter δ, only the representatives were taken. Therefore, the parameter δ=0.5μ was defined. From now, this parameter is used for the whole clustering algorithm. It was observed that a value of parameter δ smaller restricts very much the classes of objects and created many classes that apparently were different however the classes contain common elements. A higher value of parameter δ significantly increased the size of the set, adding elements that do not have common characteristics.
As there are eleven possible classes of natural scenarios, only 3 examples of query and retrieval images are presented: Bonzai (See Figure 5(a)), Prairie (See Figure 5(b)), and Forest (See Figure 5(c)). The query image (red label in each sub figure) is presented and the system was configured to present the 5 most similar images. Note how, in the three cases of query images, the results of the 5 images are very similar to the query one. As the query image is included in the training database, consequently, in the search result, the first image found is equal to the searched one.
Experimental results for classes clustering.
Experimental results for Bonzai cluster
Experimental results for Prairie cluster
Experimental results for Forest cluster
In order to evaluate the performance of the system created, precision and recall formulas are used [56]:(22)Precision=NumberofrelevantimagesretrievedTotalnumberofimagesretrievedRecall=NumberofrelevantimagesretrievedNumberofrelevantimagesinthedatabase
In the experiment, the precision vs. recovery graph is used for a query image. Graphs are made with the three distances in RGB and HSI color formats; averages were considered for each measure used [57].
The precision-recall analysis for the mountain, bonsai, and forest scenarios is presented in Figures 6, 7, and 8. It can be seen that the best results are obtained with the distance function “Intersection” for the RGB color model (dark blue line). Besides, the worst result was obtained for the “correlation” distance function in the HSI color model (green line).
CBIR Effectiveness (Mountains, average).
CBIR Effectiveness (Bonzai, average).
CBIR Effectiveness (Forest, average).
Applying our proposal to the CBIR problem, a query image is always retrieved in the correct scenario. If the query image belongs to the training database, then it is found first. Otherwise, it is in the right scenario within the first 4 retrieved images (as an example, see Figure 5 for bonsai, prairie, and forest classes).
6.1. Comparison to Previous Classification Results: Natural Scene Classification
The comparison between the proposed method in this research paper and other competitive methods reported in the literature is focused on natural scenarios. Therefore, in the comparison, eleven classes are taken into account, coast, forest, mountain, prairie, schooner, bonsai, lotus flower, aquatic flower, sky, cloud, and Joshua’s tree. The most competitive results leading to the same natural databases and scenarios are Serrano [39], Bosch [37], Lazebnik [38], and Vogel [34].
As shown in Table 3 and Figure 9 where authors used the same database, the proposed method improves the results in all cases and for the all classes. Clarifying that in Table 3 and Figure 9, the zero values of the other authors indicate that they did not work with the respective image class.
Comparison values against the most competitive methods.
Our proposal
Serrano
A. Bosh
Lazebnik
Vogel
Coast
100
94
88
82
80
Forest
100
90
100
95
95
Mountain
100
80
89
88
90
Prairie
100
92
78
70
92
Schooner
100
0
0
79
0
Bonsai
100
92
0
92
0
Lotus flower
100
64
0
70
0
Aquatic flower
100
90
0
92
0
Skies
100
86
0
0
94
Clouds
100
86
0
0
94
Joshua’s tree
100
90
0
88
0
Classification performance of our proposal against the most competitive methods focused on natural scenarios.
Remarking that Bosch [37] has the same performance for the forest class as our model, but Bosch has worked only with four classes. Besides that, Vogel [34] has percentages above 90% in four classes: forest, prairie, skies, and clouds classes; the proposed method improves the result about 6%, i.e., from 94% (mean value for the four best classification) to 100%. Finally, compared with the two authors (Lazebnik [38] and Serrano [39]), our results are far better.
Finally, comparing with articles [30, 58–60] that use other image data bases (Corel A, Corel B, and Caltech) which include some natural scenarios: costs, flowers, and mountains. It can be seen in Table 4 that this proposal is superior.
Comparison values against the most competitive methods using others DB.
Our proposal
Mehmood-2018
Nouman
Mehmood-2016
Yousuf
Coast
100
74.58
60.39
68.23
75
Lotus flower
100
85.13
88.37
84.97
98.3
Mountain
100
82.43
61.88
87.43
83.8
We consider that we have obtained better results than the previous published results since the proposed clustering algorithm, based on topology theory, takes into account both the local and the global topology of the data to be clustered.
7. Conclusions
The proposed theory of a pseudometric-based clustering model and its application in Content-Based Image Retrieval worked as it was expected. The developed clustering method based on topology theory, has successfully operated for the clustering of natural scenery. The proposal based on a topological pseudometric, provides a paradigm within the literature of clustering algorithms.
This proposal takes into account the local and global topological properties of the data to be clustered, in a definition of homogeneity measurement.
The proposal tries to overcome the main limitations of typical clustering algorithms: (1) no longer have the need to establish an a prior knowledge of the data, thus, it is not necessary to define any density-based function, (2) there is no need to work only with hyper-spherical separation functions (partitional and gravitational paradigms). Finally, (3) as our proposal is based on Algebra of sets, calculations and results are faster and without excessive memory consumption (as density-based, spectral and gravitational paradigms do).
Using the same image databases, the comparison with the most competitive works, the proposed method improves the results in all cases and for all the classes.
A query image is always retrieved in the correct scenario. If the query image belongs to the training database, then it is found first. Otherwise, it is in the right scenario within the first 4 retrieved images. As a consequence, the whole system has an efficiency of 100% for eleven natural sceneries.
The application was deployed using the LabVIEW vision assistant. This aspect is considered important because commercial computer-based devices can use this software. The observed processing time was linear with respect to the number of elements in the database. There was a considerable reduction in the training and testing times.
Theorem 10 is not limited to images; besides in any space that defines a pseudometric, this algorithm can be used. The results obtained are very promising and future work includes the application of these functions in other sets and applications.
As future work, the authors propose developing a CBIR system in FPGA hardware or mobile device, such as a camera device to acquire the images to process in real time using this methodology. Finally, the authors have shown a real-world experiment, in which this function obtained very good results.
Data Availability
The data used to support the findings of this study are available from the corresponding author upon request.
Conflicts of Interest
The authors declare that they have no conflicts of interest.
Authors’ Contributions
I. Osuna-Galán, Y. Pérez-Pimentel, Carlos Aviles-Cruz, and Juan Villegas-Cortez contributed equally to this work.
XuR.WunschD.20081John Wiley and SonsXiongH.WuJ.ChenJ.K-means clustering versus validation measures: a data-distribution perspective200939231833110.1109/TSMCB.2008.20045592-s2.0-64049085167TheodoridisS.KoutroumbasK.20081Academic PressJainA. K.Data clustering: 50 years beyond K-means20103186516662-s2.0-7795036934510.1016/j.patrec.2009.09.011BelhaouariS. B.AhmedS.MansourS.Optimized K-means algorithm201420141450648010.1155/2014/5064802-s2.0-84934922035ArthurD.VassilvitskiiS.K-means: the advantages of careful seedingProceedings of the In Proceedings of the eightteenth annual ACM-SIAM symposium on Discrete algorithms, SODA 072007Philadelphia, USA10271035XuY.QuW.LiZ.MinG.LiK.LiuZ.Efficient k-means++ approximation with MapReduce20142512313531442-s2.0-8491061541110.1109/TPDS.2014.2306193NazariZ.KangD.AsharifM. R.SungY.OgawaS.A new hierarchical clustering algorithmProceedings of the International Conference on Intelligent Informatics and Biomedical Sciences, ICIIBMS 2015November 2015Japan1481522-s2.0-84966660708LiuF.WeiY.RenM.HouX.LiuY.An agglomerative hierarchical clustering algorithm based on global distance measurementProceedings of the 7th International Conference on Information Technology in Medicine and Education (ITME)2015363367ArslanO.GuralnikD.KoditschekD.Coordinated robot navigation via hierarchical clustering20163235237110.1109/TRO.2016.2524018RashediE.MirzaeiA.RahmatiM.Optimized aggregation function in hierarchical clustering combination20162022812912-s2.0-8496095051110.3233/IDA-160805YaoW.DumitruC. O.LoffeldO.DatcuM.Semi-supervised Hierarchical Clustering for Semantic SAR Image Annotation201695199320082-s2.0-8498176332210.1109/JSTARS.2016.2537548RosF.GuillaumeS.DENDIS: A new density-based sampling for clustering algorithm2016563493592-s2.0-8496270056210.1016/j.eswa.2016.03.008JahirabadkarS.KulkarniP.Algorithm to determine ε-distance parameter in density based clustering2014416293929462-s2.0-8489054921710.1016/j.eswa.2013.10.025YokoyamaS.Bogardi-MeszolyA.IshikawaH.Ebscan, An entanglement-based algorithm for discovering dense regions in large geo-social data streams with noise2015ChaconJ. E.A population background for nonparametric density-based clustering201530451853210.1214/15-STS526MR3432839AgrawalR.GehrkeJ.GunopulosD.RaghavanP.Automatic subspace clustering of high dimensional data for data mining applications1998272941052-s2.0-0032090765ZhaoQ.ShiY.LiuQ.FräntiP.A grid-growing clustering algorithm for geo-spatial data20155377842-s2.0-8491988318810.1016/j.patrec.2014.09.017RashadM. A.El-DeebH.FakhrM. W.Document classification using enhanced grid based clustering algorithm20153122072152-s2.0-8491937199510.1007/978-3-319-06764-3_27SajanaT.Sheela RaniC. M.NarayanaK. V.A survey on clustering techniques for big data mining2016931122-s2.0-84958184626NgA. Y.JordanM. I.WeissY.On spectral clustering: Analysis and algorithm2001849856YangY.MaZ.YangY.NieF.ShenH. T.Multitask spectral clustering by exploring intertask correlation2014455106910802-s2.0-85027955587InkayaT.A parameter-free similarity graph for spectral clustering20154224948994982-s2.0-8494233323610.1016/j.eswa.2015.07.074ShangR.ZhangZ.JiaoL.WangW.YangS.Global discriminative-based nonnegative spectral clustering2016551721822-s2.0-84969361508WrightW. E.Gravitational clustering1977931511662-s2.0-001754451410.1016/0031-3203(77)90013-9RashediE.Nezamabadi-PourH.A stochastic gravitational approach to feature based color image segmentation2013264132213322-s2.0-8487521927810.1016/j.engappai.2012.10.002DowlatshahiM. B.Nezamabadi-PourH.GGSA: a grouping gravitational search algorithm for data clustering20143611412110.1016/j.engappai.2014.07.0162-s2.0-84908152477KumarV.ChhabraJ. K.KumarD.Automatic cluster evolution using gravitational search algorithm and its application on image segmentation201429931032-s2.0-8489407368910.1016/j.engappai.2013.11.008NikbakhtH.MirvaziriH.A new algorithm for data clustering based on gravitational search algorithm and genetic operatorsProceedings of the 2015 International Symposium on Artificial Intelligence and Signal Processing, AISP 2015March 2015Iran2222272-s2.0-84938801770MehmoodZ.AnwarS. M.AliN.HabibH. A.RashidM.A Novel image retrieval based on a combination of local and global histograms of visual words2016201612821725010.1155/2016/82172502-s2.0-84984707520ZhouW.LiH.TianQ.Recent advance in content-based image retrieval: A literature survey2017YousufM.MehmoodZ.HabibH. A.et alA novel technique based on visual words fusion analysis of sparse features for effective content-based image retrieval2018201813213439510.1155/2018/2134395VarmaA.KaurD. K.Survey on content based image retrieval20187471476VogelJ.SchieleB.Semantic modeling of natural scenes for content-based image retrieval20077221331572-s2.0-3384624957810.1007/s11263-006-8614-1OlivaA.TorralbaA.Modeling the shape of the scene: a holistic representation on the spatial envelope200142314517510.1023/A:10111396317242-s2.0-0035328421Fei-FeiL.FergusR.PeronaP.Learning generative visual models from few training examples: an incremental bayesian approach tested on 101 object categoriesProceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops (CVPRW '04)July 2004597010.1109/CVPR.2004.3832-s2.0-84932617705BoschA.ZissermanA.MuñozX.Scene classification using a hybrid generative/discriminative approach20083047127272-s2.0-4244916231810.1109/TPAMI.2007.7071618276975LazebnikS.SchmidC.PonceJ.Beyond bags of features: Spatial pyramid matching for recognizing natural scene categories1Proceedings of the Society Conference on Computer Vision and Pattern Recognition (CVPR06)200612Serrano-TalamantesJ. F.Avilés-CruzC.Villegas-CortezJ.Sossa-AzuelaJ. H.Self organizing natural scene image retrieval2013407239824092-s2.0-8487317281910.1016/j.eswa.2012.10.064National-Instruments2018 http://www.ni.com/en-us/shop/labview.htmlBegumR.HalseS. V.The smart car parking system using gsm and labview2018913514210.29055/jcms/740BlaifiS.MoulahoumS.ColakI.MerroucheW.Monitoring and enhanced dynamic modeling of battery by genetic algorithm using labview applied in photovoltaic system201810010211038AlamA.JafferyZ. A.A vision-based system for traffic light detection20181333343SimK.GopalkrishnanV.ZimekA.CongG.A survey on enhanced subspace clustering20132623323972-s2.0-8487240643210.1007/s10618-012-0258-xZbl1270.68260XuD.TianY.A comprehensive survey of clustering algorithms2015216519310.1007/s40745-015-0040-1DinlerD.TuralM. K.2016Cham, SwitzerlandSpringer International PublishingKhandareA. S.AnandandA.VishwakarmaH.AkasheS.Clustering algorithms: Experiment and improvements2017SingaporeSpringer263271PatibandlaR. S. M. L.VeeranjaneyuluN.BhatejaV.Coello CoelloC. A.SatapathyS. C.PattnaikP. K.Survey on clustering algorithms for unstructured data2018SingaporeSpringerOsmanM. M. A.Syed-YusofS. K.Abd MalikN. N. N.ZubairS.A survey of clustering algorithms for cognitive radio ad hoc networks20182414511475BindraK.MishraA.Suryakant.RayK.SharmaT. K.RawatS.SainiR. K.BandyopadhyayA.Effective data clustering algorithms2019SingaporeSpringerEngelkingR.1989Springer International PublishingMR862623AfifiA. J.AshourW. M.Image retrieval based on content using color feature201220121124828510.5402/2012/248285LiuG.-H.YangJ.-Y.Content-based image retrieval using color difference histogram20134618819810.1016/j.patcog.2012.06.001StrickerM. A.OrengoM.NiblackW.JainR. C.Similarity of color images2420Proceedings of SPIEZhangQ.CanosaR. L.PS.A comparison of histogram distance metrics for content-based image retrieval201490279MakhoulJ.KubalaF.SchwartzR.WeischedelR.Performance measures for information extractionProceedings of the DARPA Broadcast News Workshop1999249252MosbahM.BouchehamB.RochaÁ.CorreiaA. M.AdeliH.ReisL. P.CostanzoS.Matching measures in the context of cbir: A comparative study in terms of effectiveness and efficiencyProceedings of the Recent Advances in Information Systems and Technologies2017Cham, SwitzerlandSpringer International Publishing245258MehmoodZ.RashidM.RehmanA.SabaT.DawoodH.DawoodH.Effect of complementary visual words versus complementary features on clustering for effective content-based image search20183555421543410.3233/JIFS-171137AliN.BajwaK. B.SablatnigR.ChatzichristofisS. A.IqbalZ.RashidM.HabibH. A.RubinD. L.A Novel Image Retrieval Based on Visual Words Integration of SIFT and SURF201611612010.1371/journal.pone.0157428YousufM.MehmoodZ.HabibH. A.A novel technique based on visual words fusion analysis of sparse features for effective content-based image retrieval2018201813213439510.1155/2018/2134395