Publication:
MIGR: A Categorical Data Clustering Algorithm Based on Information Gain in Rough Set Theory

cris.virtual.department#PLACEHOLDER_PARENT_METADATA_VALUE#
cris.virtual.department#PLACEHOLDER_PARENT_METADATA_VALUE#
cris.virtual.department#PLACEHOLDER_PARENT_METADATA_VALUE#
cris.virtual.orcid#PLACEHOLDER_PARENT_METADATA_VALUE#
cris.virtual.orcid#PLACEHOLDER_PARENT_METADATA_VALUE#
cris.virtual.orcid#PLACEHOLDER_PARENT_METADATA_VALUE#
cris.virtualsource.department142ba771-8521-430f-b204-96929b0c24b7
cris.virtualsource.department58394252-8bb4-4eb1-a6bc-719df15e00aa
cris.virtualsource.department7c30ceba-f950-46b2-b5cd-848dafe70bb9
cris.virtualsource.orcid142ba771-8521-430f-b204-96929b0c24b7
cris.virtualsource.orcid58394252-8bb4-4eb1-a6bc-719df15e00aa
cris.virtualsource.orcid7c30ceba-f950-46b2-b5cd-848dafe70bb9
dc.contributor.authorSaddam Raheem
dc.contributor.authorShadi Al Shehabi
dc.contributor.authorAmaal Mohi Nassief
dc.date.accessioned2024-07-11T08:38:56Z
dc.date.available2024-07-11T08:38:56Z
dc.date.issued2022-10
dc.description.abstract<jats:p> Clustering techniques are used to split data into clusters where each cluster contains elements that look more similar to elements in the same cluster than elements in other clusters. Some of these techniques are capable of handling clustering process uncertainty, while other techniques may have stability issues. In this paper, a novel method, called Minimum Information Gain Roughness (MIGR), is proposed to select the clustering attribute based on information entropy with rough set theory. To evaluate its performance, three benchmark UCI datasets are chosen to be clustered by using MIGR. Then, the resulting clusters are compared to those which are resulted from applying Min-Min-Rough (MMR) and information-theoretic dependency roughness (ITDR) algorithms. Both last-mentioned techniques were already compared with a variety of clustering algorithms like k-modes, fuzzy centroids, and fuzzy k-modes. The Global purity, the overall purity, and F-measure are considered here as performance measures to compare the quality of the resulting clusters. The experimental results show that the MIGR algorithm outperforms both MMR and ITDR algorithms for clustering categorical data. </jats:p>
dc.identifier.doi10.1142/S0218488522500210
dc.identifier.urihttps://acikarsiv.thk.edu.tr/handle/123456789/1999
dc.publisherWorld Scientific Pub Co Pte Ltd
dc.relation.ispartofInternational Journal of Uncertainty, Fuzziness and Knowledge-Based Systems
dc.relation.issn0218-4885
dc.titleMIGR: A Categorical Data Clustering Algorithm Based on Information Gain in Rough Set Theory
dc.typejournal-article
dspace.entity.typePublication
oaire.citation.issue5
oaire.citation.volume30

Files

License bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed to upon submission
Description:

Collections