• العربية
    • English
  • English 
    • العربية
    • English
  • Login
Home
Publisher PoliciesTerms of InterestHelp Videos
Submit Thesis
IntroductionIUGSpace Policies
JavaScript is disabled for your browser. Some features of this site may not work without it.
View Item 
  •   Home
  • Faculty of Engineering
  • PhD and MSc Theses- Faculty of Engineering
  • View Item
  •   Home
  • Faculty of Engineering
  • PhD and MSc Theses- Faculty of Engineering
  • View Item

Please use this identifier to cite or link to this item:

http://hdl.handle.net/20.500.12358/18774
TitleEfficient Data Clustering Algorithms
Untitled
Abstract

Data Clustering is one of the most important issues in data mining and machine learning. Clustering is a task of discovering homogenous groups of the studied objects. Recently, many researchers have a significant interest in developing clustering algorithms. The most problem in clustering is that we do not have prior information knowledge about the given dataset. Moreover, the choice of input parameters such as the number of clusters, number of nearest neighbors and other factors in these algorithms make the clustering more challengeable topic. Thus any incorrect choice of these parameters yields bad clustering results. Furthermore, these algorithms suffer from unsatisfactory accuracy when the dataset contains clusters with different complex shapes, densities, sizes, noise and outliers. In this thesis, we propose a new approach for unsupervised clustering task. Our approach consists of three phases of operations. In the first phase we use the most widely used clustering technique which is Kmeans algorithm for its simplicity and speed in practice. We benefit just from one run of Kmeans, despites its accuracy, to discover and analyze the given dataset by catching preliminary clusters to insure closely grouping sets. The second phase takes these initial groups for processing them in a parallel fashion using shrinking based on the convex hull of the initial groups. From the second phase we obtain a set of sub-clusters of the given dataset. Hence, the third phase considers these sub-clusters for merging process based on the Delaunay triangulation. This new algorithm is named as Kmeans-Based Convex Hull Triangulation clustering algorithm (KBCHT). We present experiments that provide the strength of our new algorithm in discovering clusters with different non-convex shapes, sizes, densities, noise and outliers even though the bad initial conditions used in its first phase. These experiments show the superiority of our proposed algorithm when comparing with most competing algorithms.

Authors
Abubaker, Mohammed Bassam Mohammed
Supervisors
Hamad, Hatem Mahmoud
Typeرسالة ماجستير
Date2011
LanguageEnglish
Publisherthe islamic university
Citation
License
Collections
  • PhD and MSc Theses- Faculty of Engineering [641]
Files in this item
file_1.pdf3.010Mb
Thumbnail

The institutional repository of the Islamic University of Gaza was established as part of the ROMOR project that has been co-funded with support from the European Commission under the ERASMUS + European programme. This publication reflects the views only of the author, and the Commission cannot be held responsible for any use which may be made of the information contained therein.

Contact Us | Send Feedback
 

 

Browse

All of IUGSpaceCommunities & CollectionsBy Issue DateAuthorsTitlesSubjectsSupervisorsThis CollectionBy Issue DateAuthorsTitlesSubjectsSupervisors

My Account

LoginRegister

Statistics

View Usage Statistics

The institutional repository of the Islamic University of Gaza was established as part of the ROMOR project that has been co-funded with support from the European Commission under the ERASMUS + European programme. This publication reflects the views only of the author, and the Commission cannot be held responsible for any use which may be made of the information contained therein.

Contact Us | Send Feedback