An algorithm for clustering using L1-norm based on hyperbolic smoothing technique
- Authors: Bagirov, Adil , Mohebi, Ehsan
- Date: 2016
- Type: Text , Journal article
- Relation: Computational Intelligence Vol. 32, no. 3 (2016), p. 439-457
- Relation: http://purl.org/au-research/grants/arc/DP140103213
- Full Text: false
- Reviewed:
- Description: Cluster analysis deals with the problem of organization of a collection of objects into clusters based on a similarity measure, which can be defined using various distance functions. The use of different similarity measures allows one to find different cluster structures in a data set. In this article, an algorithm is developed to solve clustering problems where the similarity measure is defined using the L1-norm. The algorithm is designed using the nonsmooth optimization approach to the clustering problem. Smoothing techniques are applied to smooth both the clustering function and the L1-norm. The algorithm computes clusters sequentially and finds global or near global solutions to the clustering problem. Results of numerical experiments using 12 real-world data sets are reported, and the proposed algorithm is compared with two other clustering algorithms. ©2015 Wiley Periodicals, Inc.
Constrained self organizing maps for data clusters visualization
- Authors: Mohebi, Ehsan , Bagirov, Adil
- Date: 2016
- Type: Text , Journal article
- Relation: Neural Processing Letters Vol. 43, no. 3 (2016), p. 849-869
- Full Text: false
- Reviewed:
- Description: High dimensional data visualization is one of the main tasks in the field of data mining and pattern recognition. The self organizing maps (SOM) is one of the topology visualizing tool that contains a set of neurons that gradually adapt to input data space by competitive learning and form clusters. The topology preservation of the SOM strongly depends on the learning process. Due to this limitation one cannot guarantee the convergence of the SOM in data sets with clusters of arbitrary shape. In this paper, we introduce Constrained SOM (CSOM), the new version of the SOM by modifying the learning algorithm. The idea is to introduce an adaptive constraint parameter to the learning process to improve the topology preservation and mapping quality of the basic SOM. The computational complexity of the CSOM is less than those with the SOM. The proposed algorithm is compared with similar topology preservation algorithms and the numerical results on eight small to large real-world data sets demonstrate the efficiency of the proposed algorithm. © 2015, Springer Science+Business Media New York.
Intelligent packaging in meat industry : An overview of existing solutions
- Authors: Mohebi, Ehsan , Marquez, Leorey
- Date: 2015
- Type: Text , Journal article
- Relation: Journal of Food Science and Technology-Mysore Vol. 52, no. 7 (2015), p. 3947-3964
- Full Text:
- Reviewed:
- Description: Traditional packaging systems are refused since these systems do not provide any information about the quality of food products to the consumers and manufacturers at any stage of supply chain. The essence of a new technology to monitor the food spoilage from farm to fork is emerged to reduce hazards such as food borne diseases. Moreover, the food quality monitoring systems clarify the main factors in food wastage during supply chain. Intelligent packaging is employed to provide information about the history of food handling and storage to enhance food products quality and meet consumer satisfactions. Meat is one of the most perishable foods which causes sever illnesses in the case of spoilage. Variety of indicators and sensors have been proposed to warn about meat spoilage in meat industry. In this paper an overview of proposed approaches as well as commercial technologies to monitor the quality of meat during storage and transportation is presented. Furthermore, the existing technologies are compared in the sense of advantages and disadvantages in meat packaging applications.
Modified self-organising maps with a new topology and initialisation algorithm
- Authors: Mohebi, Ehsan , Bagirov, Adil
- Date: 2015
- Type: Text , Journal article
- Relation: Journal of Experimental and Theoretical Artificial Intelligence Vol. 27, no. 3 (2015), p. 351-372
- Full Text: false
- Reviewed:
- Description: Mapping quality of the self-organising maps (SOMs) is sensitive to the map topology and initialisation of neurons. In this article, in order to improve the convergence of the SOM, an algorithm based on split and merge of clusters to initialise neurons is introduced. The initialisation algorithm speeds up the learning process in large high-dimensional data sets. We also develop a topology based on this initialisation to optimise the vector quantisation error and topology preservation of the SOMs. Such an approach allows to find more accurate data visualisation and consequently clustering problem. The numerical results on eight small-to-large real-world data sets are reported to demonstrate the performance of the proposed algorithm in the sense of vector quantisation, topology preservation and CPU time requirement. © 2014 Taylor & Francis.
Nonsmooth optimization based algorithms in cluster analysis
- Authors: Bagirov, Adil , Mohebi, Ehsan
- Date: 2015
- Type: Text , Book chapter
- Relation: Partitional Clustering Algorithms p. 99-146
- Full Text: false
- Reviewed:
- Description: Cluster analysis is an important task in data mining. It deals with the problem of organization of a collection of objects into clusters based on a similarity measure. Various distance functions can be used to define the similarity measure. Cluster analysis problems with the similarity measure defined by the squared Euclidean distance, which is also known as the minimum sum-of-squares clustering, has been studied extensively over the last five decades. L1 and L1 norms have attracted less attention. In this chapter, we consider a nonsmooth nonconvex optimization formulation of the cluster analysis problems. This formulation allows one to easily apply similarity measures defined using different distance functions. Moreover, an efficient incremental algorithm can be designed based on this formulation to solve the clustering problems. We develop incremental algorithms for solving clustering problems where the similarity measure is defined using the L1; L2 and L1 norms. We also consider different algorithms for solving nonsmooth nonconvex optimization problems in cluster analysis. The proposed algorithms are tested using several real world data sets and compared with other similar algorithms.
- Description: Cluster analysis is an important task in data mining. It deals with the problem of organization of a collection of objects into clusters based on a similarity measure. Various distance functions can be used to define the similarity measure. Cluster analysis problems with the similarity measure defined by the squared Euclidean distance, which is also known as the minimum sum-of-squares clustering, has been studied extensively over the last five decades. However, problems with the L
Nonsmooth optimization models and algorithms for data clustering and visualization
- Authors: Mohebi, Ehsan
- Date: 2015
- Type: Text , Thesis
- Full Text:
- Description: Cluster analysis deals with the problem of organization of a collection of patterns into clusters based on a similarity measure. Various distance functions can be used to define this measure. Clustering problems with the similarity measure defined by the squared Euclidean distance have been studied extensively over the last five decades. However, problems with other Minkowski norms have attracted significantly less attention. The use of different similarity measures may help to identify different cluster structures of a data set. This in turn may help to significantly improve the decision making process. High dimensional data visualization is another important task in the field of data mining and pattern recognition. To date, the principal component analysis and the self-organizing maps techniques have been used to solve such problems. In this thesis we develop algorithms for solving clustering problems in large data sets using various similarity measures. Such similarity measures are based on the squared L
- Description: Doctor of Philosophy
A convolutional recursive modified Self Organizing Map for handwritten digits recognition
- Authors: Mohebi, Ehsan , Bagirov, Adil
- Date: 2014
- Type: Text , Journal article
- Relation: Neural Networks Vol. 60, no. (2014), p. 104-118
- Relation: http://purl.org/au-research/grants/arc/DP140103213
- Full Text: false
- Reviewed:
- Description: It is well known that the handwritten digits recognition is a challenging problem. Different classification algorithms have been applied to solve it. Among them, the Self Organizing Maps (SOM) produced promising results. In this paper, first we introduce a Modified SOM for the vector quantization problem with improved initialization process and topology preservation. Then we develop a Convolutional Recursive Modified SOM and apply it to the problem of handwritten digits recognition. The computational results obtained using the well known MNIST dataset demonstrate the superiority of the proposed algorithm over the existing SOM-based algorithms.
CR-Modified SOM to the problem of handwritten digits recognition
- Authors: Mohebi, Ehsan , Bagirov, Adil
- Date: 2014
- Type: Text , Conference proceedings
- Relation: 34th SGAI International Conference on Innovative Techniques and Applications of Artcificial Intelligence; Cambridge, England; 9th-11th December 2014; published in Research and Development in Intelligent Systems XXXI (Incorporating Applications and Innovations in Intelligent Systems XXII) p. 225-238
- Full Text: false
- Reviewed:
- Description: Recently, researchers show that the handwritten digit recognition is a challenging problem. In this paper first, we introduce a Modified Self Organizing Maps for vector quantization problem then we present a Convolutional Recursive ModifiedSOMto the problem of handwritten digit recognition. TheModifiedSOMis novel in the sense of initialization process and the topology preservation. The experimental result on the well known digit database of MNIST, denotes the superiority of the proposed algorithm over the existing SOM-based methods.
A new modification of Kohonen neural network for VQ and clustering problems
- Authors: Mohebi, Ehsan , Bagirov, Adil
- Date: 2013
- Type: Text , Conference paper
- Relation: Proceedings of the 11-th Australasian Data Mining Conference (AusDM'13) Vol. 146, p. 81-88
- Full Text: false
- Reviewed:
- Description: Vector Quantization (VQ) and Clustering are significantly important in the field of data mining and pattern recognition. The Self Organizing Maps has been widely used for clustering and topology visualization. The topology of the SOM and its initial neurons play an important role in the convergence of the Kohonen neural network. In this paper, in order to improve the convergence of the SOM we introduce an algorithm based on the split and merging of clusters to initialize neurons. We also introduce a topology based on this initialization to optimize the vector quantization error. Such an approach allows one to find global or near global solution to the vector quantization and consequently clustering problem. The numerical results on 4 small to large real-world data sets are reported to demonstrate the performance of the proposed algorithm.
A learning-based approach for fault tolerance on grid resources scheduling
- Authors: Karimi, Mohammadbager , Bouyer, Asgarali , Mohebi, Ehsan , Rajabalipour, Hossein
- Date: 2009
- Type: Text , Conference proceedings
- Relation: 2009 5th IEEE GCC Conference and Exhibition, GCC 2009; Kuwait City; Kuwait; 17th-19th March published in 2009
- Full Text: false
- Reviewed:
- Description: While Grid environment has developed increasingly, unfortunately the importance of fault tolerance has not been remarkable in Grid resource management. On the other hand, the cost of computing by grid is important because grid is an economy-based system. Most organizations intend to spend little on their own computations by grid. Therefore, using a better approach to resource scheduling to avoid fault is necessary. This paper presents a new approach on fault tolerance mechanisms for the resource scheduling on grid by using Case-Based Reasoning technique in a local fashion. This approach applies a specific structure in order to prepare fault tolerance between executer nodes to retain system in a safe state with minimum data transferring. Certainly, this algorithm increases fault tolerant confidence therefore, performance of grid will be high.
- Description: 2009 5th IEEE GCC Conference and Exhibition, GCC 2009
An optimized hybrid kohonen neural network for ambiguity detection in cluster analysis using simulated annealing
- Authors: Mohebi, Ehsan , Sap, Mohd Noor Md Md
- Date: 2009
- Type: Text , Book chapter
- Relation: p.
- Full Text: false
- Reviewed:
Hybrid Kohonen Self Organizing Map for the Uncertainty Involved in Overlapping Clusters Using Simulated Annealing
- Authors: Mohebi, Ehsan , Sap, Mohd Noor Md Md
- Date: 2009
- Type: Text , Conference paper
- Full Text: false
- Reviewed:
- Description: The Kohonen self organizing map is widely used as a popular tool in the exploratory phase of data mining. The SOM (self organizing maps) maps high dimensional space into a 2-dimensional grid by placing similar elements close together, forming clusters. Recently research experiments presented that to capture the uncertainty involved in cluster analysis, it is not necessary to have crisp boundaries in some clustering operations. In this paper to overcome the uncertainty, an optimized clustering algorithm based on SOM which employs the rough set theory and the simulated annealing as a general technique for optimization problems is proposed. The optimized two-level stage SA-Rough SOM (simulated annealing - rough self organizing map) (first using SOM to produce the prototypes that are then clustered in the second stage based on the combination of rough set and simulated annealing) is found to perform well and more accurate compared with the crisp clustering methods (i.e. Incremental SOM) and reduces the errors.
Rough Kohonen neural network for overlapping data detection
- Authors: Mohebi, Ehsan , Sap, Mohd Noor Md Md
- Date: 2009
- Type: Text , Book chapter
- Relation: Communications in Computer and Information Science p. 183-196
- Full Text: false
- Reviewed:
- Description: The Kohonen self organizing map is an excellent tool in exploratory phase of data mining and pattern recognition. The SOM is a popular tool that maps high dimensional space into a small number of dimensions by placing similar elements close together, forming clusters. Recently researchers found that to capture the uncertainty involved in cluster analysis, it is not necessary to have crisp boundaries in some clustering operations. In this paper to overcome the uncertainty, a two-level clustering algorithm based on SOM which employs the rough set theory is proposed. The two-level stage Rough SOM (first using SOM to produce the prototypes that are then clustered in the second stage) is found to perform well and more accurate compared with the proposed crisp clustering method (Incremental SOM) and reduces the errors. © 2009 Springer-Verlag Berlin Heidelberg.
Rough Set Based Clustering of the Self Organizing Map
- Authors: Mohebi, Ehsan , Sap, Mohd Noor Md Md
- Date: 2009
- Type: Text , Conference paper
- Relation: p. 82-85
- Full Text: false
- Reviewed:
Using Self-Announcer Approach for Resource Availability Detection in Grid Environment
- Authors: Bouyer, Asgarali , Mohebi, Ehsan , Abdullah, Abdul Hanan
- Date: 2009
- Type: Text , Conference paper
- Relation: Computing in the Global Information Technology, 2009. ICCGI '09. Fourth International Multi-Conference
- Full Text: false
- Reviewed:
- Description: Since the Grid is a dynamic environment, the detection of available resources and prediction of resource availability in near future is important for resource scheduling. In a Grid environment, prediction and evaluation of resource availability is the prerequisite for a reasonable resource selection and good scheduling guarantee. There are many approaches for discovery and prediction of the resource availability that have some weaknesses (e.g., complexity time, predicting, using out of date data, etc). In this paper, we present a new method for detection of available resource based on online-announcer with no inquiry from Grid scheduler. We use a rough set analysis in each grid node to get some useful rules for predicting about nodepsilas behavior in the grid. The experiment results show that our proposed approach is fast algorithm and it applies effectively a reliable method to predict of proper resources for scheduling.