List of Titles

Detection of Malleefowl Mounds from Point Cloud Data

Authors: Parvin, Nahida , Awrangjeb, Mohammad , Irvin, Marc , Florentine, Singarayer , Murshed, Manzur , Lu, Guojun
Date: 2021
Type: Text , Conference paper
Relation: 2021 International Conference on Digital Image Computing: Techniques and Applications, DICTA 2021, Gold Coast, 29 November to 1 December 2021
Full Text: false
Reviewed:
Description: Airborne light detection and ranging (LiDAR) data have become cost and time-efficient means for estimating the size of timid fauna populations through the identification of artefacts that evidence their occurrence in a large, hostile geographic area. The unobtrusive detection method helps conservation managers to assess the stability of a population and to design appropriate conservation programs. Here we propose a mound (nest) detection method for Australia's native iconic bird, the Malleefowl, from point cloud data, which can be manipulated to act as a surrogate for population data. Existing detection methods are largely through manual observations, and are therefore not efficient for covering large and remote areas. The proposed mound detection method can identify mound feature based on height and intensity values provided by the point cloud data. Each candidate mound point is initially selected by applying a height threshold utilising the classified ground points and their corresponding digital elevation model (DEM). Then, another threshold based on intensity range derived from ground truth mound area analysis is applied on the extracted initial mound points to find the final candidate mound points. These extracted points are then used to generate a binary mask where the potential mound points are found sparse. To connect those points, a morphological filter is applied on the binary image and found the mound separated from other remaining non-mound objects. To obtain the mound from other non-mound objects, a morphological cleaning operation and a connected component analysis are carried out on the mask. The non-mound objects are removed from the mask utilising the area property of mound derived from the empirical analysis of ground-truth observations. Finally, the effectiveness of the proposed technique is calculated based on ground truth. Although the mound shapes and structures are highly variable in nature, our height and intensity-based mound point extraction method detected 55 % of the ground-truthed mounds. © 2021 IEEE.

Integrating line weber local descriptor and deep feature for tire indentation mark image classification

Authors: Liu, Ying , Che, Xin , Dong, Haitao , Li, Daxiang , Teng, Shyh , Lu, Guojun
Date: 2021
Type: Text , Conference paper
Relation: 4th International Conference on Artificial Intelligence and Pattern Recognition, 4th International Conference on Artificial Intelligence and Pattern Recognition, AIPR 2021,Virtual, Online,17-19 September 2021, 2021, ACM International Conference Proceeding Series p. 56-61
Full Text: false
Reviewed:
Description: Tire indentation mark matching is an essential tool used for the investigation of criminal cases and traffic incidents. As such images are unique and uncommon, there is a lack of dedicated databases and relevant research on this topic. This paper presents a feature extraction algorithm effective for tire indentation mark image description. The main contributions include: (1) Line feature Weber local descriptor (LWLD) is proposed, which uses the Gabor orientations instead of the original gradient orientation. This feature can describe texture information of tire indentation mark image more efficiently. (2) An attention model is constructed to produce attention feature map of tire indentation mark image. This attention feature map is then fused with LWLD resulting in a feature with more powerful representation capability. Experimental results prove that the combined use of LWLD and attention model greatly enhances the performance of tire indentation mark image matching tasks. © 2021 ACM.

Reversible data hiding in encrypted images based on image partition and spatial correlation

Authors: Song, Chang , Zhang, Yifeng , Lu, Guojun
Date: 2019
Type: Text , Conference proceedings , Conference paper
Relation: 17th International Workshop on Digital Forensics and Watermarking, IWDW 2018; Jeju Island, South Korea; 22nd-24th October 2018; Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) Vol. 11378 LNCS, p. 180-194
Full Text: false
Reviewed:
Description: Recently, more and more attention is paid to reversible data hiding (RDH) in encrypted images because of its better protection of privacy compared with traditional RDH methods directly operated in original images. In several RDH algorithms, prediction-error expansion (PEE) is proved to be superior to other methods in terms of embedding capacity and distortion of marked image and multiple histograms modification (MHM) can realize adaptive selection of expansion bins which depends on image content in the modification of a sequence of histograms. Therefore, in this paper, we propose an efficient RDH method in encrypted images by combining PEE and MHM, and design corresponding mode of image partition. We first divide the image into three parts: W (for embedding secret data), B (for embedding the least significant bit(LSB) of W) and G (for generating prediction-error histograms). Then, we apply PEE and MHM to embed the LSB of W to reserve space for secret data. Next, we encrypt the image and change the LSB of W to realize the embedding of secret data. In the process of extraction, the reversibility of image and secret data can be guaranteed. The utilization of correlation between neighbor pixels and embedded order decided by the smoothness of pixel in part W contribute to the performance of our method. Compared to the existing algorithms, experimental results show that the proposed method can reduce distortion to the image at given embedding capacity especially at low embedding capacity.

A kernel-based approach for content-based image retrieval

Authors: Karmakar, Priyabrata , Teng, Shyh , Lu, Guojun , Zhang, Dengsheng
Date: 2018
Type: Text , Conference proceedings , Conference paper
Relation: 2018 International Conference on Image and Vision Computing New Zealand; Auckland, New Zealand; 19th-21st November 2018 p. 1-6
Full Text: false
Reviewed:
Description: Content-based image retrieval (CBIR) is a popular approach to retrieve images based on a query. In CBIR, retrieval is executed based on the properties of image contents (e.g. gradient, shape, color, texture) which are generally encoded into image descriptors. Among the various image descriptors, histogram-based descriptors are very popular. However, they suffer from the limitation of coarse quantization. In contrast, the use of kernel descriptors (KDES) is proven to be more effective than histogram-based descriptors in other applications, e.g. image classification. This is because, in the KDES framework, instead of the quantization of pixel attributes, each pixel equally takes part in the similarity measurement between two images. In this paper, we propose an approach for how the conventional KDES and its improved version can be used for CBIR. In addition, we have provided a detailed insight into the effectiveness of improved kernel descriptors. Finally, our experiment results will show that kernel descriptors are significantly more effective than histogram-based descriptors in CBIR.

A novel perceptual dissimilarity measure for image retrieval

Authors: Shojanazeri, Hamid , Zhang, Dengsheng , Teng, Shyh , Aryal, Sunil , Lu, Guojun
Date: 2018
Type: Text , Conference proceedings , Conference paper
Relation: 2018 International Conference on Image and Vision Computing New Zealand, IVCNZ 2018; Auckland, New Zealand; 19th-21st November 2018 Vol. 2018-November, p. 1-6
Full Text: false
Reviewed:
Description: Similarity measure is an important research topic in image classification and retrieval. Given a type of image features, a good similarity measure should be able to retrieve similar images from the database while discard irrelevant images from the retrieval. Similarity measures in literature are typically distance based which measure the spatial distance between two feature vectors in high dimensional feature space. However, this type of similarity measures do not have any perceptual meaning and ignore the neighborhood influence in the similarity decision making process. In this paper, we propose a novel dissimilarity measure, which can measure both the distance and perceptual similarity of two image features in feature space. Results show the proposed similarity measure has a significant improvement over the traditional distance based similarity measure commonly used in literature.
Description: International Conference Image and Vision Computing New Zealand

An Attention-Based Approach for Single Image Super Resolution

Authors: Liu, Yuan , Wang, Yuancheng , Li, Nan , Cheng, Xu , Zhang, Yifeng , Huang, Yongming , Lu, Guojun
Date: 2018
Type: Text , Conference proceedings , Conference paper
Relation: 2018 24th International Conference on Pattern Recognition, ICPR 2018; Beijing, China; 20th-24th August 2018 Vol. 2018, p. 2777-2784
Full Text:
Reviewed:
Description: The main challenge of single image super resolution (SISR) is the recovery of high frequency details such as tiny textures. However, most of the state-of-the-art methods lack specific modules to identify high frequency areas, causing the output image to be blurred. We propose an attention-based approach to give a discrimination between texture areas and smooth areas. After the positions of high frequency details are located, high frequency compensation is carried out. This approach can incorporate with previously proposed SISR networks. By providing high frequency enhancement, better performance and visual effect are achieved. We also propose our own SISR network composed of DenseRes blocks. The block provides an effective way to combine the low level features and high level features. Extensive benchmark evaluation shows that our proposed method achieves significant improvement over the state-of-the-art works in SISR.

Enhanced colour image retrieval with cuboid segmentation

Authors: Murshed, Manzur , Karmakar, Priyabrata , Teng, Shyh , Lu, Guojun
Date: 2018
Type: Text , Conference proceedings , Conference paper
Relation: 2018 International Conference on Digital Image Computing: Techniques and Applications, DICTA 2018; Canberra, Australia; 10th-13th December 2018
Full Text:
Reviewed:
Description: In this paper, we further investigate our recently proposed cuboid image segmentation algorithm for effective image retrieval. Instead of using all cuboids (i.e. segments), we have proposed two approaches to choose different subsets of cuboids appropriately. With the experimental results on eBay dataset, we have shown that our proposals outperform retrieval performance of the existing technique. In addition, we have investigated how many segments are required for the most effective image retrieval and provide a quick method to determine the suitable number of cuboids.
Description: 2018 International Conference on Digital Image Computing: Techniques and Applications, DICTA 2018

Enhancing the effectiveness of local descriptor based image matching

Authors: Hossain, Md Tahmid , Teng, Shyh , Zhang, Dengsheng , Lim, Suryani , Lu, Guojun
Date: 2018
Type: Text , Conference proceedings , Conference paper
Relation: 2018 International Conference on Digital Image Computing: Techniques and Applications, DICTA 2018; Canberra, Australia; 10th-13th December 2018 p. 1-8
Full Text: false
Reviewed:
Description: Image registration has received great attention from researchers over the last few decades. SIFT (Scale Invariant Feature Transform), a local descriptor-based technique is widely used for registering and matching images. To establish correspondences between images, SIFT uses a Euclidean Distance ratio metric. However, this approach leads to a lot of incorrect matches and eliminating these inaccurate matches has been a challenge. Various methods have been proposed attempting to mitigate this problem. In this paper, we propose a scale and orientation harmony-based pruning method that improves image matching process by successfully eliminating incorrect SIFT descriptor matches. Moreover, our technique can predict the image transformation parameters based on a novel adaptive clustering method with much higher matching accuracy. Our experimental results have shown that the proposed method has achieved averages of approximately 16% and 10% higher matching accuracy compared to the traditional SIFT and a contemporary method respectively.
Description: 2018 International Conference on Digital Image Computing: Techniques and Applications, DICTA 2018

Image clustering using a similarity measure incorporating human perception

Authors: Shojanazeri, Hamid , Aryal, Sunil , Teng, Shyh , Zhang, Dengsheng , Lu, Guojun
Date: 2018
Type: Text , Conference proceedings , Conference paper
Relation: 2018 International Conference on Image and Vision Computing New Zealand, IVCNZ 2018; Auckland, New Zealand; 19th-21st November 2018 p. 1-6
Full Text: false
Reviewed:
Description: Clustering similar images is an important task in image processing and computer vision. It requires a measure to quantify pairwise similarities of images. The performance of clustering algorithm depends on the choice of similarity measure. In this paper, we investigate the effectiveness of data independent (distance-based), data-dependent (mass-based) and hybrid (dis)similarity measures in the image clustering task using three benchmark image collections with different sets of features. Our results of K-Medoids clustering show that uses the hybrid Perceptual Dissimilarity Measure (PMD) produces better clustering results than distance-based l(p) - norm and mass-based m(p) - dissimilarity.

Reversible data hiding based on directional prediction and multiple histograms modification

Authors: Song, Chang , Zhang, Yifeng , Lu, Guojun
Date: 2017
Type: Text , Conference paper
Relation: 9th International Conference on Wireless Communications, WCSP 2017; Nanjing, China; 11th-13th October 2017 p. 1-6
Full Text: false
Reviewed:
Description: The reversible data hiding is an emerging technology that uses the redundancy of the carrier (typically digital images) to embed secret information and ensure the reversibility of the carrier and hidden information. In recent year, a number of reversible data hiding algorithms based on prediction error expansion have been developed. In prediction error expansion, prediction on the center pixel is made based on its neighbor pixels. The data embedding is conducted by the modification on the histogram made from prediction error expansion. Therefore, the accuracy of prediction on pixel is the key to improve the performance of the algorithm. In this paper, we propose a new reversible data hiding based on directional prediction and multiple histograms modification and design the corresponding reversible hiding rules. Compared to the existing algorithms, experimental results show that the proposed method can reduce distortion to the image at given embedding capacity.

A triangulation-based technique for building boundary identification from point cloud data

Authors: Awrangjeb, Mohammad , Lu, Guojun
Date: 2016
Type: Text , Conference proceedings , Conference paper
Relation: 2015 International Conference on Image and Vision Computing New Zealand, IVCNZ 2015; Auckland, New Zealand; 23rd-24th November 2015 Vol. 2016-November, p. 1-6
Full Text: false
Reviewed:
Description: Building boundary identification is an essential prerequisite in building outline generation from point cloud data. In this problem, boundary edges that constitute the building boundary are identified. The existing solutions to the identification of boundary edges from the input point set have one or more of the following problems: ineffective in finding appropriate edges in a concave shape, incapable of determining a 'hole' or 'concavity' inside the shape separately, dependant on additional information such as the scan direction that may be unavailable, and incompetent in determining the boundary of a point set from the boundaries of two or more subsets of the point set. This paper proposes a new solution to the identification of building boundary by using the maximum point-to-point distance in the input data. It properly detects the boundary edges for any type of shape and separately recognises holes, if any, inside the shape. The unique feature of the proposed solution is that it can identify the boundary of a point set from the boundaries of two or more subsets of the point set. It does not require any additional information other than the input point set. Experimental results show that the proposed solution can preserve details along the building boundary and offer high area-based completeness and quality, even in low density input data. © 2015 IEEE.
Description: International Conference Image and Vision Computing New Zealand

Extracting road centrelines from binary road images by optimizing geodesic lines

Authors: Zhou, Shaoguang , Lu, Guojun , Teng, Shyh , Zhang, Dengsheng
Date: 2016
Type: Text , Conference proceedings , Conference paper
Relation: 2015 International Conference on Image and Vision Computing New Zealand, IVCNZ 2015; Auckland, New Zealand; 23rd-24th November 2015 Vol. 2016-November, p. 1-6
Full Text: false
Reviewed:
Description: Binary road images can be obtained from remotely sensed images with the aid of classification and segmentation techniques. Extracting road centrelines from these binary images are crucial to update a Geographic Information System (GIS) database. A current state of art method of centreline extraction needs to remove road junctions and depends on the accuracy of the endpoints, leading to three main limitations: (1) causing small gaps in the roads, (2) wrongly treating short non-road segments as roads, and (3) producing centrelines of low accuracy around the road end regions. To overcome these limitations, we propose to use an iteratively searching scheme to obtain the longest geodesic line in the preprocessed road skeleton images. Several image pixels at each end of the geodesic lines were removed to avoid noise, and the remaining parts were optimized using a dynamic programming snake model. The proposed method is applied to three types of binary road images and compared with the state of art method. It shows that the proposed method is less affected by the end regions of the roads, and is effective in filling the gaps in the roads. It also has an advantage on processing short non-road segments. © 2015 IEEE.
Description: International Conference Image and Vision Computing New Zealand

Robust building roof segmentation using airborne point cloud data

Authors: Gilani, Syed , Awrangjeb, Mohammad , Lu, Guojun
Date: 2016
Type: Text , Conference proceedings , Conference paper
Relation: 23rd IEEE International Conference on Image Processing, ICIP 2016; Phoenix, United States; 25th-28th September 2016; published in Proceedings - International Conferenec on Image Processing, ICIP Vol. 2016-August, p. 859-863
Full Text: false
Reviewed:
Description: Approximation of the geometric features is an essential step in point cloud segmentation and surface reconstruction. Often, the planar surfaces are estimated using principal component analysis (PCA), which is sensitive to noise and smooths the sharp features. Hence, the segmentation results into unreliable reconstructed surfaces. This article presents a point cloud segmentation method for building detection and roof plane extraction. It uses PCA for saliency feature estimation including surface curvature and point normal. However, the point normals around the anisotropic surfaces are approximated using a consistent isotropic sub-neighbourhood by Low-Rank Subspace with prior Knowledge (LRSCPK). The developed segmentation technique is tested using two real-world samples and two benchmark datasets. Per-object and per-area completeness and correctness results indicate the robustness of the approach and the quality of the reconstructed surfaces and extracted buildings. © 2016 IEEE.
Description: Proceedings - International Conference on Image Processing, ICIP

A performance review of recent corner detectors

Authors: Awrangjeb, Mohammad , Lu, Guojun
Date: 2013
Type: Text , Conference paper
Relation: International Conference on Digital Image Computing: Techniques and Applications, 26 November 2013 to 28 November 2013 p. 157-164
Full Text:
Reviewed:
Description: Contour-based corner detectors directly or indirectly estimate a significance measure (eg, curvature) on the points of a planar curve and select the curvature extrema points as corners. A number of promising contour-based corner detectors have recently been proposed. They mainly differ in how the curvature is estimated on each point of the given curve. As the curvature on a digital curve can only be approximated, it is important to estimate a curvature that remains stable against significant noises, for example, geometric transformations and compression, on the curve. Moreover, in many applications, for instance, in content-based image retrieval, a fast corner detector is a prerequisite. So, it is also a primary characteristic that how much time a corner detector takes for corner detection in a given image. In addition, different authors evaluated their detectors on different platforms using different evaluation systems. Evaluation systems that depend on human judgements and visual identification of corners are manual and too subjective. Application of a manual system on a large test database will be expensive. Therefore, it is important to evaluate the detectors on a common platform using an automatic evaluation system. This paper first reviews six most recent and highly performed corner detectors and analyse their theoretical running time. Then it uses an automatic evaluation system to analyse their performance. Both the robustness to noise and efficiency are estimated to rank the detectors.

Building roof plane extraction from LIDAR data

Authors: Awrangjeb, Mohammad , Lu, Guojun
Date: 2013
Type: Text , Conference paper
Relation: 2013 International Conference on Digital Image Computing: Techniques and Applications (DICTA)
Full Text:
Reviewed:
Description: This paper presents a new segmentation technique to use LIDAR point cloud data for automatic extraction of building roof planes. The raw LIDAR points are first classified into two major groups: ground and non-ground points. The ground points are used to generate a 'building mask' in which the black areas represent the ground where there are no laser returns below a certain height. The non-ground points are segmented to extract the planar roof segments. First, the building mask is divided into small grid cells. The cells containing the black pixels are clustered such that each cluster represents an individual building or tree. Second, the non-ground points within a cluster are segmented based on their coplanarity and neighbourhood relations. Third, the planar segments are refined using a rule-based procedure that assigns the common points among the planar segments to the appropriate segments. Finally, another rule-based procedure is applied to remove tree planes which are generally small in size and randomly oriented. Experimental results on three Australian sites have shown that the proposed method offers high building detection and roof plane extraction rates.

Detection of structural similarity for multimodal microscopic image registration

Authors: Lv, Guohua , Teng, Shyh , Lu, Guojun , Lackmann, Martin
Date: 2013
Type: Text , Conference paper
Relation: 2013 International Conference on Digital Image Computing: Techniques and Applications, DICTA 2013
Full Text: false
Reviewed:
Description: In this paper we propose a novel method to detect the structural similarity in registering color and confocal microscopic images. Our prior work [1] presented the basic idea of detecting the structural similarity of such images, which utilizes the intensity relationships among red-green-blue color channels. The work in this paper will make the detection of structural similarity automatic and adaptive to each individual color microscopic image. The experimental results will demonstrate the effectiveness of the proposed method in detecting the structural similarity of these images and significant improvements in the registration performance.

High quality region-of-interest coding for video conferencing based remote general practitioner training

Authors: Murshed, Manzur , Siddique, Md Atiur Rahman , Islam, Saikat , Ali, Mortuza , Lu, Guojun , Villanueva, Elmer , Brown, James
Date: 2013
Type: Text , Conference paper
Relation: Proceedings of the International Conference on eHealth, Telemedicine, and Social Medicine (eTELEMED 2013), Wilmington, DE, 1st October 2013 pg 240-245
Full Text: false
Reviewed:

Integration of LIDAR data and orthoimage for automatic 3D building roof plane extraction

Authors: Awrangjeb, Mohammad , Fraser, Clive , Lu, Guojun
Date: 2013
Type: Text , Conference paper
Relation: 2013 IEEE International Conference on Multimedia and Expo (ICME)
Full Text:
Reviewed:
Description: Automatic 3D extraction of building roofs from remotely sensed data is important for many applications including city modeling. This paper proposes a new method for automatic 3D roof extraction through an effective integration of LIDAR (Light Detection And Ranging) data and multispectral orthoimagery. Using the ground height from a DEM (Digital Elevation Model), the raw LIDAR points are separated into two groups. The first group contains the ground points that are exploited to constitute a `ground mask'. The second group contains the non-ground points which are segmented using an innovative image line guided segmentation technique to extract the roof planes. The image lines extracted from the grey-scale version of the orthoimage are classified into several classes such as `ground', `tree', `roof edge' and `roof ridge' using the ground mask and colour and texture information from the orthoimagery. During roof plane extraction the lines from the later two classes are used to fit roof planes to the neighbouring non-ground LIDAR points. Finally, a new rule-based procedure is applied to remove planes constructed on trees. Experimental results show that the proposed method successfully removes vegetation and offers high extraction rates.

Maximizing structural similarity in multimodal biomedical microscopic images for effective registration

Authors: Lv, Guohua , Teng, Shyh , Lu, Guojun , Lackmann, Martin
Date: 2013
Type: Text , Conference paper
Relation: 2013 IEEE International Conference on Multimedia and Expo (ICME)
Full Text: false
Reviewed:
Description: Multimodal image registration (MMIR) is the alignment of contents in images captured from different sensors or instruments. MMIR is important in medical applications as it enables the visualization of the complementary contents in biomedical microscopic images. The registration for such images can be challenging as the structures of their contents are usually only partially similar. Thus in this paper, we propose a new method to maximize the structural similarity of the contents in such images by utilizing intensity relationships among Red-Green-Blue color channels. Our experimental results will demonstrate that our proposed method substantially improves the accuracy of registering such images as compared to the state-of-the-art methods.

Optimizing cepstral features for audio classification

Authors: Fu, Zhouyu , Lu, Guojun , Ting, Kaiming , Zhang, Dengsheng
Date: 2013
Type: Text , Conference paper
Relation: International Joint Conference on Artificial Intelligence p. 1330-1336
Full Text: false
Reviewed:
Description: Cepstral features have been widely used in audio applications. Domain knowledge has played an important role in designing different types of cepstral features proposed in the literature. In this paper, we present a novel approach for learning optimized cepstral features directly from audio data to better discriminate between different categories of signals in classification tasks. We employ multi-layer feedforward neural networks to model the cepstral feature extraction process. The network weights are initialized to replicate a reference cepstral feature like the mel frequency cepstral coefficient. We then propose a embedded approach that integrates feature learning with the training of a support vector machine (SVM) classifier. A single optimization problem is formulated where the feature and classifier variables are optimized simultaneously so as to refine the initial features and minimize the classification risk. Experimental results have demonstrated the effectiveness of the proposed feature learning approach, outperforming competing methods by a large margin on benchmark data.

Showing items 1 - 20 of 50

An Attention-Based Approach for Single Image Super Resolution

Enhanced colour image retrieval with cuboid segmentation

A performance review of recent corner detectors

Building roof plane extraction from LIDAR data

Integration of LIDAR data and orthoimage for automatic 3D building roof plane extraction