Voxel-based extraction of individual pylons and wires from lidar point cloud data
- Authors: Munir, Nosheen , Awrangjeb, Mohammad , Stantic, Bela , Lu, Guojun , Islam, Syed
- Date: 2019
- Type: Text , Journal article
- Relation: ISPRS annals of the photogrammetry, remote sensing and spatial information sciences Vol. IV-4/W8, no. (2019), p. 91-98
- Full Text:
- Reviewed:
- Description: Extraction of individual pylons and wires is important for modelling of 3D objects in a power line corridor (PLC) map. However, the existing methods mostly classify points into distinct classes like pylons and wires, but hardly into individual pylons or wires. The proposed method extracts standalone pylons, vegetation and wires from LiDAR data. The extraction of individual objects is needed for a detailed PLC mapping. The proposed approach starts off with the separation of ground and non ground points. The non-ground points are then classified into vertical (e.g., pylons and vegetation) and non-vertical (e.g., wires) object points using the vertical profile feature (VPF) through the binary support vector machine (SVM) classifier. Individual pylons and vegetation are then separated using their shape and area properties. The locations of pylons are further used to extract the span points between two successive pylons. Finally, span points are voxelised and alignment properties of wires in the voxel grid is used to extract individual wires points. The results are evaluated on dataset which has multiple spans with bundled wires in each span. The evaluation results show that the proposed method and features are very effective for extraction of individual wires, pylons and vegetation with 99% correctness and 98% completeness.
The impact of global and local features on multiple sequence alignment clustering-based near-duplicate video retrieval
- Authors: Wang, Yandan , Lu, Guojun , Belkhatir, Mohammed , Messom, Christopher
- Date: 2013
- Type: Text , Conference paper
- Relation: 14th Pacific-Rim Conference on Multimedia p. 669-677
- Full Text: false
- Reviewed:
- Description: Traditionally, the performance of Near-Duplicate Video Retrieval (NDVR) is enhanced through different video features, matching scheme and indexing methods. The video features have been intensively investigated and it has been shown that local features outperform global features in terms of accuracy. However, local features have the expensive computational problem. Therefore, indexing structure is introduced to assist in scaling up, whilst the accuracy will drop slightly or dramatically in most time by using indexing approaches. Recent progress shows that NDVR based on clustering could reduce searching space while maintains equivalent retrieval accuracy compared to that of non-clustering based. In this paper, we will continue to evaluate clustering based NDVR, but using popular global and local features. Before conducting NDVR, dataset will be pre-processed offline into groups by using clustering algorithm that near-duplicate videos (NDVs) are assembled in the same cluster. Each cluster will be represented by member video or the centroid. The query video will then be compared to the representative videos instead of all videos in database (non-clustering based). Our experiment shows that clustering-based NDVR using global and local features outperforms than that of non-clustering based in terms of both retrieval accuracy and speed.
Texture classification using multimodal invariant local binary pattern
- Authors: Sadat, Rafi , Teng, Shyh , Lu, Guojun , Hasan, Sheikh
- Date: 2011
- Type: Text , Conference paper
- Relation: IEEE Workshop on Applications of Computer Vision (WACV) p. 315-320
- Full Text: false
- Reviewed:
- Description: As texture information among pixels can be effectively represented using Local binary patterns (LBPs), image descriptors built using LBPs or its variants have been frequently used for various image analysis applications, e.g. medical image and texture image classification and retrieval. However, neither LBP nor any of its existing variants can be used to build descriptors for classifying multimodal images effectively. This is because the same object when captured in different modalities may result in opposite pixel intensity in some corresponding parts of the images, which in turn will cause their descriptors to be very different. To solve this problem, we propose a novel modality invariant texture descriptor which is built by modifying the standard procedure for building LBP. In this paper, we explain how the proposed descriptor can be built efficiently. We also demonstrate empirically that compared to all the state of the art LBP-based descriptors, the proposed descriptor achieves better accuracy for classifying multimodal images
Structural image retrieval using automatic image annotation and region based inverted file
- Authors: Zhang, Dengsheng , Islam, Md , Lu, Guojun
- Date: 2013
- Type: Text , Journal article
- Relation: Journal of Visual Communication and Image Representation Vol. 24, no. 7 (2013), p. 1087-1098
- Full Text: false
- Reviewed:
- Description: Image retrieval has lagged far behind text retrieval despite more than two decades of intensive research effort. Most of the research on image retrieval in the last two decades are on content based image retrieval or image retrieval based on low level features. Recent research in this area focuses on semantic image retrieval using automatic image annotation. Most semantic image retrieval techniques in literature, however, treat an image as a bag of features/words while ignore the structural or spatial information in the image. In this paper, we propose a structural image retrieval method based on automatic image annotation and region based inverted file. In the proposed system, regions in an image are treated the same way as keywords in a structural text document, semantic concepts are learnt from image data to label image regions as keywords and weight is assigned to each keyword according to spatial position and relationship. As the result, images are indexed and retrieved in the same way as structural document retrieval. Specifically, images are broken down to regions which are represented using colour, texture and shape features. Region features are then quantized to create visual dictionaries which are similar to monolingual dictionaries like English or Chinese dictionaries. In the next step, a semantic dictionary similar to a bilingual dictionary like the English–Chinese dictionary is learnt to mapping image regions to semantic concepts. Finally, images are then indexed and retrieved using a novel region based inverted file data structure. Results show the proposed method has significant advantage over the widely used Bayesian annotation models.
Spherical harmonics and distance transform for image representation and retrieval
- Authors: Sajjanhar, Atul , Lu, Guojun , Zhang, Dengsheng , Hou, Jingyu , Chen, Yi-Ping Phoebe
- Date: 2009
- Type: Text , Conference paper
- Relation: Proceedings of the Intelligent Data Engineering and Automated Learning p. 309-316
- Full Text: false
- Reviewed:
- Description: In this paper, we have proposed a method for 2D image retrieval based on object shapes. The method relies on transforming the 2D images into 3D space based on distance transform. Spherical harmonics are obtained for the 3D data and used as descriptors for the underlying 2D images. The proposed method is compared against two existing methods which use spherical harmonics for shape based retrieval of images. MPEG-7 Still Images Content Set is used for performing experiments; this dataset consists of 3621 still images. Experimental results show that the performance of the proposed descriptors is significantly better than other methods in the same category.
Siamese network for object tracking with multi-granularity appearance representations
- Authors: Zhang, Zhuoyi , Zhang, Yifeng , Cheng, Xu , Lu, Guojun
- Date: 2021
- Type: Text , Journal article
- Relation: Pattern Recognition Vol. 118, no. (2021), p.
- Full Text: false
- Reviewed:
- Description: A reliable tracker has the ability to adapt to change of objects over time, and is robust and accurate. We build such a tracker by extracting semantic features using robust Siamese networks and multi-granularity color features. It incorporates a semantic model that can capture high quality semantic features and an appearance model that can describe object at pixel, local and global levels effectively. Furthermore, we propose a novel selective traverse algorithm to allocate weights to semantic models and appearance models dynamically for better tracking performance. During tracking, our tracker updates appearance representations for objects based on the recent tracking results. The proposed tracker operates at speeds that exceed the real-time requirement, and outperforms nearly all other state-of-the-art trackers on OTB-2013/2015 and VOT-2016/2017 benchmarks. © 2021 Elsevier Ltd
Semantic image retrieval using region based inverted file
- Authors: Zhang, Dengsheng , Islam, Md , Lu, Guojun , Hou, Jin
- Date: 2009
- Type: Text , Journal article
- Relation: Journal of Visual Communication and Image Representation Vol. 24, no. 7 (2009), p.242-249
- Full Text: false
- Reviewed:
- Description: Image retrieval has lagged far behind text retrieval despite more than two decades of intensive research effort. Most of the research on image retrieval in the last two decades are on content based image retrieval or image retrieval based on low level features. Recent research in this area focuses on semantic image retrieval using automatic image annotation. Most semantic image retrieval techniques in literature, however, treat an image as a bag of features/words while ignore the structural or spatial information in the image. In this paper, we propose a structural image retrieval method based on automatic image annotation and region based inverted file. In the proposed system, regions in an image are treated the same way as keywords in a structural text document, semantic concepts are learnt from image data to label image regions as keywords and weight is assigned to each keyword according to spatial position and relationship. As the result, images are indexed and retrieved in the same way as structural document retrieval. Specifically, images are broken down to regions which are represented using colour, texture and shape features. Region features are then quantized to create visual dictionaries which are similar to monolingual dictionaries like English or Chinese dictionaries. In the next step, a semantic dictionary similar to a bilingual dictionary like the English–Chinese dictionary is learnt to mapping image regions to semantic concepts. Finally, images are then indexed and retrieved using a novel region based inverted file data structure. Results show the proposed method has significant advantage over the widely used Bayesian annotation models.
Segmentation of airborne point cloud data for automatic building roof extraction
- Authors: Gilani, Syed , Awrangjeb, Mohammad , Lu, Guojun
- Date: 2018
- Type: Text , Journal article
- Relation: GIScience & Remote Sensing Vol. 55, no. 1 (2018), p. 63-89
- Full Text:
- Reviewed:
- Description: Roof plane segmentation is a complex task since point cloud data carry no connection information and do not provide any semantic characteristics of the underlying scanned surfaces. Point cloud density, complex roof profiles, and occlusion add another layer of complexity which often encounter in practice. In this article, we present a new technique that provides a better interpolation of roof regions where multiple surfaces intersect creating non-manifold points. As a result, these geometric features are preserved to achieve automated identification and segmentation of the roof planes from unstructured laser data. The proposed technique has been tested using the International Society for Photogrammetry and Remote Sensing benchmark and three Australian datasets, which differ in terrain, point density, building sizes, and vegetation. The qualitative and quantitative results show the robustness of the methodology and indicate that the proposed technique can eliminate vegetation and extract buildings as well as their non-occluding parts from the complex scenes at a high success rate for building detection (between 83.9% and 100% per-object completeness) and roof plane extraction (between 73.9% and 96% per-object completeness). The proposed method works more robustly than some existing methods in the presence of occlusion and low point sampling as indicated by the correctness of above 95% for all the datasets.
Rotation invariant spatial pyramid matching for image classification
- Authors: Karmakar, Priyabrata , Teng, Shyh , Lu, Guojun , Zhang, Dengsheng
- Date: 2015
- Type: Text , Conference proceedings
- Full Text: false
- Description: This paper proposes a new Spatial Pyramid representation approach for image classification. Unlike the conventional Spatial Pyramid, the proposed method is invariant to rotation changes in the images. This method works by partitioning an image into concentric rectangles and organizing them into a pyramid. Each pyramidal region is then represented using a histogram of visual words. Our experimental results show that our proposed method significantly outperforms the conventional method. © 2015 IEEE.
Rotation invariant curvelet features for texture image retrieval
- Authors: Islam, Md , Zhang, Dengsheng , Lu, Guojun
- Date: 2009
- Type: Text , Conference paper
- Relation: Proceedings of the 2009 IEEE International Conference on Multimedia and Expo p. 562-565
- Full Text: false
- Reviewed:
- Description: Effective texture feature is an essential component in any content based image retrieval system. In the past, spectral features, like Gabor and wavelet, have shown superior retrieval performance than many other statistical and structural based features. Recent researches on multi-resolution analysis have found that curvelet captures texture properties, like curves, lines, and edges, more accurately than Gabor filters. However, the texture feature extracted using curvelet transform is not rotation invariant. This can degrade its retrieval performance significantly, especially in cases where there are many similar images with different orientations. This paper analyses the curvelet transform and derives a useful approach to extract rotation invariant curvelet features. Experimental results show that the new rotation invariant curvelet feature outperforms the curvelet feature without rotation invariance.
Rotation invariant curvelet features for region based image retrieval
- Authors: Zhang, Dengsheng , Islam, Md , Lu, Guojun , Sumana, Ishrat
- Date: 2011
- Type: Text , Journal article
- Relation: International Journal of Computer Vision Vol. 98, no. 2 (2011), p. 187-201
- Full Text: false
- Reviewed:
- Description: There have been much interest and a large amount of research on content based image retrieval (CBIR) in recent years due to the ever increasing number of digital images. Texture features play a key role in CBIR. Many texture features exist in literature, however, most of them are neither rotation invariant nor robust to scale and other variations. Texture features based on Gabor filters have been shown with significant advantages over other methods, and they are adopted by MPEG-7 as one of the texture descriptors for image retrieval. In this paper, we propose a rotation invariant curvelet features for texture representation. With systematic analysis and rigorous experiments, we show that the proposed curvelet texture features significantly outperforms the widely used Gabor texture features. A novel region padding method is also proposed to apply curvelet transform to region based image retrieval. Retrieval results from standard image databases show that curvelet features are promising for both texture and region representation.
Robust image corner detection based on the chord-to-point distance accumulation technique
- Authors: Awrangjeb, Mohammad , Lu, Guojun
- Date: 2008
- Type: Text , Journal article
- Relation: IEEE Transactions on Multimedia, vol. 10, no. 6, IEEE, p. 1059-1072
- Full Text:
Robust image classification using a low-pass activation function and DCT augmentation
- Authors: Hossain, Md Tahmid , Teng, Shyh , Sohel, Ferdous , Lu, Guojun
- Date: 2021
- Type: Text , Journal article
- Relation: IEEE Access Vol. 9, no. (2021), p. 86460-86474
- Full Text:
- Reviewed:
- Description: Convolutional Neural Network's (CNN's) performance disparity on clean and corrupted datasets has recently come under scrutiny. In this work, we analyse common corruptions in the frequency domain, i.e., High Frequency corruptions (HFc, e.g., noise) and Low Frequency corruptions (LFc, e.g., blur). Although a simple solution to HFc is low-pass filtering, ReLU - a widely used Activation Function (AF), does not have any filtering mechanism. In this work, we instill low-pass filtering into the AF (LP-ReLU) to improve robustness against HFc. To deal with LFc, we complement LP-ReLU with Discrete Cosine Transform based augmentation. LP-ReLU, coupled with DCT augmentation, enables a deep network to tackle the entire spectrum of corruption. We use CIFAR-10-C and Tiny ImageNet-C for evaluation and demonstrate improvements of 5% and 7.3% in accuracy respectively, compared to the State-Of-The-Art (SOTA). We further evaluate our method's stability on a variety of perturbations in CIFAR-10-P and Tiny ImageNet-P, achieving new SOTA in these experiments as well. To further strengthen our understanding regarding CNN's lack of robustness, a decision space visualisation process is proposed and presented in this work. © 2013 IEEE.
Robust building roof segmentation using airborne point cloud data
- Authors: Gilani, Syed , Awrangjeb, Mohammad , Lu, Guojun
- Date: 2016
- Type: Text , Conference proceedings , Conference paper
- Relation: 23rd IEEE International Conference on Image Processing, ICIP 2016; Phoenix, United States; 25th-28th September 2016; published in Proceedings - International Conferenec on Image Processing, ICIP Vol. 2016-August, p. 859-863
- Full Text: false
- Reviewed:
- Description: Approximation of the geometric features is an essential step in point cloud segmentation and surface reconstruction. Often, the planar surfaces are estimated using principal component analysis (PCA), which is sensitive to noise and smooths the sharp features. Hence, the segmentation results into unreliable reconstructed surfaces. This article presents a point cloud segmentation method for building detection and roof plane extraction. It uses PCA for saliency feature estimation including surface curvature and point normal. However, the point normals around the anisotropic surfaces are approximated using a consistent isotropic sub-neighbourhood by Low-Rank Subspace with prior Knowledge (LRSCPK). The developed segmentation technique is tested using two real-world samples and two benchmark datasets. Per-object and per-area completeness and correctness results indicate the robustness of the approach and the quality of the reconstructed surfaces and extracted buildings. © 2016 IEEE.
- Description: Proceedings - International Conference on Image Processing, ICIP
Reversible data hiding in encrypted images based on image partition and spatial correlation
- Authors: Song, Chang , Zhang, Yifeng , Lu, Guojun
- Date: 2019
- Type: Text , Conference proceedings , Conference paper
- Relation: 17th International Workshop on Digital Forensics and Watermarking, IWDW 2018; Jeju Island, South Korea; 22nd-24th October 2018; Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) Vol. 11378 LNCS, p. 180-194
- Full Text: false
- Reviewed:
- Description: Recently, more and more attention is paid to reversible data hiding (RDH) in encrypted images because of its better protection of privacy compared with traditional RDH methods directly operated in original images. In several RDH algorithms, prediction-error expansion (PEE) is proved to be superior to other methods in terms of embedding capacity and distortion of marked image and multiple histograms modification (MHM) can realize adaptive selection of expansion bins which depends on image content in the modification of a sequence of histograms. Therefore, in this paper, we propose an efficient RDH method in encrypted images by combining PEE and MHM, and design corresponding mode of image partition. We first divide the image into three parts: W (for embedding secret data), B (for embedding the least significant bit(LSB) of W) and G (for generating prediction-error histograms). Then, we apply PEE and MHM to embed the LSB of W to reserve space for secret data. Next, we encrypt the image and change the LSB of W to realize the embedding of secret data. In the process of extraction, the reversibility of image and secret data can be guaranteed. The utilization of correlation between neighbor pixels and embedded order decided by the smoothness of pixel in part W contribute to the performance of our method. Compared to the existing algorithms, experimental results show that the proposed method can reduce distortion to the image at given embedding capacity especially at low embedding capacity.
Reversible data hiding based on directional prediction and multiple histograms modification
- Authors: Song, Chang , Zhang, Yifeng , Lu, Guojun
- Date: 2017
- Type: Text , Conference paper
- Relation: 9th International Conference on Wireless Communications, WCSP 2017; Nanjing, China; 11th-13th October 2017 p. 1-6
- Full Text: false
- Reviewed:
- Description: The reversible data hiding is an emerging technology that uses the redundancy of the carrier (typically digital images) to embed secret information and ensure the reversibility of the carrier and hidden information. In recent year, a number of reversible data hiding algorithms based on prediction error expansion have been developed. In prediction error expansion, prediction on the center pixel is made based on its neighbor pixels. The data embedding is conducted by the modification on the histogram made from prediction error expansion. Therefore, the accuracy of prediction on pixel is the key to improve the performance of the algorithm. In this paper, we propose a new reversible data hiding based on directional prediction and multiple histograms modification and design the corresponding reversible hiding rules. Compared to the existing algorithms, experimental results show that the proposed method can reduce distortion to the image at given embedding capacity.
Region-based image retrieval with high-level semantics using decision tree learning
- Authors: Liu, Ying , Zhang, Dengsheng , Lu, Guojun
- Date: 2008
- Type: Text , Journal article
- Relation: Pattern Recognition Vol. 41, no. 8 (2008), p. 2554-2570
- Full Text: false
- Reviewed:
- Description: Semantic-based image retrieval has attracted great interest in recent years. This paper proposes a region-based image retrieval system with high-level semantic learning. The key features of the system are: (1) it supports both query by keyword and query by region of interest. The system segments an image into different regions and extracts low-level features of each region. From these features, high-level concepts are obtained using a proposed decision tree-based learning algorithm named DT-ST. During retrieval, a set of images whose semantic concept matches the query is returned. Experiments on a standard real-world image database confirm that the proposed system significantly improves the retrieval performance, compared with a conventional content-based image retrieval system. (2) The proposed decision tree induction method DT-ST for image semantic learning is different from other decision tree induction algorithms in that it makes use of the semantic templates to discretize continuous-valued region features and avoids the difficult image feature discretization problem. Furthermore, it introduces a hybrid tree simplification method to handle the noise and tree fragmentation problems, thereby improving the classification performance of the tree. Experimental results indicate that DT-ST outperforms two well-established decision tree induction algorithms ID3 and C4.5 in image semantic learning.
Region based color image retrieval using curvelet transform
- Authors: Islam, Md , Zhang, Dengsheng , Lu, Guojun
- Date: 2010
- Type: Text , Conference paper
- Relation: Proceedings of the 9th Asian Conference on Computer Vision p. 448-457
- Full Text: false
- Reviewed:
- Description: Effective texture feature is an essential component in any content based image retrieval system. In the past, spectral features, like Gabor and wavelet, have shown superior retrieval performance than many other statistical and structural based features. Recent researches on multi-resolution analysis have found that curvelet captures texture properties, like curves, lines, and edges, more accurately than Gabor filters. However, the texture feature extracted using curvelet transform is not rotation invariant. This can degrade its retrieval performance significantly, especially in cases where there are many similar images with different orientations. This paper analyses the curvelet transform and derives a useful approach to extract rotation invariant curvelet features. Experimental results show that the new rotation invariant curvelet feature outperforms the curvelet feature without rotation invariance.
Performance comparisons of contour-based corner detectors
- Authors: Awrangjeb, Mohammad , Lu, Guojun , Fraser, Clive
- Date: 2012
- Type: Text , Journal article
- Relation: IEEE Transactions on Image Processing Vol. 21, no. 9 (2012), p. 4167-4179
- Full Text: false
- Reviewed:
- Description: Abstract— Corner detectors have many applications in computer vision and image identification and retrieval. Contour-based corner detectors directly or indirectly estimate a significance measure (e.g., curvature) on the points of a planar curve, and select the curvature extrema points as corners. While an extensive number of contour-based corner detectors have been proposed over the last four decades, there is no comparative study of recently proposed detectors. This paper is an attempt to fill this gap. The general framework of contour-based corner detection is presented, and two major issues – curve smoothing and curvature estimation, which have major impacts on the corner detection performance, are discussed. A number of promising detectors are compared using both automatic and manual evaluation systems on two large datasets. It is observed that while the detectors using indirect curvature estimation techniques are more robust, the detectors using direct curvature estimation techniques are faster.
Optimizing cepstral features for audio classification
- Authors: Fu, Zhouyu , Lu, Guojun , Ting, Kaiming , Zhang, Dengsheng
- Date: 2013
- Type: Text , Conference paper
- Relation: International Joint Conference on Artificial Intelligence p. 1330-1336
- Full Text: false
- Reviewed:
- Description: Cepstral features have been widely used in audio applications. Domain knowledge has played an important role in designing different types of cepstral features proposed in the literature. In this paper, we present a novel approach for learning optimized cepstral features directly from audio data to better discriminate between different categories of signals in classification tasks. We employ multi-layer feedforward neural networks to model the cepstral feature extraction process. The network weights are initialized to replicate a reference cepstral feature like the mel frequency cepstral coefficient. We then propose a embedded approach that integrates feature learning with the training of a support vector machine (SVM) classifier. A single optimization problem is formulated where the feature and classifier variables are optimized simultaneously so as to refine the initial features and minimize the classification risk. Experimental results have demonstrated the effectiveness of the proposed feature learning approach, outperforming competing methods by a large margin on benchmark data.