Arama Sonuçları

Listeleniyor 1 - 6 / 6
  • Yayın
    Disaster damage assessment of buildings using adaptive self-similarity descriptor
    (2016-08) Kahraman, Fatih; İmamoğlu, Mümin; Ateş, Hasan Fehmi
    Assessment of damage caused by a disaster is significant for coordinating emergency response teams and planning emergency aid. In this letter, a robust method for rapid building damage assessment is proposed using pre- and postevent EO images and building footprints. The method uses a local self-similarity descriptor (SSD) for change detection in buildings, which is shown to be robust against variations in global illumination and small local deformations. The use of building footprints helps reduce the false alarms due to changes in nonbuilding areas. Footprint is also used to differentiate small and large buildings, extract the boundary region of a building, and adapt the descriptor computation accordingly. It is shown that the adaptive SSD provides a more accurate measure of local damage on the building. The 2010 Haiti Earthquake and Typhoon Haiyan 2013 Philippines are analyzed with the proposed method, and 75/82% true positive rate and 25/15% false positive rate are obtained for detection of collapsed buildings with respect to the ground truth data of UNITAR/UNOSAT and HOT.
  • Yayın
    Spherical coding algorithm for wavelet image compression
    (IEEE-Inst Electrical Electronics Engineers Inc, 2009-05) Ateş, Hasan Fehmi; Orchard, Michael T.
    In recent literature, there exist many high-performance wavelet coders that use different spatially adaptive coding techniques in order to exploit the spatial energy compaction property of the wavelet transform. Two crucial issues in adaptive methods are the level of flexibility and the coding efficiency achieved while modeling different image regions and allocating bitrate within the wavelet subbands. In this paper, we introduce the "spherical coder," which provides a new adaptive framework for handling these issues in a simple and effective manner. The coder uses local energy as a direct measure to differentiate between parts of the wavelet subband and to decide how to allocate the available bitrate. As local energy becomes available at finer resolutions, i.e., in smaller size windows, the coder automatically updates its decisions about how to spend the bitrate. We use a hierarchical set of variables to specify and code the local energy up to the highest resolution, i.e., the energy of individual wavelet coefficients. The overall scheme is nonredundant, meaning that the subband information is conveyed using this equivalent set of variables without the need for any side parameters. Despite its simplicity, the algorithm produces PSNR results that are competitive with the state-of-art coders in literature.
  • Yayın
    3-D Mesh geometry compression with set partitioning in the spectral domain
    (IEEE-INST Electrical Electronics Engineers Inc, 2010-02) Bayazıt, Uluğ; Konur, Umut; Ateş, Hasan Fehmi
    This paper explains the development of a highly efficient progressive 3-D mesh geometry coder based on the region adaptive transform in the spectral mesh compression method. A hierarchical set partitioning technique, originally proposed for the efficient compression of wavelet transform coefficients in high-performance wavelet-based image coding methods, is proposed for the efficient compression of the coefficients of this transform. Experiments confirm that the proposed coder employing such a region adaptive transform has a high compression performance rarely achieved by other state of the art 3-D mesh geometry compression algorithms. A new, high-performance fixed spectral basis method is also proposed for reducing the computational complexity of the transform. Many-to-one mappings are employed to relate the coded irregular mesh region to a regular mesh whose basis is used. To prevent loss of compression performance due to the low-pass nature of such mappings, transitions are made from transform-based coding to spatial coding on a per region basis at high coding rates. Experimental results show the performance advantage of the newly proposed fixed spectral basis method over the original fixed spectral basis method in the literature that employs one-to-one mappings.
  • Yayın
    Multi-hypothesis contextual modeling for semantic segmentation
    (Elsevier Science BV, 2019-01-01) Ateş, Hasan Fehmi; Sünetci, Sercan
    Semantic segmentation (i.e. image parsing) aims to annotate each image pixel with its corresponding semantic class label. Spatially consistent labeling of the image requires an accurate description and modeling of the local contextual information. Segmentation result is typically improved by Markov Random Field (MRF) optimization on the initial labels. However this improvement is limited by the accuracy of initial result and how the contextual neighborhood is defined. In this paper, we develop generalized and flexible contextual models for segmentation neighborhoods in order to improve parsing accuracy. Instead of using a fixed segmentation and neighborhood definition, we explore various contextual models for fusion of complementary information available in alternative segmentations of the same image. In other words, we propose a novel MRF framework that describes and optimizes the contextual dependencies between multiple segmentations. Simulation results on two common datasets demonstrate significant improvement in parsing accuracy over the baseline approaches.
  • Yayın
    Hierarchical quantization indexing for wavelet and wavelet packet image coding
    (Elsevier Science BV, 2010-02) Ateş, Hasan Fehmi; Tamer, Engin
    In this paper, we introduce the quantization index hierarchy, which is used for efficient coding of quantized wavelet and wavelet packet coefficients. A hierarchical classification map is defined in each wavelet subband, which describes the quantized data through a series of index classes. Going from bottom to the top of the tree, neighboring coefficients are combined to form classes that represent some statistics of the quantization indices of these coefficients. Higher levels of the tree are constructed iteratively by repeating this class assignment to partition the coefficients into larger Subsets. The class assignments are optimized using a rate-distortion cost analysis. The optimized tree is coded hierarchically from top to bottom by coding the class membership information at each level of the tree. Context-adaptive arithmetic coding is used to improve coding efficiency. The developed algorithm produces PSNR results that are better than the state-of-art wavelet-based and wavelet packet-based coders in literature.
  • Yayın
    Rate-distortion and complexity optimized motion estimation for H.264 video coding
    (IEEE-INST Electrical Electronics Engineers Inc, 2008-02) Ateş, Hasan Fehmi; Altunbaşak, Yücel
    11.264 video coding standard supports several inter-prediction coding modes that use macroblock (MB) partitions with variable block sizes. Rate-distortion (R-D) optimal selection of both the motion vectors (MVs) and the coding mode of each MB is essential for an H.264 encoder to achieve superior coding efficiency. Unfortunately, searching for optimal MVs of each possible subblock incurs a heavy computational cost. In this paper, in order to reduce the computational burden of integer-pel motion estimation (ME) without sacrificing from the coding performance, we propose a R-D and complexity joint optimization framework. Within this framework, we develop a simple method that determines for each MB which partitions are likely to be optimal. MV search is carried out for only the selected partitions, thus reducing the complexity of the ME step. The mode selection criteria is based on a measure of spatiotemporal activity within the MB. The procedure minimizes the coding loss at a given level of computational complexity either for the full video sequence or for each single frame. For the latter case, the algorithm provides a tight upper bound on the worst case complexity/execution time of the ME module. Simulation results show that the algorithm speeds up integer-pel ME by a factor of up to 40 with less than 0.2 dB loss in coding efficiency.