Fast algorithm analysis and bit-serial architecture design for sub-pixel motion estimation in H.264
dc.authorid | 0000-0001-9486-3226 | |
dc.authorid | 0000-0002-6842-1528 | |
dc.authorid | 0000-0002-7379-8397 | |
dc.contributor.author | Fatemi, Mohammad Reza Hosseiny | en_US |
dc.contributor.author | Ateş, Hasan Fehmi | en_US |
dc.contributor.author | Salleh, Rosli Bin | en_US |
dc.date.accessioned | 2015-01-15T23:01:34Z | |
dc.date.available | 2015-01-15T23:01:34Z | |
dc.date.issued | 2010-12 | |
dc.department | Işık Üniversitesi, Mühendislik Fakültesi, Elektrik-Elektronik Mühendisliği Bölümü | en_US |
dc.department | Işık University, Faculty of Engineering, Department of Electrical-Electronics Engineering | en_US |
dc.description | This work was supported in part by the Ministry of Higher Education, Malaysia, under Grant FRGS FP094/2007c. We would like to thank the reviewers of this paper for their helpful comments and suggestions which improved our paper. In addition, we would like to thank ARM and Silterra Malaysia for providing the standard cell libraries under the university program and Trans-Dist Engineering for its technical support. | en_US |
dc.description.abstract | The sub-pixel motion estimation (SME), together with the interpolation of reference frames, is a computationally extensive part of the H.264 encoder that increases the memory requirement 16-times for each reference frame. Due to the huge computational complexity and memory requirement of the H.264 SME, its hardware architecture design is an important issue especially in high resolution or low power applications. To solve the above difficulties, we propose several optimization techniques in both algorithm and architecture levels. In the algorithm level, we propose a parabolic based algorithm for SME with quarter-pixel accuracy which reduces the computational budget by 94.35% and the memory access requirement by 98.5% in comparison to the standard interpolate and search method. In addition, a fast version of the proposed algorithm is presented that reduces the computational budget 46.28% further while maintaining the video quality. In the architecture level, we propose a novel bit-serial architecture for our algorithm. Due to advantages of the bit-serial architecture, it has a low gate count, high speed operation frequency, low density interconnection, and a reduced number of I/O pins. Also, several optimization techniques including the sum of absolute differences truncation, source sharing exploiting and power saving techniques are applied to the proposed architecture which reduce power consumption and area. Our design can save between 57.71-90.01% of area cost and improves the macroblock (MB) processing speed between 1.7-8.44 times when compared to previous designs. Implementation results show that our design can support real time HD1080 format with 20.3 k gate counts at the operation frequency of 144.9 MHz. | en_US |
dc.description.sponsorship | Ministry of Education, Malaysia | en_US |
dc.description.version | Publisher's version | en_US |
dc.identifier.citation | Fatemi, M. R. H., Ateş, H. F. & Salleh, R. B. (2010). Fast algorithm analysis and bit-serial architecture design for sub-pixel motion estimation in H.264. Journal of Circuits, Systems, and Computers, 19(8), 1665-1687. doi:10.1142/S0218126610006980 | en_US |
dc.identifier.doi | 10.1142/S0218126610006980 | |
dc.identifier.endpage | 1687 | |
dc.identifier.issn | 0218-1266 | |
dc.identifier.issn | 1793-6454 | |
dc.identifier.issue | 8 | |
dc.identifier.scopus | 2-s2.0-78650095801 | |
dc.identifier.scopusquality | Q3 | |
dc.identifier.startpage | 1665 | |
dc.identifier.uri | https://hdl.handle.net/11729/356 | |
dc.identifier.uri | http://dx.doi.org/10.1142/S0218126610006980 | |
dc.identifier.volume | 19 | |
dc.identifier.wos | WOS:000285107500003 | |
dc.identifier.wosquality | Q4 | |
dc.indekslendigikaynak | Web of Science | en_US |
dc.indekslendigikaynak | Scopus | en_US |
dc.indekslendigikaynak | Science Citation Index Expanded (SCI-EXPANDED) | en_US |
dc.institutionauthor | Ateş, Hasan Fehmi | en_US |
dc.institutionauthorid | 0000-0002-6842-1528 | |
dc.language.iso | en | en_US |
dc.peerreviewed | Yes | en_US |
dc.publicationstatus | Published | en_US |
dc.publisher | World Scientific Publishing Company | en_US |
dc.relation.ispartof | Journal of Circuits, Systems, and Computers | en_US |
dc.relation.publicationcategory | Makale - Uluslararası Hakemli Dergi - Kurum Öğretim Elemanı | en_US |
dc.rights | info:eu-repo/semantics/closedAccess | en_US |
dc.subject | Video compression | en_US |
dc.subject | Sub-pixel motion estimation | en_US |
dc.subject | H.264 standard | en_US |
dc.subject | Bit-serial architecture | en_US |
dc.subject | Vlsi architecture | en_US |
dc.subject | Encoder | en_US |
dc.subject | Reuse | en_US |
dc.subject | Algorithms | en_US |
dc.subject | Budget control | en_US |
dc.subject | Computational complexity | en_US |
dc.subject | Design | en_US |
dc.subject | Estimation | en_US |
dc.subject | Image compression | en_US |
dc.subject | Optimization | en_US |
dc.subject | Pixels | en_US |
dc.subject | Standards | en_US |
dc.subject | Video signal processing | en_US |
dc.subject | Algorithm level | en_US |
dc.subject | Area cost | en_US |
dc.subject | Computational budget | en_US |
dc.subject | Fast algorithms | en_US |
dc.subject | Gate count | en_US |
dc.subject | H.264 encoders | en_US |
dc.subject | H.264 standards | en_US |
dc.subject | Hardware architecture design | en_US |
dc.subject | High resolution | en_US |
dc.subject | High-speed operation | en_US |
dc.subject | I/O pins | en_US |
dc.subject | Low density | en_US |
dc.subject | Low power application | en_US |
dc.subject | Macro block | en_US |
dc.subject | Memory access | en_US |
dc.subject | Memory requirements | en_US |
dc.subject | Operation frequency | en_US |
dc.subject | Optimization techniques | en_US |
dc.subject | Power consumption | en_US |
dc.subject | Power savings | en_US |
dc.subject | Processing speed | en_US |
dc.subject | Proposed architectures | en_US |
dc.subject | Quarter-pixel | en_US |
dc.subject | Real time | en_US |
dc.subject | Reference frame | en_US |
dc.subject | Search method | en_US |
dc.subject | Subpixel motion estimation | en_US |
dc.subject | Sum of absolute differences | en_US |
dc.subject | Video quality | en_US |
dc.subject | Motion estimation | en_US |
dc.title | Fast algorithm analysis and bit-serial architecture design for sub-pixel motion estimation in H.264 | en_US |
dc.type | Article | en_US |
dspace.entity.type | Publication |
Dosyalar
Orijinal paket
1 - 1 / 1
Küçük Resim Yok
- İsim:
- 356.pdf
- Boyut:
- 563.62 KB
- Biçim:
- Adobe Portable Document Format
- Açıklama:
- Publisher's Version