Quarter-Pixel Accuracy Motion Estimation (ME) - A Novel ME Technique in HEVC

Veiw figure View Figure

2.2. Interpolation Process of Luma Sample

In Figure 2.3 the positions labeled with upper-case letters Ai,j, represent the available luma samples at integer sample locations, whereas the other positions labelled with lower-case letters represent samples at non integer sample locations, which need to be generated by interpolation. The samples labelled a_0,0, b_0,0, c_0,0, d_0,0, h_0,0, and n_0,0and are derived from the samples by applying the eight-tap filter for half-sample positions ^[5] and the seven-tap filter for the quarter-sample positions as follows:

(2.1)

(2.2)

(2.3)

(2.4)

(2.5)

(2.6)

where the constant B ≥ 8 is the bit depth of the reference samples (and typically B = 8 for most applications) and the filter coefficient values for luma is given in Table 1 ^{[4, 5]}. In these formulae >> denotes an arithmetic right shift operation.

Table 1. Filter coefficients for luma fractional sample interpolation in HEVC

Download as

View current table in a new window

Tables index

Veiw figure View Table

View next table

The samples labelled e_0,0, i_0,0, p_0,0, f_0,0, j_0,0, q_0,0, g_0,0, k_0,0 and r_0,0can be derived by applying the corresponding filters to samples located at vertically adjacent a_0,_j, b_0,_j and c_0,_j positions as follows:

(2.7)

(2.8)

(2.9)

(2.10)

(2.11)

(2.12)

(2.13)

(2.14)

(2.15)

2.3. Interpolation Process of Chrominance Sample

Figure 2 shows the positions of the integer pixel sample, 1/2 pixel sample, 1/4 pixel sample, 1/8 pixel sample of the chrominance components of the reference image. It is supposed that chrominance sample point B_{i, j} is located at the integer sample point (xB _i,_j, yB_{i, j}), then the predicted value from chrominance point ‘ab_0,0’ to ‘hh _0,0’ at non-integer sample positions can be obtained by the 4-beat filter with the coefficientient is given in Table 2 ^[5].

Table 2. Filter coefficients for chroma sample interpolation in HEVC

Download as

View current table in a new window

Tables index

Veiw figure View Table

View previous table

View next table

The values of 1/2 pixel points ae_0,0, ea_0,0; 1/4 pixel point ac_0,0, ag_0,0, ca_0,0, ga_0,0; and 1/8 pixel point ab_0,0, ad_0,0, af_0,0, ah_0,0, ba_0,0, da_0,0, fa_0,0, ha _0,0 can be obtained by using filter interpolation mentioned in the Table 2 on the nearest integer pixel in the horizontal and vertical directions and similarly the value of sub-pixel sample point bX_0,0, cX_0,0, dX_0,0, eX_0,0, fX_0,0, gX_0,0 and hX_0,0 (among which, X presents any one in b, c, d, e, f, g and h) can be obtained by the 4-beat filter interpolation in the vertical direction.

Figure 2. Positions of Integer Sample Point and Non-integer Sample Point in the Interpolation of Chrominance

Download as

Veiw figure View Figure

3. Simulation Results & Analysis

3.1. Results of Quarter-pixel Motion Estimation

To illustrate the implementation result of quarter-pixel motion estimation in HEVC, experiment have been carried out using MATLAB. Motion vectors are obtained by using simple 2D Logarithmic search algorithm. The characteristics of the motion activities of the blocks in the current frame are predicted using this temporal information. As discussed, we implemented the Quarter-pixel interpolation algorithm with implementation of 2D Logarithmic search to find the motion vector, predicted frame with PSNR and residual with motion compensation and the corresponding 3D mesh plot of residual.

The experiment has been carried out by taking different CTU size i.e. 8X8, 16X16, 32X32, and 64X64 of different video frames such are AVI, DIVX and YUV. For 8x8, 16X16, 32X32, and 64X64 CTU size the search block size 10x10, 20x20, 40x40 and 70x70 respectively. ME results of only 8x8 CTU of AVI, 16x16 CTU of DIVX and 32x32 CTU of YUV video frames are shown in Figure 3, Figure 4, and Figure 5.

3.2. Results of ME by Taking 8x8 CTU of an AVI Video Frame

In order to find the motion vector, predicted frame with PSNR and with motion compensation and the corresponding 3D mesh plot of residual of an AVI video frames, here the CTU size is considered as 8x8 and the searching block size around CTU is 10x10.

Figure 3. Results of ME by taking 8x8 CTU of AVI video Frames

Download as

Veiw figure View Figure

3.3. Results of ME by Taking 16x16 CTU of a DivX Video Frame

In order to find the motion vector, predicted frame with PSNR and with motion compensation and the corresponding 3D mesh plot of residual of a DIVX video frames, here the CTU size is considered as 16x16 and the searching block size around CTU is 20X20.

Figure 4. Results of ME by taking 16x16 CTU of DIVX video Frames

Download as

Veiw figure View Figure

3.4. Results of ME by Taking 32x32 CTU of a YUV Video Frame

In order to find the motion vector, predicted frame with PSNR and with motion compensation and the corresponding 3D mesh plot of residual of a YUV video frames, here the CTU size is considered as 32x32 and the searching block size around CTU is 40X40.

Figure 5. Results of ME by taking 32x32 CTU of YUV video Frames

Download as

Veiw figure View Figure

4. Result Analysis

The H.265 encoding method has been complicated by the development of new coding tools. Among those tools, the quarter pixel accuracy motion estimation and compensation enhance compression gain and to reduce memory area and data bit rate and it requires the implementation of complex interpolation filters and increases the ME complexity. Here the scheme was divided into four steps; in the first step the sub-pixel ME for the 8×8 and 16×16, 32x32 and 64x64 block has been used.

The result shows that with increase of size of CTU block, the PSNR of predicted frame gradually decreases and also the number of motion vector reduces as given in Table 3. The PSNR of original candidate frame was 27.2146dB, 27.7167dB and 28.7961 dB with respect to reference frame for AVI, DIVX and YUV video frames respectively.

Table 3. PSNR of different Video Frames with different size CTU

Download as