Practical Image/Video Quality Metric

   Fan ZHANG, Songnan LI, Lin MA (supervised by Profs. King Ngi NGAN, Weisi LIN, Wenyu LIU)

 

History

     When noting a difference between things that have been compared,

                           we do not perceive the difference between the things,

                                  but the ratio of the difference to their magnitude.

                                                                                                             

                                                                                 -----  Weber (1834)

 

 
Aguiloniusernst_heinrich_weber

 

 

 

The basic principle of photometry

depicted by Rubens in book Aguilonius (1613)

 
 

 

 

 


Category

 

FEATURE

a. Difference between the impaired and the reference

b. Deviance from the impaired to the prior

c. Impairment configurations in processing chain of data

d. Any heuristic features

MODELING

METHOD

A. Casual modeling

PSNR, SSIM, MS-SSIM, CW-SSIM, FISM, VIF, PSNR-HVS-M,VSNR, MAD, SEDLAI, MOVIE, ABT-JND.

JNB

–—

–—

B. Regression

VQM (ITU J.149)

BLIINDS

ITU G.1070

–—

C. Statistical learning

CAEN

DIIVINE, BIQI

QA-CBP, DT

, Q

ACQUINE,PPRSS

Casual models _____      Descriptive models _____        Discriminative models _____

 

Criteria

                                Ground truth of subjective quality databases

Image

LIVE, IRCCyN-IVC, Toyama, A57, TID, WIQ, LAR, CSIQ; Fourier SB, BA, Meerwald

Video

IVP HD, LIVE, LIVE wireless, VQEG HD, EPFL-PolimI, NYU, VQEG FRTV I, NTU

                                         Goodness-of-fitting criterion

RMSE

Assumption of homoscedastic Gaussian distribution of data

Pearson correlation

Invariant with linear transform (fitting)

Spearman correlation

Invariant with monotonic transform. Applicable to ordinal data

Kendall correlation

Invariant with monotonic transform. Applicable to the rank pairs (even breaking the rank transitivity)

Outlier ratios

Based on confidential interval of data

RMSE*

Based on Gaussian distribution and confidential interval of data

Deviance of likelihood [4]

Adaptive to the prior distribution of data

 

Our Proposals

       Our proposals conform to the general form of distortion metric:

X and Y represent the reference and the distorted images respectively. Image difference is calculated at first,

and then is decomposed into multiple channels, denoted by operator H. We use average filter [1], DCT [2], and

KLT [3]. Finally, the image difference in each channel is weighted by the diagonal matrix M. Several HVS related

features may be encapsulated into the weighting matrix M, we only employ texture masking effect [1~3] and

contrast sensitive function (CSF) [2~3]. We learn the CSF associated with DCT domain [2], while find that the

Eigen-values are good CSF in KLT domain and happen to be opposite to Manhalanobis distance [3]. The proposed

metrics are simple, fast to be optimized, and easy to be embedded into perceptual processing system.

The above proposals aim at ordinal match between the objective predictions and the subjective scores.  For

numerical match, it is necessary to monotonically map the objective predictions toward the subjective scores.

Traditionally, metric design and monotonic map are separated. We unified the two steps into a single framework,

i.e.. Additive Log-logistic Model (ALM) framework.

Given totally M independent databases where the m-th database contain totally Nm samples, metric design is

posed as a maximum likelihood problem as

with a general log-logistic quality model as

where feature vectors are x1, x2, …; sum performs the spatial pooling, scale pooling, or impairments combination;

parameters {α,β,γ} are independent with databases while {a,b} are adaptive with each database.

ALM has regularized procedures of feature selection and parameter estimation, and also supports co-training on

multiple databases. Following this methodology, we developed

1)    a no-reference video quality metric mainly based on the impairment-relevant configurations (feature c) [4], which

has won the model competition of standardization work by ITU-T Study Group 12 in June, 2012.

2)    a full-reference image quality metric based on the feature a, which discloses the functional structure of V1 [5].

 

Publications

1. F. Zhang,  S. Li,  L. Ma, and   K. N. Ngan,  “Limitation and challenges of Image Quality Measurement,”

    presented in SPIE Visual Communications and Image Processing conference, 2010, pp. 1-8. [ code ] [ pdf ].

2. F. Zhang, L. Ma,  S. Li,  and  K. N. Ngan,  "Practical image quality metric applied to image coding,"

     IEEE Trans. Multimedia, vol. 13, pp. 615-624, 2011 (recommended by IEEE MMTC R-letter). [ code ]  [ pdf ]

3. F. Zhang, W. Liu, W. Lin, K. N. Ngan, “Spread spectrum image watermarking based on perceptual quality metric,”

     IEEE Trans. Image Processing, vol. 20, pp. 3207-3218, 2011. [ code ] [ pdf ]

4. F. Zhang, W. Lin, Z. Chen and K. N. Ngan, “Additive log-logistic model for video quality assessment,

IEEE Trans. Image Processing. vol. 22, pp. 1536-1547, 2013 (adopted by Standard ITU-T P.1202.2 Mode 2). [ pdf ]

5. F. Zhang, W. Jiang, F. Autrusseau, W. Lin, “Exploring V1 by modeling the perceptual quality of images,

    Journal of Vision [ code (parameter estimation for Additive Log-logisitic Model)]  [pdf]

More Resources

Third-party Matlab toolbox of existing metrics: MeTriX Mux VQA Package

Dr. Stefan Winkler’s link: Image and video quality resources

 

Last update: Dec. 10, 2013