Practical Image Quality Metric

Practical Image/Video Quality Metric

Fan ZHANG, Songnan LI, Lin MA (supervised by Profs. King Ngi NGAN, Weisi LIN, Wenyu LIU)

History

     When noting a difference between things that have been compared,

                           we do not perceive the difference between the things,

                                  but the ratio of the difference to their magnitude.



                                                                                 ----- Weber (1834)

Aguilonius

The basic principle of photometry

depicted by Rubens in book Aguilonius (1613)

Category

		FEATURE
		a. Difference between the impaired and the reference	b. Deviance from the impaired to the prior	c. Impairment configurations in processing chain of data	d. Any heuristic features
MODELING METHOD	A. Casual modeling	PSNR, SSIM, MS-SSIM, CW-SSIM, FISM, VIF, PSNR-HVS-M,VSNR, MAD, SEDLAI, MOVIE, ABT-JND.	JNB	–—	–—
	B. Regression	VQM (ITU J.149)	BLIINDS	ITU G.1070	–—
	C. Statistical learning	CAEN	DIIVINE, BIQI	QA-CBP, DT	, Q ACQUINE,PPRSS

Casual models _____ Descriptive models _____ Discriminative models _____

Criteria

Ground truth of subjective quality databases

Image	LIVE, IRCCyN-IVC, Toyama, A57, TID, WIQ, LAR, CSIQ; Fourier SB, BA, Meerwald
Video	IVP HD, LIVE, LIVE wireless, VQEG HD, EPFL-PolimI, NYU, VQEG FRTV I, NTU

Goodness-of-fitting criterion

RMSE	Assumption of homoscedastic Gaussian distribution of data
Pearson correlation	Invariant with linear transform (fitting)
Spearman correlation	Invariant with monotonic transform. Applicable to ordinal data
Kendall correlation	Invariant with monotonic transform. Applicable to the rank pairs (even breaking the rank transitivity)
Outlier ratios	Based on confidential interval of data
RMSE*	Based on Gaussian distribution and confidential interval of data
Deviance of likelihood [4]	Adaptive to the prior distribution of data

Our Proposals

Our proposals conform to the general form of distortion metric:

X and Y represent the reference and the distorted images respectively. Image difference is calculated at first,

and then is decomposed into multiple channels, denoted by operator H. We use average filter [1], DCT [2], and

KLT [3]. Finally, the image difference in each channel is weighted by the diagonal matrix M. Several HVS related

features may be encapsulated into the weighting matrix M, we only employ texture masking effect [1~3] and

contrast sensitive function (CSF) [2~3]. We learn the CSF associated with DCT domain [2], while find that the

Eigen-values are good CSF in KLT domain and happen to be opposite to Manhalanobis distance [3]. The proposed

metrics are simple, fast to be optimized, and easy to be embedded into perceptual processing system.

The above proposals aim at ordinal match between the objective predictions and the subjective scores. For

numerical match, it is necessary to monotonically map the objective predictions toward the subjective scores.

Traditionally, metric design and monotonic map are separated. We unified the two steps into a single framework,

i.e.. Additive Log-logistic Model (ALM) framework.

Given totally M independent databases where the m-th database contain totally Nm samples, metric design is

posed as a maximum likelihood problem as

with a general log-logistic quality model as

where feature vectors are x1, x2, …; sum performs the spatial pooling, scale pooling, or impairments combination;

parameters {α,β,γ} are independent with databases while {a,b} are adaptive with each database.

ALM has regularized procedures of feature selection and parameter estimation, and also supports co-training on

multiple databases. Following this methodology, we developed

1) a no-reference video quality metric mainly based on the impairment-relevant configurations (feature c) [4], which

has won the model competition of standardization work by ITU-T Study Group 12 in June, 2012.

2) a full-reference image quality metric based on the feature a, which discloses the functional structure of V1 [5].

Publications

1. F. Zhang, S. Li, L. Ma, and K. N. Ngan, “Limitation and challenges of Image Quality Measurement,”

presented in SPIE Visual Communications and Image Processing conference, 2010, pp. 1-8. [ code ] [ pdf ].

2. F. Zhang, L. Ma, S. Li, and K. N. Ngan, "Practical image quality metric applied to image coding,"

IEEE Trans. Multimedia, vol. 13, pp. 615-624, 2011 (recommended by IEEE MMTC R-letter). [ code ] [ pdf ]

3. F. Zhang, W. Liu, W. Lin, K. N. Ngan, “Spread spectrum image watermarking based on perceptual quality metric,”

IEEE Trans. Image Processing, vol. 20, pp. 3207-3218, 2011. [ code ] [ pdf ]

4. F. Zhang, W. Lin, Z. Chen and K. N. Ngan, “Additive log-logistic model for video quality assessment,”

IEEE Trans. Image Processing. vol. 22, pp. 1536-1547, 2013 (adopted by Standard ITU-T P.1202.2 Mode 2). [ pdf ]

5. F. Zhang, W. Jiang, F. Autrusseau, W. Lin, “Exploring V1 by modeling the perceptual quality of images,”

Journal of Vision [ code (parameter estimation for Additive Log-logisitic Model)] [pdf]

More Resources

Third-party Matlab toolbox of existing metrics: MeTriX Mux VQA Package

Dr. Stefan Winkler’s link: Image and video quality resources

Last update: Dec. 10, 2013