Robust Latent Factor Analysis for Precise Representation of High-Dimensional and Sparse Data

doi:10.1109/JAS.2020.1003533

CSpace

	Robust Latent Factor Analysis for Precise Representation of High-Dimensional and Sparse Data
	Wu, Di1,2 ; Luo, Xin1,2
	2021-04-01
摘要	High-dimensional and sparse (HiDS) matrices commonly arise in various industrial applications, e.g., recommender systems (RSs), social networks, and wireless sensor networks. Since they contain rich information, how to accurately represent them is of great significance. A latent factor (LF) model is one of the most popular and successful ways to address this issue. Current LF models mostly adopt L-2-norm-oriented Loss to represent an HiDS matrix, i.e., they sum the errors between observed data and predicted ones with L-2-norm. Yet L-2-norm is sensitive to outlier data. Unfortunately, outlier data usually exist in such matrices. For example, an HiDS matrix from RSs commonly contains many outlier ratings due to some heedless/malicious users. To address this issue, this work proposes a smooth L-1- norm-oriented latent factor (SL-LF) model. Its main idea is to adopt smooth L-1- norm rather than L-2- norm to form its Loss, making it have both strong robustness and high accuracy in predicting the missing data of an HiDS matrix. Experimental results on eight HiDS matrices generated by industrial applications verify that the proposed SL-LF model not only is robust to the outlier data but also has significantly higher prediction accuracy than state-of-the-art models when they are used to predict the missing data of HiDS matrices.
关键词	High-dimensional and sparse matrix L-1-norm L-2-norm latent factor model recommender system smooth L-1-norm
DOI	10.1109/JAS.2020.1003533
发表期刊	IEEE-CAA JOURNAL OF AUTOMATICA SINICA
ISSN	2329-9266
卷号	8 期号:4 页码:796-805
通讯作者	Luo, Xin(luoxin21@cigit.ac.cn)
收录类别	SCI
WOS记录号	WOS:000628913100006
语种	英语

中国科学院重庆绿色智能技术研究院机构知识库