CSpace  > 智能安全技术研究中心
Minimum-risk training for semi-Markov conditional random fields with application to handwritten Chinese/Japanese text recognition
Zhou, Xiang-Dong1; Zhang, Yan-Ming2; Tian, Feng3; Wang, Hong-An3; Liu, Cheng-Lin2
2014-05-01
摘要

Semi-Markov conditional random fields (semi-CRFs) are usually trained with maximum a posteriori (MAP) criterion which adopts the 0/1 cost for measuring the loss of misclassification. In this paper, based on our previous work on handwritten Chinese/Japanese text recognition (HCTR) using semi-CRFs, we propose an alternative parameter learning method by minimizing the risk on the training set, which has unequal misclassification costs depending on the hypothesis and the ground-truth. Based on this framework, three non-uniform cost functions are compared with the conventional 0/1 cost, and training data selection is incorporated to reduce the computational complexity. In experiments of online handwriting recognition on databases CASIA-OLHWDB and THAT Kondate, we compared the performances of the proposed method with several widely used learning criteria, including conditional log-likelihood (CLL), softmax-margin (SMM), minimum classification error (MCE), large-margin MCE (LM-MCE) and max-margin (MM). On the test set (online handwritten texts) of ICDAR 2011 Chinese handwriting recognition competition, the proposed method outperforms the best system in competition. (C) 2013 Elsevier Ltd. All rights reserved.

关键词Semi-markov Conditional Random Fields Minimum-risk Training Character String Recognition
DOI10.1016/j.patcog.2013.12.002
发表期刊PATTERN RECOGNITION
ISSN0031-3203
卷号47期号:5页码:1904-1916
收录类别SCI
WOS记录号WOS:000331667400011
语种英语