KMS Chongqing Institute of Green and Intelligent Technology, CAS
Minimum-risk training for semi-Markov conditional random fields with application to handwritten Chinese/Japanese text recognition | |
Zhou, Xiang-Dong1; Zhang, Yan-Ming2; Tian, Feng3; Wang, Hong-An3; Liu, Cheng-Lin2 | |
2014-05-01 | |
摘要 | Semi-Markov conditional random fields (semi-CRFs) are usually trained with maximum a posteriori (MAP) criterion which adopts the 0/1 cost for measuring the loss of misclassification. In this paper, based on our previous work on handwritten Chinese/Japanese text recognition (HCTR) using semi-CRFs, we propose an alternative parameter learning method by minimizing the risk on the training set, which has unequal misclassification costs depending on the hypothesis and the ground-truth. Based on this framework, three non-uniform cost functions are compared with the conventional 0/1 cost, and training data selection is incorporated to reduce the computational complexity. In experiments of online handwriting recognition on databases CASIA-OLHWDB and THAT Kondate, we compared the performances of the proposed method with several widely used learning criteria, including conditional log-likelihood (CLL), softmax-margin (SMM), minimum classification error (MCE), large-margin MCE (LM-MCE) and max-margin (MM). On the test set (online handwritten texts) of ICDAR 2011 Chinese handwriting recognition competition, the proposed method outperforms the best system in competition. (C) 2013 Elsevier Ltd. All rights reserved. |
关键词 | Semi-markov Conditional Random Fields Minimum-risk Training Character String Recognition |
DOI | 10.1016/j.patcog.2013.12.002 |
发表期刊 | PATTERN RECOGNITION |
ISSN | 0031-3203 |
卷号 | 47期号:5页码:1904-1916 |
收录类别 | SCI |
WOS记录号 | WOS:000331667400011 |
语种 | 英语 |