Automatic Word Ground Truth Generation for Camera Captured Documents
スポンサーリンク
概要
- 論文の詳細を見る
A database for camera captured documents is useful to train OCRs to obtain better performance. However, no dataset exists for camera captured documents because it is very laborious and costly to build these datasets manually. In this paper, a fully automatic approach allowing building the very large scale (i. e., millions of images) labeled camera captured documents dataset is proposed. The proposed approach does not require any human intervention in labeling. Evaluation of samples generated by the proposed approach shows that more than 97% of the images are correctly labeled. Novelty of the proposed approach lies in the use of document image retrieval for automatic labeling, especially for camera captured documents, which contain different distortions specific to camera, e.g., blur, perspective distortion, etc.
- 2013-03-07
著者
-
Dengel Andreas
German Research Center For Artificial Intelligence
-
Kise Koichi
Osaka Prefecture Univ.
-
LIWICKI Marcus
University of Fribourg
-
AHMED Sheraz
German Research Center for Artificial Intelligence (DFKI)
-
IWAMURA Masakazu
Osaka Prefecture University
関連論文
- ドイツ人工知能研究センター(DFKI)の20年 : 成功への道筋とそれを可能とした人々
- Position detection for a camera pen using LLAH and dot patterns (パターン認識・メディア理解)
- Object detection in images with cluttered background by using local features and their configuration (パターン認識・メディア理解)
- Position detection for a camera pen using LLAH and dot patterns (ヒューマン情報処理)
- Effectiveness of Passage-Based Document Retrieval for Short Queries(Special Issue on Text Processing for Information Access)
- Automatic Word Ground Truth Generation for Camera Captured Documents