Laser & Optoelectronics Progress, Volume. 58, Issue 20, 2010020(2021)
Character Segmentation for Historical Uchen Tibetan Document Based on Structure Attributes
Fig. 1. Syllable of the Tibetan. (a) Structure of the Tibetan syllable; (b) example of the Tibetan syllable; (c) examples of Tibetan transliteration of Sanskrit
Fig. 4. Vertical segmentation process by projection of the historical Tibetan document. (a) Document line and its vertical projection; (b) character blocks in rectangular area
Fig. 6. Examples of the character segmentation challenges. (a) Segmentation challenges above the baseline; (b) segmentation challenges below the baseline
Fig. 11. Examples of multipath segmentation. (a) Combination example; (b) marked skeleton diagram; (c) segmentation path
Fig. 13. Broken strokes type below the baseline. (a) Cross left and right; (b) cross up and down; (c) separate up and down; (d) contain
Fig. 14. Examples of strokes attribution classification. (a) With no stroke above the baseline and with no broken stroke below the baseline; (b) with strokes above the baseline and with no broken stroke below the baseline; (c) with no stroke above the baseline and with no broken stroke below the baseline; (d) with strokes above the baseline and with broken strokes below the baseline
Fig. 15. Process of local baseline detection and horizontal segmentation of character block. (a) Character blocks with syllable points; (b) character blocks with no syllable point and with no stroke above the baseline; (c) character blocks with no syllable point and with strokes above the baseline
Fig. 17. Character segmentation with a touching stroke. (a) Character direction is D1; (b) character direction is D2
Fig. 20. Statistical results of broken strokes below the baseline. (a) Cross left and right; (b) cross up and down; (c) separate up and down; (d) contain
Fig. 21. Attribution based on the horizontal distance of the centroid. (a) Character block; (b) centroid of strokes after attribution
Fig. 25. Results of character segmentation. (a) Character block; (b) block after character segmentation
Fig. 26. Wrong character segmentation caused by strokes attribution. (a)Character block; (b) local baseline and horizontal segmentation;(c) broken stroke mark; (d) result of character segmentation
Fig. 27. Wrong character segmentation caused by the baseline detection. (a) Character block; (b) horizontal projection; (c) Hough straight line detection; (d) local baseline; (e) result of character segmentation
|
|
|
|
|
Get Citation
Copy Citation Text
Ce Zhang, Weilan Wang. Character Segmentation for Historical Uchen Tibetan Document Based on Structure Attributes[J]. Laser & Optoelectronics Progress, 2021, 58(20): 2010020
Category: Image Processing
Received: Jan. 8, 2021
Accepted: Mar. 3, 2021
Published Online: Nov. 3, 2021
The Author Email: Weilan Wang (wangweilan@xbmu.edu.cn)