• farsi machine-printed subwords recognition using contour-based fourier descriptors

    جزئیات بیشتر مقاله
    • تاریخ ارائه: 1391/01/01
    • تاریخ انتشار در تی پی بین: 1391/01/01
    • تعداد بازدید: 614
    • تعداد پرسش و پاسخ ها: 0
    • شماره تماس دبیرخانه رویداد: -
     this paper presents a fast and simple method for farsi/arabic subwords recognition in a large lexicon. by omitting dots and complementary parts of machine-printed characters, a dataset including 9445 farsi/arabic subwords written by a single font and single size was obtained. this dataset not only reduces the number of subwords, but makes it suitable for both farsi/arabic languages. after normalizing boundary points of each subword, fourier descriptor features are extracted. experimental results on 30 plain text shows accuracy of 82.1% on subword level. considering this large and comprehensive dataset, the obtained results are still promising which can be enhanced in the future by the use of farsi/arabic language grammar for connecting subwords.

سوال خود را در مورد این مقاله مطرح نمایید :

با انتخاب دکمه ثبت پرسش، موافقت خود را با قوانین انتشار محتوا در وبسایت تی پی بین اعلام می کنم
مقالات جدیدترین رویدادها