رکورد قبلیرکورد بعدی

" Adaptive analysis and processing of structured multilingual documents "


Document Type : Latin Dissertation
Language of Document : English
Record Number : 52512
Doc. No : TL22466
Call number : ‭3202600‬
Main Entry : Huanfeng Ma
Title & Author : Adaptive analysis and processing of structured multilingual documents\ Huanfeng Ma
College : University of Maryland, College Park
Date : 2006
Degree : Ph.D.
student score : 2006
Page No : 182
Abstract : Digital document processing is becoming popular for applications to office and library automation, bank and postal services, publishing houses and communication management. In recent years, the demand for tools capable of searching written and spoken sources of multilingual information has increased tremendously, where the bilingual dictionary is one of the important resource to provide the required information. Processing and analysis of bilingual dictionaries brought up the challenges of dealing with many different scripts, some of which are unknown to the designer. A framework is presented to adaptively analyze and process structured multilingual documents, where adaptability is applied to every step. The proposed framework involves: (1) General word-level script identification using Gebor filter. (2) Font classification using the grating cell operator. (3) General word-level style identification using Gaussian mixture model. (4) An adaptable Hindi OCR based on generalized Hausdorff image comparison. (5) Retargetable OCR with automatic training sample creation and its applications to different scripts. (6) Bootstrapping entry segmentation, which segments each page into functional entries for parsing. Experimental results working on different scripts, such as Chinese, Korean, Arabic, Devanagari, and Khmer, demonstrate that the proposed framework can save human efforts significantly by making each phase adaptive.
Subject : Applied sciences; Computer vision; Documents; Multilingual; Pattern recognition; Electrical engineering; Computer science; Artificial intelligence; 0984:Computer science; 0544:Electrical engineering; 0800:Artificial intelligence
Added Entry : R. D. Chellappa, David S.
Added Entry : University of Maryland, College Park
کپی لینک

پیشنهاد خرید
پیوستها
عنوان :
نام فایل :
نوع عام محتوا :
نوع ماده :
فرمت :
سایز :
عرض :
طول :
3202600_8682.pdf
3202600.pdf
پایان نامه لاتین
متن
application/octet-stream
7.72 MB
85
85
نظرسنجی
نظرسنجی منابع دیجیتال

1 - آیا از کیفیت منابع دیجیتال راضی هستید؟