رکورد قبلیرکورد بعدی

" Knowledge integration into language models: A random forest approach "


Document Type : Latin Dissertation
Language of Document : English
Record Number : 54970
Doc. No : TL24924
Call number : ‭3357015‬
Main Entry : Yi Su
Title & Author : Knowledge integration into language models: A random forest approach\ Yi Su
College : The Johns Hopkins University
Date : 2009
Degree : Ph.D.
student score : 2009
Page No : 89
Abstract : A language model (LM) is a probability distribution over all possible word sequences. It is a vital component of many natural language processing tasks, such as automatic speech recognition, statistical machine translation, information retrieval and so on. The art of language modeling has been dominated by a simple yet powerful model family, the n -gram language models. Many attempts have been made to go beyond n -grams either by proposing a new mathematical framework or by integrating more knowledge of human language, preferably both. The random forest language model (RFLM)--a collection of randomized decision tree language models--has distinguished itself as a successful effort of the former kind; we explore its potential of the latter. We start our quest by advancing our understanding of the RFLM through explorative experimentation. To facilitate further investigation, we address the problem of training the RFLM on large amount of data through an efficient disk swapping algorithm. We formalize our method of integrating various knowledge sources into language models with random forests and illustrate its applicability with three innovative applications: morphological LMs of Arabic, prosodic LMs for speech recognition and combination of syntactic and topic information in LMs.
Subject : Applied sciences; Pure sciences; Language modeling; Natural language processing; Random forest; Speech recognition; Random forest language model; Decision tree; Statistics; Electrical engineering; Computer science; 0984:Computer science; 0463:Statistics; 0544:Electrical engineering
Added Entry : F. Jelinek
Added Entry : The Johns Hopkins University
کپی لینک

پیشنهاد خرید
پیوستها
عنوان :
نام فایل :
نوع عام محتوا :
نوع ماده :
فرمت :
سایز :
عرض :
طول :
3357015_13594.pdf
3357015.pdf
پایان نامه لاتین
متن
application/octet-stream
1.08 MB
85
85
نظرسنجی
نظرسنجی منابع دیجیتال

1 - آیا از کیفیت منابع دیجیتال راضی هستید؟