One of you asked me what are the prospects of NLP? In [1] PC Magazine calls NL one of the top ten technologies of tomorrow.
Introductory Papers on Statistical Methods in NLP:
[1] takes a detailed look at the current statistical techniques in NLP. In [2] and [3] stochastic grammars are the main focus. [4] is a summary of current research in statistical methods in NLP. It does a good job of placing most current work in this field in perspective.
n-gram Language Models:
[1] presents a tutorial introduction to n-gram language modeling, and surveys the most widely-used smoothing algorithms for such models. The presentation slides in [2] cover the state-of-the-art in language modeling.
Search Engines:
[1] presents the technical details of Google. [2] describes a new technique for summarising the information found in Web pages in a coherent snippet. Currently, for example, in Yahoo! and the Open Directory Project, this is done by thousands of human editors.
Latent Semantic Analysis:
[1] gives good insight on SVD and latent semantic representation of documents with applications in speech recognition and understanding. In [2] a cognitive science perspective on LSA and its applications is presented. [3] covers an application of LSA to comprehend students' answers in an automatic tutoring system.
Hidden Markov Models:
A comprehensive coverage of HMMs along with extensive references.
Topics in Information Retrieval:
The presentation slides in [1] give an introduction to IR. [2] gives the technical details of the Porter Stemmer.
Statistical Machine Translation:
[2] gives the details for SMT. It explains the IBM Models in detail. [1] is a tutorial based on this paper.
| HOME | RESEARCH | TEACHING | PUBLICATIONS | PATENTS | PERSONAL |