Stochastic grammar
From Wikipedia, the free encyclopedia
A stochastic grammar (statistical grammar) is a grammar framework with a probabilistic notion of grammaticality:
- Stochastic context-free grammar
- Statistical parsing
- Data-oriented parsing
- Hidden Markov model
- Estimation theory
The grammar is realized as a language model. Allowed sentences are stored in a database together with the frequency how common a sentence is.[1] Statistical natural language processing uses stochastic, probabilistic and statistical methods, especially to resolve difficulties that arise because longer sentences are highly ambiguous when processed with realistic grammars, yielding thousands or millions of possible analyses. Methods for disambiguation often involve the use of corpora and Markov models. "A probabilistic model consists of a non-probabilistic model plus some numerical quantities; it is not true that probabilistic models are inherently simpler or less structural than non-probabilistic models."[2]