#############################################################################
## Copyright (c) 1996, Carnegie Mellon University, Cambridge University,
## Ronald Rosenfeld and Philip Clarkson
## Version 3, Copyright (c) 2006, Carnegie Mellon University
## Contributors includes Wen Xu, Ananlada Chotimongkol,
## David Huggins-Daines, Arthur Chan and Alan Black
#############################################################################
=============================================================================
=============== This file was produced by the CMU-Cambridge ===============
=============== Statistical Language Modeling Toolkit ===============
=============================================================================
This is a 3-gram language model, based on a vocabulary of 8 words,
which begins "", "", "backward"...
This is an OPEN-vocabulary model (type 1)
(OOVs were mapped to UNK, which is treated as any other vocabulary word)
Good-Turing discounting was applied.
1-gram frequency of frequency : 5
2-gram frequency of frequency : 11 0 0 0 1 0 0
3-gram frequency of frequency : 16 0 0 0 0 0 0
1-gram discounting ratios : 0.62
2-gram discounting ratios :
3-gram discounting ratios :
This file is in the ARPA-standard format introduced by Doug Paul.
p(wd3|wd1,wd2)= if(trigram exists) p_3(wd1,wd2,wd3)
else if(bigram w1,w2 exists) bo_wt_2(w1,w2)*p(wd3|wd2)
else p(wd3|w2)
p(wd2|wd1)= if(bigram exists) p_2(wd1,wd2)
else bo_wt_1(wd1)*p_1(wd2)
All probs and back-off weights (bo_wt) are given in log10 form.
Data formats:
Beginning of data mark: \data\
ngram 1=nr # number of 1-grams
ngram 2=nr # number of 2-grams
ngram 3=nr # number of 3-grams
\1-grams:
p_1 wd_1 bo_wt_1
\2-grams:
p_2 wd_1 wd_2 bo_wt_2
\3-grams:
p_3 wd_1 wd_2 wd_3
end of data mark: \end\
\data\
ngram 1=9
ngram 2=12
ngram 3=16
\1-grams:
-1.1761 0.0000
-0.2730 -0.8451
-98.9720 -0.8921
-1.1761 backward 0.0000
-1.1761 down -0.1461
-1.1761 forward -0.1461
-1.1761 left -0.1461
-1.1761 right -0.1461
-1.1761 up -0.1461
\2-grams:
-0.0669 -0.1413
-0.8129 backward 0.0300
-0.8129 down 0.1761
-0.8129 forward 0.1761
-0.8129 left 0.1761
-0.8129 right 0.1761
-0.8129 up 0.1761
-0.1761 down 0.5441
-0.1761 forward 0.5441
-0.1761 left 0.5441
-0.1761 right 0.5441
-0.1761 up 0.5441
\3-grams:
-0.7782 backward
-0.7782 down
-0.7782 forward
-0.7782 left
-0.7782 right
-0.3010 backward
-0.3010 down
-0.3010 forward
-0.3010 left
-0.3010 right
-0.3010 up
-0.3010 down
-0.3010 forward
-0.3010 left
-0.3010 right
-0.3010 up
\end\