#include <lm.h>
◆ access_type
Updated on every lm_{tg,bg,ug}_score call to reflect the kind of n-gram accessed: 3 for 3-gram, 2 for 2-gram and 1 for 1-gram
◆ bg
◆ bg32
Bigram 32 bits, NULL iff disk-based
◆ bg_seg_sz
◆ bgoff
BG offsets into DMP file (used iff disk-based)
◆ bgprob
tgcache 32 bits Table of actual bigram probs
◆ byteswap
Whether this file is in the WRONG byte order
◆ dict2lmwid
a mapping from dictionary word to LM word
◆ dict_size
Only used in class-based LM, because class-based LM is addressed in the dictionary space.
◆ finishlwid
S3_FINISH_WORD id, if it exists
◆ fp
◆ HT
hash table for word-string->word-id map
◆ inclass_ugscore
int32* lm_s::inclass_ugscore |
An array of inter-class unigram probability
◆ inputenc
◆ is32bits
Whether the current LM is 32 bits or not. Derived from version and n_ug
◆ isLM_IN_MEMORY
int32 lm_s::isLM_IN_MEMORY |
Whether LM in in memory, it is a property, potentially it means the code could allow you some model to be disk-based, some are not.
◆ lmclass
◆ log_bg_seg_sz
uint32 lm_s::log_bg_seg_sz |
◆ logmath
◆ lw
Language weight currently in effect for this LM
◆ max_sorted_entries
int32 lm_s::max_sorted_entries |
Temporary Variable: 2x the maximum size of the MAX_SORTED_ENTRIES
◆ max_ug
To which n_ug can grow with dynamic addition of words
◆ membg
membg[w1] = bigrams for lm wid w1 (used iff disk-based)
◆ membg32
membg 32bits membg[w1] = bigrams for lm wid w1 (used iff disk-based)
◆ n_bg
◆ n_bg_bo
#bg_score ops backed off to ug
◆ n_bg_fill
◆ n_bg_inmem
◆ n_bg_score
◆ n_bgprob
◆ n_lmclass
◆ n_ng
if unigram, n_ng=1, if bigram n_bg=2 and so one
◆ n_tg
◆ n_tg_bo
#tg_score ops backed off to bg
◆ n_tg_fill
Similar stats for trigrams
◆ n_tg_inmem
◆ n_tg_score
◆ n_tgbowt
◆ n_tgcache_hit
int32 lm_s::n_tgcache_hit |
# of trigram cache hit ops backed off to bg
◆ n_tgprob
◆ n_ug
◆ name
◆ outputenc
◆ sorted_bowt2
Temporary Variable: Sorted list
◆ sorted_prob2
Temporary Variable: Sorted list
◆ sorted_prob3
Temporary Variable: Sorted list
◆ startlwid
S3_START_WORD id, if it exists
◆ tg
◆ tg32
Trigram 32 bits NULL iff disk-based
◆ tg_segbase
tg_segbase[i>>lm_t.log_bg_seg_sz] = index of 1st trigram for bigram segment (i>>lm_t.log_bg_seg_sz)
◆ tgbowt
Table of actual trigram backoff weights
◆ tgcache
<w0,w1,w2> hashed to an entry into this array. Only the last trigram mapping to any * given hash entry is kept in that entry. (The cache doesn't have to be super-efficient.)
◆ tgcache32
◆ tginfo
tginfo[w2] = fast trigram access info for bigrams (*,w2)
◆ tginfo32
tginfo 32bits tginfo[w2] = fast trigram access info for bigrams (*,w2)
◆ tgoff
TG offsets into DMP file (used iff disk-based)
◆ tgprob
Table of actual trigram probs
◆ ug
◆ version
The version number of LM, in particular, this is the version that recently read in.
◆ wip
logs3(word insertion penalty) in effect for this LM
◆ wordstr
The LM word list (in unigram order)
The documentation for this struct was generated from the following file:
- /builddir/build/BUILD/sphinx3-0.8/include/lm.h