According to the idea of EM, a language model is built increasingly by collection the fractional counts of patterns (such as bigram pair) from the augmentations of all the segmentation candidates of a sentence.
英
美
- 基于em的思想,每個(gè)句子所對應的所有(或一定範圍內)的分詞結果構成訓練集,通過(guò)這個(gè)訓練集和初始的語(yǔ)言模型可以估計出一個(gè)新的語(yǔ)言模型。