jeremykun.com
Word Segmentation, or Makingsenseofthis
A First Look at Google’s N-Gram Corpus In this post we will focus on the problem of finding the appropriate word boundaries in strings like “homebuiltairplanes”, as is common in w…