I am trying to build a model for Nepal bhasa, followed instructions, collected about 21k words (I dont have word frequencies) and built one: https://github.com/sapradhan/lexical-models/tree/newa
However I am getting unexpected predictions.
After typing two characters I was expecting words beginning with those two characters. There such words in the wordlist (on the right) however I get something else
The typed in letters are
The suggested words are
- U+1140e U+11400 U+11443
- U+1140e U+11402 U+11410 U+11438
- U+1140e U+11411 U+11435 U+11411
Also how does the model suggest words when nothing is typed ? This is what I get currently
May be my understanding is wrong, please let me know.