[Solved] What representation of chat text data should I use for user classification? [closed]

You’re asking what ML representation you should use for user-classification of chat text. bag-of-words and word-vector are the main representations generally used in text-processing. However user-classification of chat is not the usual text-processing task, we look for telltale features indicative of a specific user. Here are some: character length, word length, sentence length of each … Read more

[Solved] ValueError: shapes (4155,1445) and (4587,7) not aligned: 1445 (dim 1) != 4587 (dim 0)

Have a look at the sci-kit learn documentation for Multinomial NB. It clearly specifies the structure of the input data while trainig model.fit() must match the structure of the input data while testing or scoring model.predict(). This means that you cannot use the same model for different dataset. The only way this is possible is … Read more