I am building a rule based transliteration method for Arabizi-Arabic:
https://github.com/AMR-KELEG/Franco-Arabic-Transliterator
and the notebook here:
https://gist.github.com/AMR-KELEG/3b63d5966087aee8b37cbf39c4c1830e
uses the method to filter away stopwords. I haven't tested the stopwords removal's effect on the classification model but I would appreciate if you can let me know if this notebook is useful or not.
All the best,
Amr
okay. Great idea.