I am building a rule based transliteration method for Arabizi-Arabic:
and the notebook here:
uses the method to filter away stopwords. I haven't tested the stopwords removal's effect on the classification model but I would appreciate if you can let me know if this notebook is useful or not.
All the best,