Kurdish font is like Arabic and Persian when I put Kurdish documents into weka after apply "StringtoWordvector" filter the text change to symbol as shown in the above picture.
Linux uses UTF-8 by default and will have no problem displaying languages other than English. If you are on Windows (which uses a different encoding for historic reasons), please have a look at the FAQ Can I process UTF-8 datasets or files?