Search code examples
textwekatext-processing

Kurdish font convert to symbol in weka software


enter image description hereKurdish font is like Arabic and Persian when I put Kurdish documents into weka after apply "StringtoWordvector" filter the text change to symbol as shown in the above picture.


Solution

  • Linux uses UTF-8 by default and will have no problem displaying languages other than English. If you are on Windows (which uses a different encoding for historic reasons), please have a look at the FAQ Can I process UTF-8 datasets or files?