Search code examples
javaunicodeutf-8character-encodingfile-encodings

Is there way to check charset encoding of .txt file with Java?


Is there way to check is text file (.txt) encoded with Unicode or UTF-8 with Java?


Solution

  • You cannot know with absolute certainty which charset is used in the general case. I found this to be a good read.

    http://illegalargumentexception.blogspot.co.uk/2009/05/java-rough-guide-to-character-encoding.html

    Especially the section Automatic detection of encoding.