java string url inputstream bufferedreader

Read site source: � characters

I'm trying to read the source code from a browser, but when the code has characters like ã, á, à, õ, I get � instead.

I've tried to apply java.nio.Charset.encode on read lines, but no result: the same thing occurs.

My code is:

URLConnection connection = ...;
BufferedReader reader = new BufferedReader(connection.getInputStream());
String s = null;

while ((s = reader.readLine()) != null) {
  // got new source line...
}

The site I'm trying to read is this one (PT-BR).

Solution

According to the meta tag, the charset on that page is ISO-8859-1. Try using:

Scanner scanner = new Scanner(connection.getInputStream(), "ISO-8859-1");

JUINT test giving error java.lang.reflect.InaccessibleObjectException: Unable to make protected void java.lang.Object.finalize()
Cannot invoke "com.vaadin.flow.server.RequestHandler.handleRequest
AutoComplete ComboBox in JavaFX
Dynamic spring data jpa repository query with arbitrary AND clauses
dspace import metadata acknowledges changes doesn't apply them
Could not resolve all files for configuration ':app:androidJdkImage
Cost of inserting element at 0th position of LinkedHashSet?
OTEL Agent How to disable logging
Received fatal alert: bad_certificate
What is the reason to disable csrf in spring boot web application?
Input from user in Luaj
How can I list all the files in folder on tomcat?
Java Convert negative numbers to 0
Can Lombok toString() output include newlines?
Access jdk.unsupported from JUnit doesn't need module require
Unable to resolve name [org.hibernate.spatial.dialect.postgis.PostgisDialect] as strategy [org.hibernate.dialect.Dialect]
java.lang.module.FindException: Module not found
How to process tabular data from Oracle PL/SQL function in JAVA?
Pass device udid from test parameters to DesiredCapabilities in seperate classes
What does a "Cannot find symbol" or "Cannot resolve symbol" error mean?
Spring Boot 2.5.0 generates plain.jar file. Can I remove it?
Fastest method to define whether a number is a triangular number
Can I have two the same methods annotated with @RequestMapping with different headers?
Maximized JFrame's location is [-8,-8]. Can it be fixed?
HTML/JS app within an Android App
To compare UUID, can I use == or have to use UUID.equals(UUID)?
Lombok @Slf4j and interfaces?
Unable to run jar from cli, poi-ooxml ...xssf.usermodel.XSSFCellSytle ignored
Using Hibernate UUIDGenerator via annotations
Calling unmanaged C\C++ DLL methods from Java?