I'm trying to pull in form-10k from SEC using Beautifulsoup. Unfortunately, the following codes are not displaying all of the html. It is printing starting from somewhere in the middle of the html. However, it works fine when applied for several other web pages I tried. Any help will be much appreciated. I'm very new to python coding and I'm hoping to learn more as it is starting to grow on me :)
import urllib.request, urllib.error
from bs4 import BeautifulSoup
import ssl
ctx = ssl.create_default_context()
ctx.check_hostname = False
ctx.verify_mode = ssl.CERT_NONE
url = "https://www.sec.gov/Archives/edgar/data/920148/000092014820000011/lh10-k2019.htm"
html = urllib.request.urlopen(url, context=ctx).read()
soup = BeautifulSoup(html, "html.parser")
print(soup.prettify().encode("utf-8"))
What is probably happening is that there is not enough space in your terminal so what you see is only part of it, but in fact the whole page is there. I'm guessing the pages that worked are much shorter.