Calling html2text iteratively on a set of htmls not working

This is the code snippet:

for i in obj:
    url = "someurl" + i
    oars = requests.get(url, timeout=1)
    soup = BeautifulSoup(oars.content)
    fout = open(i + ".html", "wt")
    print((type(soup.prettify)))
    fout.write(oars.text)
    oars.close
    #fout.write(soup.get_text())
    # Still not working, using zsh for now
    if call("html2text " + i + ".html" + ">" + i + ".txt", shell=True) == 0:
        print("yay")
        #call("rm -f " + i + ".html", shell=True)
    else:
        print(i)

But html2text is just creating empty txt files rather than properly piping the output. I even tried replacing html2text with elinks -dump but to no avail.

Solution

Not sure, but this might be what you're after

import subprocess
import sys

outfile = i + ".txt"


cmd = sys.path[0] + "/htmltotext " + i + ".html"

with open(outfile, "w") as output_f:
    p = subprocess.Popen(cmd, stdout=output_f, shell=True)

How do I get the current IPython / Jupyter Notebook name
Python - AttributeError: 'NoneType' object has no attribute 'findAll'
Django Invalid HTTP_HOST header: 'testserver'. You may need to add u'testserver' to ALLOWED_HOSTS
Geopandas : sort a sample of points like a cycle graph
_tkinter.TclError: can't use "pyimage1" as iconphoto: not a photo image
toomanyrequests: You have reached your pull rate limit. You may increase the limit by authenticating and upgrading
How to update imshow() window for Python OpenCV CV2
What's a fast way to identify all overlapping sets?
http.client works but requests throws read timeout
Elegant way to unpack limited dict values into local variables in Python
How to see if a widget exists in Tkinter?
ROS1 catkin_make failed: catkin_install_python() called without required DESTINATION argument
Custom permissions in rests framework
Python: Sort XML attributes alphabetically within element without sorting elements
Cplex Python how to avoid printing the output
How to use the cl command?
Run the same Python script with different arguments?
Can't raise an exception with user input
How can I silence logs of a command in .ipynb file?
Unable to use Selenium Webdriver. Getting two exceptions
How do I perform a function A set number of times and countdown each time it is performed?
g++ linking and swig
Django CSRF failing with .env file for Docker
dask map_partitions strange behaviour
How can I place a line exactly on the Y-axis?
Python: Convert PDF to DOC
Import local function from a module housed in another directory with relative imports in Jupyter Notebook using Python 3
The equivalent of tf.contrib.image.transform in tensorflow 2.0?
How to use character or string as operator placed in between operands?
How to access a WhatsApp template on Twilio using Python?