I've been working on a shopping app for a while (which is working), and now looking to expand the features, however my understanding of website element/file hierarchies etc. is lackluster to say the least!
I'm reading this url: https://minetilbud.dk/tilbudsaviser/aldi/1?id={E2193C06-1136-4ED7-9B66-693C3EB175FF}
What I'm looking for is the text part that includes "30,- per kg". However when I read the HTML programmatically I'm getting a ton of text that doesn't contain this particular wanted text.
Sorry for my poor explained question but I'm not really sure what I'm looking for here, could the text I'm looking for be hidden in another file or another url?
Any tips that can help me in the right direction will be greatly appreciated!
It looks like each clickable product has a product id that can be used to make an api request with to get the info in structured JSON. The page loads these id's with that second script tag towards the top of the html via a url that starts with something like this:
https://cdn.ipaper.io/iPaper/Papers/0996f096-0e32-4fd4-b62b-f40e59e490ea/Enrichments/v1/
If you scrape this url then do a get against that you should get back json that contains the product id's. Then you would want to store those product id's and then do more get requests with those product id's in this format.
https://minetilbud.dk/api/rest/product/%7BD1E5927C-6EDD-4F3C-A925-1141077F3D96%7D
So https://minetilbud.dk/api/rest/product/%7{product id}%7D
You can see all this in the chrome dev tools or fiddler.