Does not parse the page vk.com #4

Vponed · 2022-01-19T04:15:47Z

raw_html = requests.get('https://vk.com/neurosciencenews').text
results = Extractor().extract(raw_html)

It does not return almost anything. Why it can be? It works great with other sites.
Also, I would like to know more about manipulations with the extractor. It is very interesting whether it is possible to obtain from it not only data, but also the way in which he extracted them.

The text was updated successfully, but these errors were encountered:

theblackcat102 · 2022-05-03T01:05:25Z

My guess is this page is a client side generated site which the content are loaded after the website was loaded. Using requests only returns empty web page ( contents are not yet loaded ). You might need to render the page and try again.

You can view these two files for understanding how the extraction works

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Does not parse the page vk.com #4

Does not parse the page vk.com #4

Vponed commented Jan 19, 2022 •

edited

Loading

theblackcat102 commented May 3, 2022

Does not parse the page vk.com #4

Does not parse the page vk.com #4

Comments

Vponed commented Jan 19, 2022 • edited Loading

theblackcat102 commented May 3, 2022

Vponed commented Jan 19, 2022 •

edited

Loading