scraper error? #5

tschrist · 2021-05-24T16:34:17Z

2021-05-24 09:32:27 [scrapy.core.scraper] ERROR: Spider error processing <GET https://www.goodreads.com/giveaway?sort=recently_listed&tab=recently_listed> (referer: https://www.goodreads.com/)
Traceback (most recent call last):
File "/usr/lib/python3/dist-packages/scrapy/utils/defer.py", line 102, in iter_errback
yield next(it)
File "/usr/lib/python3/dist-packages/scrapy/core/spidermw.py", line 84, in evaluate_iterable
for r in iterable:
File "/usr/lib/python3/dist-packages/scrapy/spidermiddlewares/offsite.py", line 29, in process_spider_output
for x in result:
File "/usr/lib/python3/dist-packages/scrapy/core/spidermw.py", line 84, in evaluate_iterable
for r in iterable:
File "/usr/lib/python3/dist-packages/scrapy/spidermiddlewares/referer.py", line 339, in
return (_set_referer(r) for r in result or ())
File "/usr/lib/python3/dist-packages/scrapy/core/spidermw.py", line 84, in evaluate_iterable
for r in iterable:
File "/usr/lib/python3/dist-packages/scrapy/spidermiddlewares/urllength.py", line 37, in
return (r for r in result or () if _filter(r))
File "/usr/lib/python3/dist-packages/scrapy/core/spidermw.py", line 84, in evaluate_iterable
for r in iterable:
File "/usr/lib/python3/dist-packages/scrapy/spidermiddlewares/depth.py", line 58, in
return (r for r in result or () if _filter(r))
File "/root/Goodreads-Giveaway-BOT/goodreads/spiders/giveaway.py", line 106, in giveaway_pages
pages_list.pop()
IndexError: pop from empty list

DanielSmith1239 · 2021-06-06T16:11:39Z

I'm having this issue as well.

DanielSmith1239 · 2021-06-06T16:12:47Z

I believe it's caused by goodreads making some modifications to their website so the bot can't parse the pages.

DanielSmith1239 · 2021-06-06T22:23:11Z

@kaushikthedeveloper just forked and made some changes that fixes the issue: https://github.com/DanielSmith1239/GoodreadsMaster

kaushikthedeveloper · 2021-11-01T12:35:58Z

Hey folks. Am happy to see this project is still running. While I created this project for personal usage and for anyone else to try it out, since Goodreads stopped Giveaways across India (basically anywhere outside USA), I stopped looking into this.

Will check it out and give it a try once more to keep it up to date.

@DanielSmith1239 , awesome fork buddy. Will take a look.

danielkadosh10 · 2022-05-24T06:29:29Z

@DanielSmith1239 I know this isnt your fork but I couldn't find a way to comment on your fork, but the bot is not working currently could you maybe fix it?

DanielSmith1239 · 2022-05-24T10:57:05Z

@DanielSmith1239 I know this isnt your fork but I couldn't find a way to comment on your fork, but the bot is not working currently could you maybe fix it?

Sure I’ll take a look, thanks for letting me know. I have the bot automated so I didn’t realize it wasn’t working. Do you have any info that might help me pinpoint the issue?

danielkadosh10 · 2022-05-24T15:08:52Z

@DanielSmith1239 yes I'm not that good at network but from what I could gather their original sign in page was was https://goodreads.com/user/sign_in that's where you made the form request but now you need to press log in with email on that page and it takes you to another sign in page

DanielSmith1239 · 2022-05-25T02:07:07Z

@DanielSmith1239 yes I'm not that good at network but from what I could gather their original sign in page was was https://goodreads.com/user/sign_in that's where you made the form request but now you need to press log in with email on that page and it takes you to another sign in page

Yup it should be all fixed now! That was the main issue, and they changed how the authentication worked behind the scenes (I think) so I had to add some cookie stuff. I learned a lot more about scrapy today lol

danielkadosh10 · 2022-05-25T02:35:46Z

@DanielSmith1239 cool thank you, if you dont mind me asking but where did you learn how to really use scrapy

DanielSmith1239 · 2022-05-25T02:46:39Z

Uhhh Google pretty much lol, I wouldn’t really say I know how to use it. I just kept googling “how to do x with scrapy”, which is how I learn most of my coding stuff

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

scraper error? #5

scraper error? #5

tschrist commented May 24, 2021

DanielSmith1239 commented Jun 6, 2021

DanielSmith1239 commented Jun 6, 2021

DanielSmith1239 commented Jun 6, 2021

kaushikthedeveloper commented Nov 1, 2021

danielkadosh10 commented May 24, 2022

DanielSmith1239 commented May 24, 2022

danielkadosh10 commented May 24, 2022

DanielSmith1239 commented May 25, 2022

danielkadosh10 commented May 25, 2022

DanielSmith1239 commented May 25, 2022

scraper error? #5

scraper error? #5

Comments

tschrist commented May 24, 2021

DanielSmith1239 commented Jun 6, 2021

DanielSmith1239 commented Jun 6, 2021

DanielSmith1239 commented Jun 6, 2021

kaushikthedeveloper commented Nov 1, 2021

danielkadosh10 commented May 24, 2022

DanielSmith1239 commented May 24, 2022

danielkadosh10 commented May 24, 2022

DanielSmith1239 commented May 25, 2022

danielkadosh10 commented May 25, 2022

DanielSmith1239 commented May 25, 2022