-
Notifications
You must be signed in to change notification settings - Fork 31
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Extending the web scraper. #18
Comments
@invinciblycool I like the thought, I would suggest, a detailed list of missing components you find in the current code of scraper, then we will assign you the work. |
@invinciblycool |
@ashwini0529 I have added the XML response to |
@ashwini0529 @shubhodeep9 Couldn't resist the excitement 😄
{
"assets":
{
"images":
[
"link of image1 on the page",
"link of image2 on the page"
],
"videos":
[
"link to embedded video1",
"link to embedded video2"
]
},
"content":
{
"text": "all raw text from the page",
"html": "all html from the page"
}
}
|
Hey @invinciblycool Sounds good. |
@invinciblycool Add a TO-DO with your PR, and we will keep this issue alive until we feel satisfied. So that whenever someone gets a new idea on web-scraping, they can add to that TO-DO |
Also, please add a [WIP] tag in your PR message. 😄 |
@ashwini0529 To start working if you could make it clear that should the function be returning a response or should create folders and save the content locally. Thanks. |
Hey @invinciblycool you can take a look at the QR Code function. I think you can make something like that. |
I guess then we agree on saving all the content locally. |
Hey @invinciblycool Updates? |
Sorry for the delay, I will try opening a PR by this week. |
Perfect @invinciblycool |
There hasn't been much work on the web scraping part.
I am interested to work on this.
Since this is going to be a generic one, what I have thought as of now includes:
Still a beginner, any tips or corrections?
The text was updated successfully, but these errors were encountered: