Would it make sense to parse the HTML to only "analyze" the text? #40
Closed
rasulkireev
started this conversation in
Ideas
Replies: 1 comment 1 reply
-
Hi Rasul, Converting to text loses a lot of context that can be used to determine page structure. Imagine a page like
When converted to text or markdown you'd wind up with John Smith D 33 & the helpful information is lost. |
Beta Was this translation helpful? Give feedback.
1 reply
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
I was thinking would it make sense to add to scrapeghost the ability to parse HTML into text (or markdown even) and let GPT analyze that? That would prob reduce the cost even further right?
The logic should be simple. Something like this:
Add as an option in the Parser. @jamesturk if you think this is something that will help Scrapeghost, I can try making a PR.
Beta Was this translation helpful? Give feedback.
All reactions