Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Convert mhtml pages saved in Chrome to SingleFile's html format #1586

Open
defcon79 opened this issue Oct 7, 2024 · 2 comments
Open

Convert mhtml pages saved in Chrome to SingleFile's html format #1586

defcon79 opened this issue Oct 7, 2024 · 2 comments

Comments

@defcon79
Copy link

defcon79 commented Oct 7, 2024

*Is your feature request related to a problem? Please describe.
I have a lot of webpages I saved in Chrome, using the default save as mhtml built into the browser. The format of this file is different from the one used by SingleFile

Describe the solution you'd like
I'd like a way for singlefile to operate on local mhtml (and also html complete perhaps) files, and do the same processing - it seems to remove a lot of extra html elements and do cleanup and the output seems a lot smaller compared to Chrome's default mhtml.

Describe alternatives you've considered (optional)
I have not found any other solutions that can do this.

Additional context (optional)
(actually this is more for singlefile-cli I suppose but am requesting here as its the main project)

@gildas-lormeau
Copy link
Owner

gildas-lormeau commented Oct 10, 2024

Maybe extracting MHTML files into HTML and saving the result with SingleFile could work. Did you try for example https://github.com/rumpeltux/python-unmht?

@dsiddens
Copy link

It will be appreciated.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

3 participants