Skip to content

langchain: Replace lxml and XSLT with BeautifulSoup in HTMLHeaderTextSplitter for Improved Large HTML File Processing #35609

langchain: Replace lxml and XSLT with BeautifulSoup in HTMLHeaderTextSplitter for Improved Large HTML File Processing

langchain: Replace lxml and XSLT with BeautifulSoup in HTMLHeaderTextSplitter for Improved Large HTML File Processing #35609

Triggered via pull request December 19, 2024 22:53
Status Failure
Total duration 2m 51s
Artifacts

check_diffs.yml

on: pull_request
Matrix: compile-integration-tests
Matrix: extended-tests
Matrix: lint
Matrix: test-doc-imports
Waiting for pending jobs
Matrix: test-pydantic
Matrix: test
CI Success
0s
CI Success
Fit to window
Zoom out
Zoom in

Annotations

21 errors and 51 warnings
Ruff (I001): libs/text-splitters/langchain_text_splitters/html.py#L1
langchain_text_splitters/html.py:1:1: I001 Import block is un-sorted or un-formatted
Ruff (D212): libs/text-splitters/langchain_text_splitters/html.py#L39
langchain_text_splitters/html.py:39:5: D212 Multi-line docstring summary should start at the first line
Ruff (D205): libs/text-splitters/langchain_text_splitters/html.py#L58
langchain_text_splitters/html.py:58:5: D205 1 blank line required between summary line and description
Ruff (D212): libs/text-splitters/langchain_text_splitters/html.py#L58
langchain_text_splitters/html.py:58:5: D212 Multi-line docstring summary should start at the first line
Ruff (D212): libs/text-splitters/langchain_text_splitters/html.py#L78
langchain_text_splitters/html.py:78:9: D212 Multi-line docstring summary should start at the first line
Ruff (D212): libs/text-splitters/langchain_text_splitters/html.py#L96
langchain_text_splitters/html.py:96:9: D212 Multi-line docstring summary should start at the first line
Ruff (D212): libs/text-splitters/langchain_text_splitters/html.py#L111
langchain_text_splitters/html.py:111:9: D212 Multi-line docstring summary should start at the first line
Ruff (D212): libs/text-splitters/langchain_text_splitters/html.py#L126
langchain_text_splitters/html.py:126:9: D212 Multi-line docstring summary should start at the first line
Ruff (D212): libs/text-splitters/langchain_text_splitters/html.py#L152
langchain_text_splitters/html.py:152:9: D212 Multi-line docstring summary should start at the first line
Ruff (D212): libs/text-splitters/langchain_text_splitters/html.py#L169
langchain_text_splitters/html.py:169:9: D212 Multi-line docstring summary should start at the first line
Ruff (I001): libs/text-splitters/langchain_text_splitters/html.py#L1
langchain_text_splitters/html.py:1:1: I001 Import block is un-sorted or un-formatted
Ruff (D212): libs/text-splitters/langchain_text_splitters/html.py#L39
langchain_text_splitters/html.py:39:5: D212 Multi-line docstring summary should start at the first line
Ruff (D205): libs/text-splitters/langchain_text_splitters/html.py#L58
langchain_text_splitters/html.py:58:5: D205 1 blank line required between summary line and description
Ruff (D212): libs/text-splitters/langchain_text_splitters/html.py#L58
langchain_text_splitters/html.py:58:5: D212 Multi-line docstring summary should start at the first line
Ruff (D212): libs/text-splitters/langchain_text_splitters/html.py#L78
langchain_text_splitters/html.py:78:9: D212 Multi-line docstring summary should start at the first line
Ruff (D212): libs/text-splitters/langchain_text_splitters/html.py#L96
langchain_text_splitters/html.py:96:9: D212 Multi-line docstring summary should start at the first line
Ruff (D212): libs/text-splitters/langchain_text_splitters/html.py#L111
langchain_text_splitters/html.py:111:9: D212 Multi-line docstring summary should start at the first line
Ruff (D212): libs/text-splitters/langchain_text_splitters/html.py#L126
langchain_text_splitters/html.py:126:9: D212 Multi-line docstring summary should start at the first line
Ruff (D212): libs/text-splitters/langchain_text_splitters/html.py#L152
langchain_text_splitters/html.py:152:9: D212 Multi-line docstring summary should start at the first line
Ruff (D212): libs/text-splitters/langchain_text_splitters/html.py#L169
langchain_text_splitters/html.py:169:9: D212 Multi-line docstring summary should start at the first line
CI Success
Process completed with exit code 1.
build
ubuntu-latest pipelines will use ubuntu-24.04 soon. For more details, see https://github.com/actions/runner-images/issues/10636
cd libs/text-splitters / make extended_tests #3.9
ubuntu-latest pipelines will use ubuntu-24.04 soon. For more details, see https://github.com/actions/runner-images/issues/10636
cd libs/langchain / poetry run pytest -m compile tests/integration_tests #3.9
ubuntu-latest pipelines will use ubuntu-24.04 soon. For more details, see https://github.com/actions/runner-images/issues/10636
cd libs/text-splitters / make test # pydantic: ~=2.5.0, python: 3.11,
ubuntu-latest pipelines will use ubuntu-24.04 soon. For more details, see https://github.com/actions/runner-images/issues/10636
cd libs/text-splitters / make test # pydantic: ~=2.10.0, python: 3.11,
ubuntu-latest pipelines will use ubuntu-24.04 soon. For more details, see https://github.com/actions/runner-images/issues/10636
cd libs/partners/prompty / make test # pydantic: ~=2.9.0, python: 3.11,
ubuntu-latest pipelines will use ubuntu-24.04 soon. For more details, see https://github.com/actions/runner-images/issues/10636
cd libs/text-splitters / make test #3.9
ubuntu-latest pipelines will use ubuntu-24.04 soon. For more details, see https://github.com/actions/runner-images/issues/10636
cd libs/partners/prompty / make test #3.9
ubuntu-latest pipelines will use ubuntu-24.04 soon. For more details, see https://github.com/actions/runner-images/issues/10636
cd libs/partners/prompty / poetry run pytest -m compile tests/integration_tests #3.9
ubuntu-latest pipelines will use ubuntu-24.04 soon. For more details, see https://github.com/actions/runner-images/issues/10636
cd libs/partners/prompty / make test # pydantic: ~=2.5.0, python: 3.11,
ubuntu-latest pipelines will use ubuntu-24.04 soon. For more details, see https://github.com/actions/runner-images/issues/10636
cd libs/partners/prompty / make test # pydantic: ~=2.6.0, python: 3.11,
ubuntu-latest pipelines will use ubuntu-24.04 soon. For more details, see https://github.com/actions/runner-images/issues/10636
cd libs/text-splitters / make lint #3.9
ubuntu-latest pipelines will use ubuntu-24.04 soon. For more details, see https://github.com/actions/runner-images/issues/10636
cd libs/partners/prompty / make test # pydantic: ~=2.8.0, python: 3.11,
ubuntu-latest pipelines will use ubuntu-24.04 soon. For more details, see https://github.com/actions/runner-images/issues/10636
cd libs/community / poetry run pytest -m compile tests/integration_tests #3.9
ubuntu-latest pipelines will use ubuntu-24.04 soon. For more details, see https://github.com/actions/runner-images/issues/10636
cd libs/text-splitters / make test # pydantic: ~=2.8.0, python: 3.11,
ubuntu-latest pipelines will use ubuntu-24.04 soon. For more details, see https://github.com/actions/runner-images/issues/10636
cd libs/community / poetry run pytest -m compile tests/integration_tests #3.11
ubuntu-latest pipelines will use ubuntu-24.04 soon. For more details, see https://github.com/actions/runner-images/issues/10636
cd libs/text-splitters / make test # pydantic: ~=2.6.0, python: 3.11,
ubuntu-latest pipelines will use ubuntu-24.04 soon. For more details, see https://github.com/actions/runner-images/issues/10636
cd libs/text-splitters / make test # pydantic: ~=2.9.0, python: 3.11,
ubuntu-latest pipelines will use ubuntu-24.04 soon. For more details, see https://github.com/actions/runner-images/issues/10636
cd libs/langchain / poetry run pytest -m compile tests/integration_tests #3.13
ubuntu-latest pipelines will use ubuntu-24.04 soon. For more details, see https://github.com/actions/runner-images/issues/10636
cd libs/text-splitters / make test # pydantic: ~=2.7.0, python: 3.11,
ubuntu-latest pipelines will use ubuntu-24.04 soon. For more details, see https://github.com/actions/runner-images/issues/10636
cd libs/text-splitters / make extended_tests #3.13
ubuntu-latest pipelines will use ubuntu-24.04 soon. For more details, see https://github.com/actions/runner-images/issues/10636
cd libs/text-splitters / poetry run pytest -m compile tests/integration_tests #3.13
ubuntu-latest pipelines will use ubuntu-24.04 soon. For more details, see https://github.com/actions/runner-images/issues/10636
cd libs/langchain / make test # pydantic: ~=2.9.0, python: 3.11,
ubuntu-latest pipelines will use ubuntu-24.04 soon. For more details, see https://github.com/actions/runner-images/issues/10636
cd libs/langchain / make test # pydantic: ~=2.8.0, python: 3.11,
ubuntu-latest pipelines will use ubuntu-24.04 soon. For more details, see https://github.com/actions/runner-images/issues/10636
cd libs/partners/prompty / make test # pydantic: ~=2.7.0, python: 3.11,
ubuntu-latest pipelines will use ubuntu-24.04 soon. For more details, see https://github.com/actions/runner-images/issues/10636
cd libs/text-splitters / make test #3.13
ubuntu-latest pipelines will use ubuntu-24.04 soon. For more details, see https://github.com/actions/runner-images/issues/10636
cd libs/partners/prompty / make test #3.13
ubuntu-latest pipelines will use ubuntu-24.04 soon. For more details, see https://github.com/actions/runner-images/issues/10636
cd libs/partners/prompty / poetry run pytest -m compile tests/integration_tests #3.13
ubuntu-latest pipelines will use ubuntu-24.04 soon. For more details, see https://github.com/actions/runner-images/issues/10636
cd libs/langchain / make test # pydantic: ~=2.7.0, python: 3.11,
ubuntu-latest pipelines will use ubuntu-24.04 soon. For more details, see https://github.com/actions/runner-images/issues/10636
cd libs/community / make test # pydantic: ~=2.9.0, python: 3.11,
ubuntu-latest pipelines will use ubuntu-24.04 soon. For more details, see https://github.com/actions/runner-images/issues/10636
cd libs/langchain / make extended_tests #3.9
ubuntu-latest pipelines will use ubuntu-24.04 soon. For more details, see https://github.com/actions/runner-images/issues/10636
cd libs/community / make test # pydantic: ~=2.8.0, python: 3.11,
ubuntu-latest pipelines will use ubuntu-24.04 soon. For more details, see https://github.com/actions/runner-images/issues/10636
cd libs/text-splitters / make lint #3.13
ubuntu-latest pipelines will use ubuntu-24.04 soon. For more details, see https://github.com/actions/runner-images/issues/10636
cd libs/community / make test # pydantic: ~=2.7.0, python: 3.11,
ubuntu-latest pipelines will use ubuntu-24.04 soon. For more details, see https://github.com/actions/runner-images/issues/10636
cd libs/langchain / make test # pydantic: ~=2.10.0, python: 3.11,
ubuntu-latest pipelines will use ubuntu-24.04 soon. For more details, see https://github.com/actions/runner-images/issues/10636
cd libs/community / make lint #3.9
ubuntu-latest pipelines will use ubuntu-24.04 soon. For more details, see https://github.com/actions/runner-images/issues/10636
cd libs/langchain / make lint #3.9
ubuntu-latest pipelines will use ubuntu-24.04 soon. For more details, see https://github.com/actions/runner-images/issues/10636
cd libs/langchain / make extended_tests #3.13
ubuntu-latest pipelines will use ubuntu-24.04 soon. For more details, see https://github.com/actions/runner-images/issues/10636
cd libs/community / make test # pydantic: ~=2.10.0, python: 3.11,
ubuntu-latest pipelines will use ubuntu-24.04 soon. For more details, see https://github.com/actions/runner-images/issues/10636
cd libs/langchain / make lint #3.13
ubuntu-latest pipelines will use ubuntu-24.04 soon. For more details, see https://github.com/actions/runner-images/issues/10636
cd libs/langchain / make test #3.9
ubuntu-latest pipelines will use ubuntu-24.04 soon. For more details, see https://github.com/actions/runner-images/issues/10636
cd libs/community / make lint #3.13
ubuntu-latest pipelines will use ubuntu-24.04 soon. For more details, see https://github.com/actions/runner-images/issues/10636
cd libs/partners/prompty / make lint #3.9
ubuntu-latest pipelines will use ubuntu-24.04 soon. For more details, see https://github.com/actions/runner-images/issues/10636
cd libs/langchain / make test #3.13
ubuntu-latest pipelines will use ubuntu-24.04 soon. For more details, see https://github.com/actions/runner-images/issues/10636
cd libs/community / make test #3.13
ubuntu-latest pipelines will use ubuntu-24.04 soon. For more details, see https://github.com/actions/runner-images/issues/10636
cd libs/community / make test #3.9
ubuntu-latest pipelines will use ubuntu-24.04 soon. For more details, see https://github.com/actions/runner-images/issues/10636
cd libs/community / make extended_tests #3.9
ubuntu-latest pipelines will use ubuntu-24.04 soon. For more details, see https://github.com/actions/runner-images/issues/10636
cd libs/text-splitters / poetry run pytest -m compile tests/integration_tests #3.9
ubuntu-latest pipelines will use ubuntu-24.04 soon. For more details, see https://github.com/actions/runner-images/issues/10636
cd libs/partners/prompty / make lint #3.13
ubuntu-latest pipelines will use ubuntu-24.04 soon. For more details, see https://github.com/actions/runner-images/issues/10636
cd libs/community / make extended_tests #3.12
ubuntu-latest pipelines will use ubuntu-24.04 soon. For more details, see https://github.com/actions/runner-images/issues/10636
CI Success
ubuntu-latest pipelines will use ubuntu-24.04 soon. For more details, see https://github.com/actions/runner-images/issues/10636