Skip to content

FineWeb-2: multilingual, numpy 2.0, minhash improvements #338

FineWeb-2: multilingual, numpy 2.0, minhash improvements

FineWeb-2: multilingual, numpy 2.0, minhash improvements #338

Workflow file for this run

name: Test & Check Code Quality
on:
pull_request:
branches:
- main
push:
branches:
- main
workflow_call:
jobs:
check_code_quality:
runs-on: ubuntu-latest
steps:
- uses: actions/checkout@v3
- name: Set up Python
uses: actions/setup-python@v4
with:
python-version: "3.10"
- name: Install dependencies
run: |
python -m pip install uv
uv pip install --system .[quality]
- name: Check quality
run: |
ruff check tests src examples # linter
ruff format --check tests src examples # formatter
test:
runs-on: ubuntu-latest
strategy:
matrix:
python-version: ["3.10", "3.11", "3.12"]
steps:
- uses: actions/checkout@v3
- name: Set up Python ${{ matrix.python-version }}
uses: actions/setup-python@v4
with:
python-version: ${{ matrix.python-version }}
- name: Install dependencies
run: |
python -m pip install uv
uv pip install --system .[testing]
python -m nltk.downloader punkt
- name: Test with pytest
run: |
python -m pytest -sv ./tests/