Skip to content

Commit

Permalink
Fix the CI.
Browse files Browse the repository at this point in the history
  • Loading branch information
LaurentMazare committed Dec 4, 2024
1 parent 58b9fe9 commit cfd7b75
Show file tree
Hide file tree
Showing 3 changed files with 14 additions and 3 deletions.
2 changes: 1 addition & 1 deletion .github/workflows/ykpy-ci.yml
Original file line number Diff line number Diff line change
Expand Up @@ -23,7 +23,7 @@ jobs:
runs-on: ubuntu-latest
strategy:
matrix:
target: [x86_64, x86, aarch64, armv7, s390x, ppc64le]
target: [x86_64, x86, aarch64, armv7]
steps:
- uses: actions/checkout@v3
- uses: actions/setup-python@v4
Expand Down
2 changes: 1 addition & 1 deletion Cargo.toml
Original file line number Diff line number Diff line change
Expand Up @@ -6,7 +6,7 @@ members = [
resolver = "2"

[workspace.package]
version = "0.2.0"
version = "0.3.0"
edition = "2021"
description = "Dataloader for training large text models."
repository = "https://github.com/kyutai-labs/yomikomi"
Expand Down
13 changes: 12 additions & 1 deletion yomikomi-pyo3/py_src/yomikomi/__init__.pyi
Original file line number Diff line number Diff line change
Expand Up @@ -103,7 +103,18 @@ class YkIterable:
""" """
pass

def tokenize(self, path, *, in_field=..., out_field=None, report_bpb=True, include_bos=True, include_eos=False):
def tokenize(
self,
path,
*,
in_field=...,
out_field=None,
report_bpb=True,
include_bos=True,
include_eos=False,
bos_id=None,
eos_id=None
):
"""
Loads a sentencepiece tokenizer, and use it to tokenize the field passed as an argument of
this function.
Expand Down

0 comments on commit cfd7b75

Please sign in to comment.