Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

feat: check document store and retriever dimensions before calculating embeddings for all documents #7323

Closed
wants to merge 397 commits into from
Closed
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
397 commits
Select commit Hold shift + click to select a range
80c3e68
fix: serialize/deserialize torch dtype in the components that need it…
anakin87 Jan 12, 2024
dbdeb82
feat: rename `model_name` or `model_name_or_path` to `model` in gener…
sahusiddharth Jan 12, 2024
24c71bd
rename model_name_or_path to model in test (#6732)
anakin87 Jan 12, 2024
ce7abc9
feat!: Rename `model_name` or `model_name_or_path` to `model` in all …
ZanSara Jan 12, 2024
3156343
fix leftover model_name_or_path param (#6737)
ZanSara Jan 12, 2024
288ed15
feat!: Rename `model_name` or `model_name_or_path` to `model` in all …
ZanSara Jan 12, 2024
e0afe27
feat: Add method to set a Component input type with default value (#6…
silvanocerza Jan 12, 2024
0616197
feat!: Rename `model_name_or_path` to `model` in `TransformersSimilar…
ZanSara Jan 12, 2024
4647f2a
fix: `ComponentMeta.__call__` handles keyword- and positional-only pa…
shadeMe Jan 12, 2024
a5189dd
fix!: `InMemoryBM25Retriever` no longer returns documents that have a…
shadeMe Jan 12, 2024
24afc2a
feat: Highlight optional connections in `Pipeline.draw()` (#6724)
ZanSara Jan 15, 2024
8eba053
fix pipeline test (#6741)
anakin87 Jan 15, 2024
b236ea4
fix: hybrid pipeline e2e test (#6740)
ZanSara Jan 15, 2024
96c0b59
feat!: Rename `model_name_or_path` to `model` in `ExtractiveReader` (…
ZanSara Jan 15, 2024
8cafff0
refactor: Extract HF stop words handling in `hf_utils.py` (#6745)
vblagoje Jan 15, 2024
20f04f6
feat: MetaFieldRanker update (#6742)
sjrl Jan 16, 2024
d6cafea
test: Rename RAG E2E test file (#6750)
shadeMe Jan 16, 2024
909c1eb
fix a few docstrings (#6743)
ZanSara Jan 16, 2024
b8b8b5d
feat!: rename `model_name_or_path` to `model` in `NamedEntityExtracto…
ZanSara Jan 16, 2024
7376838
feat!: Framework-agnostic device management (#6748)
shadeMe Jan 17, 2024
88191e7
chore: Fix lazy import in HuggingFaceLocalGenerator (#6752)
vblagoje Jan 17, 2024
6a15145
test: Update E2E tests to use `Pipeline.dump/load` (#6756)
shadeMe Jan 17, 2024
a7ac4ed
feat: added split by page to `DocumentSplitter` (#6753)
sahusiddharth Jan 17, 2024
d1bdb8c
chore: bump Haystack version to beta5 (#6757)
julian-risch Jan 17, 2024
eaec5bf
refactor: Move HF-specific model serde code to a new submodule. (#6754)
shadeMe Jan 17, 2024
c0b6743
feat: Add page breaks to default PDF to Document converter (#6755)
sjrl Jan 18, 2024
5d66d04
feat: Add serde methods to `HTMLToDocument` (#6758)
shadeMe Jan 18, 2024
1c76aa0
Fix __version__ handling (#6765)
silvanocerza Jan 18, 2024
8079501
Speed up Document dataclass import (#6767)
silvanocerza Jan 18, 2024
ac353c4
chore(deps): bump actions/cache from 3 to 4 (#6775)
dependabot[bot] Jan 18, 2024
8d65a86
chore(deps): bump tj-actions/changed-files from 41 to 42 (#6774)
dependabot[bot] Jan 18, 2024
fea1428
feat: Add `HuggingFaceLocalChatGenerator` (#6751)
vblagoje Jan 18, 2024
0b177b3
feat: Improve OpenAPIServiceConnector service response serialization …
vblagoje Jan 18, 2024
40a8b2b
Move import to lazy import section (#6778)
vblagoje Jan 18, 2024
d4f6531
feat: Refactor `Pipeline.run()` (#6729)
silvanocerza Jan 18, 2024
f47439c
Use forward references for type hints, avoid NameError (#6780)
vblagoje Jan 18, 2024
cad30b0
add .haystack_debug to .gitignore (#6782)
augchan42 Jan 19, 2024
8a08ab5
add telemetry overview (#6785)
dfokina Jan 19, 2024
a238c6d
feat: Add Exact Match metric (#6696)
awinml Jan 22, 2024
5c8feea
proposal: Integration of 3rd party evaluation frameworks (#6784)
shadeMe Jan 22, 2024
f44f123
chore: mention integrations in the README (#6805)
masci Jan 22, 2024
df2a23d
chore: cleanup unused code (#6804)
masci Jan 23, 2024
5d300a7
add missing components to docs (#6813)
dfokina Jan 23, 2024
6d8f369
chore: mention cookbook repo in README (#6814)
dfokina Jan 23, 2024
1825140
Readme updates (#6817)
TuanaCelik Jan 23, 2024
4efe406
use haystack-pydoc-tools package instead of local code (#6818)
masci Jan 23, 2024
c47b82c
Remove pipeline_utils package and dependent code (#6806)
vblagoje Jan 23, 2024
6e86f4e
Update embedding integration tests (#6823)
vblagoje Jan 24, 2024
7358b91
feat: Weights and score normalization for DocumentJoiner with recipro…
robpasternak Jan 24, 2024
3bea3b1
feat: Add query and document prefix options for the TransformerSimila…
sjrl Jan 25, 2024
b176750
improve reno config (#6827)
anakin87 Jan 26, 2024
7217f9d
feat: Add F1 metric (#6822)
awinml Jan 26, 2024
fdf844f
fix: Fix missing format string prefixes in `pipeline.py` (#6834)
shadeMe Jan 26, 2024
acf4cd5
refact: Rename helper function (#6831)
masci Jan 26, 2024
b1ec32d
Simplify Pipeline.__eq__ logic (#6840)
silvanocerza Jan 29, 2024
9211f53
Remove unnecessary Connection class (#6842)
silvanocerza Jan 29, 2024
f5e6133
chore: Remove all mentions of Canals (#6844)
silvanocerza Jan 29, 2024
d90b0de
Update README.md (#6850)
anakin87 Jan 30, 2024
76d324a
feat: Change `Pipeline.add_component` to fail when reusing `Component…
silvanocerza Jan 30, 2024
ceda4cd
feat: Add support for `device_map` (#6679)
sjrl Jan 30, 2024
537107b
ci: bump transformers to 4.37.2 in test_requirements (#6848)
anakin87 Jan 30, 2024
b772c11
feat: Implement `Secret` for structured authentication (#6855)
shadeMe Jan 31, 2024
9af6c7e
add some tolerance to Roberta test (#6880)
ZanSara Jan 31, 2024
1039c73
feat: Allow setting metadata for `ByteStream` when created from file …
ZanSara Feb 1, 2024
27d0b28
chore: rename categories in the API docs (#6885)
masci Feb 1, 2024
461556c
fix: Fix language servers never working with Components (#6893)
silvanocerza Feb 2, 2024
393a799
feat: Add Semantic Answer Similarity metric (#6877)
awinml Feb 2, 2024
27d1af3
feat!: Use `Secret` for passing authentication secrets to components …
shadeMe Feb 5, 2024
c3a9dac
chore: Tick version to 2.0.0-beta.6 (#6914)
shadeMe Feb 5, 2024
0fbb065
Create breaking-change-proposal.md issue template (#6892)
bilgeyucel Feb 5, 2024
3bd6ba9
feat:Add dimensions parameter to OpenAI Embedders to fully support th…
sahusiddharth Feb 5, 2024
0191b1e
feat: Change Component's I/O dunder type (#6916)
silvanocerza Feb 5, 2024
b9d7a98
Rework Pipeline.connect() tests and structure (#6917)
silvanocerza Feb 5, 2024
7d29ddb
chore: merge hf utils modules into one (#6921)
masci Feb 6, 2024
289aa44
Add flaky as dev dependency (#6924)
silvanocerza Feb 6, 2024
3d5c329
Change Pipeline.connect() to return Pipeline instance (#6922)
silvanocerza Feb 6, 2024
1182c08
fix: Dont filter negative scores when using `BM25Okapi` and `scale_sc…
ZanSara Feb 6, 2024
95dce5b
Change HuggingFaceTGIChatGenerator default model to HuggingFaceH4/zep…
vblagoje Feb 6, 2024
b875eda
refactor: Rename `Pipeline.connect()` arguments (#6919)
silvanocerza Feb 6, 2024
9e6a2e3
fix: HuggingFaceTGIGenerator gets stuck when model is not supported …
vblagoje Feb 6, 2024
929baf3
chore(deps): bump aws-actions/configure-aws-credentials (#6933)
dependabot[bot] Feb 7, 2024
b49e86d
fix: Change `PromptBuilder` to have default values for all inputs (#6…
silvanocerza Feb 7, 2024
a771d7f
bump version (#6935)
anakin87 Feb 7, 2024
74683fe
Feat: Add FilterRetriever (#6836)
bglearning Feb 8, 2024
2f965fb
feat: Add `__repr__` method to all Components (#6927)
silvanocerza Feb 8, 2024
24b23a2
docs: Add missing docstrings to `NamedEntityExtractor` (#6958)
shadeMe Feb 8, 2024
d2497d5
Update to use the default Secret.from_env_var(OPENAI_API_KEY) approac…
vblagoje Feb 9, 2024
a7f36fd
feat: Enhance `Pipeline.draw()` to show image directly in Jupyter not…
silvanocerza Feb 9, 2024
d2d01f9
feat: Enhance `Pipeline.__repr__()` (#6963)
silvanocerza Feb 9, 2024
37d9de3
feat: Add service_credentials to OpenAPIServiceConnector run (#6962)
vblagoje Feb 9, 2024
21488ac
fix: Fix `Pipeline.draw()` failing cause of missing socket (#6965)
silvanocerza Feb 9, 2024
55785d5
fixing typo (#6967)
davidsbatista Feb 9, 2024
549021d
Use better cache key for dependencies in tests.yml (#6966)
silvanocerza Feb 9, 2024
f96eb38
refactor: Merge `Pipeline`s definition in `core` package (#6973)
silvanocerza Feb 12, 2024
a311d82
feat: Externalize callable serialization so it can be reused (#6979)
vblagoje Feb 13, 2024
97a0df6
feat: Add OutputAdapter (#6936)
vblagoje Feb 13, 2024
ea72759
feat: Meta field ranker add `meta_value_type` (#6977)
sjrl Feb 13, 2024
6a776e6
Add OutputAdapter sede for custom filters (#6985)
vblagoje Feb 13, 2024
0e044a8
Reintroduce serialize_callback_handler and deserialize_callback_handl…
vblagoje Feb 14, 2024
8d46a28
feat: Make system_messages optional in OpenAPIServiceToFunctions run …
vblagoje Feb 14, 2024
9297fca
feat: Add `SASEvaluator` component (#6980)
silvanocerza Feb 14, 2024
36ab23d
feat: Add `StatisticalEvaluator` component (#6982)
silvanocerza Feb 14, 2024
cb42ac1
docs: Add eval Components docs config (#6984)
silvanocerza Feb 14, 2024
cf221a9
Delete old eval API (#6983)
silvanocerza Feb 14, 2024
5a8d020
feat: Add JsonSchemaValidator (#6937)
vblagoje Feb 15, 2024
06a9349
Change mermaid integration test to avoid comparing received image (#7…
silvanocerza Feb 15, 2024
2a4e6a1
refactor: Refactor `SASEvaluator` (#6998)
silvanocerza Feb 15, 2024
c82f787
feat: Add `TextCleaner` component (#6997)
silvanocerza Feb 15, 2024
2b8a606
refactor: Refactor `StatisticalEvaluator` (#6999)
silvanocerza Feb 15, 2024
6fe1d3b
refactor: Clean eval components (#7005)
silvanocerza Feb 15, 2024
e2cee46
fix: Adding `api_base_url` to `OpenAITextEmbeder` self assignments (#…
TuanaCelik Feb 15, 2024
8a10170
fix docs config for evaluators (#7006)
anakin87 Feb 15, 2024
a7209f6
Mark OpenAPIServiceConnector integration test as flaky (#7007)
silvanocerza Feb 15, 2024
0aa788f
refactor!: LocalWhisperTranscriber - new devices mgmt (#7008)
anakin87 Feb 16, 2024
b552b0b
fix: Ensure `Secret` types are immutable (#6994)
shadeMe Feb 16, 2024
44b5ae2
specify CPU device in warm_up test (#7014)
anakin87 Feb 16, 2024
b645c16
Fix multiplexer importing from wrong package (#7015)
silvanocerza Feb 16, 2024
3f85a63
fix: deepcopy the inputs of components (#6987)
anakin87 Feb 16, 2024
3ce6b97
feat: Add ByteStream to_string method (#7009)
vblagoje Feb 17, 2024
3cc8e54
ci: update "Sync docs with Readme" (#7029)
anakin87 Feb 18, 2024
5f97e08
feat: Reintroduce `max_loops_allowed` check in `Pipeline.run()` (#7010)
silvanocerza Feb 19, 2024
f1a6b2a
feat: Add `is_greedy` argument in `@component` decorator (#7016)
silvanocerza Feb 19, 2024
29acffc
Add the Contributions wanted project (#7035)
TuanaCelik Feb 19, 2024
cb01cb4
feat: Add PipelineTemplate for ready made pipelines (#7001)
vblagoje Feb 19, 2024
d00f171
refactor!: Sentence Transformers Embedders - new devices mgmt (#7033)
anakin87 Feb 19, 2024
5910b4a
fix: Pipeline templates with new devices mgmt (#7036)
anakin87 Feb 19, 2024
9215882
Add Recall Multi Hit and Single Hit metric (#7038)
silvanocerza Feb 19, 2024
925dfeb
chore: add `accelerate` to lazy imports (#7040)
anakin87 Feb 20, 2024
05af9c3
test: Simplify OpenAPIServiceConnector run test (#7043)
silvanocerza Feb 20, 2024
327c2d2
feat: Add Mean Reciprocal Rank (MRR) metric to `StatisticalEvaluator`…
awinml Feb 20, 2024
b335b5d
feat: Add Lost In The Middle Ranker (#6995)
vrunm Feb 20, 2024
f8a06b6
chore: collect templates in a dedicated folder (#7048)
masci Feb 21, 2024
f3be576
refact: remove the concept of `TemplateSource` from the public API (#…
masci Feb 21, 2024
8ca4bf4
Remove all evaluator components (#7053)
silvanocerza Feb 21, 2024
7f4d11c
chore(deps): bump fossas/fossa-action from 1.3.1 to 1.3.3 (#7019)
dependabot[bot] Feb 22, 2024
088aa50
update version (#7057)
ZanSara Feb 22, 2024
34edaa1
fix formatting (#7059)
ZanSara Feb 22, 2024
62ab83a
remove pydoc config (#7060)
ZanSara Feb 22, 2024
d6ef2b5
ci: only only skip tests if no code changes at all (#7061)
wochinge Feb 22, 2024
6e580e4
feat: implement pipeline tracing (#7046)
wochinge Feb 22, 2024
cb6389d
feat: Improve OpenAPI integration (#7034)
vblagoje Feb 22, 2024
6d0d373
feat: opentelemetry tracer (#7052)
wochinge Feb 22, 2024
49cad21
chore: Adjust json_schema.py slightly (#7055)
vblagoje Feb 22, 2024
0507fce
forward-declare opentelemetry (#7067)
masci Feb 22, 2024
08e97d8
Export (de)serialize_callable and (de)serialize_type in haystack.util…
vblagoje Feb 22, 2024
bc8a48c
feat: datadog tracer (#7058)
wochinge Feb 23, 2024
b4011af
Add `AnswerExactMatchEvaluator` (#7050)
silvanocerza Feb 23, 2024
41b9310
feat: allow content tracing (#7066)
wochinge Feb 23, 2024
0a7dfc1
Revert "Add `AnswerExactMatchEvaluator` (#7050)" (#7075)
silvanocerza Feb 23, 2024
f5645a4
Update API docs to include OutputAdapter, OpenAPIServiceConnector and…
vblagoje Feb 23, 2024
79b32cd
build: fix CVE-2022-40897 (#7080)
wochinge Feb 23, 2024
3b8af36
restore transparent background for Pipeline images (#7086)
anakin87 Feb 26, 2024
d66b535
Remove eval end to end tests (#7093)
silvanocerza Feb 26, 2024
ba49905
ci: unify dependency management + hatch scripts (#7079)
wochinge Feb 26, 2024
22e9def
chore: Remove deprecated GPTGenerator and GPTChatGenerator (#7125)
julian-risch Feb 26, 2024
8838c02
Fix snippets_tests.yml (#7208)
silvanocerza Feb 26, 2024
e0b6924
`TopPSampler` docstrings (#7205)
anakin87 Feb 26, 2024
2a59128
feat: implement support for structured logging (#7126)
wochinge Feb 27, 2024
6fce2db
feat: log trace correlation (#7206)
wochinge Feb 27, 2024
87727e4
chore(deps): bump iterative/setup-cml from 2 to 3 (#7223)
dependabot[bot] Feb 27, 2024
9b1d792
preprocessors: review docstrings (#7219)
anakin87 Feb 27, 2024
2580e05
fix wrong docs config (#7224)
anakin87 Feb 27, 2024
e194c08
docs: review `DocumentLanguageClassifier` docstrings (#7210)
anakin87 Feb 27, 2024
93acdc1
refactor: Refactor pipeline templates (#7213)
masci Feb 27, 2024
3fc7797
fixing docstrings (#7225)
davidsbatista Feb 27, 2024
d871bbb
feat: Add complex types in OpenAPI support (#7065)
vblagoje Feb 27, 2024
14f65d5
feat: check tty / jupyter availability for logging config detection (…
wochinge Feb 27, 2024
f9db75c
Fix/docstrings generators (#7218)
davidsbatista Feb 27, 2024
380052a
replace text with content (#7211)
anakin87 Feb 28, 2024
419009b
fix: move sensitive log to debug mode (#7230)
wochinge Feb 28, 2024
8549143
ci: Add script to delete documentation that don't exist anymore (#7209)
silvanocerza Feb 28, 2024
ac4f458
docs: review and normalize `haystack.components.fetchers` (#7232)
wochinge Feb 28, 2024
7b9704a
docs: review `Routers` docstrings (#7234)
anakin87 Feb 28, 2024
f812048
remove the override feature (#7227)
masci Feb 28, 2024
61eb143
Fix delete outdated docs job in readme_sync.yml (#7241)
silvanocerza Feb 28, 2024
2fea67d
Fix delete_outdated_docs.py to get the correct api key (#7242)
silvanocerza Feb 28, 2024
c2a9528
build: pin typing-extensions (#7245)
wochinge Feb 28, 2024
20ebb46
Fix/docstrings dataclasses (#7221)
davidsbatista Feb 28, 2024
f22d499
docs: review and normalize `haystack.components.websearch` (#7236)
wochinge Feb 28, 2024
0ddb48b
docs: docstrings - `generators.chat` (#7239)
davidsbatista Feb 28, 2024
f492836
fixing docstrings (#7249)
davidsbatista Feb 28, 2024
c4b54bc
docs: Update docstrings for `haystack.components.caching` (#7247)
shadeMe Feb 28, 2024
e5f0e24
docs: review docstrings in `haystack.components.validators` (#7238)
wochinge Feb 28, 2024
890c613
chore: adjust docstrings in the audio package (#7246)
masci Feb 28, 2024
cf1e284
fix docstrings for the builder package (#7248)
masci Feb 28, 2024
4766efb
adding missing imports in __init__.py
davidsbatista Feb 29, 2024
e7809b6
feat: Add `from_template` class method to `Pipeline` (#7240)
masci Feb 29, 2024
14191de
docs: Update docstrings for `haystack.components.writers` (#7260)
julian-risch Feb 29, 2024
25a1a97
restore to_dict method (#7261)
masci Feb 29, 2024
fe0ac5c
chore: enforce kwarg logging (#7207)
wochinge Feb 29, 2024
ee4d479
chore: run ruff with fix during precommit hook (#7264)
wochinge Feb 29, 2024
2325444
feat: Add new predefined template: chat with website (#7259)
masci Feb 29, 2024
69d5438
tests: make 3.12 compatible (#7265)
wochinge Feb 29, 2024
4065a75
Update converters docstrings (#7250)
silvanocerza Feb 29, 2024
408be73
Update embedders docstrings (#7268)
silvanocerza Feb 29, 2024
d659795
fix: Update `Component` protocol to fix some type checking issues (#7…
silvanocerza Mar 1, 2024
9fe2aae
ci: add ruff to CI + bring config up to date (#7266)
wochinge Mar 1, 2024
b011bfc
chore: update the INDEXING and RAG pipeline templates (#7272)
masci Mar 1, 2024
221bfb0
fix: Update pyproject.toml (#7281)
masci Mar 1, 2024
4eced44
fix output types in WebSearch components (#7274)
anakin87 Mar 1, 2024
a57ed6d
docs: Update docstrings for `haystack.core` (#7263)
julian-risch Mar 1, 2024
b554e4c
docs: Update docstrings and API doc config for `haystack.components.e…
shadeMe Mar 1, 2024
7ffb21d
docs: Update docstrings for `haystack.components.connectors` (#7267)
vblagoje Mar 1, 2024
34dac5f
Update test_pipeline.py (#7284)
masci Mar 1, 2024
6325015
docs: Update docstrings for haystack.components.retrievers (#7252)
julian-risch Mar 1, 2024
d17f180
docs: Add warmup() call to HuggingFaceLocalGenerator usage example (#…
julian-risch Mar 1, 2024
9ab5c95
docs: Multiplexer - review and normalize docstrings for `component.ot…
vblagoje Mar 1, 2024
247716f
docs: Update `joiners` docstrings and configs (#7280)
silvanocerza Mar 1, 2024
3077a08
docs: Update docstrings for `haystack.utils` (#7269)
vblagoje Mar 1, 2024
72d776c
fix: Fix run order of variadic greedy components in Pipeline.run() (#…
silvanocerza Mar 1, 2024
727794c
pin pytest (#7295)
anakin87 Mar 4, 2024
f49dbdc
docs: last fixes on non-chat generators (#7287)
davidsbatista Mar 4, 2024
721691c
replace flaky with pytest-rerunfailures (#7298)
anakin87 Mar 4, 2024
0e7c41b
feat: Improve OpenAPIServiceToFunctions signature (#7257)
vblagoje Mar 4, 2024
9a0e2e5
docs: Added LostInTheMiddleRanker usage example and updated docstring…
julian-risch Mar 4, 2024
15373de
chore: Update ruff pre-commit version (#7293)
julian-risch Mar 4, 2024
312ba9e
chore(deps): bump dorny/paths-filter from 3.0.1 to 3.0.2 (#7300)
dependabot[bot] Mar 4, 2024
f0fb71d
docs: Add LostInTheMiddleRanker to API docs (#7302)
julian-risch Mar 5, 2024
c1c0cbf
docs: Update docs of MetaFieldRanker, TransformersSimilarityRanker (#…
julian-risch Mar 5, 2024
b86490b
docs: Update `ExtractiveReader` docstrings (#7308)
julian-risch Mar 5, 2024
38a80b0
fix: `MetaFieldRanker` - use `weight` if passed in the `run` method (…
anakin87 Mar 5, 2024
655d4a1
test: test for missing dependencies (#7278)
wochinge Mar 5, 2024
50ad1fa
fix: Remove pipeline serialization from telemetry code (#7289)
julian-risch Mar 5, 2024
23c65c2
chore: migrate `ExtractiveReader` to use secret management (#7309)
wochinge Mar 5, 2024
c3caadb
Add haystack.utils API docs (#7311)
vblagoje Mar 5, 2024
2c7d1ea
docs: edit audio docstrings order (#7317)
dfokina Mar 6, 2024
0db95fb
docs: `haystack.utils` docfixes (#7318)
shadeMe Mar 6, 2024
1380429
docs: Prepare the readme for 2.0.0 (#7314)
masci Mar 6, 2024
cfc5253
docs: documentation search -> document search in Readme (#7320)
julian-risch Mar 6, 2024
82cd317
Update unstable to 2.1.0-rc0
silvanocerza Mar 6, 2024
ede768a
Fix VERSION.txt
silvanocerza Mar 6, 2024
3dbde84
test: monkeypatch some env vars in Predefined Pipelines tests (#7321)
anakin87 Mar 7, 2024
d5336e9
Fix pypi_release.yml workflow to skip rc0 versions (#7327)
silvanocerza Mar 7, 2024
bbdc52e
Fix readme_sync.yml workflow to handle unstable 2.x versions (#7330)
silvanocerza Mar 7, 2024
a3a2194
docs: disable class def rendering (#7329)
wochinge Mar 7, 2024
95837ab
Fix readme sync workflow (#7332)
silvanocerza Mar 7, 2024
8d7a583
fix: `HuggingFaceTEITextEmbedder` returning embedding of incorrect sh…
awinml Mar 7, 2024
0beb6e1
chore: update banner (#7340)
masci Mar 11, 2024
ad2366c
link using absolute URI (#7342)
masci Mar 11, 2024
f8b9f71
make weight defined in run to be used even if 0 (#7343)
anakin87 Mar 11, 2024
6239b60
Update README.md (#7344)
bilgeyucel Mar 11, 2024
38b3472
feat: Add `SentenceTransformersDiversityRanker` (#7095)
awinml Mar 11, 2024
41dbbdb
feat: Add support for matching mime types using regex (#7303)
Lord-Haji Mar 11, 2024
2bb1742
add conda badge (#7347)
masci Mar 12, 2024
e4f37e9
Fixed pipeline import statement (#7348)
dcbark01 Mar 12, 2024
5aee378
chore: Remove all examples and point to cookbooks repo (#7350)
silvanocerza Mar 12, 2024
762e8c8
extend stalebot to community triage (#7352)
masci Mar 13, 2024
2a83ecc
Update docs renderer (#7349)
silvanocerza Mar 13, 2024
de4fca4
ci: Skip collection of `test_json_schema.py` to fix CI failures (#7353)
silvanocerza Mar 13, 2024
4b814b2
remove references to test_requirements (#7351)
anakin87 Mar 13, 2024
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
The table of contents is too big for display.
Diff view
Diff view
  •  
  •  
  •  
27 changes: 27 additions & 0 deletions .github/ISSUE_TEMPLATE/breaking-change-proposal.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,27 @@
---
name: Breaking change proposal
about: Track a breaking change in Haystack
title: ''
labels: breaking change
assignees: ''

---

## Summary and motivation

Briefly explain how the change is breaking and why it is needed.

## Checklist

```[tasklist]
### Tasks
- [ ] The changes are merged in the `main` branch (Code + Docstrings)
- [ ] Release notes have documented the breaking change
- [ ] A new version of `haystack-ai` has been released on PyPI
- [ ] Docs at https://docs.haystack.deepset.ai/ were updated
- [ ] Integrations on [haystack-core-integrations](https://github.com/deepset-ai/haystack-core-integrations) were updated (if needed) - This step might require a [Breaking change proposal](https://github.com/deepset-ai/haystack-core-integrations/issues/new?assignees=&labels=breaking+change&projects=&template=breaking-change-proposal.md&title=) on the repo
- [ ] Notebooks on https://github.com/deepset-ai/haystack-cookbook were updated (if needed)
- [ ] Tutorials on https://github.com/deepset-ai/haystack-tutorials were updated (if needed)
- [ ] Articles on https://github.com/deepset-ai/haystack-home/tree/main/content were updated (if needed)
- [ ] Integration tile on https://github.com/deepset-ai/haystack-integrations was updated (if needed)
```
125 changes: 30 additions & 95 deletions .github/labeler.yml
Original file line number Diff line number Diff line change
@@ -1,105 +1,40 @@
# Release lines
2.x:
- base-branch: 'main'

1.x:
- base-branch: 'v1.x'

# Proposals
proposal:
- proposals/text/*

# 2.x
2.x:
- haystack/preview/**/*
- test/preview/**/*
- changed-files:
- any-glob-to-any-file: proposals/text/*

# Topics
topic:tests:
- test/**/*
- test/*
- conftest.py
- changed-files:
- any-glob-to-any-file: ['test/**/*','test/*']

topic:docker:
- docker/*
- docker-compose.yml
- changed-files:
- any-glob-to-any-file: docker/*

topic:CI:
- .github/*
- .github/**/*
- changed-files:
- any-glob-to-any-file: ['.github/*','.github/**/*']

topic:DX:
- CONTRIBUTING.md
- .pre-commit-config.yaml
- .gitignore
topic:dependencies:
- pyproject.toml
- changed-files:
- any-glob-to-any-file: ["CONTRIBUTING.md", ".pre-commit-config.yaml",".gitignore"]

topic:build/distribution:
- pyproject.toml
topic:modeling:
- haystack/modeling/*
- haystack/modeling/**/*
- test/modeling/*
topic:reader:
- haystack/nodes/reader/*
- test/nodes/test_reader.py
topic:retriever:
- haystack/nodes/retriever/*
- test/nodes/test_retriever.py
topic:agent:
- haystack/agents/*
- test/agents/*
topic:pipeline:
- haystack/pipelines/*
- haystack/nodes/other/*
- test/pipelines/*
- test/nodes/test_other.py
topic:rest_api:
- rest_api/*
- rest_api/**/*
topic:LLM:
- haystack/nodes/prompt/*
- test/nodes/test_prompt_node.py
topic:promptnode:
- haystack/nodes/prompt/*
- test/prompt/*
topic:file_converter:
- haystack/nodes/file_converter/*
- test/nodes/test_file_converter.py
topic:crawler:
- haystack/nodes/connector/*
- test/nodes/test_connector.py
topic:preprocessing:
- haystack/nodes/preprocessor/*
- test/nodes/test_preprocessor.py
topic:images:
- haystack/nodes/image_to_text/*
- test/nodes/test_image_to_text.py
topic:audio:
- haystack/nodes/audio/*
- test/nodes/test_audio.py
- changed-files:
- any-glob-to-any-file: pyproject.toml

topic:security:
- SECURITY.md
topic:dc-api:
- haystack/utils/deepsetcloud.py
topic:document_store:
- haystack/document_stores/*
- test/document_stores/*
topic:weaviate:
- haystack/document_stores/weaviate.py
- test/document_stores/test_weaviate.py
topic:elasticsearch:
- haystack/document_stores/search_engine.py
- haystack/document_stores/elasticsearch/*
- test/document_stores/test_elasticsearch.py
topic:opensearch:
- haystack/document_stores/search_engine.py
- haystack/document_stores/opensearch.py
- test/document_stores/test_opensearch.py
topic:sql:
- haystack/document_stores/sql.py
- test/document_stores/test_sql.py
- test/document_stores/test_sql_based.py
topic:faiss:
- haystack/document_stores/faiss.py
- test/document_stores/test_faiss.py
- test/document_stores/test_sql_based.py
topic:pinecone:
- haystack/document_stores/pinecone.py
- test/document_stores/test_pinecone.py
topic:knowledge_graph:
- haystack/document_stores/memory_knowledgegraph.py
- test/document_stores/test_knowledge_graph.py
topic:dc_document_store:
- haystack/document_stores/deepsetcloud.py
- test/document_stores/test_deepsetcloud.py
- changed-files:
- any-glob-to-any-file: SECURITY.md

topic:core:
- changed-files:
- any-glob-to-any-file: haystack/core/**/*
50 changes: 0 additions & 50 deletions .github/release.yml

This file was deleted.

72 changes: 72 additions & 0 deletions .github/utils/delete_outdated_docs.py
Original file line number Diff line number Diff line change
@@ -0,0 +1,72 @@
import argparse
import base64
import os
import re
from pathlib import Path
from typing import List

import requests
import yaml

VERSION_VALIDATOR = re.compile(r"^[0-9]+\.[0-9]+$")


def readme_token():
api_key = os.getenv("README_API_KEY", None)
if not api_key:
raise Exception("README_API_KEY env var is not set")

api_key = f"{api_key}:"
return base64.b64encode(api_key.encode("utf-8")).decode("utf-8")


def create_headers(version: str):
return {"authorization": f"Basic {readme_token()}", "x-readme-version": version}


def get_docs_in_category(category_slug: str, version: str) -> List[str]:
"""
Returns the slugs of all documents in a category for the specific version.
"""
url = f"https://dash.readme.com/api/v1/categories/{category_slug}/docs"
headers = create_headers(version)
res = requests.get(url, headers=headers, timeout=10)
return [doc["slug"] for doc in res.json()]


def delete_doc(slug: str, version: str):
url = f"https://dash.readme.com/api/v1/docs/{slug}"
headers = create_headers(version)
res = requests.delete(url, headers=headers, timeout=10)
res.raise_for_status()


if __name__ == "__main__":
parser = argparse.ArgumentParser(
description="Delete outdated documentation from Readme.io. "
"It will delete all documents that are not present in the current config files."
)
parser.add_argument(
"-c", "--config-path", help="Path to folder containing YAML documentation configs", required=True, type=Path
)
parser.add_argument("-v", "--version", help="The version that will have its documents deleted", required=True)
args = parser.parse_args()

configs = [yaml.safe_load(c.read_text()) for c in args.config_path.glob("*.yml")]

remote_docs = {}
for config in configs:
category_slug = config["renderer"]["category_slug"]
if category_slug in remote_docs:
continue
docs = get_docs_in_category(category_slug, args.version)

remote_docs[category_slug] = docs

for config in configs:
doc_slug = config["renderer"]["slug"]
category_slug = config["renderer"]["category_slug"]
if doc_slug in remote_docs[category_slug]:
continue

delete_doc(doc_slug, args.version)
9 changes: 0 additions & 9 deletions .github/utils/pydoc-markdown.sh
Original file line number Diff line number Diff line change
@@ -1,19 +1,10 @@
#!/bin/bash

set -e # Fails on any error in the following loop
export PYTHONPATH=$PWD/docs/pydoc # Make the renderers available to pydoc
cd docs/pydoc
rm -rf temp && mkdir temp
cd temp
for file in ../config/* ; do
echo "Converting $file..."
pydoc-markdown "$file"
done
# render preview markdown docs
cd ..
rm -rf temp-preview && mkdir temp-preview
cd temp-preview
for file in ../config-preview/* ; do
echo "Converting $file..."
pydoc-markdown "$file"
done
Loading
Loading