Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Check indexer handling of rendered items or metadata URLs #72

Open
anjackson opened this issue Apr 29, 2020 · 0 comments
Open

Check indexer handling of rendered items or metadata URLs #72

anjackson opened this issue Apr 29, 2020 · 0 comments

Comments

@anjackson
Copy link
Contributor

anjackson commented Apr 29, 2020

e.g. are screenshot:https handled properly.

2020-04-29 15:19:18,753 INFO: attempt_202002261158_0423_m_000224_1: Apr 29, 2020 3:10:48 PM org.archive.wayback.resourcestore.indexer.WARCRecordToSearchResultAdapter generi
cResult
2020-04-29 15:19:18,753 INFO: attempt_202002261158_0423_m_000224_1: WARNING: FAILED canonicalize(har:https://twitter.com/InterbankLGBT/):BL-NPLD-WEBRENDER-frequent-npld-202
00227133858-20200425151705068-03362-0o4xyiz2.warc.gz 143220215
2020-04-29 15:19:18,754 INFO: attempt_202002261158_0423_m_000224_1: Apr 29, 2020 3:10:48 PM org.archive.wayback.resourcestore.indexer.WARCRecordToSearchResultAdapter adaptI
nner
2020-04-29 15:19:18,754 INFO: attempt_202002261158_0423_m_000224_1: INFO: Skipping record type : resource

Also: CDX indexer should convert metadata:// URIs to urn:embeds:

@anjackson anjackson changed the title Check indexer handling of extended URLs Check indexer handling of rendered items or metadata URLs Feb 3, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant