Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Quick evaluation results #147

Open
aazaff opened this issue Apr 29, 2021 · 5 comments
Open

Quick evaluation results #147

aazaff opened this issue Apr 29, 2021 · 5 comments

Comments

@aazaff
Copy link

aazaff commented Apr 29, 2021

Here is an assessment of COSMOS returned results on the geothermal dataset (bigram model) for the search terms "thermal conductivity", "geochemistry", and "porosity" WITH the permalink to each success/failure included. Is there an ideal place to put this information?

table checks.xlsx

@cambro
Copy link
Contributor

cambro commented Apr 30, 2021

Right here is fine.
FYI, the "bigram model" has no bearing on the COSMOS results, only the embedding model used to identify similar terms. These similar terms are not used in retrieval at the present time.

@cambro
Copy link
Contributor

cambro commented Apr 30, 2021

Side note:
Some of returns identified as "incorrect" here are actually correct returns (i.e., the returned object has the search term) but the returned object is in some way truncated. This is either because the table spans pages, something we do not handle currently or because of some other segmentation error. These should ideally be flagged as correct but incomplete.
Example: https://xdd.wisc.edu/set_visualizer/sets/geothermal/object/07b481b52a9000ac82d1ff121bf549ec0df62a43

@cambro
Copy link
Contributor

cambro commented Apr 30, 2021

Likewise this one: this is flagged incorrect. But. if asked to classify what this visual object is and asked whether it contains "porosity" or not, the answer is Table and yes.
https://xdd.wisc.edu/set_visualizer/sets/geothermal/object/935b76e310960c798bf0d94c1449bdbe0977b245

@aazaff
Copy link
Author

aazaff commented Apr 30, 2021

I agree. I told anna to be extra conservative in her determinations which was maybe too strict. I will see if she has time to go through them today and revise the numbers.

@aazaff
Copy link
Author

aazaff commented Apr 30, 2021

contents_separated.xlsx

Revised version... now table of contents and truncation are flagged with "2" so this way you can decide the best way.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants