-
Notifications
You must be signed in to change notification settings - Fork 0
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
User-Acceptance Testing for Milestone 1 #8
Comments
Close issue since user-acceptance tests are scheduled for milestone 6. |
This looks wrong - we're closing issues because they are scheduled in the future? Each milestone has to be accepted by users. And where do we keep these tests documented? |
Might have been too fast here. I thought it was done and that user acceptance would be reworked in milestone 6. I can discuss with William to see if we need to reopen the issue and come up with a way to record and keep a trace of the tests that have been done. |
For Nachet-Interactive to be considered "in production" by the end Milestone 1 (2023Q4), a series of tests will have to be passed. I'm trying to cover all angles of the confusion matrix.
TRUE POSITIVES (TP)
TP TEST 1
Classify an image containing ONE of EACH seed which the model WAS trained to recognize.
Expected result: All seeds are identified correctly with high confidence score.
TP TEST 2
Classify a group photo of seeds, where some seeds are partially covered or overlapped by others.
Expected result: Visible portions of the seeds are correctly identified with high confidence score.
TRUE NEGATIVES (TN)
TN Test 1
Classify an image containing seeds which the model was NOT trained to recognize.
Expected result: Model offers guesses with low confidence score.
TN Test 2
Classify an image containing nothing.
Expected result: Model does not return results.
TN Test 3
Classify an image with complex patterns or textures that might mimic the appearance of seeds.
Expected result: Model does not return results.
FALSE POSITIVES (FP)
FP Test 1
Classify an image containing similar-looking seeds which the model is NOT trained to recognize.
Expected result: Identifies the seeds as being the seed it CAN recognize but gives it a low confidence score.
The text was updated successfully, but these errors were encountered: