-
Notifications
You must be signed in to change notification settings - Fork 6
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Term needed to designate when an image or ROI is a label #200
Comments
Also relevant to audio - there are plenty of recordings on BioAoustica with spoken introductions (some are already annotated as such). |
Was there consideration of this going in subjectPart? |
We talked about it. However, I think there is an understanding that the "subject" of subjectPart is an organism. In the task group, we've been trying to group all of the parts in collections that include parts that are appropriate for particular organism groups. So I think we just decided this was out of scope. However, it's an important thing and maybe we need to broaden the concept of "subject" to include things like labels and sound descriptions. If it doesn't go in "subjectPart", then where does it go? I'm going to be focused on the Views TG for the immediate future, so I'll keep this issue in mind. |
In principle I agree. However, there is a subtle but important aspect of this that should be considered. Technically, in this context, the media documents not just an This seems abstract, but ultimately it fits into the idea that media can serve as evidence of occurrence. If there is a spoken introduction included on the audio recording, then that also represents evidence of occurrence (i.e., of an organism identifiable as Homo sapiens, at whatever time/place the spoken commentary was recorded). Granted, that's not the sort of |
Just to list some potential usage examples - and potentially expand the scope.
An example of an image with lots of non-organism regions: https://data.nhm.ac.uk/media/334690/preview Perhaps a term called something like |
I have a (potentially stupid and/or answered) question: This question was prompted by the post from @edwbaker referencing "Perhaps a term called something like I'm just wondering if all properties related to content depicted/represented within any media item should be rooted to an ROI instance, rather than the media item itself. By default, any properties related to contents of a media item in general would be associated with the "root ROI" (entire media item), until a more restricted/precise ROI is defined for the media item. I vaguely remember we discussed this on one of the calls, but I can't find my notes or any other documentation that addresses this question. Apologies if I am out of the loop... |
In my head a ROI is a way of applying ac terms to a region (or regions) within a media item. Which might be the other way round to what you are thinking? (i.e. I could create a new image by cropping an image to a ROI; the ROI effectively defines a new media item (virtually), and inherits things like bitrate). While there is a clear philosophical difference between the two, would they vary in a practical sense to people using ac? |
Thanks, yeah -- that's sort of what I'm thinking as well. But in my mind, some ac terms represent properties of the media item itself, and other represent properties of the content depicted within the media. I guess what I'm thinking is that an ROI is potentially a different thing (class?) from the media item, and potentially serves as a "container" for properties about specific content items. In other words, I imagine it sort of as ROI being the bridge between a media item and the things depicted or represented as content by the media item. I guess I'm just trying to sort out in my mind how to tease apart properties that apply to a media item, vs. properties that apply to content represented within the media item. I don't see a problem with derived media items mapping to an ROI of a parent Media Item. But for years I've wrestled with how to model properties of a media item compared to properties of content items represented by the media item. |
I've been pondering this since @edwbaker posted his earlier use cases. It seems to me that perhaps the way to handle these questions would be to apply some generic property to the ROIs that would indicate what they represented. At first I was thinking The thing that's a bit mushy in these examples is that the example of "FOAF Depicts to describe who and what is in the image", one of the depicted things is a particular instance of a person, while the other examples are generic concepts of "Hotel" and "Car". In an example of an herbarium sheet, if we said that a particular ROI depicted a label, would we want to say it depicted a particular instance of a label (which could have properties like the specific text on it) or do we want to say that it generically depicts "a label". These seem to be two different kinds of things. In the first case, we might actually want to assign an identifier to the label so that we could link out to its OCR verbatim data and cleaned up fields. In the second case, we might have a controlled vocabulary for generic things that could be depicted on an herbarium sheet image: a label, an organism, a color scale, a ruler. I think in the end there are multiple ways to skin this cat and perhaps we just need to experiment to see which is the most practical. |
I'll try and describe an example. One thing I have just noticed is ac:framerate is in the ROI vocabulary and mo:sample_rate is in Resource Creation - I will male a new issue. |
Just pondering the examples in the ROI recipes document, I see that we used In that case, a representation might look something like this for an audio recording:
and this for an herbarium specimen
The controlled values for |
With respect to whether there is an implied ROI that includes the entire media item, it seems like we pretty much have to allow that since we can hardly demand that providers who can only handle flat records support the one-to-many image:ROI relationship. For example, I think some people will want to (or already are) apply I think it would be reasonable for a provider who wanted to aggregate both kinds of records to apply a "default ROI" assumption similar to what @edwbaker and @deepreef were discussing to split off the fields that would normally go into an ROI if it were there (such as ac:subjectPart) into an ROI whose bounds were the entire image. |
This makes sense to me. I understand the point made by @deepreef about separating things about media, and things depicted in the media, into classes - but can this be solved in a similar way? (some terms are clearly about the media itself, while others are clearly about what is depicted?) Maybe we can have this as a philosophical assumption, and ideally document this assumption, and only formalise it if there becomes a pressing reason to do so?
|
@edwbaker I think that it would be appropriate to make the philosophical distinction about what properties belong with particular classes (media item, ROI, SAP). Whether people would sort the properties out that way would depend on how fully normalized their data management system is. I think what we may need is an overhaul of the Audubon Core Structure document. For better or worse, it sets the precedent of allowing people to flatten out SAP records in a number of ways if they want to cram their normalized data into a single table. Now that we've added ROIs as another class under which properties can be organized, the waters are even muddier. So documenting the assumptions (without necessarily making them normative, at least for the time being) would probably help. |
This issue is handled by #246, which is going to public comment |
This came up in assembling the use cases for the Views TG, but was determined to be out of scope for that group. However, there is a need for it and it is related to the new work on Regions Of Interest since an ROI could be flagged as label and not part of an organism.
The text was updated successfully, but these errors were encountered: