Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Efficient Model #2

Open
ClashLuke opened this issue Jun 26, 2022 · 0 comments
Open

Efficient Model #2

ClashLuke opened this issue Jun 26, 2022 · 0 comments
Labels
ml requires in-depth machine learning knowledge

Comments

@ClashLuke
Copy link
Member

When training our model to "see" screenshots, it's essential that it can see the screenshots at full resolution instead of massively downscaled screenshots, as the information gets lost much too quickly. For example, text written on a screen quickly becomes unreadable after just one 2x reduction. Therefore it's critical that our model can efficiently process these large states.
The difficulty in this problem comes from the fact that a "full screenshot" is compromised of ~9 million pixels (in the case of a 4k monitor), each having three features. In total, that'd be 27 million features per frame, which would let us fit 19 frames at a theoretical maximum of 500 million features per sample. As seeing only the past 19 frames is not feasible, we need to improve the memory efficiency of our model by feeding it something other than pure frames.

@ClashLuke ClashLuke added the ml requires in-depth machine learning knowledge label Jun 26, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
ml requires in-depth machine learning knowledge
Projects
None yet
Development

No branches or pull requests

1 participant