Skip to content

Latest commit

 

History

History
81 lines (47 loc) · 7.29 KB

collaborations.md

File metadata and controls

81 lines (47 loc) · 7.29 KB

Working with the AI for Earth Camera Trap Tools

Table of contents

  1. Overview
  2. How people use MegaDetector
  3. Questions about specific camera trap use cases
  4. Learn more

Overview

Conservation biologists invest a huge amount of time reviewing camera trap images, and – even worse – a huge fraction of that time is spent reviewing images they aren't interested in. This primarily includes empty images, but for many projects, images of people and vehicles are also "noise", or at least need to be handled separately from animals.

Machine learning can accelerate this process, letting biologists spend their time on the images that matter.

To this end, we've trained an AI model – called "MegaDetector" – to detect animals, people, and vehicles in camera trap images. It does not identify animals, it just finds them. We've also done a little bit of work on training species classifiers, but 99% of what we do is related to MegaDetector.

This page summarizes what we do with that model to help our collaborators, typically ecologists, more specifically ecologists who are overwhelmed by camera trap images. This page also includes some questions we ask new collaborators, to help assess whether our tools are useful, and – if so – what the right set of tools is for a particular project.

Basically this page is the response we give when someone emails us and says "I have too many camera trap images! Can you help me?!?!". If you're an ecologist reading this page, and that sounds familiar, feel free to answer the questions below in an email to [email protected].

You can see a list of some of the organizations who have used our tools here.

If you are looking for a more technical description of our MegaDetector model, see this page.

How people use MegaDetector

An AI model isn't useful to most ecologists by itself, so we "package" this model in a variety of ways. For most of our collaborators, they send us images (anywhere from thousands to millions), which we run through this model in the cloud, then we send back a results file. We've integrated with a variety of tools that camera trap researchers already use, to make it relatively painless to use our results in the context of a real workflow. Our most mature integration is with an open-source tool called Timelapse; read more about how to use MegaDetector results with Timelapse here.

We have somewhat-less-complete integrations with the eMammal desktop application and with dikiKam.

For more programming-inclined users, or for real-time applications (primarily anti-poaching scenarios), we also package our MegaDetector model into two different APIs: a batch processing API for processing large volumes of images with some latency, and a real-time API.

Whether you are using an image review tool like Timelapse or calling our APIs yourself, usually the first step with a new user is just running our model on a few thousand images and seeing what happens, so if you're interested in trying this on your images, we can work out a way to transfer a set of example images.

After that, we'll typically send back a page of sample results; depending on whether you already know the "right" answer for these images, the results will look like one of these:

Those pages aren't quite what a real results page would look like: rather than just "detections" and "non-detections", a real results page would have images broken out into separate links for empty/people/vehicle/animal. But we can't use a data set with people in it for a public demo, so the samples above are simplified to just include images that have animals or are empty (both sample pages are based on public data from the Snapshot Serengeti project.

Questions about specific camera trap use cases

These questions help us assess how we can best help a new collaborator, and which of our tools will be most applicable to a particular project.

  1. Can you provide a short overview of your project? What ecosystem are you working in, and what are the key species of interest?

  2. About how many images do you have that you've already annotated, from roughly the same environments as the photos you need to process in the future?

  3. If you have some images you've already annotated:

  • Did you keep all the empty images, or only the images with animals?
  • Are they from exactly the same camera locations that you need to process in the future (as in, cameras literally bolted in place), or from similar locations?
  1. About how many images do you have waiting for processing right now?

  2. About how many images do you expect to process in the next, e.g., 1 year?

  3. What tools do you use to process and annotate images? For example, do you:

  • Move images to folders named by species
  • Keep an Excel spreadsheet open and fill it with filenames and species IDs
  • Use a tool like Timelapse or Reconyx MapView that's specifically for camera traps
  • Use a tool like Adobe Bridge or digiKam that's for general-purpose image management
  1. About what percentage of your images are empty?

  2. About what percentage of your images typically contain vehicles or people, and what do you want to do with those images? I.e., do you consider those "noise" (the same as empty images), or do you need those labeled as well?

  3. What is your level of fluency in Python?

  4. Do you have a GPU available (or access to cloud-based GPUs)? "I don't know what a GPU is" is a perfectly good answer.

Learn more