Hello,
Before continuing, and for us to avoid talking in circles, I would like to underline that I am not saying that thresholding is unreasonable, my guess is that it is faster than the DLC method. Nevertheless, there are scenarios where it just won't work with respect to the environmental lighting, animal and the contrast between the animal and the environment. The speed increase from the DLC method is well documented, and I think its performance improvement is beyond any question at this point. Both methods should be in Bonsai, to not have one but the other is a lost opportunity for experiment design. I also misrepresented bruocruz'es method, and for that, I would like to apologize, I should've worded myself better.
My thoughts on some of the statements
brunocruz
>The function you are describing is a pre-processing function from DLC that does not exist in Bonsai.
Is this a problem? If nobody can do this I can do it, I just thought it would be no-brainer to implement this function (apparently it isn't), and goncalocpes would probably implement it faster than I can. Moreover, I am always eager to learn, and I am up for it if nobody wants to implement it.
goncalocpes
>If you want to compute a crop based on the last detected pose, then you could compute a bounding rectangle with the detected pose points and feed that back into Crop. This could also be included in the Pose output, as it would be the same to what is in the LargestBinaryRegion node (Contour > Rect property gives you the contour bounding box so you could use that rect to dynamically determine your crop based on animal size for example).
This is a very cool idea; however, it is not what I need at the moment. Do you have a project file with this? I might use it down the line.
>Thresholding and DLC inference have different sources of error, which can lead to quite serious failure modes in both cases"
This would imply that the DLC model being used for inferencing isn't competent and acquiret better/more data for the training. With that said, thresholding only provides speed, and not robustness.
>Of course using the output of pose itself means you have to first analyse the video using the full-frame until you detect an animal, otherwise there is no information to help you crop.
This is certainly true; however, it is still better than thresholding in my opinion. This method ensures that the animal isn't cropped out , which might occur in thresholding.
Thank you for your time guys, we have been discussing and using mind space for hours. Means a lot :)
Can