Hi @hinro and thank you for your kind words
Letâs be honest: no, it is not matching what this company developed.
I do not have the knowledge, nor the expertise in Computer Vision nor ML to achieve what they did, at a professional scale, and on my free time only.
So by no means am I trying to compete, I am just exploring and trying to give back to the community.
This being said, progress is made every day, trying to tackle unexpected/unwanted behavior of the algorithm wrapping the yolo model detection layer. And as you know, while you close one tap, two others opened in your back and the water is pouring.
As we covered it in the discord, the dataset is lacking of specific cases (it is struggling with latex scenesâŚ), so it would need to be augmented, but as you mentioned it, this is really not the funny part of the job.
I am still struggling with some very specific scenes (grinding moves for instance), where your approach could be leveraged and optimized by focusing on a very specific part of the video, and on a very specific part of the frame (ROI). Happy to discuss with you if we could join efforts somehow.
Regarding the weights, since I had no clue what I was doing in the beginning (and mostly, even now), I first thought that it might be better to not expose them right away on GitHub, and that GitHub was mainly for code, not for actual weights. I initially uploaded an apple version of the weight, and it was seen as a directory, that was a mess, and it wouldnât be cloned, making a mess
AnywayâŚ
Second reason is I was trying to foster engagement, and having people join the Discord actually helped:
- Not spamming this thread with installation troubleshooting (and I can tell you I spent time helping a couple people with their install), nor with questions on how the tool works, nor suggestions, food for thought, etc.
- Getting interest from developers, and already some Pull Requests have been merged (thank you!) and more are coming !
- Posting videos, images, etc. that would have made this thread heavy
- etc.
I was more envisioning this thread (which looks quite messy now, my bad, sorry guys) as a place to cover new releases, enhancements and such, even if we had kind of a breakthrough in this very thread thanks to people like @Zalunda , @jambavant and @fenderwq with the ffmpeg v360 filter to unwarp the VR frames for instance.
So there is absolutely no hidden agenda with the Discord, no pay wall to get the weights at all.
There is no monetization goal -at all-, but once again, if someone wants to put a coin in the ko-fi piggy bank, that would easen up my walletâs pain from buying a computer to handle all this development, model training, inferring, etc. haha
And a simple thank you, a nice word from newcomers or veterans of this forum simply made my day a couple times already!
This is a very transparent post, like I am in real life, I hope it makes things clear
Looking forward to having you onboard!