I'd like to ask a few questions about the training of the model.
Which video did you take as source and which as target? From my understanding, your own video would be source and Bruno Mars video as target, is that correct?
How many images did you have in your original (or landmarks) folder for both source and target videos?
What kind of hardware did you use to train the model?
How much time did it take to train and test the model on this hardware?
What more do you think we can do to improve the results?