Motion is one of the most conspicuous visual cues in the natural world, and humans possess a remarkable sensitivity to it. While humans effortlessly perceive motion, training a model to learn realistic scene motion presents substantial challenges due to the intricacies involved in measuring and capturing physical properties on a large scale. Fortunately, recent advancements