NeuROK proposes Neural Object Kinematics, a data-driven approach to generating realistic 4D dynamics (temporal deformations) of static 3D objects under physical conditions. Unlike prior methods requiring predefined physical models and system identification, NeuROK learns a latent kinematic state space and decoder that maps latent states to plausibly deformed shapes, using a transformer-based encoder-decoder trained on a curated large-scale 4D dataset. The approach simplifies dynamics generation by operating in a low-dimensional latent space from a Lagrangian mechanics perspective.
- Neural Object Kinematics parameterization: Learns a latent space representing all possible object states plus a decoder mapping any latent sample to a deformed shape, avoiding predefined physical models
The method learns both:
The model is a transformer-based encoder-decoder trained on a curated large-scale 4D dataset with diverse dynamic object types. The formulation treats dynamics generation as sampling from the latent space rather than simulating from first principles.
- Demonstrated across diverse dynamic object types
- Limited to object-centric physical systems (single object focus)
NeuROK addresses a key challenge in world models: generating physically plausible temporal deformations of objects. The data-driven kinematic approach provides an alternative to explicit physics simulation, relevant for world models that need to predict object dynamics over time. The CVPR 2026 acceptance indicates strong interest in neural approaches to physical world modeling.