RynnVLA-002 is an autoregressive action world model that unifies action and image understanding and generation. RynnVLA-002 intergrates Vision-Language-Action (VLA) model (action model) and world ...
The code has been validated using pytorch 1.10.1, Python 3.8, CUDA 11.3, and cuDNN 8.2.0_0. You can directly replicate the conda environment using the environment.yml file. Additional dependencies are ...