Spell can generate entire 3D scenes or “Worlds” from an image in just a few minutes. The worlds are consistent with the initial image input and are represented as a volume that can be rendered using Gaussian Splatting (or other methods, like NeRFs).
More