• oce 🐆@jlai.lu
    link
    fedilink
    arrow-up
    1
    ·
    3 months ago

    It seems there’s a fundamental incapacity for the model to produce an ordered series of images inside one image. What if you ask to describe each step with a separate image?