Now it is possible to feed graphic for the VLM as issue of generations! This is different from image2video where the impression grow to be the main body of the video. IP2V uses image as a A part of the prompt, to extract the notion and magnificence on the picture. https://rap22110.tblogz.com/helping-the-others-realize-the-advantages-of-rap-47597954