We compare optical flow prediction and cinemagraph generation without mask and/or text conditioning against our full method. The results highlight their importance in accurate flow prediction and subsequent plausible cinemagraph generation.
Single Image
Ours (w/o text and mask) [Video]
Ours (w/o mask) [Video]
Ours (w/o text) [Video]
Ours [Video]
Ground-Truth Flow
Ours (w/o text and mask) [Optical Flow]
Ours (w/o mask) [Optical Flow]
Ours (w/o text) [Optical Flow]
Ours [Optical Flow]
Single Image
Ours (w/o text and mask) [Video]
Ours (w/o mask) [Video]
Ours (w/o text) [Video]
Ours [Video]
Ground-Truth Flow
Ours (w/o text and mask) [Optical Flow]
Ours (w/o mask) [Optical Flow]
Ours (w/o text) [Optical Flow]
Ours [Optical Flow]
Single Image
Ours (w/o text and mask) [Video]
Ours (w/o mask) [Video]
Ours (w/o text) [Video]
Ours [Video]
Ground-Truth Flow
Ours (w/o text and mask) [Optical Flow]
Ours (w/o mask) [Optical Flow]
Ours (w/o text) [Optical Flow]
Ours [Optical Flow]
Single Image
Ours (w/o text and mask) [Video]
Ours (w/o mask) [Video]
Ours (w/o text) [Video]
Ours [Video]
Ground-Truth Flow
Ours (w/o text and mask) [Optical Flow]
Ours (w/o mask) [Optical Flow]
Ours (w/o text) [Optical Flow]
Ours [Optical Flow]
Single Image
Ours (w/o text and mask) [Video]
Ours (w/o mask) [Video]
Ours (w/o text) [Video]
Ours [Video]
Ground-Truth Flow
Ours (w/o text and mask) [Optical Flow]
Ours (w/o mask) [Optical Flow]
Ours (w/o text) [Optical Flow]
Ours [Optical Flow]
Single Image
Ours (w/o text and mask) [Video]
Ours (w/o mask) [Video]
Ours (w/o text) [Video]
Ours [Video]
Ground-Truth Flow
Ours (w/o text and mask) [Optical Flow]
Ours (w/o mask) [Optical Flow]
Ours (w/o text) [Optical Flow]
Ours [Optical Flow]
Single Image
Ours (w/o text and mask) [Video]
Ours (w/o mask) [Video]
Ours (w/o text) [Video]
Ours [Video]
Ground-Truth Flow
Ours (w/o text and mask) [Optical Flow]
Ours (w/o mask) [Optical Flow]
Ours (w/o text) [Optical Flow]
Ours [Optical Flow]
Single Image
Ours (w/o text and mask) [Video]
Ours (w/o mask) [Video]
Ours (w/o text) [Video]
Ours [Video]
Ground-Truth Flow
Ours (w/o text and mask) [Optical Flow]
Ours (w/o mask) [Optical Flow]
Ours (w/o text) [Optical Flow]
Ours [Optical Flow]
Single Image
Ours (w/o text and mask) [Video]
Ours (w/o mask) [Video]
Ours (w/o text) [Video]
Ours [Video]
Ground-Truth Flow
Ours (w/o text and mask) [Optical Flow]
Ours (w/o mask) [Optical Flow]
Ours (w/o text) [Optical Flow]
Ours [Optical Flow]
Single Image
Ours (w/o text and mask) [Video]
Ours (w/o mask) [Video]
Ours (w/o text) [Video]
Ours [Video]
Ground-Truth Flow
Ours (w/o text and mask) [Optical Flow]
Ours (w/o mask) [Optical Flow]
Ours (w/o text) [Optical Flow]
Ours [Optical Flow]