This is a cool setup and demo of the Clip library . So , What is this library ? The Clip library is a cool model that can build pairs of images and instructed text. This Pytorch pre-trained model predict the most relevant text, given an image, without directly optimizing for the task. Pretty cool. Watch this video for more info. You can find the link for the video tutorial here: Moreover, you may find in the video description an instructions file with the setup process, reference for the Github library Enjoy Eran #python #computervision #pytorch