Methexis-Inc/img2prompt is a tool designed to generate approximate text prompts that match an image. This tool is particularly optimized for stable-diffusion (clip ViT-L/14).
The tool is based on the open-source CLIP Interrogator notebook created by @pharmapsychotic and utilizes the OpenAI CLIP models to match an image to a variety of artists, mediums, and styles.
The results of the comparison are then combined with BLIP captions to generate a text prompt that can be used to create additional images similar to the original.
The tool can be run via an API, or the GitHub repository and license can be accessed for more information. Predictions typically complete within 24 seconds and run on Nvidia T4 GPU hardware.