About img2prompt
Methexis-Inc/img2prompt is a tool designed to generate approximate text prompts that match an image. This tool is particularly optimized for stable-diffusion (clip ViT-L/14). The tool is based on the open-source CLIP Interrogator notebook created by @pharmapsychotic and utilizes the OpenAI CLIP models to match an image to a variety of artists, mediums, and styles. The results of the comparison are then combined with BLIP captions to generate a text prompt that can be used to create additional images similar to the original. The tool can be run via an API, or the GitHub repository and license can be accessed for more information. Predictions typically complete within 24 seconds and run on Nvidia T4 GPU hardware.
Key Features
- ✅ Stable
- ✅ diffusion optimized
- ✅ Uses CLIP models
- ✅ Comparative image analysis
- ✅ Integration with BLIP
- ✅ Generates text prompts
- ✅ Creates similar images
- ✅ API available
- ✅ GitHub repository access
- ✅ Rapid prediction time
- ✅ Runs on Nvidia GPU
- ✅ Image
- ✅ based prompt generation
- ✅ Includes a variety of styles
- ✅ Matches image to artists
Pricing
Free to use
Rating & Reviews
3/5 stars based on 1 reviews
Categories & Tags
Category: Image To Text
Tags: image, text, prompt