license: apache-2.0
language:
- en
tags:
- image-processing
- deep-learning
- AI
IMageDucHaiten
Version: V1.1.2
Table of Contents
- Recent Updates
- Introduction
- Key Features
- Installation
- Usage
- Community and Support
- Support and Contributions
Recent Updates
Version V1.1.2
Version V1.1.2
introduces significant optimizations and new features, especially in the Image To Caption module. The key enhancements include:
- Added Bit Precision Options: Users can now select between different bit precision levels when processing images:
- 4-bit: Requires at least 20GB RAM and 12GB VRAM.
- 8-bit: Requires at least 20GB RAM and 16GB VRAM.
- 16-bit: Requires at least 50GB RAM and 24GB VRAM.
- 32-bit: Although the previous version required 40GB VRAM for 32-bit float32, this update optimizes it to run even with 24GB VRAM, though at significantly reduced speeds. This lower VRAM option is not recommended for performance-critical tasks.
NOTE: Although optimization has been made to allow GPUs with lower VRAM to run at higher bit precision, the speed will be significantly reduced.
Introduction
IMageDucHaiten is a powerful tool specifically designed for AI professionals to assist in the preparation of image data for training generative AI models. Version V1.1.2
offers a suite of automated and flexible tools for processing, tagging, and captioning images, making the data preparation process more efficient and streamlined.
Whether you're an AI researcher, data engineer, or someone who enjoys working with images, IMageDucHaiten can be an invaluable tool. It not only excels in supporting AI data preparation but is also accessible and beneficial to anyone looking to work with images effortlessly and effectively.
Key Features
- Image Converter: Convert image formats to match the requirements of your AI model, supporting a wide range of formats such as PNG, JPG, GIF, BMP, and more.
- Image Filter: Filter and remove images that do not meet quality criteria such as size and resolution, ensuring a clean dataset for training.
- Rotate & Flip: Rotate and flip images to create various training data variations, enhancing dataset diversity.
- Image Error Fix: Automatically detect and fix errors in images, reducing the risk of using incomplete data.
- Image To Tag: Automatically tag images, optimizing the data labeling process for AI models.
- Image To Caption: Automatically generate captions, providing context to images, useful in image recognition and captioning tasks.
- Photo Fantasy: Create artistic effects on images, useful for data augmentation or enhancing creative content.
Installation
To install and set up IMageDucHaiten, you have three options:
Option 1: Full Package Installation via Direct Download
Download the Full Package:
- Download the full package from the following link: IMageDucHaiten-Full.zip
Extract the Package:
- Extract the contents of the
.zip
file to your desired location.
- Extract the contents of the
Install Visual Studio Code (if not installed):
- Download and install Visual Studio Code from here.
Activate the Virtual Environment:
- Open the extracted folder in Visual Studio Code by selecting
File > Open Folder
and navigating to the extracted folder. - Open the integrated terminal within Visual Studio Code by selecting
View > Terminal
. - Activate the pre-configured virtual environment:
- On Windows:
.\venv\Scripts\activate
- On macOS/Linux:
source venv/bin/activate
- On Windows:
- Open the extracted folder in Visual Studio Code by selecting
Run the Application:
- After activating the virtual environment, you can start the application by running:
python main.py
- After activating the virtual environment, you can start the application by running:
Option 2: Full Package Installation via Git Clone
Install Git and Git LFS:
- Ensure Git is installed on your system. If not, download and install it from Git's official site and Git lfs
- Install Git LFS (Large File Storage) by running:
git lfs install
Clone the Repository:
- Use Git to clone the repository with all necessary files:
git clone https://huggingface.co/DucHaiten/IMageDucHaiten
- Use Git to clone the repository with all necessary files:
Open the Project in Visual Studio Code:
- If you haven't installed Visual Studio Code, download and install it from here.
- Open Visual Studio Code, then select
File > Open Folder
and navigate to the cloned folder.
Activate the Virtual Environment:
- Open the integrated terminal by selecting
View > Terminal
. - Activate the virtual environment:
- On Windows:
.\venv\Scripts\activate
- On macOS/Linux:
source venv/bin/activate
- On Windows:
- Open the integrated terminal by selecting
Run the Application:
- Start the application by running:
python main.py
- Start the application by running:
Option 3: Manual Installation by Downloading Individual Files
Download the Files Manually:
- Download individual files and directories from the repository as needed.
Set up a Virtual Environment:
- Create and activate a virtual environment:
- On Windows:
python -m venv venv .\venv\Scripts\activate
- On macOS/Linux:
python3 -m venv venv source venv/bin/activate
- On Windows:
- Create and activate a virtual environment:
Install Python Dependencies:
- Install the required Python packages using
requirements.txt
:pip install -r requirements.txt
- Install the required Python packages using
Install External Dependencies (Ghostscript, ImageMagick, Visual C++ Redistributable):
- Follow the instructions provided earlier to ensure all dependencies are properly installed.
Open the Project in Visual Studio Code:
- If you haven't installed Visual Studio Code, download and install it from here.
- Open Visual Studio Code, then select
File > Open Folder
and navigate to the project folder.
Run the Application:
- Open the integrated terminal by selecting
View > Terminal
. - Activate the virtual environment, then start the application by running:
python main.py
- Open the integrated terminal by selecting
Usage
After completing the installation, you can run IMageDucHaiten as follows:
Running the Application via Command Line (CMD)
Activate the Virtual Environment:
- On Windows:
.\venv\Scripts\activate
- On macOS/Linux:
source venv/bin/activate
- On Windows:
Run the Application:
- In the terminal, navigate to the folder where
main.py
is located. - Execute the following command to start the application:
python main.py
- In the terminal, navigate to the folder where
Running the Application via Visual Studio Code
Open the Project in Visual Studio Code:
- Launch Visual Studio Code.
- Open the project folder by selecting
File > Open Folder
and navigating to the folder wheremain.py
is located.
Activate the Virtual Environment:
- Open the integrated terminal within Visual Studio Code by selecting
View > Terminal
. - Activate the virtual environment:
- On Windows:
.\venv\Scripts\activate
- On macOS/Linux:
source venv/bin/activate
- On Windows:
- Open the integrated terminal within Visual Studio Code by selecting
Run the Application:
- In the terminal, start the application by running:
python main.py
- Alternatively, you can press F5 to run the application with debugging.
- In the terminal, start the application by running:
Installing Visual Studio Code
If you don't have Visual Studio Code installed, follow these steps:
Download Visual Studio Code:
- Go to Visual Studio Code's official site.
- Download the appropriate version for your operating system.
Install Visual Studio Code:
- Run the installer and follow the on-screen instructions to complete the installation.
Install the Python Extension:
- Open Visual Studio Code and go to the Extensions view by clicking on the square icon in the sidebar or pressing
Ctrl+Shift+X
. - Search for "Python" and install the extension provided by Microsoft.
- Open Visual Studio Code and go to the Extensions view by clicking on the square icon in the sidebar or pressing
Community and Support
Join our community to stay updated, share your experiences, and get support from other users:
- Discord Server: Join our Discord community for discussions, support, and updates.
Support and Contributions
If you find IMageDucHaiten useful and would like to support the project, consider making a donation or becoming a patron. Your contributions help in maintaining and improving the project:
PayPal: You can make a one-time donation through PayPal.
Patreon: Become a patron and support ongoing development with monthly contributions.
Thank you for your support!