v2.0+ • Multi-API • Model-Optimized

Professional LoRA Dataset Prep

Upload images and get enterprise-grade dataset preparation: intelligent multi-API captioning, checkpoint-specific optimization for SDXL, Flux, SD-1.5, Qwen, and more. Trainer-ready exports with complete control over caption quality and style.

See Features Join Skool

✓

Gemini, OpenAI, Grok APIs

Choose the best API for your budget and quality needs

✓

Checkpoint-Specific Prompts

Auto-optimized captions for SDXL, Flux, SD-1.5, Qwen, WAN

✓

Advanced Caption Controls

Quality levels, temperature, token limits, quality/token scoring

✓

Kohya Export Ready

Proper formatting for immediate training use

What Makes It Amazing

Built from the ground up for professional LORA training with intelligence at every step.

🧠

new in v2

Multi-API Vision

Switch between Gemini, OpenAI, and Grok vision APIs. Optimize for cost (Gemini at $0.14/1K images) or quality. Smart routing ensures you get the best results for your workflow.

⚙️

new in v2

Checkpoint Optimization

Model-specific captioning strategies for SDXL, Flux, SD-1.5, Qwen-Image, and WAN-2.2. Each checkpoint receives tailored prompts for maximum training effectiveness.

🎛️

new in v2

Advanced Caption Controls

Fine-tune quality levels, temperature settings, token limits, and caption scoring. Create perfectly balanced captions for character, style, concept, and product LoRAs.

📊

Smart Trainer Naming

Files automatically renamed for Kohya format with bucket/repeat awareness. Compatible with all popular training applications. No manual renaming needed.

📝

Rich Text Descriptions

Generates trainer-friendly .txt files per image. Goal-aligned phrasing, consistent token ordering, and ready for immediate training use.

🔍

Comprehensive Logging

Full transparency into every step of the process. See API responses, caption generation, and scoring. Debug and optimize with confidence.

Perfect For

Character LoRAs

Automatically avoid captioning "baked features" like hair/eye color to prevent training conflicts.

Style & Concept

Optimized prompting for artistic styles, visual concepts, and aesthetic training.

Product & Architecture

Detailed captioning for product design, interior architecture, and technical subjects.

Clothing & Fashion

Precise garment descriptions and style metadata for fashion-focused training.

Pose & Cinematography

Automatic pose detection and cinematic element tagging for dynamic subjects.

Mixed Checkpoints

Handle multiple model types in one dataset with automatic optimization per checkpoint.

FAQ

Which API should I choose?

Gemini is most cost-effective at ~$0.14 per 1,000 images. OpenAI is ~$2.16/1K images, and Grok is ~$4.05/1K images. The tool recommends the best API for your checkpoint type, but you can override manually. For budget-conscious workflows, Gemini is unbeatable; for maximum quality, try OpenAI.

Does it work with my checkpoint?

The tool includes optimized configurations for SDXL, Flux, SD-1.5, Qwen-Image, and WAN-2.2. Each checkpoint receives model-specific captioning strategies and prompt engineering. If you need support for additional checkpoints, you can customize the settings or reach out for feature requests.

What training formats does it export?

The tool exports in Kohya format with proper filename patterns for bucket and repeat awareness. Files are compatible with all popular training applications including Kohya, AI Toolkit, and Civitai Trainer. Each image gets a matching .txt caption file ready for immediate training.

Can I control caption quality and style?

Yes. The advanced caption controls let you adjust quality levels (standard, detailed, expert), temperature for creative variation, token limits for conciseness, and quality/token scoring to balance detail vs. efficiency. You can also set different controls per LoRA training goal (character, style, concept, etc.).

Is my data private?

Your images are sent to your chosen API (Gemini, OpenAI, or Grok) for captioning. All processing is handled per your API provider's terms. The tool itself doesn't store or log your images—everything runs in your browser session.

Professional LoRA Dataset Prep

What Makes It Amazing

Multi-API Vision

Checkpoint Optimization

Advanced Caption Controls

Smart Trainer Naming

Rich Text Descriptions

Comprehensive Logging

Perfect For

Character LoRAs

Style & Concept

Product & Architecture

Clothing & Fashion

Pose & Cinematography

Mixed Checkpoints

FAQ

Which API should I choose?

Does it work with my checkpoint?

What training formats does it export?

Can I control caption quality and style?

Is my data private?

Ready to Prep Your Dataset?