Agent skill

ai-visual-generation

Stars 5
Forks 0

Install this agent skill to your Project

npx add-skill https://github.com/Gaku52/claude-code-skills/tree/main/07-ai/ai-visual-generation

SKILL.md

日本語版

AI Visual Generation

AI is revolutionizing image and video production. This skill covers all aspects of AI visual generation — from Stable Diffusion, DALL-E, and Midjourney to video generation (Sora) and 3D modeling.

Target Audience

  • Creators looking to learn AI image and video generation technologies
  • Engineers integrating AI visual generation into their products
  • Those interested in AI art and design

Prerequisites

  • Foundational AI/ML concepts
  • Basic knowledge of image processing

Learning Guide

00-fundamentals — Image Generation AI Fundamentals

# File Description

01-image — Image Generation

# File Description

02-video — Video Generation

# File Description

03-3d — 3D Generation

# File Description

Quick Reference

AI Image Generation Service Comparison:
  Midjourney:        Highest quality, Discord-based
  DALL-E 3:          Easy API integration, ChatGPT integration
  Stable Diffusion:  Open source, fully customizable
  Adobe Firefly:     Commercially safe, Adobe integration
  Flux:              Latest open model, high quality

References

  1. Rombach, R. et al. "High-Resolution Image Synthesis with Latent Diffusion Models." CVPR, 2022.
  2. OpenAI. "DALL-E 3." openai.com, 2024.
  3. Stability AI. "Stable Diffusion." stability.ai, 2024.

Expand your agent's capabilities with these related and highly-rated skills.

Didn't find tool you were looking for?

Be as detailed as possible for better results