AIGC Awesome List - April 2026

TechScan - Artificial Intelligence Generated Content (AIGC) - April 2026

· 3D · Agent · Benchmark · Company · Datasets · DepthMap · FaceSwap · Image Captioning · Lighting · Models · Music · Platforms · Performance · Prompts · Read · Segmentation · Simulation / 3D-World · TalkingHead · Training · TechScan · Text / OCR · TTS · Upscale · Vibe-code · Video · Online

Companies

Benchmarks & Leaderboards

Keywords: InoReader - Benchmarking ,

GenAI - Platforms

GenAI - Reading Material

Tool - Performance

Keywords: Awesome VLM Architectures ,

Tool / Training / Utility

Tool - Prompt Engineering

adieyal/comfyui-dynamicprompts
AIrjen/OneButtonPrompt
HunyuanVideo 1.5 Prompt Handbook
MushroomFleet/LLM-Base-Prompts (mixed)
AI Video Creation Guide
PixelPruner theallyprompts , civitai
Prompt Lists fofr , ai-prompts/prompt-lists , marduk191/ComfyUI-Fluxpromptenhancer , PromptHero - portraits-prompts , PromptMania ,

Tool - General / TechScan / Research / DeepResearch

Face Restoration & Realism

Keywords: 1ai

Image-To-Text (i2t) Captioning

AllenAI Molmo 7B D

Joy Caption

Microsoft , Microsoft Florence2 , Florence-2 , MiaoshouAI , ComfyUI-Miaoshouai-Tagger ,

Microsoft Phi - alexisrolland/ComfyUI-Phi (Phi-3.5-mini-instruct, Phi-3.5-vision-instruct) , Phi 3.5 ,

MiniCPM-Plus , MiniCPM v2.6 Prompt Generator

Moondream (Visual Q&A, Caption, Object Detection) , Moondream blog , vikhyatk/moondream2 , vikhyat/moondream , kijai/ComfyUI-moondream , Hangover3832/ComfyUI-Hangover-Moondream

OmniVLM-968M (no ComfyUI)

Pixtral Llama Molmo Vision

PromptCraft

RealGen: Photorealistic Text-to-Image Generation via Detector-Guided Rewards (qwen-edit-2509 LORA)

QwenVL for ComfyUI (image & video) , Qwen2-VL-Instruct ,

Searge-LLM

WD14-Tagger

gokaygokay/Flux Prompt Generator , Flux-Florence-2 , fairy-root ,

IuvenisSapiens (miniCPM, QWEN, QWEN Audio)

Zhipu GLM , GitHub - JcandZero/ComfyUI_GLM4Node , GitHub - Nojahhh/ComfyUI_GLM4_Wrapper ,

Models

Keywords: HuggingFace - text-generation , InoReader - Algorithm ,

AIModels.fyi
Comfy-Org
HuggingFace
ModelScope
AlexGeNovese checkpoint , clip , clip_vision , controlnet , facerestore , ipadapters , loras , sams , vae , ultralytics
city96 GGUF Qwen-Image , LTX , HunyuanVideo-I2V
DiffBot diffbot-llm-inference , diffy.chat demo
Edge Models Falcon Falcon-H1 , HuggingFace Vision Language Model - SmolVLM-500M-Instruct-WebGPU , SmolLM3-3B (web) , Liquid.AI LFM2: On-Device Models , Edge Models , LFM2.5 Models MiroMind MiroThinker
HuiHui-ai abliterated models , Huihui-Qwen3-VL-8B-Instruct-abliterated , coder3101/Qwen3-VL-8B-Thinking-heretic ,
In-Context Edit: Enabling Instructional Image Editing with In-Context Generation in Large Scale Diffusion Transformer , spaces/RiverZ/ICEdit
IamCreateAI/Ruyi ,
ByteDance - 1.58-bit FLUX
Hugging Face for Legal , HFforLegal/datasets ,
IPAdapter (FaceID, clip-vision, LORA)
Kijai Skyreels , LTXV , HunyuanVideo ,
MonsterMMORPG Wan - GGUF , Upscale , FaceSegments` , Yolo
Mohamed bin Zayed University of Artificial Intelligence (MBZUAI) , UAE Institute of Foundation Models (IFM) , Sherkala (English, Russian, and Turkish) , K2-Think ,
Ostris qwen_edit_inpainting
PowerInfer , , SmallThinker: A Family of Efficient Large Language Models Natively Trained for Local , Paper
QuantStack GGUF Wan2.2-I2V-A14B , Qwen-Image-Distill , FLUX.1-Kontext-dev , LTXV-13B-0.9.8-distilled , Wan2.1_I2V_14B_FusionX
Reaslim TensorArt - Extra-Realistic-Flux , TensorArt - kg_09
StrangerZone StrangerZone LORA (Flux-Super-Realism-LoRA, Super 3D - Engine)
Swiss-AI - Apertus . Swiss-AI - Projects
SVDQuant , mit-han-lab/ComfyUI-nunchaku
TheBloke (>4K)
TildeOpen LLM: Europe's Sovereign Multilingual AITildeOpen LLM: Europe's Sovereign Multilingual AI , TildeAI/TildeOpen-30b
Unsloth.ai Unsloth.ai , UnSloth (>300) , GitHub - UnSloth AI , unsloth/deepseek-v3 , phi-4-all-versions , Fine-tune & Run Qwen3 , Fine-tuning TTS models (Sesame's CSM, Orpheus)

Upscale SUPIR

Keywords: Awesome-video-super-resolution-diffusion , Awesome Diffusion Models for Video Super-Resolution , OpenModelDB , HuggingFace - Phips , realistic skin

Video

Keywords: Github - Awesome Video Diffusion , Github - Awesome-LLMs-for-Video-Understanding

3D OpenPose / PoseNet / DepthMap

Keyword: VAST-AI-Research/repositories , CivitAI - poses , CivitAI - openpose ,

3D - 2D to 3D Monocular / NERF / Gaussian Splatting / Multi-view

Keyword: Github - awesome-gaussians , 3D Gaussian Splatting Papers , Awesome 3D Diffusion , Github - awesome-3D-gaussian-splatting

Agents

Keyword: Awesome Adaptation of Agentic AI , Github - LLM-Agents-Papers , Google Scholar , GitHub - restyler/awesome-n8n , GitHub - enescingoz/awesome-n8n-templates ,

Simulation Worlds, GIS & World Models

Keyword: Awesome World Models for Robotics , Benchmark - WorldScore ,

Datasets

Amazon Berkeley Objects (ABO) Dataset (household items)
HumanRig - Learning Automatic Rigging for Humanoid Character in a Large Scale Dataset
CivitAI-As-Characters
FineVision: Open Data Is All You Need 200 datasets containing 17M images, 89M question-answer turns, and 10B answer tokens, totaling 5TB of high-quality data
Yuan-ManX/ai-audio-datasets
Cartoon Movement (Kenny Tosh)
Data No Culture Left Behind: ArtELingo-28, a Benchmark of WikiArt with Captions in 28 Languages
Data Common Pile v0.1
Data Meta Omnilingual ASR Corpus
Data - Faces CelebV-HQ: A Large-scale Video Facial Attributes Dataset , TalkVid: A Large-Scale Diversified Dataset for Audio-Driven Talking Head Synthesis
Data - Humans HUMOTO: A 4D Dataset of Mocap Human Object Interactions
Data - Movies Movie-Drama scripts
Data - Cccupations O*NET database (800 US occupationsUS)
HuggingFace FineWeb dataset consists of more than 18.5T tokens (originally 15T tokens) of cleaned and deduplicated english web data from CommonCrawl , FineWeb-Edu dataset consists of 1.3T tokens and 5.4T tokens ,
Images Eigen-Banana-Qwen-Image-Edit: Lightning-Fast Instruction-Based Image Editing with Pico-Banana-400K ,
MOSEv2: A More Challenging Dataset for Video Object Segmentation in Complex Scenes
NTU NTU EEE - Digital Signal Processing Laboratory , Research Data
Nvidia Granary - Multilingual Speech AI , nvidia/PhysicalAI-Autonomous-Vehicles-NuRec
UniqueData , UniqueData/facial-emotion-recognition-dataset
Cartoons Cartoon Movement - Israeli-Palestinian-Conflict , Israel-War-Cycle , Paresh Nath, India , Marian Kamensky, Austria , Kenny Tosh, Nigeria , ThinkChina ,

Lighting

Keyword: CivitAI - lighting ,

Text - Translation / OCR / Storyboarding

Keyword: OmniDocBench , Hybrid OCR-LLM Framework for Enterprise-Scale Document Information Extraction Under Copy-heavy Task ,

OCR DeepSeek-OCR-2 , dots.ocr: Multilingual Document Layout Parsing in a Single Vision-Language Model , FireRed-OCR , GLM-OCR , HunyuanOCR-1B , LightOnOCR-2-1B: a lightweight high-performance end-to-end OCR model family , PaddleOCR ,
Doc/Text To LORA Doc-to-LoRA and Text-to-LoRA
Translation Tencent-Hunyuan/HY-MT , Google TranslateGemma , Cohere Tiny-aya

TechScan - Artificial Intelligence Generated Content (AIGC) - April 2026

Companies

Benchmarks & Leaderboards

GenAI - Platforms

GenAI - Reading Material

Tool - Performance

Tool / Training / Utility

Tool - Prompt Engineering

Tool - General / TechScan / Research / DeepResearch

Object Background Remover / Segmentation / InPaint / OutPaint

Speech - Music

Speech - Text-2-Speech (TTS)

Talking Head

Face Restoration & Realism

Image-To-Text (i2t) Captioning

Models

Upscale SUPIR

Video

3D OpenPose / PoseNet / DepthMap

3D - 2D to 3D Monocular / NERF / Gaussian Splatting / Multi-view

Agents

Simulation Worlds, GIS & World Models

Datasets

Lighting

Text - Translation / OCR / Storyboarding

Coding Assistant / Vibe-code