Name	Name	Last commit message	Last commit date
Latest commit History 12 Commits
assets	assets
external	external
include	include
resources	resources
src	src
vc2022	vc2022
.gitignore	.gitignore
.gitmodules	.gitmodules
LICENSE	LICENSE
README.md	README.md

AIDrawing: Real-time Human-AI Collaborative Drawing

ai_drawing_4x.mp4

An interactive C++ drawing application that combines AI-powered visual completion through real-time canvas interpretation and image generation.
Built with the Cinder framework, this application allows users to draw naturally without entering prompts — an AI model analyzes the artwork in real time and collaborates with an image diffusion pipeline (StreamDiffusion) to augment and auto-complete the drawing.

Features

Interactive Vector Drawing: Cursor-based drawing
AI Canvas Analysis: Real-time interpretation of drawings using Ollama vision models
AI Image Generation: Real-time augmentation of drawings through diffusion models
Real-time Communication:
- Spout integration for texture sharing between applications
- OSC messaging for communication with TouchDesigner and other tools
Advanced Drawing System:
- Undo/redo via command pattern
- Stroke smoothing and dynamic width
- Customizable colors and brushes

Examples

(All GIFs below show AI-assisted drawing auto-completion in real time.)

Final result	Animation (4X speed)

Application Architecture

Class Hierarchy

App (Cinder) → CinderApp → DrawingApp → App

Core Components

Drawing System (`vdraw` namespace)

Vec2 — 2D vector math
Color — RGBA color representation
StrokePoint — Points with position, pressure, timestamp
Stroke — Collection of stroke points with styling
Drawing — Main canvas for stroke management
DrawingCommand — Command pattern for undo/redo

AI Integration

OllamaClient — Interface to Ollama local AI models for vision analysis
ThreadSafeList — Thread-safe container for AI results

Communication

Spout — Real-time texture sharing (Windows)
OSC — Open Sound Control messaging

System Overview

 ┌────────────────────────────────────────────────────────────┐
 │ User │
 │ (Interactive Drawing) │
 └───────────────┬────────────────────────────────────────────┘
 │
 ▼
 ┌────────────────────┐
 │ AIDrawing │
 │ (Cinder App) │
 │────────────────────│
 │ - Captures strokes │
 │ - Renders drawing │
 │ - Manages AI I/O │
 └─────────┬──────────┘
 │
 Drawing image │ Vision prompt
 ┌────────┴────────┐
 │ │
 ▼ ▼
 ┌───────────────────┐ ┌───────────────────────────┐
 │ OllamaClientCinder│ │ StreamDiffusionSpoutService│
 │ (AI Interpretation│ │ (AI Image Generation) │
 │ via Ollama) │ │ via Diffusion Models) │
 │───────────────────│ │───────────────────────────│
 │ - Sends drawing │ │ - Receives image+prompt │
 │ to Ollama model │ │ - Generates enhanced img │
 │ - Gets description│ │ - Outputs via Spout │
 └─────────┬─────────┘ └────────────┬──────────────┘
 │ │
 └───────────┬──────────────┘
 ▼
 ┌──────────────────────────────┐
 │ Real-time Compositing │
 │ (Cinder + Spout Integration) │
 │ - Combines AI output + drawing│
 │ - Displays augmented canvas │
 └──────────────────────────────┘

Dependencies & Libraries

Core Framework

Cinder — Creative coding framework for C++
- OpenGL rendering
- Cross-platform windowing and input handling
- Image processing and file I/O

AI Integration

Ollama — Local AI model inference
- Supports vision models like LLaVA for image analysis
- REST API for local communication
- Runs locally, no cloud dependencies
OllamaClientCinder — Cross-framework C++ client for Ollama API
- Works with Cinder and OpenFrameworks
- Supports text and vision models
- Async/sync APIs for flexible integration
- Windows support via WinHTTP
- No external dependencies (includes Base64 + JSON)
StreamDiffusionSpoutService — Real-time image generation server
- Receives drawing image + prompt
- Performs real-time image-to-image diffusion
- GPU-accelerated output shared via Spout
- Controlled via OSC
- Windows-only (requires Spout)

Real-time Communication

Spout — Real-time texture sharing framework
- GPU texture sharing between applications
- DirectX/OpenGL interoperability
- Popular in VJ and creative coding communities
OSC (Open Sound Control) — Network protocol
- UDP-based messaging
- Used for communication with TouchDesigner, Max/MSP, etc.

Project Structure

ai-drawing/
├── src/ # Source code
│ ├── main.cpp # Application entry point
│ ├── AIDrawingApp.cpp/.h # Main application class with AI integration
│ ├── DrawingApp.cpp/.h # Base drawing functionality
│ ├── CinderApp.cpp/.h # Cinder framework wrapper
│ ├── VectorDrawing.cpp/.h # Vector drawing system
│ ├── ThreadSafeList.* # Thread-safe data structures
│ └── CinderConsole.cpp/.h # Console utilities
├── external/ # External dependencies (git submodules)
│ └── ollama-client/ # C++ client for Ollama API
├── vc2022/ # Visual Studio 2022 project files
│ ├── AIDrawing.sln # Solution file
│ ├── AIDrawing.vcxproj # Project file
│ └── AIDrawing.vcxproj.filters # Project filters
├── include/ # Header files
├── blocks/ # Third-party Cinder blocks (from Cinder install)
├── assets/ # Demo images and animations
├── resources/ # Application resources (icons, etc.)
└── README.md # This file

Getting Started

Prerequisites

Visual Studio 2022 or newer with C++ support
Cinder Framework — Download from libcinder.org and include the Spout block
Ollama — Install from ollama.com
LLaVA model — Run ollama pull llava:7b to download vision model
StreamDiffusionSpoutService — Real-time image generation server (separate application, clone and run separately)
Spout — GPU texture sharing (included via Cinder Spout block)

Note: OllamaClientCinder is included automatically as a git submodule.

Building

Clone this repository with submodules:
```
git clone --recursive https://github.com/olwal/ai-drawing.git
```
If you already cloned without --recursive, initialize submodules:
```
git submodule update --init --recursive
```
Open vc2022/AIDrawing.sln in Visual Studio
Configure Cinder paths in project settings if needed
Build and run (F5)

Setup AI Models

# Install Ollama
# Download from https://ollama.com/
# Pull vision model
ollama pull llava:7b
# Verify installation
ollama list

Running the AI Services

Before launching AIDrawing, make sure both AI backends are running:

Start Ollama — This runs automatically after installation.
- Handles vision analysis of your drawing.
- Provides semantic descriptions that guide the diffusion model.
Run StreamDiffusionSpoutService — Must be active for image generation.
- Receives both the user’s drawing (as an image) and the AI-generated description (as a text prompt).
- Performs real-time image-to-image diffusion to enhance and complete your artwork.
- Sends the generated result back to AIDrawing via Spout (GPU texture sharing).
When both services are active, AIDrawing forms a real-time creative loop:

User draws → AIDrawing captures strokes → OllamaClientCinder analyzes drawing → StreamDiffusionSpoutService generates enhanced imagery → Spout sends result → AIDrawing composites drawing + AI output in real time

Usage

Controls

Mouse: Draw on canvas
T: Toggle text overlay
Escape: Exit application

AI Models

The application supports any Ollama vision model, for example:

llava:7b (default) - Good balance of speed and quality
llava:13b - Higher quality, slower inference
granite3.2-vision - Alternative vision model

Change models by modifying the model variable in AIDrawingApp::setup().

Performance Notes

Closed-Loop AI Augmentation
AIDrawing forms a continuous feedback cycle where each user stroke is interpreted by the vision model and enhanced by diffusion — no text prompting required.
Real-Time Vision Analysis
OllamaClientCinder communicates with Ollama for multimodal inference, asynchronously generating semantic descriptions of the drawing.
AI Image Generation
StreamDiffusionSpoutService takes the current drawing and its interpreted prompt to perform real-time image-to-image diffusion, producing AI-augmented content streamed back via Spout.
Concurrent Pipeline
Drawing, inference, and generation operate in separate threads, coordinated by a ThreadSafeList for synchronized data sharing.
GPU Optimization
Spout enables zero-copy GPU texture sharing for seamless real-time compositing between the Cinder app and the diffusion server.
Recommended Hardware
- GPU: ≥ 8 GB VRAM (for stable diffusion models)
- CPU: ≥ 6 cores for smooth concurrency and real-time updates

Contributing

The codebase follows standard C++ practices with:

Command pattern for undo/redo
Namespace organization (vdraw)
Clear separation of concerns between drawing, AI, and communication

Author

Alex Olwal

Related works: Closed-Loop AI Drawing Tools with VLM Interpretation

Graffiti-X - Real-time generative drawing system that interprets both formal intent (line quality, proportion) and contextual intent (semantic meaning) using VLMs for multi-user collaborative creation.
SketchAgent - Language-driven sequential sketch generation using multimodal LLMs that enables stroke-by-stroke collaboration and chat-based editing of sketches.
AutoSketch - VLM-assisted style-aware vector sketch completion that iteratively analyzes and adjusts partial sketches to match the original drawing style.
CadVLM - End-to-end vision language model for parametric CAD sketch generation that performs autocompletion by interpreting partial sketches as both image and text inputs.

Folders and files

Latest commit

History

Repository files navigation

AIDrawing: Real-time Human-AI Collaborative Drawing

Features

Examples

Application Architecture

Class Hierarchy

Core Components

Drawing System (vdraw namespace)

AI Integration

Communication

System Overview

Dependencies & Libraries

Core Framework

AI Integration

Real-time Communication

Project Structure

Getting Started

Prerequisites

Building

Setup AI Models

Running the AI Services

Usage

Controls

AI Models

Performance Notes

Contributing

Author

Related works: Closed-Loop AI Drawing Tools with VLM Interpretation

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Drawing System (`vdraw` namespace)

Packages