Introduction
In the world of artificial intelligence, producing images has become a relatively easy task thanks to generative models. However, a persistent challenge is the accuracy of text and numbers embedded in these images. Sam Collins, with his "underdrawing" method, offers an innovative solution to this thorny issue. This approach promises to transform how we generate complex visuals with precise details.
The Challenge of Generative Models
AI generative models, such as Gemini 3.0 Pro and ChatGPT Images, are incredibly effective at creating appealing visuals. However, when it comes to embedding text or ensuring numbers appear in the correct order, these models often fail. For instance, envisioning a game board with 50 stones numbered in a precise order is a significant challenge for these technologies.
The "Underdrawing" Method
Step 1: Create the "Underdrawing"
The "underdrawing" method begins by creating an accurate base using deterministic technologies. Utilizing tools like SVG or HTML allows for defining the exact positions and orientations of numbers and text. This step generates a base image with the pixels of the numbers or text correctly placed.
Step 2: The Generative Application
Next, this base image is used as a sub-layer for a generative image model. By using a multimodal model like Gemini 3.0 Pro, a generated image is overlaid on this base, maintaining the precision of details.
Use Case: Spiral Game Board
Take the example of a game board with 50 stones arranged in a spiral. Using the "underdrawing" method, one starts by creating an SVG where each stone is numbered from 1 to 50. Then, this base is transformed into a diorama of artisanal chocolate, each stone becoming a candy, while preserving the correct order and numbering.
Why It Works
The key to this method lies in the complementary use of deterministic technologies for precision and generative models for aesthetics. It bridges the gap of current AI models regarding the accuracy of texts and numbers.
Conclusion
The "underdrawing" method opens new avenues for developers and entrepreneurs working with AI-generated images. It offers a concrete solution to a long-frustrating problem and enables precision that was previously impossible. Let's discuss your project in 15 minutes.