Generating Sequential CAD Designs from
Beginner-to-Expert Level Text Prompts
Text2CAD is the first AI framework for generating parametric CAD designs from multi-level textual descriptions, supporting prompts from abstract shape descriptions to detailed parametric instructions.
Key Contributions
Novel Data Annotation Pipeline — Leverages open-source LLMs and VLMs to annotate the DeepCAD dataset with multi-level text prompts of varying complexity and parametric detail.
Text2CAD Transformer — An end-to-end Transformer-based autoregressive architecture for generating complete CAD design history from natural language prompts.
Our pipeline generates multi-level text prompts describing CAD construction workflow with varying complexities via a two-stage approach:
The Text2CAD Transformer converts natural language into parametric 3D CAD models by deducing intermediate design steps autoregressively. Given text prompt \(T\) and CAD subsequence \(\mathbf{C}_{1:t-1}\), a pretrained BERT Encoder with trainable Adaptive layer extracts \(T_{adapt}\), which passes through \(\mathbf{L}\) decoder blocks alongside CAD sequence embedding \(F^0_{t-1}\).
Two evaluation strategies assess Text2CAD performance:
Select a tab to explore results. Hover over charts for values.
Win-rate breakdown by prompt complexity — GPT-4 evaluation
Abstract
Beginner
Intermediate
Expert
Win-rate breakdown by prompt complexity — Human evaluation
Abstract
Beginner
Intermediate
Expert
This work was in parts supported by the EU Horizon Europe Framework under grant agreement 101135724 (LUMINOUS).
If you find our work useful, please cite: